Summary
Overview
Work History
Education
Skills
Certification
Websites
Timeline
Generic
Vinoth Subbiah

Vinoth Subbiah

Dubai

Summary

19+ years of experienced & result oriented DevOps Architect possessing in-depth experience in managing

cloud-based technology & effectively handling configuration & deployment of infrastructure & services. Gained

hands-on experience in implementing core DevOps concepts such as containerization, Kubernetes virtualization,

version control, cloud computing, database management & administration, load balancing, on- prem and private

Cloud etc., by using a wide variety of technologies while working with multiple DBMS Operating Systems &

programming languages. Drives excellence in every project to deliver outstanding results.

Overview

19
19
years of professional experience
1
1
Certification

Work History

Senior SRE

Emirates NBD
Dubai
08.2023 - Current
  • System Stability & Performance: Ensure the consistent reliability and optimal performance of critical banking systems and applications, ensuring uninterrupted service for users.
  • Incident Management Leadership: Lead the charge in incident response, swiftly identifying and addressing issues to minimize impact and maintain service continuity.
  • Monitoring & Alerting: Spearhead the implementation and management of advanced monitoring tools, such as Pingdom, AppDynamics, and ELK, ensuring immediate alerts for any system anomalies or performance deviations.
  • Service Level Objectives (SLO) Definition: Defined clear SLOs for Tier 1 and 2 applications, establishing benchmarks for system performance and reliability. Regularly communicate these metrics, alongside Service Level Indicators (SLI), to leadership via MS Teams channels.
  • Self-Healing Automation: Pioneered the development and deployment of self-healing automation mechanisms, leveraging tools like Jenkins, Ansible, and Python. This proactive approach effectively mitigates repeat issues, enhancing system resilience and stability.
  • Cloud & Container Management: Oversee and optimize cloud-based environments, particularly in Azure and Openshift, ensuring seamless integration and operational excellence.
  • Java & .NET Application Support: Provide specialized support for Java and .NET applications, addressing unique challenges and ensuring optimal performance across diverse platforms.
  • Disaster Recovery Strategy: Develop and refine comprehensive disaster recovery plans, ensuring the bank's critical functions can swiftly recover from any unforeseen events.
  • Cross-Functional Collaboration: Engage actively with diverse teams, fostering collaboration to ensure aligned objectives and cohesive project outcomes.
  • Documentation & Knowledge Sharing: Maintain thorough documentation of system configurations, processes, and best practices, promoting a culture of continuous learning and knowledge dissemination.
  • Mentorship & Technical Leadership: Provide guidance and mentorship to junior team members, ensuring the team remains at the forefront of industry trends and best practices in banking technology.

Principal Site Reliability Engineer

Granicus - USA ( Payroll in LTIMindtree)
St Paul
12.2019 - 07.2023
RabbitMQZabbixJforg
  • System Architecture & Deployment: Lead the design and execution of resilient, scalable, and highly available cloud-based solutions on platforms such as AWS, Azure, and VMware. Oversee the successful migration of applications while optimizing release cycles.
  • Continuous Integration/Continuous Deployment (CI/CD): Spearhead the development and maintenance of robust CI/CD pipelines leveraging tools like GitLab, Jenkins, and other automation frameworks. Automate infrastructure deployment and streamline configuration management using tools like Ansible and Terraform.
  • Performance Monitoring & Management: Utilize monitoring Zabbixlike Zabbix, LogicMonitor, New Relic, Datadog, Site24/7, Pingdom, and PagerDuty to ensure optimal system performance and availability.
  • Containerization & Networking: Proficiently manage Kubernetes and Docker container environments. Oversee the configuration and optimization of F5 load balancers, CDNs (including Akamai and Imperva), and other critical network infrastructures.
  • Collaborative Development: Engage proactively with software development teams, advocating for reliability, scalability, and performance-centric application designs and practices.
  • Scripting & Tool Development: Create and refine tools and scripts using Python, PowerShell, and other scripting languages to enhance system automation and efficiency.
  • Team Leadership & Mentorship: Provide mentorship and guidance to a global team of over 20 engineers. Foster a culture of continuous learning, ensuring the adoption of best practices and emerging technologies.
  • Problem Resolution: Dive deep into intricate technical challenges, deliver comprehensive root cause analyses (RCAs), and drive actionable solutions to prevent future recurrences.
  • Agile Project Management: Champion Agile methodologies, ensuring timely project deliveries within stipulated budgets. Offer technical leadership to uphold best practices and procedural adherence across teams.
  • Vendor Management: Cultivate and maintain strong vendor partnerships, ensuring adherence to service level agreements (SLAs) and fostering collaborative growth.
  • 24/7 Support: Participate actively in the on-call rotation, ensuring uninterrupted system availability and providing swift resolutions to critical issues.

Automation Engineer

Flydubai
Dubai
01.2016 - 11.2019
Akamaio functional and technical design documentation, ensuring adherence to industry best practices.
  • Cloud Infrastructure: Championed cloud solutions for production servers, focusing on security, compliance, and efficiency.
  • DevOps & CI/CD: Established CI/CD pipelines using GitHub Actions, streamlining application deployment and integration processes.
  • Message Brokers: Integrated RabbitMQ and Kafka, enhancing data processing capabilities and optimizing application performance.
  • Disaster Recovery & Migration: Architected robust disaster recovery solutions within AWS, ensuring business continuity and resilience.
  • Team Collaboration & Access Management: Managed project groups, overseeing code access and fostering collaborative development efforts.
  • Quality Assurance & Reporting: Generated comprehensive reports using tools like SonarQube, detailing test outcomes, and bug metrics.
  • Environment Management: Oversaw the configuration and maintenance of diverse environments within on-premises data centers, ensuring consistent performance and reliability.
  • Monitoring & Troubleshooting: Utilized advanced APM solutions for system monitoring, troubleshooting, and proactive issue resolution.
  • : Implemented Grafana to provide a consolidated view of system metrics, facilitating real-time insights and decision-making.

    Technical Specialist and Automation Engineer

    Mindtree
    CHENNAI
    07.2010 - 12.2015
    • Collaborative Development Approach: Collaborate closely with the Product Management team to prioritize and address issues highlighted during the Problem Management phase.
    • Performance & Infrastructure Management: Offer expertise in devising strategies for performance optimization, ensuring robust disaster recovery solutions, and establishing comprehensive monitoring and access management protocols.
    • User Support & Enhancement: Engage proactively with business users to identify and understand system-related challenges. Conduct in-depth root cause analyses, and collaborate with the technical team to implement effective solutions, be it enhancements or fixes.
    • Project Coordination: Collaborate with cross-functional teams to establish and communicate development milestones, schedules, and ongoing project statuses, ensuring transparency and alignment across the board.
    • Engineering Oversight: Provide technical guidance across various workstreams, emphasizing incident & problem management, change control, as well as adherence to security and compliance standards.
    • Infrastructure Enhancement: Spearhead initiatives to bolster the security and performance of existing infrastructure, fostering collaboration with other departments for cohesive outcomes.
    • Team Leadership & Continuous Learning: Take the lead in fostering a culture of innovation and continuous learning within the team. Stay abreast of industry advancements and ensure that the team is equipped with the latest knowledge and skills.
    • Disaster Recovery Expertise: Strategize and implement comprehensive Disaster Recovery solutions, ensuring the resilience and continuity of operations for a diverse clientele of over 50+ customers.

    Senior System Administrator

    CMCK LLC ( Payroll in HCL Ltd)
    CHENNAI
    09.2008 - 07.2010
    • Virtualization & Deployment Expertise: Collaborated with teams to ensure smooth ESX 3.5 installations, VM deployments, and migration processes. Managed transitions from 2.5 farms to 3.5 farms while adhering to best practices.
    • Hardware & Storage Management: Oversaw the deployment of 30 ESX 3.5 hosts on Dell M600 hardware and integrated Dell EqualLogic iSCSI storage solutions. Implemented RAID configurations for critical server environments to ensure optimal performance and reliability.
    • Server Consolidation Leadership: Led initiatives to consolidate servers, transitioning from physical to virtual environments. Ensured seamless transitions while maintaining system integrity.
    • Standards & Best Practices: Established and documented corporate standards for VMware ESX Server support. Regularly assessed servers and applications for virtualization feasibility, ensuring alignment with industry best practices.
    • Network Administration & Citrix Management: Managed Citrix server PS 4.0 configurations, printer setups, and policy implementations. Ensured continuous web interface accessibility through Juniper and NetScaler solutions.
    • SCSI Device Management: Handled the end-to-end process of SCSI device installations, configurations, and testing, guaranteeing compatibility and optimal performance.
    • Windows Server Oversight: Administered a vast Windows-based network comprising over 800 servers. Specialized in the upkeep and management of Dell PowerEdge R300 servers, ensuring system uptime and reliability..

    Support Engineer

    Dell
    Chennai
    03.2008 - 08.2008
    • System Maintenance & Troubleshooting: Specialized in the maintenance and timely troubleshooting of Dell systems, ensuring optimal performance and minimizing downtime.
    • Platform Support Expertise: Delivered proficient support for both Windows and VMware environments, addressing issues promptly to uphold system reliability and user satisfaction.
    • Datacenter Strategy & Deployment: Played a pivotal role in datacenter planning and execution, ensuring infrastructure alignment with organizational objectives and future scalability.
    • Storage Configuration Proficiency: Expertly handled RAID configurations, optimizing data storage and ensuring data integrity across platforms.
    • Peripheral Device Management: Managed the configuration and integration of SCSI devices, ensuring seamless compatibility and functionality within the system architecture.

    System Administrator

    PRECISION TECHSERVE (P) Ltd
    Trichy
    05.2007 - 03.2008
    • Environment Oversight: Maintained system health and performance.
    • Technical Troubleshooting: Resolved system bugs for uninterrupted operations.
    • Configuration Management: Adapted systems to evolving business needs.
    • Monitoring & Optimization: Implemented real-time system health checks.
    • Platform Expertise: Supported Windows and VMware platforms.
    • Infrastructure Planning: Led datacenter scalability initiatives.
    • Storage & Cluster Management: Managed RAID and cluster configurations.
    • Comprehensive System Management: Oversaw hardware and software maintenance.
    • Security & Compliance: Conducted regular Windows updates and addressed security findings.

    System Administrator

    Accel Frontline Ltd
    DalavoiDalmiapuramDalmiaTrichy
    08.2004 - 05.2007

    FM Engineer at M/S Dalmia Cement (Bharat) Ltd, Dalmiapuram Ariyalur:

    • Oversaw maintenance for 190+ systems and managed 150 printers.

    Engineer at M/s India Cement Corp.Ltd, Dalavoi Works, Perambalur Dt:

    • Managed maintenance for 80+ systems and printers, primarily under Windows NT and Windows98 platforms, spanning from Jan 2005 to Nov 2006.

    Education

    Post Graduate Program - DevOps

    Directorate Of Technical Education
    Online
    04-2023

    Diploma in Computer Technology - Computer Science

    Srinivasa Polytechnic College
    Tiruchirappalli, India
    04-2004

    Skills

    • Jenkins
    • Docker
    • Kubernetes
    • Terraform
    • Chef
    • Puppet
    • Ansible
    • Jforg
    • Nagios
    • Zabbix
    • Azure DevOps
    • New Relic
    • PagerDuty
    • Pingdom
    • Prometheus
    • Helm
    • Python
    • JSON
    • Windows
    • Linux
    • GitHub
    • GitLab
    • WebLogic
    • Tomcat
    • Apache
    • Atlassian Jira
    • RabbitMQ
    • Kafka
    • Nginx
    • VMWare
    • Azure
    • Azure Private Cloud
    • ESXi

    Certification

    • HashiCorp Certified: Terraform Associate (003)
    • Certified Kubernetes Administrator (CKA)
    • AWS Certified SysOps
    • AWS Certified Solution Architect Associate
    • Microsoft Certified: Azure Administrator Associate
    • Microsoft Azure Architect Design
    • Microsoft Certified Professional
    • Microsoft Certified Technology Specialist
    • Microsoft Certified IT Professional
    • Microsoft Certified Implementing
    • Microsoft Azure Infrastructure Solutions
    • Microsoft Certified Server
    • Virtualization with Windows Server Hyper-V and System Center
    • VMware Certified Professional
    • Logicmonitor certified associate
    • PagerDuty Certified Foundational
    • PagerDuty Certified API Specialty
    • PagerDuty Certified Incident Responder
    • NewRelic Full Stack Certified
    • Python Administrator
    • RedHat OpenShift Certificated training

    Timeline

    Senior SRE

    Emirates NBD
    08.2023 - Current

    Principal Site Reliability Engineer

    Granicus - USA ( Payroll in LTIMindtree)
    12.2019 - 07.2023

    Automation Engineer

    Flydubai
    01.2016 - 11.2019

    Technical Specialist and Automation Engineer

    Mindtree
    07.2010 - 12.2015

    Senior System Administrator

    CMCK LLC ( Payroll in HCL Ltd)
    09.2008 - 07.2010

    Support Engineer

    Dell
    03.2008 - 08.2008

    System Administrator

    PRECISION TECHSERVE (P) Ltd
    05.2007 - 03.2008

    System Administrator

    Accel Frontline Ltd
    08.2004 - 05.2007

    Post Graduate Program - DevOps

    Directorate Of Technical Education

    Diploma in Computer Technology - Computer Science

    Srinivasa Polytechnic College
    Vinoth Subbiah