Site Reliability Engineer

Resume

Software Engineer with 15 years of experience in Site Reliability Engineering, Enterprise Software Development and Operations. Proficient in Kubernetes, Java development, scripting and Incident Management. Experienced in migrating massively distributed services to cloud platforms.

EDUCATION

CALIFORNIA STATE UNIVERSITY, FULLERTON, CA

Master of Business Administration Information Systems 2017


UNIVERSITY OF CALICUT, INDIA

Bachelor of Technology Electrical & Electronics Engineering 2006

PROFESSIONAL EXPERIENCE

APPLE INC, CUPERTINO CA

Site Reliability Engineer, Aug. 2017 to Current

    • Cloud Architect: Designed and implemented Kubernetes based cloud solutions on which several hundreds of Apple Maps and Apple Find My iPhone services are running

    • Migrated close to 140 microservices from physical nodes to Kubernetes which resulted in a significant improvement in capacity utilization and mitigated a myriad of operations issues associated with physical nodes such as periodic OS upgrades, disk fill up and other issues associated to hosting multiple apps on same nodes.

    • Evangelize Kubernetes and provide tooling for legacy applications to migrate to Kubernetes platform

    • Wrote Helm charts to spin up standalone Solr cluster on Kubernetes

    • Took ownership large migration projects which needed coordination with multiple stakeholders and drove it to completion under tight timeframes

    • Implemented end-to-end Release automation based on Jenkins Pipeline as code which resulted in 80% time saving and zero human intervention

    • Responsible for maintaining 99.999% service uptime through zero-downtime Software Releases

    • Work in sync with DevOps team to setup CI CD pipelines to enhance developer productivity

    • Create detailed dashboards to monitor app performance using Splunk and Apple internal tools

    • Spin up on-demand Solr, Kafka, Redis and Zookeeper instances on Kubernetes for app developers to experiment

    • Functioned as the Direct Responsible Individual for triaging application crash or performance issues using heap dumps, thread dumps and application logs

    • Participate in on-call rotation and coordinate war rooms

    • Technology / Tools: Kubernetes 1.18, Linux, Jenkins, Jenkins Pipeline as code, Gradle, Helm charts development, Spinnaker, Java 14, Python 3, Bash scripting

STATE STREET IMS (CONSULTANT, SYNTEL INC), IRVINE CA DevOps Engineer/Java Developer, March 2011 – August 2017

  • Setup Jenkins Continuous Integration server for the first time in the organization

  • Wrote build scripts in bash, ANT and Windows batch to build & deploy wide technology stack of projects including Java, Glassfish, Websphere, Tomcat CPP, VB, PowerBuilder and COBOL

  • Develop/maintain in-house Java based build and dependency tools

  • Manage monthly application production releases and do post-release validations. Monitor and maintain nightly builds.

  • Sybase to Oracle migration - Performed Export Transform Load (ETL) of data from Sybase to Oracle using Talend Open Studio v 4.1.2

  • Technology / Tools: Solaris


WELLPOINT ANTHEM (CONSULTANT, UST GLOBAL), WOODLAND HILLS CA

Technical Lead/ Software Engineer, July 2010 – March 2011

  • Developed Java based webservices for WellPoint Online Store for healthcare plans

  • Enhanced admin tool for WellPoint’s pharmacy website

  • Prepare technical design documents including UML design, DB design, Webservice design

  • Java Coding, Unit testing and support for production deployment

  • Wrote Spring batch jobs for DB refresh activities

  • Tools: IBM Rational Application Developer, IBM DB2 client, IBM Portal Server, CVS Software, UNIX, Java

UST GLOBAL, TECHNOPARK TRIVANDRUM, INDIA

Junior Software Engineer / Internship, Oct 2006 – July 2010

  • Developed batch program capable of processing 450K records in multiple threads simultaneously

  • Extensive soft skill development workshops and courses

  • Technology / Tools: UNIX, Java, IBM DB2 client

SKILLS

  • K8s migration

  • Expert in conducting large scale Release Management

  • Experience in setting up Continuous Delivery for microservices for large organizations

  • Linux troubleshooting

Java programming

  • On-call rotation and incident management

  • CISA, SCJP certified