- Featured in:
Find out what is the best resume for you in our Ultimate Resume Format Guide.
Additional Engineering Resume Samples
Site Reliability Engineer Resume Samples
No results found
0-5 years of experience
- Implemented, tested and monitored microservices in the datacenter cloud environments for Cisco-Jasper IOT platform. Performing continuous integration and delivery of new microservices, on-demand trouble shooting of large-scale deployment issues on Linux systems. Started and maintained How-To series of knowledge items, sharing acquired information about installation, integration and deployment for Middleware services on privately hosted and public clouds, including AWS, Google and IBM clouds.
- Provided configuration, maintenance and testing of Jibe pipeline framework for Apple Corp., allowing migration of data between heterogeneous systems and services. Worked on creating Maven based build environment, testing import and export components of the Jibe framework, integration with Kafka services, monitoring data synchronization between Oracle and Mongo databases. Enabled continuous build and deployment automation for hybrid cloud environment, expanding integration coverage for software defined enterprise infrastructure.
- Developed and maintained a toolchain framework for configuring, integrating and testing a set of Middleware tools, used to build corporate VMware products and services. Project resulted in continuous management of the private cloud based distributed repositories, allowing automated test driven synchronization of the toolchain content. Created and managed a virtual test lab environment for testing enterprise services inside multitenant cloud infrastructure.
- Designed and implemented adaptive remote testing framework for installation and customization of multitenant cloud environments, their integration with distributed data sources.
0-5 years of experience
Deploy and monitor Amazon Web Service resources (EC2, VPC, ELB, S3, RDS) using Boto, Terraform and Chef
- Deploy code updates into test and production environments and work to roll environments forward
- Maintain Git repositories for developers and promote topic branch workflow
- Help with Support tickets by reproducing bugs
- Troubleshoot and escalate bugs for our Live server product
0-5 years of experience
Provide systems support by participating in rotational on-call support as well as performing recovery, maintenance and upgrades during weekend and evening hours.
- Serve as an escalation point for other Systems Administrators, Engineers, and other technology teams in the resolution of server and system problems.
- Contribute to the development and maintenance of automation tools used in the management of our infrastructure.
- Plan, schedule, test and perform software installation and upgrades.
- Create and maintain documentation of systems and processes for existing and new systems.
- Build, administer, and troubleshoot all mission critical environments (Production, Stage, Dev, Test, QA)
- Coordinate changes with application owners to ensure minimal user impact.
- Maintain PCI and SOX compliance with required applications and environments.
0-5 years of experience
- Deploy and maintain international server environment for 24/7 critical uptime business product offering in a mixed Windows/Linux environment.
- Leverage automation tools, especially Powershell and Puppet, in order to decrease end-to-end deployment times, reduce downtime, and increase reliability.
- Implement and maintain monitoring solutions at the server and application level in order to increase visibility into day-to-day operations and issues, utilizing SCOM, Nagios, Solarwinds and AppDynamics.
- Lead initiatives to transition critical software services into the Cloud, and provide training for other employees on the Cloud transition process for other portions of the product/organization.
- Act as top-tier on-call support for critical uptime business applications to maintain availability and minimize downtime during outage scenarios.
- Provide training for System Administrators and other Engineers, including brown-bag style trainings, documentation, and one-on-one mentorship.
0-5 years of experience
- Write automation/self-healing scripts in Ruby / BASH / Go to maintain the Bluemix cloud environment
- Manage the stability, operation, and automation of more than 50 Bluemix environments (Cloud Foundry-based cloud platforms)
- Perform primary/secondary on call duties to manage alerts on pager duty and solve issues
- Perform Cloud Foundry deployments to Bluemix using BOSH and Urban Code Deploy
- Create/maintain Slack integration bot which supports the Bluemix SRE team (Ruby/Sinatra)
- Contribute to development pipeline for Urban Code Deploy using Golang
0-5 years of experience
- Front line technical service reliability operators accountable for handling critical customer issues coming in via support phone line and HUB.
- Responsible for first touch incident resolution (via TSG or SOP) or escalation to the appropriate resource within SLA.
- Responsible for monitoring the live service via HUB alerts, Heads up Displays, Manual service checks or customer escalations.
- Accountable for High Priority Bridge Moderation (Spin up bridge, start whiteboard, document sequence of events).
- Document and refine Phone Script, TSGs and SOPs.
- Service Request Management (User Provisioning, Client Invites, Environment requests, Deployments, etc.)
- Responsible for refining Service Center tools and process
0-5 years of experience
- Preparing the Business Process Flow using Bizagi Modeler
- Responsible for setting up ELK (ElasticSearch, Logstash, Kibana) platform, parsing unstructured logs using regular expressions to structured JSON format
- Passing the structured data to ElasticSearch and performing operations on this data
- Analyzing the data on Kibana, Graphana and Graphite and deriving the performance of the products. Stabilizing the servers
- Setting up alerts, handling overloads on server, performing release engineering
- Analyzing, investigating and resolving problems to help smooth product performance. Programming in Visual Studio Code, tracking the progress through JIRA and Git Repositories
0-5 years of experience
- Collect and maintain a complete inventory of all systems. Identify and retire unused systems to recycle resources and reduce maintenance costs.
- Configure and maintain thousands of systems via a set of Chef cookbooks within an Atlassian continuous build and deploy environment (Jira, Confluence, Stash, Bamboo, Git).
- Identify and correct the root cause of various system alarms. Recommend changes to avoid their recurrence.
- Configure and maintain Amazon Web Services (AWS) Cloud Computing environments.