Job Information
TEKsystems Site Reliability Engineer in Ann Arbor, Michigan
Seeking a motivated and skilled Site Reliability Engineer (SRE) to join our team. In this role, you will be instrumental in ensuring the operational uptime and reliability of our applications and services hosted in both Azure and on-premises environments. As a member of the Site Reliability team, your responsibilities will include managing on-call activities, implementing configuration changes, executing upgrades, and performing maintenance across a diverse range of systems.
Responsible for the overall operational uptime of the eCommerce and Corporate RedHat Linux VMWare Guest environments within eCommerce and Corporate systems, as well as the environments in Azure. This position requires a wide base of knowledge from basic Linux administration through capacity planning and middleware, as well as an understanding of public cloud platforms and containerization platforms (preference for Azure and AKS).
Duties and responsibilities:
On-Call Support: Manage major incidents, ensuring swift restoration of services and stability.
Alert Response: Proactively monitor alerts and respond to issues to maintain system reliability.
Release Management: Participate in the release and deployment processes for infrastructure and services.
Platform Management: Maintain a standard platform that is current and extensible for all relevant environments.
Documentation: Ensure provisioning practices and documentation are regularly updated and maintained.
Automation: Identify automation opportunities, engage in related initiatives, and manage content in version control.
Configuration Management: Implement configuration management for both server platforms and service configurations.
Troubleshooting: Diagnose and resolve issues with services and applications.
CI/CD Deployment: Deploy services using CI/CD pipelines and automation tools.
Infrastructure Support: Assist in supporting and upgrading a wide range of infrastructure components.
Skills
Site reliability, Linux, Red hat, AKS, Terraform
Additional Skills & Qualifications
*Education: Bachelor’s degree in Computer Science or a related field, or equivalent experience.
*Advanced English: Exceptional written and verbal English communication skills are essential, as this role requires collaboration with global teams and documentation of the migration process.
*Experience: Minimum of 4-6 years of production application support in high-availability environments.
*System Administration: Familiarity with UNIX/Linux administration, including troubleshooting performance issues and basic network configuration.
*Scripting Skills: Familiarity with scripting languages such as Bash and Python.
*Kubernetes Knowledge: Basic understanding of Kubernetes.
*Configuration Management: Familiarity with tools such as Terraform and Puppet.
*Orchestration Management: Familiarity with Jenkins Pipelines, Github Workflows, Bitbucket and VMware for VM orchestration.
*Data Management: Understanding of YAML and other data serialization formats like JSON and XML.
*Project Management: Ability to work independently on complex projects with minimal supervision.
*Communication Skills: Strong written and verbal communication skills, capable of creating clear documentation for technical and non-technical audiences.
*Process Improvement: Awareness of process and efficiency enhancement methodologies.
Nice to Have:
Understanding of the core tenants of Java (troubleshooting JVMs, garbage collection, heap, etc)
Web service administration
The desire to consistently learn and find better, more efficient ways to do things is critical for the right candidate in this role -- someone who loves to automate things rather than do the same tasks over and over would be ideal. They would rather take someone who is more junior but eager to learn and contribute than someone more advanced who prefers to sit back or work at a slower pace.
Pay and Benefits
The pay range for this position is $50.00 - $65.00/hr.
Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to specific elections, plan, or program terms. If eligible, the benefits available for this temporary role may include the following: • Medical, dental & vision• Critical Illness, Accident, and Hospital• 401(k) Retirement Plan – Pre-tax and Roth post-tax contributions available• Life Insurance (Voluntary Life & AD&D for the employee and dependents)• Short and long-term disability• Health Spending Account (HSA)• Transportation benefits• Employee Assistance Program• Time Off/Leave (PTO, Vacation or Sick Leave)
Workplace Type
This is a hybrid position in Ann Arbor,MI.
Application Deadline
This position is anticipated to close on Apr 23, 2025.
About TEKsystems:
We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in Full-Stack Technology Services, Talent Services, and real-world application, we work with progressive leaders to drive change. That's the power of true partnership. TEKsystems is an Allegis Group company.
The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.