image
  • Snapboard
  • Activity
  • Reports
  • Campaign
Welcome ,
loadingbar
Loading, Please wait..!!

Sr Software Engineer Site Reliability Engineering

  • ... Posted on: Mar 06, 2026
  • ... Stellent IT LLC
  • ... ATL, Georgia
  • ... Salary: Not Available
  • ... Full-time

Sr Software Engineer Site Reliability Engineering   

Job Title :

Sr Software Engineer Site Reliability Engineering

Job Type :

Full-time

Job Location :

ATL Georgia United States

Remote :

No

Jobcon Logo Job Description :

Job Title: Software Engineer

Location: ATL, Georgia (Hybrid)

Duration : CTH

USC or GC holders only

years and under, sweet spot 5-7 years.

Skills

  • AWS broadly (EC2, but moving away and going to Fargate) ECS, code pipeline and build, deploy, RDS, Dynamo, WAP, CloudFront)
  • GitHub actions moving away from AWS code connect
  • Terraform (must) most of the day there, or CDK is a benefit but terraform the hard requirement
  • Terraform modules strong plus
  • Splunk, new relic, pager duty is great lot of alerting, dashboarding, capacity planning, running internal PaaS

Any dev experience they favor more (from dev to devops seems to work better than ops side moving in because they deal mostly with dev teams passing code and application questions) Dev's use .NET, people have come from Ruby, PHP, more developing and deploying into prod environment is a good background.

Some very large scale project, going through Fargate conversion 1-2 people on project, individual projects security initiatives, implement with business partners firewall rules, WAP and business traffic, longer term things.

Middle experience, pic up project and drive independently, imbed with team, infra, solve, monitoring,

Interview: 1st round with him, quick screen, overview on role, team structure, projects, high level tech questions, not coding more high-level systems design on the spectrum

Second round with tech leads: more systems design, potentially a live coding question things. Theu have an eye on people using AI to answer, culture ft as well.

Job Description:

This role is for an opening for a Senior Site Reliability Engineer (SRE) on the Manheim Logistics SRE team. The SRE team is tasked with designing and maintaining AWS infrastructure and deployment pipelines for Manheim Logistics' 15+ development teams. The team has currently standardized on a Docker-based infrastructure solution and is adding functionality to support new development team requests and architectural patterns (such as Lambda, Step Functions, Fargate, etc). The SRE team has a strong focus on IaC with Terraform and best practices such as least privilege access, proactive monitoring and alerting, etc. This role will work directly with a release train and help with IaC and SRE activites such as improving monitoring/alerting, defining an error budget, assisting with DevSecOps, etc.

As a Senior Site Reliability Engineer at Cox Automotive you will:

  • Strong automation experience- testing, deploying, monitoring, etc.
  • Take complex problems and come up with a technically reasonable solution
  • Experience working with and defining SLOs, error budgets, etc.
  • Have innate curiosity about how things work
  • Design and assist in the authoring of software tools that reliably manage application delivery & performance
  • Design and assist in the setup and maintenance of application monitoring and alerting
  • Engage with engineering teams to ensure best practices are implemented
  • Improve predictability and reliability of software releases, workflows, and operating software.
  • Reduce mean time to recovery (MTTR) by helping troubleshoot, monitor, alert, and automating recovery.

    Qualifications:
  • Bachelor's degree in Computer Science or related field and at least 3-5 years working experience
  • Expertise in software development and architecture/solutioning experience
  • Strong background with Terraform
  • Experience with Amazon AWS technologies especially: ECS and Lambda
  • Experience with monitoring/observability tools such as: New Relic, Splunk, PagerDuty
  • Experience with agile development, continuous integration and automated testing
  • Solid written communication, problem solving, and process management skills

    Preferred Skills:
  • Broad AWS platform skills including Cognito, WAF, Elasticache (Redis), Elasticsearch, SNS, SQS, S3, Systems Manager
  • Experience automating Terraform at scale
  • Experience with Database Server infrastructure (RDS, MySQL, Postgres, etc)
  • .NET core development experience
  • GitHub Actions

Experience with Github, docker, and Linux adminstration experience

Priyanka Yadav

IT Technical Recruiter

Email:

Address:
505 Knolle Court
Saint Augustine, FL 32092

Telephone:
321-497-8237

Jobcon Logo Position Details

Posted:

Mar 06, 2026

Reference Number:

47230-6911

Employment:

Full-time

Salary:

Not Available

City:

ATL

Job Origin:

CIEPAL_ORGANIC_FEED

Share this job:

  • linkedin

Jobcon Logo
A job sourcing event
In Dallas Fort Worth
Aug 19, 2017 9am-6pm
All job seekers welcome!

Sr Software Engineer Site Reliability Engineering    Apply

Click on the below icons to share this job to Linkedin, Twitter!

Job Title: Software Engineer

Location: ATL, Georgia (Hybrid)

Duration : CTH

USC or GC holders only

years and under, sweet spot 5-7 years.

Skills

  • AWS broadly (EC2, but moving away and going to Fargate) ECS, code pipeline and build, deploy, RDS, Dynamo, WAP, CloudFront)
  • GitHub actions moving away from AWS code connect
  • Terraform (must) most of the day there, or CDK is a benefit but terraform the hard requirement
  • Terraform modules strong plus
  • Splunk, new relic, pager duty is great lot of alerting, dashboarding, capacity planning, running internal PaaS

Any dev experience they favor more (from dev to devops seems to work better than ops side moving in because they deal mostly with dev teams passing code and application questions) Dev's use .NET, people have come from Ruby, PHP, more developing and deploying into prod environment is a good background.

Some very large scale project, going through Fargate conversion 1-2 people on project, individual projects security initiatives, implement with business partners firewall rules, WAP and business traffic, longer term things.

Middle experience, pic up project and drive independently, imbed with team, infra, solve, monitoring,

Interview: 1st round with him, quick screen, overview on role, team structure, projects, high level tech questions, not coding more high-level systems design on the spectrum

Second round with tech leads: more systems design, potentially a live coding question things. Theu have an eye on people using AI to answer, culture ft as well.

Job Description:

This role is for an opening for a Senior Site Reliability Engineer (SRE) on the Manheim Logistics SRE team. The SRE team is tasked with designing and maintaining AWS infrastructure and deployment pipelines for Manheim Logistics' 15+ development teams. The team has currently standardized on a Docker-based infrastructure solution and is adding functionality to support new development team requests and architectural patterns (such as Lambda, Step Functions, Fargate, etc). The SRE team has a strong focus on IaC with Terraform and best practices such as least privilege access, proactive monitoring and alerting, etc. This role will work directly with a release train and help with IaC and SRE activites such as improving monitoring/alerting, defining an error budget, assisting with DevSecOps, etc.

As a Senior Site Reliability Engineer at Cox Automotive you will:

  • Strong automation experience- testing, deploying, monitoring, etc.
  • Take complex problems and come up with a technically reasonable solution
  • Experience working with and defining SLOs, error budgets, etc.
  • Have innate curiosity about how things work
  • Design and assist in the authoring of software tools that reliably manage application delivery & performance
  • Design and assist in the setup and maintenance of application monitoring and alerting
  • Engage with engineering teams to ensure best practices are implemented
  • Improve predictability and reliability of software releases, workflows, and operating software.
  • Reduce mean time to recovery (MTTR) by helping troubleshoot, monitor, alert, and automating recovery.

    Qualifications:
  • Bachelor's degree in Computer Science or related field and at least 3-5 years working experience
  • Expertise in software development and architecture/solutioning experience
  • Strong background with Terraform
  • Experience with Amazon AWS technologies especially: ECS and Lambda
  • Experience with monitoring/observability tools such as: New Relic, Splunk, PagerDuty
  • Experience with agile development, continuous integration and automated testing
  • Solid written communication, problem solving, and process management skills

    Preferred Skills:
  • Broad AWS platform skills including Cognito, WAF, Elasticache (Redis), Elasticsearch, SNS, SQS, S3, Systems Manager
  • Experience automating Terraform at scale
  • Experience with Database Server infrastructure (RDS, MySQL, Postgres, etc)
  • .NET core development experience
  • GitHub Actions

Experience with Github, docker, and Linux adminstration experience

Priyanka Yadav

IT Technical Recruiter

Email:

Address:
505 Knolle Court
Saint Augustine, FL 32092

Telephone:
321-497-8237

Loading
Please wait..!!