Sre Engineer Apply
SRE Engineer
Job Description/ Responsibilities
1. Planning and Scope Assessment
a) Evaluate the current IT environment and application stack to identify monitoring and observability needs.
b) Define the scope of the deployment in alignment with organizational goals and technical requirements.
c) Engage stakeholders to prioritize applications, services, and systems to be included in the deployment.
2. Deployment Strategy
a) Develop a detailed deployment roadmap, considering technical dependencies, timelines, and resource availability.
b) Ensure compatibility with the existing infrastructure and identify any gaps requiring resolution before deployment.
3. Phased Rollout
a) Implement Dynatrace in a phased manner to minimize disruption, focusing on high-priority areas first.
b) Test and validate functionality at each stage of the rollout to ensure alignment with performance objectives.
c) Address issues promptly during the rollout to maintain project momentum.
4. Enabling Full Stack Monitoring
a) Deploy and configure full-stack monitoring features, including infrastructure, applications, and user experience monitoring where needed.
b) Ensure that all key performance indicators (KPIs) are tracked and that integrations with relevant tools and systems are in place.
5. Tool Adoption & Upskilling
a) Provide training sessions and resources to IT teams to enhance understanding and usage of Dynatrace.
b) Act as a subject matter expert, offering ongoing support and guidance for tool adoption.
c) Develop documentation and best practices for future use and troubleshooting.
6. Analytics and Alerting
a) Configure Dynatrace analytics to deliver actionable insights into system performance and health.
b) Establish custom alerts to ensure timely responses to incidents and anomalies.
c) Monitor and refine alerting thresholds to reduce noise and improve system reliability.
Qualifications:
Proven experience in deploying Dynatrace or similar observability tools across complex IT environments.
Strong understanding of server stacks, networking, databases, cloud services, and application architectures.
Proficient in scripting and automation to streamline deployment and configuration processes.
Strong problem-solving skills and the ability to address technical issues effectively.
Excellent communication skills to collaborate with stakeholders and train end-users.
Preferred Skills:
Certification in Dynatrace is preferred.
Familiarity with DevOps tools and practices.
Experience with ITIL processes for incident and problem management.