Manager- Red hat Open Stack Apply
The L3 OpenStack Administrator is responsible for ensuring reliable and secure cloud operations by managing advanced troubleshooting, performance tuning, upgrades, and incident resolution across all OpenStack services. The role focuses on maintaining high availability, optimizing infrastructure components, automating day‑2 operations, and supporting complex tenant and platform needs in a large‑scale OpenStack environment.The ideal candidate should have deep knowledge of RHEL, CentOS, Ubuntu, SUSE, Oracle Linux, along with cloud Linux workloads (AWS, GCP, Azure, OCI), containerization (Docker, Kubernetes, OpenShift), and automation (Ansible, Terraform, Python, Bash).Major Duties & Responsibilities:Design and implement highly available and scalable architecture using Red Hat OpenStack Platform.Manage and operate the production OpenStack environment across compute, storage, and networking services. Lead deployment, configuration, and lifecycle management of OpenStack clusters (including major/minor upgrades and patching).Experience in Cisco ESC (Elastic Services Controller) for end‑to‑end VNF lifecycle management including deployment, monitoring, scaling, healing, and termination on OpenStack environments.Perform capacity planning for compute, storage, and networking resourcesAct as L3 escalation point for critical production incidents (P1/P2)Conduct deep troubleshooting of Nova, Neutron, Cinder, Glance, and Keystone servicesDiagnose and resolve complex issues related to RabbitMQ, MariaDB Galera, HAProxy, and Pacemaker clustersOptimize performance (CPU pinning, NUMA tuning, hugepages, IO/network optimization) on Red Hat Enterprise LinuxManage and troubleshoot storage backends including Red Hat Ceph Storage, iSCSI, and NFSTroubleshoot advanced networking issues (OVS/OVN, VLAN, VXLAN, LACP, bonding)Implement automation using Ansible and scripting (Bash/Python)Ensure high availability, disaster recovery readiness, and backup strategiesPerform root cause analysis (RCA) and provide preventive action plansDefine security hardening standards and implement RBAC policiesReview and improve operational SOPs and documentationExperience in virtualization platform – KVM, proxmox, VM Ware etc. Mentor L1/L2 engineers and lead technical discussions during major incident bridges Design and implement highly available and scalable architecture using Red Hat OpenStack PlatformLead deployment, configuration, and lifecycle management of OpenStack clusters (including major/minor upgrades and patching)Perform capacity planning for compute, storage, and networking resourcesAct as L3 escalation point for critical production incidents (P1/P2)Conduct deep troubleshooting of Nova, Neutron, Cinder, Glance, and Keystone servicesRequired Knowledge, Skills and Abilities:Strong expertise in Red Hat Linux and core OpenStack services with advanced troubleshooting and performance tuning skills.Hands-on experience with KVM virtualization, OVS/OVN networking, VLAN/VXLAN, and Ceph or similar storage.Proficiency in automation and scripting using Ansible, Bash, Python (Terraform/PowerShell optional).Working knowledge of monitoring and logging tools such as Prometheus, Grafana, or ELK.Solid understanding of cloud security, IAM, multi-tenant operations, and high-availability architectures.Strong analytical and problem-solving abilities for mission-critical environments.Preferred additional Skills and Abilities:Experience with Linux-based Kubernetes clusters (EKS, AKS, GKE, OpenShift, Rancher).Understanding of CI/CD pipelines and DevOps tools (Jenkins, Git, GitLab, ArgoCD, Helm).Knowledge of big data, logging, and analytics tools (Splunk, ELK Stack, Kafka, Hadoop).Familiarity with database management on Linux (MySQL, PostgreSQL, MariaDB, MongoDBFollowing are the key skills and experience expected out of the candidateBachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent experience).7+ years of experience in Linux administration with strong exposure to OpenStack operations in production environments.Proven experience with virtualization, cloud infrastructure, and networking in large-scale, mission‑critical setups.Hands-on experience with automation, monitoring, and infrastructure troubleshooting at an L3 level.

