Job Summary
We are looking for an experienced Site Reliability Engineer with 7 to 9 years of expertise in Azure DevOps and cloud infrastructure to join our dynamic team based in Hyderabad or Pune. The ideal candidate will possess strong scripting and programming skills, a deep understanding of Azure environments, and proven experience managing large-scale cloud platforms. This role requires close collaboration with cross-functional teams to ensure high availability, security, and compliance, while continuously enhancing operational processes and service delivery.
Key Responsibilities
Developing and maintaining automation scripts using Bash, PowerShell, and Azure CLI to support cloud infrastructure operations is a core part of this role. You will also write and manage code in Python, C#, and Java to improve platform capabilities and integrations. Utilizing query languages such as SQL and Kusto Query Language (KQL) for data analysis and monitoring is essential.
Managing version control through GitHub and implementing CI/CD pipelines will streamline deployments. You will administer Linux environments, preferably Red Hat, automating system tasks through scripting. Applying object-oriented programming principles to create scalable and maintainable solutions is expected.
Designing, building, operating, and supporting cloud infrastructure and data services at scale, with a focus on Azure services, will be a key responsibility. Collaboration with multiple support groups to deliver reliable, high-availability infrastructure—including web and reverse proxies—is critical. You will ensure data quality, management, controls, and governance align with organizational standards.
Implementing security and compliance measures such as Identity and Access Management (IAM), cloud auditing, and monitoring tools is vital. Troubleshooting complex issues across cloud platforms, services, and application stacks efficiently will be part of your daily tasks. Maintaining thorough documentation, managing change processes, and following agile methodologies are also required.
Finally, you will identify and drive process improvements to boost operational efficiency and service quality.
Required Qualifications
Candidates must have 7 to 9 years of relevant experience in Site Reliability Engineering, Azure DevOps, or related cloud infrastructure roles. Proficiency in scripting languages including Bash, PowerShell, and Azure CLI is mandatory. Programming skills in Python, C#, and Java are essential.
Experience with query languages such as SQL and Kusto Query Language (KQL), along with familiarity with version control systems like GitHub and CI/CD pipelines, is required. Hands-on experience with PowerShell, Terraform, Python, and Windows Command Prompt is expected.
Strong Linux administration and scripting skills, preferably with Red Hat systems, are necessary. A solid understanding of hardware and software fundamentals—covering storage technologies (SSD, HDD, NVMe), CPU architectures, memory, OS basics, and networking stack—is important. Deep knowledge of network protocols and network design is required.
Expertise in cloud infrastructure and platform engineering, particularly Azure, is essential. Experience working with high-availability and high-load infrastructure components is expected. Knowledge of security, compliance, and governance frameworks related to cloud environments is critical. Proven ability to troubleshoot complex technical issues and implement effective solutions is a must.
Strong documentation, change management, and agile development skills complete the required qualifications.
Preferred Qualifications and Benefits
A Bachelor’s degree in Computer Science, Software Engineering, Data Science, or a related STEM field is preferred. Candidates with equivalent industry certifications or substantial experience in Cloud, Data, or Cybersecurity will also be considered.
Experience in highly regulated industries, especially Financial Services, is advantageous. Familiarity with cybersecurity principles and global financial compliance regulations is beneficial. Knowledge of industry cybersecurity standards and frameworks such as OWASP, ISO 2700x, PCI DSS, GLBA, FFIEC, CIS, NIST, and global data security/privacy laws is preferred.
Proficiency with Azure technology services—including Identity, Networking, Compute, Storage, Web, Containers, Databases, Azure Data Factory, Databricks, Synapse Analytics, Power BI, Data Lake Store, Azure Functions, Logic Apps, Azure Monitor, and Log Analytics—is highly desirable.
Experience with infrastructure tools like CosmosDB, Nginx/Apache, Linux, Bash, PowerShell, and observability tools such as Prometheus, Grafana, and Elasticsearch is a plus. Familiarity with automation tools including Terraform, Chef, Ansible, CloudFormation, and ARM templates is beneficial.
Knowledge of streaming platforms like Azure Event Hubs, Kafka, and Spark Streaming, as well as exposure to Security Information and Event Management (SIEM) and Security Orchestration, Automation, and Response (SOAR) technologies—especially cloud-based solutions—is preferred.
We value candidates with a strong work ethic, positive attitude, and passion for continuous learning. Joining UST means becoming part of a global digital transformation leader with over 30,000 employees across 30 countries, delivering innovative technology solutions that impact billions worldwide.
This role offers a challenging and rewarding opportunity to advance your career in cloud infrastructure and platform engineering while contributing to cutting-edge digital transformation initiatives.