Job Description
US Work Authorization Requirement:
Candidates must be legally authorized to work in the United States without employer sponsorship. This includes, but is not limited to, U.S. Citizens, Permanent Residents, and other individuals with valid U.S. work authorization.
Job Summary:
We are looking for a highly experienced Site Reliability Engineer (SRE) with 12+ years of experience to support and enhance the reliability, scalability, and performance of enterprise applications and cloud infrastructure. The ideal candidate will have strong hands-on experience with CI/CD pipelines, Google Cloud Platform (GCP), Linux systems, databases, and API testing, along with a strong production support mindset.
Key Responsibilities:
Design, build, and maintain CI/CD pipelines using Jenkins
Perform API testing and validation using Postman or Bruno
Write, analyze, and troubleshoot SQL Server queries and stored procedures
Provide advanced Linux support, including shell scripting and AWK usage
Monitor and support applications hosted on Google Cloud Platform (GCP)
Work with BigQuery for data analysis and issue resolution
Manage Google Cloud Storage, including bucket-to-bucket data transfers
Parse and manipulate JSON data for APIs and system integrations
Provide production support, incident management, and root cause analysis
Collaborate with development and DevOps teams to improve system reliability and automation
Required Skills & Experience:
12+ years of IT experience, including SRE, DevOps, or Production Support roles
Strong hands-on experience with Jenkins
Experience with Postman or Bruno for API testing
Strong expertise in SQL Server (complex queries, stored procedures, tuning)
Advanced Linux skills with shell scripting and AWK
Hands-on experience with Google Cloud Platform (GCP)
Experience with BigQuery
Knowledge of Google Cloud Storage and data transfers
Working knowledge of JavaScript (scripting level)
Strong understanding of JSON
Experience supporting production systems and on-call environments
Excellent troubleshooting and communication skills
Preferred / Nice to Have:
Experience with monitoring tools (Grafana, Prometheus, Stackdriver)
Exposure to Infrastructure as Code (Terraform)
Experience in Agile / DevOps environments