Kolkata, West Bengal, India
Job Type: Contract
We are looking for a Junior Site Reliability Engineer (SRE) to help maintain, troubleshoot, and optimize our infrastructure and services. As an SRE, you will be responsible for improving the reliability, scalability, and performance of our applications while working closely with development, security, and operations teams.
Key Responsibilities:
Monitoring & Incident Response:
- Continuously monitor system performance, availability, and health using tools like Prometheus, Grafana, Datadog, or New Relic.
- Investigate and resolve incidents, working with development teams to prevent future occurrences.
- Participate in on-call rotations and incident post-mortems.
Infrastructure & Automation:
- Assist in managing cloud-based (AWS, GCP, or Azure) or on-premise infrastructure.
- Develop and maintain Infrastructure as Code (IaC) using tools like Terraform, Ansible, or CloudFormation.
- Automate deployment and scaling processes using CI/CD pipelines (GitHub Actions, Jenkins, GitLab CI/CD, ArgoCD, etc.).
Performance & Reliability:
- Ensure high availability, scalability, and performance of critical services.
- Implement observability solutions (logs, metrics, tracing) to proactively identify issues.
- Optimize database and application performance.
Security & Compliance:
- Support security initiatives, such as access controls, vulnerability scanning, and patch management.
- Ensure compliance with industry best practices and standards (SOC2, ISO 27001, etc.).
Required Skills & Qualifications:
- Education: Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent experience).
- Experience: 1-2 years in Site Reliability Engineering, DevOps, or System Administration.
- Technical Skills:
- Experience with Linux/Unix administration.
- Knowledge of containerization & orchestration (Docker, Kubernetes, Helm, etc.).
- Familiarity with scripting languages like Python, Bash, or Go.
- Experience with monitoring/logging tools (Prometheus, ELK stack, Grafana, Datadog, etc.).
- Understanding of cloud platforms (AWS, GCP, or Azure).
- Knowledge of networking concepts, DNS, load balancing, and security best practices.
Nice-to-Have (Preferred Skills):
- Experience with SQL/NoSQL databases (PostgreSQL, MySQL, MongoDB, etc.).
- Hands-on experience with serverless architectures (AWS Lambda, Google Cloud Functions, etc.).
- Understanding of service meshes (Istio, Linkerd, etc.).
- Familiarity with version control systems (Git, GitHub, GitLab, etc.).
What We Offer:
- Competitive salary and benefits.
- Opportunities for mentorship and career growth.
- Hands-on experience with cutting-edge technologies.
- Work in a dynamic and collaborative environment.