Indus Net Technologies (INT.) is a global digital engineering and technology consulting company with 1,000+ professionals serving 500+ clients across 45+ countries. Founded in 1997 and headquartered in Kolkata, INT specializes in cloud-native modernization, platform engineering, AI, cybersecurity, data platforms, and digital transformation solutions for enterprises worldwide.
Website: https://intglobal.com/
About the RoleWe are looking for an experienced Site Reliability Engineer (SRE) & Platform Engineer to join our growing technology team. The ideal candidate will be responsible for building, automating, and managing scalable, secure, and highly available cloud infrastructure while driving platform reliability, observability, and operational excellence.
You will work closely with development, DevOps, and engineering teams to design and maintain cloud-native platforms, improve deployment processes, and establish reliability best practices across multiple environments and Kubernetes clusters.
Key Responsibilities- Design, implement, and manage highly available cloud-native infrastructure on AWS.
- Build and maintain Kubernetes platforms across multiple clusters and environments.
- Develop and manage Infrastructure as Code (IaC) using Terraform.
- Implement GitOps practices using ArgoCD for application deployment and lifecycle management.
- Configure and optimize cluster autoscaling using Karpenter and workload scaling using KEDA.
- Manage infrastructure provisioning workflows through Atlantis.
- Define, implement, and monitor SLI/SLO frameworks to improve system reliability and performance.
- Support and maintain AWS Control Tower environments and governance standards.
- Build and enhance observability, monitoring, alerting, and incident management processes.
- Collaborate with engineering teams to improve CI/CD pipelines and platform automation.
- Troubleshoot production issues and drive root cause analysis for critical incidents.
- Ensure platform security, compliance, scalability, and operational efficiency.
- Strong hands-on experience with Kubernetes administration and operations.
- Expertise in AWS cloud services and architecture.
- Extensive experience with Terraform for Infrastructure as Code.
- Hands-on experience with ArgoCD and GitOps methodologies.
- Experience with Karpenter and KEDA for Kubernetes scaling.
- Experience managing infrastructure workflows using Atlantis.
- Strong understanding of SRE principles, including SLI, SLO, and error budgets.
- Experience working with AWS Control Tower.
- Proven experience managing multi-cluster and multi-environment Kubernetes deployments.
- Experience with platform engineering and cloud-native architectures.
- Strong troubleshooting and production support skills.
- Experience with CI/CD tools and automation frameworks.
- Knowledge of monitoring and observability tools such as Prometheus, Grafana, ELK, Datadog, or similar.
- Experience with container security and cloud governance.
- Scripting skills in Python, Bash, or Go.
- Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field.
- 7–10 years of overall experience in DevOps, Platform Engineering, Cloud Infrastructure, or SRE roles.



