Job Summary:
You’re responsible for building, testing, and maintaining the infrastructure and tools to enable speedy development and release of software. We expect you to have a comprehensive understanding of technology stacks for automation in software development and infrastructure operations. You will collaborate with developer and operations teams to bridge the gap between development and operations, enabling a smooth and efficient software delivery process.
Job Description:
- Develop and maintain infrastructure-as-code (IAC) using tools like Terraform for automated infrastructure provisioning
- Implement CI/CD pipelines to streamline the application deployment process
- Design and implement highly available and fault-tolerant systems, ensuring maximum uptime for critical services
- Implement observability systems to monitor service availability and performance to improve reliability
- Implement centralized logging solutions to facilitate debugging and troubleshooting
- Collaborate with security teams to implement best practices for secure software development and infrastructure
- Roll out fixes and upgrades to software as needed
- Respond to incidents, perform root cause analysis, and implement preventive measures to avoid similar issues in the future
Qualifications:
- Bachelor’s degree in Information Technology, Computer Science, Computer Engineering, or a related field
- Experience with CI/CD tools (e.g., Jenkins, GitLab CI/CD) and automation frameworks.
- Proficiency in cloud platforms such as AWS, Azure, and associated services.
- Knowledge of IAC tools like Terraform or Azure ARM.
- Familiarity with monitoring and logging tools like Prometheus, Grafana, ELK stack, APM, etc.
- Good understanding of IT Operations.
- Strong problem-solving skills and the ability to troubleshoot complex issues.
- Excellent communication and teamwork skills to collaborate effectively across various teams.