Operations · Full-time · Bangkok, Thailand
Job Summary: We are seeking an experienced AWS Site Reliability Engineer (SRE) to join our team. The successful candidate will be responsible for monitoring and maintaining the infrastructure and services that power our cloud-based applications. The ideal candidate will have a strong background in AWS and experience with automation, monitoring, and troubleshooting complex systems.
Key Responsibilities: Maintain the infrastructure and services that support our cloud-based applications on AWS. Work closely with development teams to ensure that applications are designed with scalability, reliability, and security in mind. Automate deployment and configuration processes using tools such as Terraform, and Ansible. Develop and implement monitoring and alerting systems to proactively identify and resolve issues before they impact customers. Perform regular system maintenance, including patching, upgrades, and backups. Troubleshoot and resolve complex issues related to networking, security, and application performance. Participate in on-call rotation to provide 24/7 support for our production environments.
Qualifications: Bachelor's degree in Computer Science, Information Technology, or related field. At least 5 years of experience as a Site Reliability Engineer or a similar role. Strong understanding of AWS services, including EKS, S3, RDS, Lambda, EC2 and others. Hands-on experience with automation tools such as Terraform, and Ansible. Experience with monitoring and logging tools such as CloudWatch, ELK Stack, Grafana and Prometheus Strong knowledge of networking and security principles, including firewalls, VPNs, and SSL/TLS. Excellent troubleshooting and problem-solving skills. Strong written and verbal communication skills. Ability to work independently and as part of a team. If you are passionate about cloud infrastructure and have a proven track record of designing and maintaining highly available systems on AWS, we would love to hear from you!
Sign up to view 0 direct reports
Get started