Senior Site Reliability Engineer (Hybrid)

Jakarta, Jakarta   |   Full Time

About Flip 

Rafi, Luqman, and Anjar, who were college friends in Universitas Indonesia, started Flip as a project in 2015 to transfer payments to each other at a fraction of what banks would charge them. They are pioneers in the Indonesian market, with their technology now helping millions of Indonesians, both individuals and businesses, carry out bank-to-bank money transfers through a reliable and seamless app.

After five years of operations, Flip has helped Indonesians transfer money worth several trillions of rupiah and has received double-digit funding from respectable investors such as Sequoia India, Insight Partner, and Insignia. Flip’s ultimate mission is to give Indonesians access to one of the most progressive and fairest financial services in the world.

At Flip, we always strive to provide the fairest place for you to work, learn, and grow with talented and fun people in various opportunities to advance your career and get fair rewards. We believe that we have to treat employees, customers, and all stakeholders fairly and respectfully. Fair treatment for employees means we establish clear goals, facilitate our employees to achieve them, and value their contribution to the company with equitable benefits.

What you'll do:

  • Operate and maintain Flip’s production systems and mission-critical services adhering to the SRE best practices;
  • Operate and manage Flip’s cloud platforms adhering to the SRE best practices;
  • Lead and manage Flip’s incident response, and blameless postmortems, and use the insights to come up with improvements;
  • Lead and manage Flip’s SRE projects to successful completion with a good measurable impact;
  • Collaborate in designing and architecting Flip’s services to improve the availability, scalability, latency, and efficiency with a well-defined SLO, SLI, and SLA;
  • Continuously improve Flip’s overall monitoring and observability solutions;
  • Continuously improve Flip’s software development life cycle by adhering to Continuous Delivery best practices;
  • Continuously improve Flip’s overall security posture, manage and remediate security risks;
  • Continuously improve Flip’s SRE standards, tooling, documents, and processes;
  • Continuously build and contribute to the overall Flip’s SRE automation with the goal of repeatable and scalable solutions;
  • Mentor and guide other team members and champion Flip’s SRE best practices.

What you'll need:

  • 5 years of experience as a Site Reliability Engineer or similar role;
  • Experience designing, operating, and maintaining production-grade applications in distributed virtualized/containerized environments;
  • Experience working on cloud platforms (e.g. Alibaba Cloud, GCP, AWS, etc.);
  • Experience architecting, developing, and troubleshooting distributed systems;
  • Experience programming in one or more languages (e.g., PHP, Java, Python, Golang, Ruby, JavaScript, etc.).

Preferred qualifications:

  • Bachelor's degree in Computer Science or equivalent practical experience;
  • Experience in operating and maintaining complex, large-scale, and critical production applications with business impact;
  • Excellent technical leadership, project management, analytical problem solving, and troubleshooting skills;
  • Excellent understanding of SLO, SLI, SLA, and error budget;
  • Experience with Unix/Linux system administration & networking;
  • Experience with cloud networking (e.g. VPC, Load Balancers, DNS, network connectivity & peering, etc.);
  • Experience with automation, infrastructure as code, orchestration, and CI/CD (e.g. Terraform, Ansible, Packer, Pulumi, GitLab, Jenkins, Circle CI, etc.);
  • Experience with monitoring and observability technologies (e.g. Datadog, Prometheus, Elasticsearch, Kibana, Grafana, Loki, New Relic, etc.);
  • Experience with container technologies (e.g. Docker, Kubernetes, etc.);
  • Experience with open-source server and middleware technologies (e.g. Apache, NGINX, HAProxy, Envoy, Kong, Redis, Kafka, RabbitMQ, etc.);
  • Experience with database technologies (e.g. MySQL, PostgreSQL, MariaDB, MongoDB, ProxySQL, etc.);
  • Experience with web and cloud security (e.g. SSL/TLS, OAuth, SSO, OWASP, IAM, encryption, firewall, etc.).

P.S. if you experience problems when submitting your CV through this platform, you can send it directly to [email protected]

Submit Your Application

You have successfully applied
  • You have errors in applying