About the Job

The Team

As a key member of the Infrastructure team, you will play a critical role in developing and maintaining core systems that ensure the reliability, scalability, and performance of our platform. Your contributions will directly impact the efficiency and stability of our services, ensuring a seamless experience for our users.

Your Role

  • Ensure the availability, performance, security, and scalability of our production systems.
  • Collaborate closely with the engineering team to deploy services across multiple environments.
  • Scale our infrastructure to handle traffic spikes and increased capacity during peak events.
  • Provide recommendations for architectural and process improvements.
  • Maintain and optimize a CI/CD pipeline to facilitate rapid and secure software delivery.
  • Develop and maintain shared tools and infrastructure used by product and engineering teams.
  • Monitor Service Level Indicators (SLIs) and take proactive measures to meet organizational Service Level Objectives (SLOs).
  • Participate in an on-call rotation to address and resolve critical production issues.
  • Drive process improvements and automation to streamline operational workflows and enhance system health.
  • Continuously enhance existing systems and tooling, focusing on testing, monitoring, alerting, security, and release processes.
  • Collaborate with other engineering teams to design reliable and maintainable systems.
  • Define standards, guidelines, and best practices for effective infrastructure management.

Required Qualifications

To excel in this role, you should embody a growth mindset, humility, and a dedication to making a high impact. As our next Senior Software Engineer, Infrastructure, you should bring:

  • Experience: 5+ years of experience in infrastructure engineering, with a proven track record of building scalable systems that can handle high traffic.
  • Education: Bachelor’s degree in Computer Science, a related field, or equivalent work experience.
  • Technical Skills:
    • Proficiency in operating and maintaining distributed systems.
    • Strong knowledge of Linux and Kubernetes.
    • Expertise in public cloud environments (AWS, GCP) using tools such as Kubernetes, Docker, and Terraform.
    • Strong programming skills in Java, Rust, or Python.
    • Expertise in CI/CD workflows and associated tooling.
  • Problem Solving: Ability to approach complex problems from first principles and strive for simple yet effective solutions.
  • Mindset: Passion for learning, exploration, and continuous growth.
  • Communication: Excellent verbal and written communication skills.

Nice-to-Have Qualifications

  • Experience: Previous experience at a hyper-growth startup, demonstrating the ability to work independently and adapt quickly in a fast-paced environment.

This version maintains the essence of the original while omitting specific company references and aligning with your client's needs.