Site Reliability Engineer

  • Remote
  • Full Time
  • Mid Level

Our vision is to enable everyone to craft software with confidence while focusing on impacting the world at the speed of thought. Our DevOps Intelligence Platform includes two products that enable software development teams to achieve their full potential and give management teams visibility on their investment:

  • Codacy: provides software analysis to help developers quantify and act on their software quality, engineering performance, and security
  • Pulse: measures engineering health and performance so teams can continually improve with data-driven insights

We're curious, funny, radically honest yet kind, and we thrive on collaboration and transparency. We're a team of highly dedicated and ambitious domain experts brought together by the mission to help development teams reach their full potential, and driven by having a worldwide impact on software development. 

We are looking for a Site Reliability Engineer to join our Product Team.

What will be your day-to-day?

  • Monitoring: contribute to the improvement of the monitoring and measurement systems that support our operational scale and continuous delivery. This goes from setting up and maintaining the right tools, to help the different engineering teams on the correct instrumentation of their code;
  • Availability: work to measure and increase the mean-time-between-failures and decrease the mean-time-to-repair of public-facing systems;
  • Operations: help the engineering team to operate their systems;
  • Performance, Efficiency & Latency: contribute to the measurement techniques that assist in the performance tuning of the applications stack, use the monitoring systems to help maintain application performance at acceptable levels, and recommend and implement performance improvements across the stack;
  • Security & Risk: participate in the ongoing process to identify and mitigate risk in our systems;
  • Capacity Planning: use our monitoring to advise on capacity requirements;
  • Engineering Tools: create and maintain tools that can help engineering teams improve their day to day work.

What are the skills and experience needed to do the job successfully?

  • Docker;
  • Datadog, APM , Grafana,  Prometheus, Cloudwatch - or similar;
  • Application development experience with at least one programming language (Java, Scala, Go, python...);
  • Experience managing systems with daily deployments that have to handle millions of requests;
  • An understanding that managing systems at scale require end to end infrastructure tools and automation;
  • Broad knowledge of system administration, networking, databases, security, storage and performance and have expertise in at least one of these disciplines;
  • Experience aligning with the goals of the DevOps movement in the sense that teams own the full cycle of the development process from design to operation;
  • Has provided a positive contribution to both operations-focused and development-focused work;
  • Has built and maintained cloud-based applications and infrastructure;
  • Has worked with tools and frameworks for automating infrastructure;
  • Passion for and experience in best practices in systems operations tools and techniques.

What else makes working at Codacy awesome?

  • Competitive Salary. Check our salary calculator at https://www.codacy.com/careers; 
  • Comprehensive health insurance for household members, with dental and vision;
  • Generous learning and development budget;
  • Flexible holidays;
  • Flexible working hours;
  • A remote first work policy (work from anywhere!)
     
Read More

Apply for this position

Required*
Apply with Indeed
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file