Site Reliability Engineer

Reports to: Director, Site Reliability Engineering & Release Engineering

Works with: Software Development, UX Design, Marketing, Sales, Support, Documentation, Business Operations and Finance

Level: Junior to Intermediate

Location: Ottawa, ON or Rest of Canada (Hybrid or remote work)

A SRE team member at Klipfolio gets to wear many hats, and understands the full lifecycle of software from the first line of code to how it scales up to hundreds of servers, and the nuances of the underlying infrastructure that software runs on. The successful applicant will not only need to demonstrate amazing technical abilities, but a strong aptitude for learning new skills and problem solving when faced with unexpected challenges.

Key responsibilities:

  • Participate in the development, deployment, monitoring and maintenance of Klipfolio products, infrastructure, build systems, and internal development tools
  • Some on call required
  • Assist customer success in troubleshooting customer technical issues in production
  • Debugging production and tooling issues across the entire stack from browser to data level

Required Skills:

  • High proficiency in Linux shell scripting
  • Understanding of Software testing methodologies, including TDD and BDD
  • Understanding of fundamental devops concepts including Continuous Integration and Continuous Deployment
  • Understanding of Containers and Containerization technologies, primarily Docker and Kubernetes
  • Knowledge of Python, Java, Javascript and ideally GoLang and Groovy
  • Knowledge of Junit, TestNG, Mocha or other similar test frameworks
  • Understanding of fundamental networking concepts such as firewalls, load balancers, and subnetting
  • Understanding of SQL database concepts and some familiarity with database administration
  • Understanding of NoSQL database concepts and some familiarity with database administration
  • Understanding of Application Logging and Telemetry, familiarity with Prometheus, Grafana, and ELK an asset
  • Understanding of cloud computing concepts, especially around AWS
  • BS/BEng in Computers Science/Software Engineering or College Diploma in Information Technology or equivalent experience

We celebrate diversity and are committed to creating an inclusive environment for all employees.

Send your resume to Ensure the subject line of your email contains "Site Reliability Engineer".

Klipfolio is a fast growing early stage SaaS company with a proven product and an ambitious vision. Since our Series B funding of $12M in Jan 2017, we now have 10,000+ clients, spanning over 80 countries.

Imagine a world where every team has real-time access to the metrics they need to continually improve and optimize their business each and every day.

Imagine no more because at Klipfolio we have developed a real-time cloud based dashboard solution for teams who want to continuously monitor the health of their business.

klipfolio health benefits
Great Health Benefits
klipfolio employee stock option
Employee Stock Option Plan
klipfolio competitive salaries
Competitive Salaries
klipfolio 3 weeks vacation
3 Weeks Vacation
klipfolio flexible work hours
Flexible Work Hours
klipfolio fitness benefit
$600/Year Fitness Benefit
klipfolio free snacks and drinks
Free Snacks and Drinks
klipfolio pet friendly
Pet Friendly
klipfolio - active social commitee
Active Social Committee
klipfolio - parental leave
Parental Leave
klipfolio christmas
Christmas - New Years Shut-down
klipfolio bike clinic
On-site Bike Clinic