Templates
Skill

Debugging & Monitoring

The ability to understand and implement systematic debugging and monitoring practices to diagnose issues

Debugging & Monitoring

Level 1

Understands debugging basics, the appropriate tools to use and is beginning to implement good monitoring practices

  • Debugging: You are able do basic code profiling and optimize obvious bugs

  • Tools: You use out-of-the box tools to instrument and monitor systems with occasional help from senior engineers.

  • Philosophy: You understand the team's monitoring philosophy and work comfortably within it.

Level 2

Is able to systematically debug issues located within a single service, whilst taking greater responsibility for the monitoring systems.

  • Debugging: You can do multi application and multi layer profiling without support and optimize almost all bugs

  • Tools: You customize out of the box tools, and on occasion create small custom tools to instrument and monitor systems.

  • Philosophy: You understand the organization's monitoring philosophy. You help tune and change the monitoring philosophy on your team.

  • Data: You understand the operational data for your team's domain.

Level 3

Uses systematic debugging to diagnose all issues located to a single service and drives monitoring work across the team

  • Debugging: You use systematic debugging to diagnose cross service issues, sometimes with help from more senior engineers.

  • Tools: You can judge whether a new tool is necessary to instrument and monitor systems. You can create that tool if required.

  • Philosophy: You understand the cross-functional and cross-organization implications of your team's monitoring philosophy. You proactively seek to refine the monitoring philosophy of your team.

  • Data: You are aware of the operational data for your team and use it as a basis for driving changes to your team's services to achieve stability and performance improvements.

  • Mentoring: You act as a primary mentor for the rest of their team on running systems.

Level 4

Uses systematic debugging to diagnose all issues within a set of related domains and leads in fostering a culture of observability and monitoring across several teams

  • Debugging: You use systematic debugging to diagnose cross service issues

  • Philosophy: You drive monitoring work across several teams based on the organization's monitoring philosophy.

  • Data: You are aware of the operational data for domains of several teams and use it to drive changes to the teams' services to achieve stability and performance improvements.

  • Profiling: You can solve any profiling challenge at any layer of the stack.

  • Mentoring: You actively seek to mentor any junior engineer or manager on running systems.

Level 5

Leads incident response, and fosters and promotes a culture of observability and monitoring across the organization.

  • Debugging: Uses systematic debugging to diagnose issues across the organization.

  • Philosophy: You lead the organization's monitoring philosophy.

  • Data: Helps teams across the organization use operational data to improve stability and performance of their domains.

  • Mentoring: You are a mentor for senior managers across the organization.