Skip to main content

Importance of identifying and tracking errors in DevOps

Place I like to visit

For almost three years I am curious about tracking errors in my daily work, there are lot of tools like Data dog, Splunk, Dynatrace etc... available as observability tools. it would have been easy to use those, rather I though why not build one. There is an advantage for projects that are small to rely on tools that are built around them. Standards set by industries are important, what if certain configuration don't align with general market standards. I think some developers agree with this approach.

The idea of tracking error is not new, but the way we categorise them is unique to different environments. the nature of these categories depends on infrastructure used, network topology, development strategy. A tool designed to handle them all might be bit over engineered for the purpose, because I think the value it creates by reducing our efforts in identifying problems is not more valuable than the product itself. Again this is my opinion on projects that are small but needs quality checks. For managing tens of thousands of projects investing in a standard tool may give better returns in long run.

What do we get out of all these error identifying and tracking, these are some of practical benefits I have observed with a tool I developed for tracking errors in continuous integration environment.

  • Proactively identifying errors before they make huge impact.
  • Faster response to stack holders regarding the issue instead of keeping them waiting.
  • Reducing error related to configurations and dependencies.
  • Reduced manual effort in finding RCA
  • Better feed back loop for improving our practices and changes as teams learn from past mistakes.
  • It reduces mean time to repair because if we already know the error then we also have to maintain different fixes for it.
  • It is a handbook of things that may go wrong, it is the first thing to look for when things are not good.
  • If the tool is developed in the same ecosystem as the product then it can adapt to different design constraints of the product, a vital tool for maintaining quality and standards.
in conclusion, By systematically tracking and analysing errors, teams build a robust CI infrastructure that supports faster, more reliable development cycles, aligning with the goal of DevOps and continuous delivery.

Comments

Popular posts from this blog

Failed attempt of capturing pictures of the Milkyway

Picture of the night I went star gazing  I got a full frame camera, and I wanted to capture milky way. What I initially understood was it is easy to do it if I have a good equipment, I was proven wrong. I have nikon z5 with a zoom kit lens of f4. I traveled to nearest hills where there is less light pollution, the thing is I should have stayed on top of the hill to get good glimpse of the horizon. In southern hemisphere during the time of November milky way rise and set in south west region, we don't get the full view of it.  The failed attempt I stayed in a valley where they grow coffee beans. The other gamble I made was trusting the weather, this year(2024) post monsoon has extended and lot of clouds can be seeing near horizon. One observation I did was early morning before all the fog drops down the sky will be clear like a still pond this will be early 3 to 4 am in the morning. Next time I will try following what others are doing, get a tripod, also possibly a pollution fi...