<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>OpenTelemetry | Endurance Consulting | Fractional CTO &amp; Platform Engineering Leadership</title><link>https://enduranceconsulting.com/tags/opentelemetry/</link><atom:link href="https://enduranceconsulting.com/tags/opentelemetry/index.xml" rel="self" type="application/rss+xml"/><description>OpenTelemetry</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Wed, 15 Apr 2026 00:00:00 +0000</lastBuildDate><image><url>https://enduranceconsulting.com/media/logo_hu_98515048530bb274.png</url><title>OpenTelemetry</title><link>https://enduranceconsulting.com/tags/opentelemetry/</link></image><item><title>OpenTelemetry Adoption: Why Most Organizations Get It Wrong</title><link>https://enduranceconsulting.com/blog/opentelemetry-adoption-strategy/</link><pubDate>Wed, 15 Apr 2026 00:00:00 +0000</pubDate><guid>https://enduranceconsulting.com/blog/opentelemetry-adoption-strategy/</guid><description>&lt;p&gt;After leading observability rationalization projects at Fortune 500 companies and serving as the OpenTelemetry SME at Groundcover, I&amp;rsquo;ve watched dozens of organizations attempt OTEL adoption. Most start with the wrong assumptions.&lt;/p&gt;
&lt;h2 id="the-common-mistake-big-bang-migration"&gt;The Common Mistake: Big Bang Migration&lt;/h2&gt;
&lt;p&gt;Organizations treat OpenTelemetry adoption like a database migration: plan everything, execute once, celebrate. This fails because:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Legacy instrumentation has institutional knowledge baked in&lt;/strong&gt;: your existing dashboards and alerts encode years of production incidents&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Vendor SDKs are wired into everything&lt;/strong&gt;: ripping them out breaks more than observability&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Teams resist learning curves&lt;/strong&gt;: especially when the old tools &amp;ldquo;work fine&amp;rdquo;&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="the-better-approach-strategic-layering"&gt;The Better Approach: Strategic Layering&lt;/h2&gt;
&lt;p&gt;Instead of replacing existing observability, &lt;strong&gt;add OpenTelemetry as a data layer&lt;/strong&gt;:&lt;/p&gt;
&lt;h3 id="phase-1-new-services-only-months-1-3"&gt;Phase 1: New Services Only (Months 1-3)&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Mandate OTEL for all new microservices&lt;/li&gt;
&lt;li&gt;Create golden path instrumentation templates&lt;/li&gt;
&lt;li&gt;Export telemetry in parallel to both your OTEL collector and the legacy backend&lt;/li&gt;
&lt;li&gt;Build confidence without breaking production&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id="phase-2-edge-and-mesh-instrumentation-months-3-6"&gt;Phase 2: Edge and Mesh Instrumentation (Months 3-6)&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Instrument API gateways, load balancers, and (if you run one) the service mesh with OTEL&lt;/li&gt;
&lt;li&gt;Capture entry-point and hop-level spans without touching application code&lt;/li&gt;
&lt;li&gt;Push W3C trace-context propagation inward so those spans start linking into end-to-end traces, instead of staying isolated at the edge&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id="phase-3-high-value-migrations-months-6-12"&gt;Phase 3: High-Value Migrations (Months 6-12)&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Migrate services where legacy observability is painful&lt;/li&gt;
&lt;li&gt;Target the services that deploy often, break often, and have the worst instrumentation&lt;/li&gt;
&lt;li&gt;Show ROI: faster troubleshooting, lower vendor costs&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id="phase-4-observability-pipeline-months-12"&gt;Phase 4: Observability Pipeline (Months 12+)&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Introduce OTEL collector pipelines for sampling, enrichment, routing&lt;/li&gt;
&lt;li&gt;Route less critical data to cheaper backends&lt;/li&gt;
&lt;li&gt;Keep high-value data in premium tools&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="why-this-works"&gt;Why This Works&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;It separates data collection from data destinations.&lt;/strong&gt; Your teams adopt OTEL instrumentation without also migrating dashboards, alerts, and runbooks. You de-risk the transition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;It proves value early.&lt;/strong&gt; By month 4 you have distributed traces across your instrumented paths and edge: request flows most orgs can&amp;rsquo;t see today, and you got there without migrating a single dashboard or alert.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;It strengthens your hand at renewal.&lt;/strong&gt; When your observability vendor sees OpenTelemetry collectors in production, renewal conversations get interesting. You&amp;rsquo;re no longer locked in.&lt;/p&gt;
&lt;h2 id="the-build-vs-buy-decision"&gt;The Build vs. Buy Decision&lt;/h2&gt;
&lt;p&gt;The dirty secret is that &lt;strong&gt;OpenTelemetry is infrastructure, not a product&lt;/strong&gt;. You still need:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Trace storage and query engines&lt;/li&gt;
&lt;li&gt;Dashboarding and visualization&lt;/li&gt;
&lt;li&gt;Alerting and incident management&lt;/li&gt;
&lt;li&gt;SLO tracking and error budgets&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Some organizations build these (Uber, Netflix). Most should buy (Honeycomb, Grafana, Datadog&amp;rsquo;s OTEL support).&lt;/p&gt;
&lt;p&gt;The win isn&amp;rsquo;t avoiding vendors. It&amp;rsquo;s &lt;strong&gt;avoiding vendor lock-in&lt;/strong&gt;.&lt;/p&gt;
&lt;h2 id="real-world-outcomes"&gt;Real-World Outcomes&lt;/h2&gt;
&lt;p&gt;At a Fortune 500 semiconductor manufacturer, we:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Reduced observability spend by 40% over 18 months&lt;/li&gt;
&lt;li&gt;Cut MTTR by 25% through distributed tracing&lt;/li&gt;
&lt;li&gt;Migrated 60% of critical services without a single SEV-1 incident&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;At a major financial services firm:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Consolidated 9 observability vendors down to 3&lt;/li&gt;
&lt;li&gt;Improved signal-to-noise ratio (alert fatigue dropped by 50%)&lt;/li&gt;
&lt;li&gt;Gave teams flexibility to choose backends based on use case&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="where-organizations-need-help"&gt;Where Organizations Need Help&lt;/h2&gt;
&lt;p&gt;Most engineering teams can instrument a service with OTEL. What&amp;rsquo;s hard:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Sampling strategy&lt;/strong&gt;: head-based sampling is cheap but blind; tail-based sampling keeps the errored and slow traces that matter, but forces you to buffer every span until the trace completes. Tuning that tradeoff at scale is the hard part.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Collector pipelines&lt;/strong&gt;: enrichment, routing, backpressure handling, failover&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Schema governance&lt;/strong&gt;: preventing instrumentation drift across 200 microservices&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Organizational change&lt;/strong&gt;: getting SREs to trust new tooling&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;This is where fractional leadership pays off. You need someone who&amp;rsquo;s done this before, can architect the strategy, and doesn&amp;rsquo;t need 6 months to understand your infrastructure.&lt;/p&gt;
&lt;h2 id="bottom-line"&gt;Bottom Line&lt;/h2&gt;
&lt;p&gt;OpenTelemetry adoption is an organizational transformation disguised as a technical problem. Treat it like one.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&lt;strong&gt;Need help with your observability strategy?&lt;/strong&gt; I&amp;rsquo;ve led these transformations at enterprise scale. &lt;a href="https://enduranceconsulting.com/#contact"&gt;Let&amp;rsquo;s talk&lt;/a&gt;.&lt;/p&gt;</description></item></channel></rss>