What 10,000 Slack Messages Reveal About OpenTelemetry Adoption Challenges

By Juraci Paixão Kröhling (OllyGarden) | Wednesday, January 07, 2026

Cover image showing Slack message volume over time

The OpenTelemetry community has grown tremendously over the past few years, and with that growth comes valuable insights hidden in our community conversations. We analyzed nearly 10,000 messages from the #otel-collector and #opentelemetry channels on CNCF Slack spanning from May 2019 to December 2025 to understand what challenges users face most often, which components generate the most discussion, and where the community might need additional documentation or tooling improvements.

The Dataset

Our analysis covered 9,966 messages across two of the most active OpenTelemetry Slack channels:

#otel-collector: 5,570 messages (56%)
#opentelemetry: 4,396 messages (44%)

These messages break down into several categories:

Category	Percentage
Questions	46.7%
Error Reports	25.9%
Discussions	23.3%
Configuration	3.0%
Help Responses	1.0%

The high proportion of questions and error reports (over 72% combined) tells us that these channels serve as critical support resources for the community, and the topics that appear most frequently represent real adoption challenges.

We applied topic modeling using BERTopic to cluster similar messages, then analyzed sentiment and frustration indicators to identify which topics cause the most difficulty. Messages containing error reports, repeated requests for help, or expressions of confusion scored higher on our frustration metric.

Most Discussed Collector Components

Topic modeling revealed clear patterns in which Collector components generate the most community discussion. Here are the top components by message volume:

1. Prometheus Receiver and Exporter (498 messages, 5.0%)

Prometheus integration dominates community discussions. Users frequently ask about:

Configuring the Prometheus receiver to scrape metrics
Setting up the Prometheus remote write exporter
Understanding metric type and metadata preservation across the pipeline
Integrating with existing Prometheus infrastructure

This makes sense given Prometheus’s widespread adoption. Many organizations start their OpenTelemetry journey by wanting to integrate with or migrate from existing Prometheus setups. The remote write exporter in particular sees heavy use, as it allows teams to continue using Prometheus as a storage backend while adopting OpenTelemetry for collection and processing.

2. k8sattributes Processor (258 messages, 2.6%)

Kubernetes metadata enrichment is the second most discussed topic. Common challenges include:

Pod association and metadata extraction in DaemonSet deployments
RBAC permissions for accessing the Kubernetes API
Performance implications in large clusters
Interaction with the kubeletstats receiver

The complexity of Kubernetes environments and the desire for rich metadata context makes this processor essential but sometimes tricky to configure correctly. Users often discover that running the Collector as a DaemonSet requires different pod association rules than running it as a gateway, leading to troubleshooting cycles that could be avoided with clearer guidance.

3. Tail Sampling Processor (167 messages, 1.7%)

Tail-based sampling generates significant discussion, often with a higher frustration level than other topics. Users struggle with:

Policy configuration and interaction between multiple policies
Stateful sampling across distributed services
Head sampling vs. tail sampling trade-offs
Debugging why traces are or aren’t being sampled
Understanding the decision wait period and its impact on latency

The stateful nature of tail sampling, which requires collecting all spans of a trace before making a decision, adds operational complexity that head sampling avoids. Many teams end up running both approaches, using head sampling at the SDK level for baseline reduction and tail sampling in the Collector for intelligent retention of interesting traces.

4. Kafka Receiver and Exporter (131 messages, 1.3%)

Kafka integration appears frequently, particularly around:

Connection and authentication issues with managed Kafka services (AWS MSK)
Topic configuration and consumer group management
Message format and serialization
High-availability deployment patterns

5. Memory Limiter Processor (125 messages, 1.3%)

Resource management is a consistent concern:

Proper memory limit configuration relative to container limits
GOMEMLIMIT interaction with the memory limiter
Debugging memory spikes and OOM situations
CPU usage profiling with pprof

Understanding the relationship between Go’s memory management, container limits, and the memory limiter processor requires knowledge that spans multiple domains. The recent addition of GOMEMLIMIT support has helped, but users still need guidance on proper configuration for their specific deployment scenarios.

What This Tells Us

Several themes emerge from this analysis:

The Prometheus ecosystem remains central. Organizations aren’t abandoning Prometheus; they’re integrating it with OpenTelemetry. Documentation and tooling that bridges these ecosystems will continue to be valuable.

Kubernetes complexity compounds OTel complexity. The k8sattributes processor and Operator discussions show that Kubernetes environments introduce additional layers of configuration and troubleshooting. Simplified deployment patterns and better defaults could help.

Sampling is conceptually difficult. Tail sampling, despite being well documented, generates ongoing confusion. Interactive tools or visualization of sampling decisions might help users understand and debug their configurations.

Error messages need improvement. Many frustration-heavy discussions start with a cryptic error message. Investing in actionable error messages with suggested fixes would significantly improve the user experience.

The gap between “getting started” and “production ready” is real. Basic tutorials work, but scaling to production with proper memory limits, persistent queues, and multi-backend routing requires significant learning.

Moving Forward

We hope this analysis helps maintainers and SIGs identify areas where documentation improvements would have the highest impact. The data clearly shows that certain topics, particularly around configuration patterns, sampling strategies, and multi-backend deployments, generate recurring questions that better guides could address.

On my end, I have lined up a series of articles that tackle some of these pain points directly, covering topics like decomposing Collector configuration files into manageable pieces, routing telemetry to multiple backends based on tenant or environment, and building effective tail sampling strategies.

Acknowledgments

Thank you to everyone who participates in the OpenTelemetry Slack community. Your questions, error reports, and discussions not only help each other but also provide valuable signal for where the project can improve. A special thanks to the community members who take time to answer questions and share their experiences - the 1% of help responses in our data represent countless hours of volunteer effort that makes this community welcoming for newcomers.

This analysis used topic modeling and sentiment analysis on publicly available Slack messages. Individual messages were aggregated into topics; no personally identifiable information was used in this report.

←Previous
Next→

Last modified January 7, 2026: Blog Post - What 10,000 Slack Messages Reveal About OpenTelemetry Adoption Challenges (#8727) (cf117399)

What 10,000 Slack Messages Reveal About OpenTelemetry Adoption Challenges

The Dataset

Most Discussed Collector Components

1. Prometheus Receiver and Exporter (498 messages, 5.0%)

2. k8sattributes Processor (258 messages, 2.6%)

3. Tail Sampling Processor (167 messages, 1.7%)

4. Kafka Receiver and Exporter (131 messages, 1.3%)

5. Memory Limiter Processor (125 messages, 1.3%)

Top 10 Problem Areas and Pain Points

1. Connection and Export Failures

2. Custom Collector Distributions

3. Configuration Syntax and Validation

4. Context Propagation

5. Attribute and Resource Management

6. OTTL (OpenTelemetry Transformation Language)

7. Kubernetes Operator and Auto-Instrumentation

8. Backend Integration

9. Docker and Container Deployment

10. Queue and Retry Behavior

What This Tells Us

Moving Forward

Acknowledgments