Observability
Better Stack, Highlight, Axiom, ArcticDB & New Relic seem useful.
Vector seems nice too.
Notes
Links
- What is Observability (2021)
- Why Your Services Need Observability (2021)
- New Relic - Observability platform built to help engineers create more perfect software. (GitHub) (New Relic Helm Charts)
- Elastic Observability - Bring your logs, metrics, and APM traces together at scale.
- What's your preferred cloud application monitoring tool? (2020)
- Dynatrace - Cloud Monitoring.
- Are we observable yet? An introduction to Rust telemetry (2020)
- Suzieq - Framework and application for network observability. (Docs)
- Honeycomb Observability - Observe, Debug and Improve Production.
- Orijtech - Observability, developer tools, cloud. (GitHub)
- Opstrace - Secure Observability Deployed in your Network. (Code) (HN)
- Adaptive Request Concurrency. Resilient observability at scale. (2020)
- Observability, Getting Started - 50 Free Access and Open-Source Solutions (2020)
- tobs - Observability Stack for Kubernetes.
- Sensu - Observability Pipeline that delivers monitoring as code on any cloud. (Code)
- SigNoz - Open source Observability platform. Alternative to DataDog. (Code) (HN) (HN)
- Data Observability: Building Your Own Data Quality Monitors Using SQL (2021)
- Data Observability Universe (2020)
- Observability: What it is and why you need it | Logging, Metrics & Alerts (2021)
- Metaplane - Observability for modern data stacks. (HN)
- Testing vs Observability: Which is right for your data quality needs? (2021)
- Observability: A New Theory Based on the Group of Invariance (2020) (HN)
- Unpacking Observability: The Observability Stack (2021)
- Hydrolix - Elastic cloud data platform built for observability. (1.1 Billion Taxi Rides using Hydrolix on AWS)
- Axiom - Serverless log management solution. (GitHub) (Twitter) (Axiom Elements)
- Free Your Services From Vendor Lock-in With OpenTelemetry (2021)
- A tale of Distributed Context (2021)
- Calyptia - First Mile Observability.
- tracing-filter - Query language for filtering tracing spans and events.
- OpenTelemetry Collector Contrib - Contrib repository for the OpenTelemetry Collector.
- Awesome Observability
- Elementary - Open-source data observability framework for modern data teams. Move fast and be confident about your data.
- Hypertrace - Open source distributed tracing & observability platform. (Code)
- Deepfence - Open source cloud native security observability platform. Linux, K8s, AWS Fargate and more. (Code)
- AppScope - General-Purpose Observable Application Telemetry System. (Code)
- Observability is not only for SREs (HN)
- Tetragon - eBPF-based Security Observability and Runtime Enforcement.
- How We Built Alert Rules, Runbooks, and Dashboards to Observe Our Observability Tool (2022)
- Duo - Easy-to-use observability solution that provides both logging and tracing capabilities for Rust applications.
- Observables - Tiny (850B) and fast reactive observables library via functions. (HN)
- O11y toolkit - Set of utilities that help you debug, augment, and manage your open source observability stack. (Code)
- Vector - High-performance observability data pipeline. (Web)
- OpenTelemetry - OpenTelemetry functions for shells.
- OpenTelemetry Demo - OpenTelemetry Community Demo Application.
- OpenTelemetry Enhancement Proposals
- Prodfiler - Whole-system Continuous Profiling Platform. (Docs)
- The best way to find performance bottlenecks: observing production (2022)
- The four pillars of data observability: metrics, metadata, lineage, and logs
- flow-telemetry - Adding observability to feature flow with OpenTelemetry.
- Coralogix - Full-Stack Observability Platform with In-Stream Data Analytics.
- Metalens - Stream-based visual programming language for systems observability. Live Programming and Visualizing eBPF.
- "Building Observability for 99% Developers" by Jean Yang (2022)
- Open Data Discovery - Open source data discovery and observability platform. (HN)
- Best Practices for Observability, Metrics, Logging (2022)
- Orb - Cloud native orchestration platform for dynamic network observability. (Web)
- Kosli - Track and query every change from commit through to production. (GitHub) (CLI)
- Dave Lucia on Observability at Bitfo (2022)
- Elixir Observability: OpenTelemetry, Lightstep, Honeycomb (2022)
- Glean SDK Docs - Modern cross-platform telemetry client libraries. (Glean Dictionary)
- Cito - Snowflake Observability Software.
- A beginner’s guide to OpenTelemetry (2022)
- The Modern Observability Problem (HN)
- StatsHouse - Highly-available, scalable, multi-tenant monitoring system.
- Vast - Visibility Across Space and Time – The network telemetry engine for data-driven security investigations. (Docs)
- Awesome Monitoring
- Velociraptor - Endpoint visibility and collection tool.
- Stanza - Fast and lightweight log transport and processing agent.
- Using eBPF and predefined inspections to minimize “observability tax” (2022) (HN)
- Odigos - Instant distributed tracing for Kubernetes clusters. (HN)
- Last9 - Providing visibility into microservices.
- Weasel - Gather end-user browser performance data.
- Instana - Enterprise Observability and APM for Cloud-Native Applications. (GitHub)
- Instana Go Collector - Go Distributed Tracing & Metrics Sensor for Instana.
- Dynolog - Performance monitoring daemon for heterogeneous CPU-GPU systems.
- Zinc Observe - Cheap petabyte scale observability platform.
- highlight.io - Open source, full-stack monitoring platform. Error monitoring, session replay, logging and more. (Code) (HN) (HN)
- Digma - Continuous Feedback pipeline, comprised of an analysis backend and an IDE plugin.
- Kindling - eBPF-based Cloud Native Monitoring Tool.
- Rezolus - Tool for collecting detailed systems performance telemetry and exposing burst patterns through high-resolution telemetry.
- Opting In to Transparent Telemetry (2023)
- Getting started with monitoring (2023)
- otel-cli - OpenTelemetry command-line tool for sending events from shell scripts & similar environments.
- Teletrace - Open-Source Tracing Platform.
- Read Every Single Error (2023)
- Infino - Fast and scalable service to store time series and logs - written in Rust.
- OpenObserve - Elasticsearch/Datadog alternative. (HN)
- Performance Optimizer Observation Platform
- End-to-end Tracing
- OpenMeter - Real-Time and Scalable Usage Metering.
- Tesla Fleet Telemetry
- Observability in Practice (2023)
- 0x.Tools - Always-on Profiling for Production Systems.
- OpenTelemetry: A beginner’s handbook to instrument your application (2023)
- OTel Arrow Protocol implementation - Protocol and libraries for sending and receiving OpenTelemetry data using Apache Arrow.
- Scanner - Petabyte-scale security data lake in AWS S3.
- OpenTelemetry in 2023