Pipeline should have middleware and be configured

Currently, the `Datadog::Pipeline` is a global-level class that has mutable state. It is configured via `Datadog::Pipeline.before_flush { }` and used via  `Datadog::Pipeline.process!`.

Some downsides:
 - It is not possible to "reset" the state of the pipeline (once its modified, its permanent until application restart.)
 - Its state is effectively a global variable, which has negative consequences (particularly Ractors)

It would be better if:

1. Pipeline was instantiated, rather than accessed through class methods. Perhaps middleware style?
    ```ruby
    Datadog::Pipeline.new do |p|
      # Custom processor: passed "traces", returns traces
      p.process HealthCheckTraceFilter
      # Shorthand for processing traces in aggregate
      # Return traces to be kept
      p.process { |traces| }
      # Shorthand for filtering each individual trace
      # Return truthy to keep, falsey to remove
      p.trace_filter { |trace| }
      # Shorthand for filtering each individual span
      # Return truthy to keep, falsey to remove
      p.span_filter { |span| }
    end
    ```
2. Pipeline was managed by configuration e.g.
    ```ruby
    Datadog.configure do |c|
      c.pipeline do |p|
        p.process HealthCheckTraceFilter
        # More pipeline middleware here...
      end
    end
    ```

This would make it easier to rebuild and modify the pipeline in a controlled, non-ambiguous way.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pipeline should have middleware and be configured #1513

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Pipeline should have middleware and be configured #1513

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions