Imagine that we’ve refactored our generateDadJoke program from our Getting Started guide. Now, instead of handling all errors internally, the code can fail with domain-specific issues like network interruptions or provider outages:
The Effect interface defines a value that describes a workflow or job,
which can succeed or fail.
Details
The Effect interface represents a computation that can model a workflow
involving various types of operations, such as synchronous, asynchronous,
concurrent, and parallel interactions. It operates within a context of type
R, and the result can either be a success with a value of type A or a
failure with an error of type E. The Effect is designed to handle complex
interactions with external resources, offering advanced features such as
fiber-based concurrency, scheduling, interruption handling, and scalability.
This makes it suitable for tasks that require fine-grained control over
concurrency and error management.
To execute an Effect value, you need a Runtime, which provides the
environment necessary to run and manage the computation.
Provides a way to write effectful code using generator functions, simplifying
control flow and error handling.
When to Use
Effect.gen allows you to write code that looks and behaves like synchronous
code, but it can handle asynchronous tasks, errors, and complex control flow
(like loops and conditions). It helps make asynchronous code more readable
and easier to manage.
The generator functions work similarly to async/await but with more
explicit control over the execution of effects. You can yield* values from
effects and return the final result at the end.
AiPlan<in Error, in out Provides, in out Requires>.Provider<Completions | Tokenizer>.provide: <AiResponse.AiResponse, NetworkError|ProviderOutage, Completions.Completions>(effect:Effect.Effect<AiResponse.AiResponse, NetworkError|ProviderOutage, Completions.Completions>) => Effect.Effect<...>
Retry the program a fixed number of times on NetworkErrors
Add some backoff delay between retries
Fallback to a different model provider if OpenAi is down
How can we accomplish such logic?
The AiPlan Type
The Effect AI integrations provide a robust method for creating structured execution plans for your LLM interactions through the AiPlan data type. Rather than making a single model call and hoping it succeeds, AiPlan lets you describe how to handle retries, fallbacks, and recoverable errors in a clear, declarative way.
This is especially useful when:
You want to fall back to a secondary model if the primary one is unavailable
You want to retry on transient errors (e.g. network failures)
You want to control timing between retry attempts
Think of an AiPlan as the blueprint for for an LLM interaction, with logic for when to keep trying to interact with one provider and when to switch providers.
interfaceAiPlan<Errors, Provides, Requires> {}
An AiPlan has three generic type parameters:
Errors: Any errors that can be handled during execution of the plan
Provides: The services that this plan can provide (e.g. Completions, Embeddings)
Requires: The services that this plan requires (e.g. OpenAiClient, AnthropicClient)
If you’ve used AiModel before (via OpenAiCompletions.model() or similar), you’ll find AiPlan familiar. In fact, an AiModelis in fact an AiPlan with just a single step.
This means you can start by writing your code with plain AiModels, and as your needs become more complex (e.g. adding retries, switching providers), you can upgrade to AiPlan without changing how the rest of your code works.
Defining a Primary Model
The primary entry point to building up an AiPlan is the AiPlan.fromModel constructor.
This method defines the primary model that you would like to use for an LLM interaction, as well as the rules for retrying it under specific conditions.
The Effect interface defines a value that describes a workflow or job,
which can succeed or fail.
Details
The Effect interface represents a computation that can model a workflow
involving various types of operations, such as synchronous, asynchronous,
concurrent, and parallel interactions. It operates within a context of type
R, and the result can either be a success with a value of type A or a
failure with an error of type E. The Effect is designed to handle complex
interactions with external resources, offering advanced features such as
fiber-based concurrency, scheduling, interruption handling, and scalability.
This makes it suitable for tasks that require fine-grained control over
concurrency and error management.
To execute an Effect value, you need a Runtime, which provides the
environment necessary to run and manage the computation.
Creates a schedule that recurs indefinitely with exponentially increasing
delays.
Details
This schedule starts with an initial delay of base and increases the delay
exponentially on each repetition using the formula base * factor^n, where
n is the number of times the schedule has executed so far. If no factor
is provided, it defaults to 2, causing the delay to double after each
execution.
Provides a way to write effectful code using generator functions, simplifying
control flow and error handling.
When to Use
Effect.gen allows you to write code that looks and behaves like synchronous
code, but it can handle asynchronous tasks, errors, and complex control flow
(like loops and conditions). It helps make asynchronous code more readable
and easier to manage.
The generator functions work similarly to async/await but with more
explicit control over the execution of effects. You can yield* values from
effects and return the final result at the end.
AiPlan<in Error, in out Provides, in out Requires>.Provider<Completions | Tokenizer>.provide: <AiResponse.AiResponse, NetworkError|ProviderOutage, Completions.Completions>(effect:Effect.Effect<AiResponse.AiResponse, NetworkError|ProviderOutage, Completions.Completions>) => Effect.Effect<...>
Attempt to use OpenAi’s "gpt-4o" model up to 3 times
Wait with an exponential backoff between attempts (starting at 100ms)
Only re-attempt the call to OpenAi if the error is a NetworkError
Adding Fallback Models
To make your LLM interactions resilient to provider outages, you can define a fallback model to use via AiPlan.withFallback. This will allow the plan to automatically fallback to another model if the previous step in the execution plan fails.
Use this when:
You want to make your LLM interactions resilient to provider outages
You want to potentially have multiple fallback models
Example (Adding a Fallback to Anthropic from OpenAi)
The Effect interface defines a value that describes a workflow or job,
which can succeed or fail.
Details
The Effect interface represents a computation that can model a workflow
involving various types of operations, such as synchronous, asynchronous,
concurrent, and parallel interactions. It operates within a context of type
R, and the result can either be a success with a value of type A or a
failure with an error of type E. The Effect is designed to handle complex
interactions with external resources, offering advanced features such as
fiber-based concurrency, scheduling, interruption handling, and scalability.
This makes it suitable for tasks that require fine-grained control over
concurrency and error management.
To execute an Effect value, you need a Runtime, which provides the
environment necessary to run and manage the computation.
Creates a schedule that recurs indefinitely with exponentially increasing
delays.
Details
This schedule starts with an initial delay of base and increases the delay
exponentially on each repetition using the formula base * factor^n, where
n is the number of times the schedule has executed so far. If no factor
is provided, it defaults to 2, causing the delay to double after each
execution.
Creates a schedule that recurs indefinitely with exponentially increasing
delays.
Details
This schedule starts with an initial delay of base and increases the delay
exponentially on each repetition using the formula base * factor^n, where
n is the number of times the schedule has executed so far. If no factor
is provided, it defaults to 2, causing the delay to double after each
execution.
Provides a way to write effectful code using generator functions, simplifying
control flow and error handling.
When to Use
Effect.gen allows you to write code that looks and behaves like synchronous
code, but it can handle asynchronous tasks, errors, and complex control flow
(like loops and conditions). It helps make asynchronous code more readable
and easier to manage.
The generator functions work similarly to async/await but with more
explicit control over the execution of effects. You can yield* values from
effects and return the final result at the end.
AiPlan<in Error, in out Provides, in out Requires>.Provider<Completions | Tokenizer>.provide: <AiResponse.AiResponse, NetworkError|ProviderOutage, Completions.Completions>(effect:Effect.Effect<AiResponse.AiResponse, NetworkError|ProviderOutage, Completions.Completions>) => Effect.Effect<...>