Optimization¶

Base Classes and Data Structures¶

The GradComponent and LossComponent are a subclass from Component to serve the purpose to differentiate the gradient and loss components in the optimization process. And it will be used if users want to implement their own with more customization.

`optim.parameter`	Parameter is used by Optimizer, Trainers, AdalComponent to auto-optimizations
`optim.optimizer`	Base Classes for AdalFlow Optimizers, including Optimizer, TextOptimizer, and DemoOptimizer.
`optim.grad_component`	Base class for Autograd Components that can be called and backpropagated through.
`optim.loss_component`	Base class for Autograd Components that can be called and backpropagated through.
`optim.types`	All data types used by Parameter, Optimizer, AdalComponent, and Trainer.

Few Shot Optimizer¶

optim.few_shot.bootstrap_optimizer

Adapted and optimized boostrap fewshot optimizer:

Textual Gradient¶

`optim.text_grad.llm_text_loss`	Implementation of TextGrad: Automatic “Differentiation” via Text.
`optim.text_grad.text_loss_with_eval_fn`	Adapted from text_grad's String Based Function
`optim.text_grad.ops`	Text-grad operations such as Sum and Aggregate.
`optim.text_grad.tgd_optimizer`	Text-grad optimizer and prompts.

Trainer and AdalComponent¶

`optim.trainer.adal`	AdalComponent provides an interface to compose different parts, from eval_fn, train_step, loss_step, optimizers, backward engine, teacher generator, etc to work with Trainer.
`optim.trainer.trainer`	Ready to use trainer for LLM task pipeline

Overview¶

class Optimizer[source]¶

Bases: object

Base class for all optimizers.

proposing: bool = False¶

params: Iterable[Parameter] | Iterable[Dict[str, Any]]¶

state_dict()[source]¶

propose(*args, **kwargs)[source]¶

step(*args, **kwargs)[source]¶

revert(*args, **kwargs)[source]¶

class RandomSampler(dataset: Sequence[T_co] | None = None, default_num_shots: int | None = None)[source]¶

Bases: Sampler, Generic[T_co]

Simple random sampler to sample from the dataset.

set_dataset(dataset: Sequence[T_co])[source]¶: Set the dataset for the sampler

random_replace(shots: int, samples: List[Sample[T_co]], replace: bool | None = False) → List[Sample[T_co]][source]¶

Randomly replace num of shots in the samples.

If replace is True, it will skip duplicate checks

random_sample(shots: int, replace: bool | None = False) → List[Sample][source]¶: Randomly sample num of shots from the dataset. If replace is True, sample with replacement, meaning the same sample can be sampled multiple times.

call(num_shots: int | None = None, replace: bool | None = False) → List[Sample][source]¶: Abstract method to do the main sampling

class ClassSampler(dataset: Sequence[T_co], num_classes: int, get_data_key_fun: Callable, default_num_shots: int | None = None)[source]¶

Bases: Sampler, Generic[T_co]

Sample from the dataset based on the class labels.

T_co can be any type of data, e.g., dict, list, etc. with get_data_key_fun to extract the class label.

Example: Initialize ` dataset = [{"coarse_label": i} for i in range(10)] sampler = ClassSampler[Dict](dataset, num_classes=6, get_data_key_fun=lambda x: x["coarse_label"]) `

random_replace(shots: int, samples: List[Sample], replace: bool | None = False, weights_per_class: List[float] | None = None) → Sequence[Sample[T_co]][source]¶: Randomly select num shots from the samples and replace it with another sample has the same class index

random_sample(num_shots: int, replace: bool | None = False) → List[Sample[T_co]][source]¶: Randomly sample num_shots from the dataset. If replace is True, sample with replacement.

call(num_shots: int, replace: bool | None = False) → List[Sample[T_co]][source]¶: Sample num_shots from the dataset. If replace is True, sample with replacement.

class Sampler(*args, **kwargs)[source]¶

Bases: Generic[T_co]

dataset: Sequence[object] = None¶

set_dataset(dataset: Sequence[T_co])[source]¶: Set the dataset for the sampler

random_replace(*args, **kwargs)[source]¶

Randomly replace some samples

You can have two arguments, e.g., shots and samples, or shots, samples, and replace.

call(*args, **kwargs) → List[Sample[T_co]][source]¶: Abstract method to do the main sampling

class Parameter(*, id: str | None = None, data: ~optim.parameter.T = None, data_id: str = None, requires_opt: bool = True, role_desc: str = '', param_type: ~adalflow.optim.types.ParameterType = <ParameterType.NONE: none, ''>, name: str = None, instruction_to_optimizer: str = None, instruction_to_backward_engine: str = None, score: float | None = None, eval_input: object = None, successor_map_fn: ~typing.Dict[str, ~typing.Callable] | None = None, data_in_prompt: ~typing.Callable = None)[source]¶

Bases: Generic[T]

A data container to represent a parameter used for optimization.

A parameter enforce a specific data type and can be updated in-place. When parameters are used in a component - when they are assigned as Component attributes they are automatically added to the list of its parameters, and will appear in the parameters() or named_parameters() method.

Args:

End users only need to create the Parameter with four arguments and pass it to the prompt_kwargs in the Generator.

data (str): the data of the parameter

requires_opt (bool, optional): if the parameter requires optimization. Default: True

role_desc.

param_type, incuding ParameterType.PROMPT for instruction optimization, ParameterType.DEMOS

for few-shot optimization. - instruction_to_optimizer (str, optional): instruction to the optimizer. Default: None - instruction_to_backward_engine (str, optional): instruction to the backward engine. Default: None

The parameter users created will be automatically assigned to the variable_name/key in the prompt_kwargs for easy reading and debugging in the trace_graph.

References:

https://github.com/karpathy/micrograd/blob/master/micrograd/engine.py

allowed_types = {<ParameterType.DEMOS: demos, 'A few examples to guide the language model.'>, <ParameterType.HYPERPARAM: hyperparam, 'Hyperparameters/args for the component.'>, <ParameterType.INPUT: input, 'The input to the component.'>, <ParameterType.NONE: none, ''>, <ParameterType.PROMPT: prompt, 'Instruction to the language model on task, data, and format.'>}¶

proposing: bool = False¶

predecessors: Set[Parameter] = {}¶

peers: Set[Parameter] = {}¶

tgd_optimizer_trace: TGDOptimizerTrace = None¶

id: str = None¶

data_id: str = None¶

role_desc: str = ''¶

name: str = None¶

param_type: ParameterType¶

data: T = None¶

eval_input: object = None¶

successor_map_fn: Dict[str, Callable] = None¶

data_in_prompt: Callable = None¶

gt: object = None¶

map_to_successor(successor: object) → T[source]¶: Apply the map function to the successor based on the successor’s id.

add_successor_map_fn(successor: object, map_fn: Callable)[source]¶: Add or update a map function of the value for a specific successor using its id. succssor will know the value of the current parameter.

check_if_already_computed_gradient_respect_to(response_id: str) → bool[source]¶

set_gt(gt: object)[source]¶

get_gt() → object[source]¶

add_gradient(gradient: Gradient)[source]¶

reset_gradients()[source]¶

get_gradients_names() → str[source]¶

get_prompt_data() → str[source]¶

get_gradients_str() → str[source]¶

get_gradient_and_context_text(skip_correct_sample: bool = False) → str[source]¶

Aggregates and returns: 1. the gradients 2. the context text for which the gradients are computed

Sort the gradients from the lowest score to the highest score. Highlight the gradients with the lowest score to the optimizer.

get_gradients_component_schema(skip_correct_sample: bool = False) → str[source]¶

Aggregates and returns: 1. the gradients 2. the context text for which the gradients are computed

Sort the gradients from the lowest score to the highest score. Highlight the gradients with the lowest score to the optimizer.

merge_gradients_for_cycle_components()[source]¶: Merge data_id, from_response_component_id into the same gradient

sort_gradients()[source]¶: With rules mentioned in Graient class, we will track the gradients by data_id, then response_component_id, then score

set_predecessors(predecessors: List[Parameter] = None)[source]¶

set_grad_fn(grad_fn)[source]¶

get_param_info()[source]¶: Used to represent the parameter in the prompt.

set_peers(peers: List[Parameter] = None)[source]¶

trace_optimizer(api_kwargs: Dict[str, Any], response: TGDData)[source]¶: Trace the inputs and output of a TGD optimizer.

set_eval_fn_input(eval_input: object)[source]¶: Set the input for the eval_fn.

set_score(score: float)[source]¶

Set the score of the parameter in the backward pass For intermediate nodes, there is only one score per each eval fn behind this node. For leaf nodes, like DEMO or PROMPT, it will have [batch_size] of scores.

But this score is only used to relay the score to the demo parametr.

add_dataclass_to_trace(trace: DataClass, is_teacher: bool = True)[source]¶

Called by the generator.forward to add a trace to the parameter.

It is important to allow updating to the trace, as this will give different sampling weight. If the score increases as the training going on, it will become less likely to be sampled, allowing the samples to be more diverse. Or else, it will keep sampling failed examples.

add_score_to_trace(trace_id: str, score: float, is_teacher: bool = True)[source]¶: Called by the generator.backward to add the eval score to the trace.

propose_data(data: T, demos: List[DataClass] | None = None)[source]¶: Used by optimizer to put the new data, and save the previous data in case of revert.

revert_data(include_demos: bool = False)[source]¶: Revert the data to the previous data.

step_data(include_demos: bool = False)[source]¶: Use PyTorch’s optimizer syntax to finalize the update of the data.

get_grad_fn()[source]¶

update_value(data: T)[source]¶: Update the parameter’s value in-place, checking for type correctness.

get_short_value(n_words_offset: int = 10) → str[source]¶: Returns a short version of the value of the variable. We sometimes use it during optimization, when we want to see the value of the variable, but don’t want to see the entire value. This is sometimes to save tokens, sometimes to reduce repeating very long variables, such as code or solutions to hard problems. :param n_words_offset: The number of words to show from the beginning and the end of the value. :type n_words_offset: int

reset_all_gradients()[source]¶: Traverse the graph and reset the gradients for all nodes.

static trace_graph(root: Parameter) → Tuple[Set[Parameter], Set[Tuple[Parameter, Parameter]]][source]¶

backward()[source]¶: Apply backward pass for for all nodes in the graph by reversing the topological order.

static generate_node_html(node: Parameter, output_dir='node_pages')[source]¶: Generate an HTML page for a specific node.

draw_interactive_html_graph(filepath: str | None = None, nodes: List[Parameter] = None, edges: List[Tuple[Parameter, Parameter]] = None) → Dict[str, Any][source]¶

Generate an interactive graph with pyvis and save as an HTML file.

Parameters:

nodes (list) – A list of Parameter objects.
edges (list) – A list of edges as tuples (source, target).
filepath (str, optional) – Path to save the graph file. Defaults to None.

Returns:

A dictionary containing the graph file path.

Return type:

dict

static wrap_and_escape(text, width=40)[source]¶: Wrap text to the specified width, considering HTML breaks, and escape special characters.

draw_graph(add_grads: bool = True, full_trace: bool = False, format: Literal['png', 'svg'] = 'png', rankdir: Literal['LR', 'TB'] = 'TB', filepath: str | None = None) → Dict[str, Any][source]¶

Draw the graph of the parameter and its gradients.

Parameters:

add_grads (bool, optional) – Whether to add gradients to the graph. Defaults to True.
format (str, optional) – The format of the output file. Defaults to “png”.
rankdir (str, optional) – The direction of the graph. Defaults to “TB”.
filepath (str, optional) – The path to save the graph. Defaults to None.
full_trace (bool, optional) – Whether to include more detailed trace such as api_kwargs. Defaults to False.

draw_output_subgraph(add_grads: bool = True, format: str = 'png', rankdir: str = 'TB', filepath: str = None) → Dict[source]¶

Build and visualize a subgraph containing only OUTPUT parameters.

Parameters:

add_grads (bool) – Whether to include gradient edges.
format (str) – Format for output (e.g., png, svg).
rankdir (str) – Graph layout direction (“LR” or “TB”).
filepath (str) – Path to save the graph.

draw_component_subgraph(format: str = 'png', rankdir: str = 'TB', filepath: str = None)[source]¶

Build and visualize a subgraph containing only OUTPUT parameters.

Parameters:

format (str) – Format for output (e.g., png, svg).
rankdir (str) – Graph layout direction (“LR” or “TB”).
filepath (str) – Path to save the graph.

to_dict()[source]¶

classmethod from_dict(data: dict)[source]¶

class OutputParameter(*, id: str | None = None, data: ~optim.parameter.T = None, data_id: str = None, requires_opt: bool = True, role_desc: str = '', param_type: ~adalflow.optim.types.ParameterType = <ParameterType.OUTPUT: output, 'The output of the component.'>, name: str = None, instruction_to_optimizer: str = None, instruction_to_backward_engine: str = None, score: float | None = None, eval_input: object = None, successor_map_fn: ~typing.Dict[str, ~typing.Callable] | None = None, data_in_prompt: ~typing.Callable | None = None, full_response: ~typing.Any | None = None)[source]¶

Bases: Parameter

The output parameter is the most complex type of parameter in the system.

It will trace the predecessors, set up a grad_fn, store gradients, and trace the forward pass by tracking the component_trace.

allowed_types = {<ParameterType.GENERATOR_OUTPUT: generator_output, 'The output of the generator.'>, <ParameterType.LOSS_OUTPUT: loss, 'The loss value.'>, <ParameterType.OUTPUT: output, 'The output of the component.'>, <ParameterType.SUM_OUTPUT: sum, 'The sum of the losses.'>}¶

component_trace: ComponentTrace = None¶

full_response: object = None¶

trace_forward_pass(input_args: Dict[str, Any], full_response: object, id: str = None, name: str = None)[source]¶: Trace the forward pass of the parameter. Adding the component information to the trace

trace_api_kwargs(api_kwargs: Dict[str, Any])[source]¶: Trace the api_kwargs for components like Generator and Retriever that pass to the model client.

to_dict()[source]¶

classmethod from_dict(data: dict)[source]¶

class BackwardContext(disable_backward_engine: bool, backward_fn: Callable, backward_engine: BackwardEngine = None, *args, **kwargs)[source]¶

Bases: object

Represents a context for backward computation.

Parameters:

backward_fn (callable) – The backward function to be called during backward computation.
args – Variable length argument list to be passed to the backward function.
kwargs – Arbitrary keyword arguments to be passed to the backward function.

Variables:

backward_fn (callable) – The backward function to be called during backward computation.
fn_name (str) – The fully qualified name of the backward function.
args – Variable length argument list to be passed to the backward function.
kwargs – Arbitrary keyword arguments to be passed to the backward function.

Method __call__(backward_engine:

EngineLM) -> Any: Calls the backward function with the given backward engine and returns the result.

Method __repr__() -> str:

Returns a string representation of the BackwardContext object.

class BootstrapFewShot(params: List[Parameter], raw_shots: int | None = None, bootstrap_shots: int | None = None, dataset: List[DataClass] | None = None, weighted: bool = True, exclude_input_fields_from_bootstrap_demos: bool = False)[source]¶

Bases: DemoOptimizer

BootstrapFewShot performs few-shot sampling used in few-shot ICL.

It will be used to optimize paramters of demos. Based on research from AdalFlow team and DsPy library.

Compared with Dspy’s version:

we added weighted sampling for both the raw and augmented demos to prioritize failed demos but successful in augmented demos based on the evaluation score while we backpropagate the demo samples.
In default, we exclude the input fields from the augmented demos. Our reserch finds that using the reasoning demostrations from teacher model can be more effective in some cases than taking both inputs and output samples and be more token efficient.

Reference: - DsPy: Com-piling declarative language model calls into state-of-the-art pipelines.

add_scores(ids: List[str], scores: List[float], is_teacher: bool = True)[source]¶: Add scores for each demo via _teacher_scores or _student_scores.

config_shots(raw_shots: int, bootstrap_shots: int)[source]¶: Initialize the samples for each parameter.

config_dataset(dataset: List[DataClass])[source]¶

property num_shots: int¶

sample(augmented_demos: Dict[str, DataClass], demos: Dict[str, DataClass], dataset: List[DataClass], raw_shots: int, bootstrap_shots: int, weighted: bool = True)[source]¶: Performs weighted sampling, ensure the score is in range [0, 1]. The higher score means better accuracy.

static samples_to_str(samples: List[DataClass], augmented: bool = False, exclude_inputs: bool = False) → str[source]¶

propose()[source]¶: Proposing a value while keeping previous value saved on parameter.

revert()[source]¶: Revert to the previous value when the evaluation is worse.

step()[source]¶: Discard the previous value and keep the proposed value.

class TGDOptimizer(params: Iterable[Parameter] | Iterable[Dict[str, Any]], model_client: ModelClient, model_kwargs: Dict[str, object] = {}, constraints: List[str] = None, optimizer_system_prompt: str = 'You are an excellent prompt engineer tasked with instruction and demonstration tuning a compound LLM system.\nYour task is to refine a variable/prompt based on feedback from a batch of input data points.\n\nThe variable is either input or output of a functional component where the component schema will be provided.\nIf the same DataID has multiple gradients, it means this component/variable is called multiple times in the compound system(with a cycle) in the same order as it appears in the gradient list.\n\nYou Must edit the current variable with one of the following editing methods.\nYou can not rewrite everything all at once:\n\nYou have Four Editing Methods:\n1. ADD new elements(instruction) to address each specific feedback.\n2. ADD Examples (e.g., input-reasoning-answer) for tasks that require strong reasoning skills.\n3. Rephrase existing instruction(for more clarity), Replace existing sample with another, to address the feedback.\n4. DELETE unnecessary words to improve clarity.\n\nThese SIX prompting techniques can be a helpful direction.\n1. Set Context and Role: Establish a specific identity or domain expertise for the AI to guide style, knowledge, and constraints.\n2. Be Specific, Clear, and Grammarly correct: Clearly define instructions, desired format, and constraints to ensure accurate and relevant outputs with regards to the feedback.\n3. Illicit reasoning: "chain-of-thought" (e.g. "think step by step") helps the model reason better.\n4. Examples: Construct examples(e.g., input(optional)-reasoning(required)-answer) especially for tasks that require strong reasoning skills.\n5. Leverage Constraints and Formatting: Explicitly direct how the answer should be structured (e.g., bullet points, tables, or tone).\n6. Self-Consistency / Verification Prompts: Prompt the model to check its own logic for errors, inconsistencies, or missing details.\n\nYour final action/reasoning = one of FOUR editing method + one of SIX prompting technique.\n\nYou must stick to these instructions:\n1. **MUST Resolve concerns raised in the feedback** while preserving the positive aspects of the original variable.\n2. **Observe past performance patterns** to retain good qualities in the variable and past failed ones to try things differently.\n3. **System Awareness**: When other system variables are given, ensure you understand how this variable works in the whole system.\n4. **Peer Awareness**: This variable works together with Peer variables, ensure you are aware of their roles and constraints.\n5. **Batch Awareness**: You are optimizing a batch of input data, ensure the change applys to the whole batch (except while using demonstration.)\n\n{{output_format_str}}\n\n{% if instruction_to_optimizer %}\n**Additional User Instructions**: {{instruction_to_optimizer}}\n{% endif %}\n', in_context_examples: List[str] = None, max_past_history: int = 3, max_failed_proposals: int = 5, steps_from_last_improvement: int = 0, one_parameter_at_a_time: bool = True)[source]¶

Bases: TextOptimizer

Textual Gradient Descent(LLM) optimizer for text-based variables.

proposing: bool = False¶

params_history: Dict[str, List[HistoryPrompt]] = {}¶

failed_proposals: Dict[str, List[HistoryPrompt]] = {}¶

current_tgd_output: Dict[str, TGDData | None] = {}¶

params: Iterable[Parameter] | Iterable[Dict[str, Any]]¶

constraints: List[str]¶

one_parameter_at_a_time: bool¶

property constraint_text¶

Returns a formatted string representation of the constraints.

Returns:: A string containing the constraints in the format “Constraint {index}: {constraint}”.
Return type:: str

increment_steps_from_last_improvement()[source]¶

reset_steps_from_last_improvement()[source]¶

add_score_to_params(val_score: float)[source]¶

add_score_to_current_param(param_id: str, param: Parameter, score: float)[source]¶

add_history(param_id: str, history: HistoryPrompt)[source]¶

render_history(param_id: str) → List[str][source]¶

add_failed_proposal()[source]¶: Save a copy of the current value of the parameter in the failed proposals.

render_failed_proposals(param_id: str) → List[str][source]¶

update_gradient_memory(param: Parameter)[source]¶

zero_grad()[source]¶: Clear all the gradients of the parameters.

set_target_param()[source]¶

propose()[source]¶: Proposing a value while keeping previous value saved on parameter.

revert()[source]¶: Revert to the previous value when the evaluation is worse.

step()[source]¶: Discard the previous value and keep the proposed value.

to_dict()[source]¶

class EvalFnToTextLoss(eval_fn: Callable | BaseEvaluator, eval_fn_desc: str, backward_engine: BackwardEngine | None = None, model_client: ModelClient = None, model_kwargs: Dict[str, object] = None)[source]¶

Bases: LossComponent

Convert an evaluation function to a text loss.

LossComponent will take an eval function and output a score (usually a float in range [0, 1], and the higher the better, unlike the loss function in model training).

In math:

score/loss = eval_fn(y_pred, y_gt)

The gradident/feedback = d(score)/d(y_pred) will be computed using a backward engine. Gradient_context = GradientContext(

context=conversation_str, response_desc=response.role_desc, variable_desc=role_desc,

)

Parameters:

eval_fn – The evaluation function that takes a pair of y and y_gt and returns a score.
eval_fn_desc – Description of the evaluation function.
backward_engine – The backward engine to use for the text prompt optimization.
model_client – The model client to use for the backward engine if backward_engine is not provided.
model_kwargs – The model kwargs to use for the backward engine if backward_engine is not provided.

forward(kwargs: Dict[str, Parameter], response_desc: str = None, metadata: Dict[str, str] = None, id: str = None, gt: object = None, input: Dict[str, object] = None) → Parameter[source]¶

Parameters:

kwargs – The inputs to the eval_fn.
response_desc – Description of the output.
metadata – Additional notes on the input kwargs.
id – The unique identifier for the data point.
gt – The ground truth for the evaluation function.

set_backward_engine(backward_engine: BackwardEngine = None, model_client: ModelClient = None, model_kwargs: Dict[str, object] = None)[source]¶

backward(response: Parameter, eval_fn_desc: str, kwargs: Dict[str, Parameter], ground_truth: object = None, backward_engine: BackwardEngine | None = None, metadata: Dict[str, str] = None, input: Dict[str, object] = None, disable_backward_engine: bool = False)[source]¶: Ensure to set backward_engine for the text prompt optimization. It can be None if you are only doing demo optimization and it will not have gradients but simply backpropagate the score.

class LLMAsTextLoss(prompt_kwargs: Dict[str, str | Parameter], model_client: ModelClient, model_kwargs: Dict[str, object])[source]¶

Bases: LossComponent

Evaluate the final RAG response using an LLM judge.

The LLM judge will have: - eval_system_prompt: The system prompt to evaluate the response. - y_hat: The response to evaluate. - Optional: y: The correct response to compare against.

The loss will be a Parameter with the evaluation result and can be used to compute gradients. This loss use LLM/Generator as the computation/transformation operator, so it’s gradient will be found from the Generator’s backward method.

forward(*args, **kwargs) → Parameter[source]¶: Default just wraps the call method.

class Trainer(adaltask: AdalComponent, optimization_order: Literal['sequential', 'mix'] = 'sequential', strategy: Literal['random', 'constrained'] = 'constrained', max_steps: int = 1000, train_batch_size: int | None = 4, num_workers: int = 4, ckpt_path: str = None, batch_val_score_threshold: float | None = 1.0, correct_val_score_threshold: float | None = 0.5, max_error_samples: int | None = 2, max_correct_samples: int | None = 2, max_proposals_per_step: int = 5, train_loader: Any | None = None, train_dataset: Any | None = None, val_dataset: Any | None = None, test_dataset: Any | None = None, raw_shots: int | None = None, bootstrap_shots: int | None = None, weighted_sampling: bool = False, exclude_input_fields_from_bootstrap_demos: bool = False, debug: bool = False, save_traces: bool = False, sequential_order: List[str] = ['text', 'demo'], skip_subset_val: bool = False, disable_backward_gradients: bool = False, disable_backward: bool = False, text_optimizers_config_kwargs: Dict[str, Any] | None = {}, *args, **kwargs)[source]¶

Bases: Component

Ready to use trainer for LLM task pipeline to optimize all types of parameters.

Training set: can be used for passing initial proposed prompt or for few-shot sampling. Validation set: Will be used to select the final prompt or samples. Test set: Will be used to evaluate the final prompt or samples.

Parameters:

adaltask – AdalComponent: AdalComponent instance
strategy – Literal[“random”, “constrained”]: Strategy to use for the optimizer
max_steps – int: Maximum number of steps to run the optimizer
num_workers – int: Number of workers to use for parallel processing
ckpt_path – str: Path to save the checkpoint files, default to ~/.adalflow/ckpt.
batch_val_score_threshold – Optional[float]: Threshold for skipping a batch
max_error_samples – Optional[int]: Maximum number of error samples to keep
max_correct_samples – Optional[int]: Maximum number of correct samples to keep
max_proposals_per_step – int: Maximum number of proposals to generate per step
train_loader – Any: DataLoader instance for training
train_dataset – Any: Training dataset
val_dataset – Any: Validation dataset
test_dataset – Any: Test dataset
few_shots_config – Optional[FewShotConfig]: Few shot configuration
save_traces – bool: Save traces for for synthetic data generation or debugging
debug (and for demo) – bool: Debug mode to run the trainer in debug mode. If debug is True, for text debug, the graph will be under /ckpt/YourAdalComponentName/debug_text_grads for prompt parameter,
debug
parameters. (the graph will be under /ckpt/YourAdalComponentName/debug_demos for demo)

Note

When you are in the debug mode, you can use get_logger api to show more detailed log on your own.

Example:

from adalflow.utils import get_logger

get_logger(level=”DEBUG”)

optimizer: Optimizer = None¶

ckpt_file: str | None = None¶

random_seed: int = None¶

optimization_order: Literal['sequential', 'mix'] = 'sequential'¶

strategy: Literal['random', 'constrained']¶

max_steps: int¶

ckpt_path: str | None = None¶

adaltask: AdalComponent¶

num_workers: int = 4¶

train_loader: Any¶

val_dataset = None¶

test_dataset = None¶

batch_val_score_threshold: float | None = 1.0¶

correct_val_score_threshold: float | None = 0.5¶

max_error_samples: int | None = 2¶

max_correct_samples: int | None = 2¶

max_proposals_per_step: int = 5¶

train_batch_size: int | None = 4¶

debug: bool = False¶

sequential_order: List[str] = ['text', 'demo']¶

skip_subset_val: bool = False¶

disable_backward_gradients: bool = False¶

disable_backward: bool = False¶

text_optimizers_config_kwargs: Dict[str, Any] | None = {}¶

set_random_seed(seed: int)[source]¶

diagnose(dataset: Any, split: str = 'train', resume_from_ckpt: str = None)[source]¶

Run an evaluation on the trainset to track all error response, and its raw response using AdaplComponent’s default configure_callbacks :param dataset: Any: Dataset to evaluate :param split: str: Split name, default to train and it is also used as set the directory name for saving the logs

Example:

trainset, valset, testset = load_datasets(max_samples=10)
adaltask = TGDWithEvalFnLoss(
    task_model_config=llama3_model,
    backward_engine_model_config=llama3_model,
    optimizer_model_config=llama3_model,
)

trainer = Trainer(adaltask=adaltask)
diagnose = trainer.diagnose(dataset=trainset)
print(diagnose)

diagnose_report(split: str, acc_score: float | None = None, stats_list: List[Dict] | None = None, log_paths: Dict[str, List[str]] | None = None)[source]¶

debug_report(text_grad_debug_path: Dict[str, object] | None = None, few_shot_demo_debug_path: Dict[str, object] | None = None)[source]¶

resume_params_from_ckpt(ckpt_file: str)[source]¶: Resume the parameters from the checkpoint file

fit(*, adaltask: AdalComponent | None = None, train_loader: Any | None = None, train_dataset: Any | None = None, val_dataset: Any | None = None, test_dataset: Any | None = None, debug: bool = False, save_traces: bool = False, raw_shots: int | None = None, bootstrap_shots: int | None = None, resume_from_ckpt: str | None = None, backward_pass_setup: BackwardPassSetup | None = None) → Tuple[str, TrainerResult][source]¶

train_loader: An iterable or collection of iterables specifying training samples.

Returns:: Checkpoint file and the TrainerResult object
Return type:: Tuple[str, TrainerResult]

initial_validation(val_dataset: Any, test_dataset: Any)[source]¶

gather_trainer_states()[source]¶

prep_ckpt_file_path(trainer_state: Dict[str, Any] = None)[source]¶

Prepare the checkpoint root path: ~/.adalflow/ckpt/task_name/.

It also generates a unique checkpoint file name based on the strategy, max_steps, and a unique hash key. For multiple runs but with the same adalcomponent + trainer setup, the run number will be incremented.

class AdalComponent(task: Component, eval_fn: Callable | None = None, loss_eval_fn: Callable | None = None, loss_fn: LossComponent | None = None, backward_engine: BackwardEngine | None = None, backward_engine_model_config: Dict | None = None, teacher_model_config: Dict | None = None, text_optimizer_model_config: Dict | None = None, *args, **kwargs)[source]¶

Bases: Component

Define a train, eval, and test step for a task pipeline.

This serves the following purposes: 1. Organize all parts for training a task pipeline in one place. 2. Help with debugging and testing before the actual training. 3. Adds multi-threading support for training and evaluation.

It has no need on call, forward, bicall, or __call__, so we need to overwrite the base ones.

task: Component¶

eval_fn: Callable | None¶

loss_eval_fn: Callable | None¶

loss_fn: LossComponent | None¶

backward_engine: BackwardEngine | None¶

prepare_task(sample: Any, *args, **kwargs) → Tuple[Callable, Dict][source]¶

Tell Trainer how to call the task in both training and inference mode.

Return a task call and kwargs for one training sample.

If you just need to eval, ensure the Callable has the inference mode. If you need to also train, ensure the Callable has the training mode which returns a Parameter and mainly call forward for all subcomponents within the task.

Example:

def prepare_task(self, sample: Any, *args, **kwargs) -> Tuple[Callable, Dict]:
    return self.task, {"x": sample.x}

prepare_loss(sample: Any, y_pred: Parameter, *args, **kwargs) → Tuple[Callable, Dict][source]¶

Tell Trainer how to calculate the loss in the training mode.

Return a loss call and kwargs for one loss sample.

Need to ensure y_pred is a Parameter, and the real input to use for y_gt and y_pred is eval_input. Make sure it is setup.

Example:

# "y" and "y_gt" are arguments needed
#by the eval_fn inside of the loss_fn if it is a EvalFnToTextLoss

def prepare_loss(self, sample: Example, pred: adal.Parameter) -> Dict:
    # prepare gt parameter
    y_gt = adal.Parameter(
        name="y_gt",
        data=sample.answer,
        eval_input=sample.answer,
        requires_opt=False,
    )

    # pred's full_response is the output of the task pipeline which is GeneratorOutput
    pred.eval_input = pred.full_response.data
    return self.loss_fn, {"kwargs": {"y": y_gt, "y_pred": pred}}

prepare_eval(sample: Any, y_pred: Any, *args, **kwargs) → float[source]¶

Tell Trainer how to eval in inference mode. Return the eval_fn and kwargs for one evaluation sample.

Ensure the eval_fn is a callable that takes the predicted output and the ground truth output. Ensure the kwargs are setup correctly.

prepare_loss_eval(sample: Any, y_pred: Any, *args, **kwargs) → float[source]¶

Tell Trainer how to eval in inference mode. Return the eval_fn and kwargs for one evaluation sample.

Ensure the eval_fn is a callable that takes the predicted output and the ground truth output. Ensure the kwargs are setup correctly.

configure_optimizers(*args, **text_optimizer_kwargs) → List[Optimizer][source]¶: Note: When you use text optimizor, ensure you call configure_backward_engine_engine too.

configure_backward_engine(*args, **kwargs)[source]¶: Configure a backward engine for all GradComponent in the task for bootstrapping examples.

disable_backward_engine()[source]¶: Disable the backward engine for all GradComponent in the task. No more gradients generation.

evaluate_samples(samples: Any, y_preds: List, metadata: Dict[str, Any] | None = None, num_workers: int = 2, use_loss_eval_fn: bool = False) → EvaluationResult[source]¶

Evaluate predictions against the ground truth samples. Run evaluation on samples using parallel processing. Utilizes prepare_eval defined by the user.

Metadata is used for storing context that you can find from generator input.

Parameters:

samples (Any) – The input samples to evaluate.
y_preds (List) – The predicted outputs corresponding to each sample.
metadata (Optional[Dict[str, Any]]) – Optional metadata dictionary.
num_workers (int) – Number of worker threads for parallel processing.

Returns:

An object containing the average score and per-item scores.

Return type:

EvaluationResult

pred_step(batch, batch_idx, num_workers: int = 2, running_eval: bool = False, min_score: float | None = None, use_loss_eval_fn: bool = False) → Tuple[List[Parameter], List, Dict[int, float]][source]¶

Applies to only the eval mode.

Parameters:

batch (Any) – The input batch to predict.
batch_idx (int) – The index of the batch.
num_workers (int) – Number of worker threads for parallel processing.
running_eval – bool = False,

Returns:

The predicted outputs, the samples, and the scores.

Return type:

Tuple[List[“Parameter”], List, Dict[int, float]]

train_step(batch, batch_idx, num_workers: int = 2) → List[source]¶: Run a training step and return the predicted outputs. Likely a list of Parameters.

validate_condition(steps: int, total_steps: int) → bool[source]¶: In default, trainer will validate at every step.

validation_step(batch, batch_idx, num_workers: int = 2, minimum_score: float | None = None, use_loss_eval_fn: bool = False) → EvaluationResult[source]¶

Parameters:

batch (Any) – The input batch to validate, can be a whole dataset
batch_idx (int) – The index of the batch. or current_step
num_workers (int) – Number of worker threads for parallel processing.
minimum_score (Optional[float]) – The max potential score needs to be larger than this to continue evaluating.

Evaluate a batch or the validate dataset by setting the batch=val_dataset. Uses self.eval_fn to evaluate the samples. If you require self.task.eval() to be called before validation, you can override this method as:

def validation_step(self, batch, batch_idx, num_workers: int = 2) -> List:
    self.task.eval()
    return super().validation_step(batch, batch_idx, num_workers)

loss_step(batch, y_preds: List[Parameter], batch_idx, num_workers: int = 2) → List[Parameter][source]¶: Calculate the loss for the batch.

configure_teacher_generator()[source]¶

Configure a teach generator for all generators in the task for bootstrapping examples.

You can call configure_teacher_generator_helper to easily configure it by passing the model_client and model_kwargs.

configure_teacher_generator_helper(model_client: ModelClient, model_kwargs: Dict[str, Any], template: str | None = None)[source]¶: Configure a teach generator for all generators in the task for bootstrapping examples.

disable_backward_engine_helper()[source]¶: Disable the backward engine for all gradcomponents in the task.

configure_backward_engine_helper(model_client: ModelClient, model_kwargs: Dict[str, Any], template: str | None = None, backward_pass_setup: BackwardPassSetup | None = None)[source]¶: Configure a backward engine for all generators in the task for bootstrapping examples.

configure_callbacks(save_dir: str | None = 'traces', *args, **kwargs) → List[str][source]¶: In default we config the failure generator callback. User can overwrite this method to add more callbacks.

run_one_task_sample(sample: Any) → Any[source]¶: Run one training sample. Used for debugging and testing.

run_one_loss_sample(sample: Any, y_pred: Any) → Any[source]¶: Run one loss sample. Used for debugging and testing.

configure_demo_optimizer_helper() → List[DemoOptimizer][source]¶

One demo optimizer can handle multiple demo parameters. But the demo optimizer will only have one dataset (trainset) configured by the Trainer.

If users want to use different trainset for different demo optimizer, they can configure it by themselves.

configure_text_optimizer_helper(model_client: ModelClient, model_kwargs: Dict[str, Any], **kwargs) → List[TextOptimizer][source]¶: Text optimizer hands prompt parameter type. One text optimizer can handle multiple text parameters.

bicall(*args, **kwargs)[source]¶: If the user provides a bicall method, then __call__ will automatically dispatch here for both training and inference scenarios. This can internally decide how to handle training vs. inference, or just produce a single unified output type.

call(*args, **kwargs)[source]¶: User must override this for the inference scenario if bicall is not defined.

forward(*args, **kwargs)[source]¶: User must override this for the training scenario if bicall is not defined.

class DemoOptimizer(weighted: bool = True, dataset: Sequence[DataClass] = None, exclude_input_fields_from_bootstrap_demos: bool = False, *args, **kwargs)[source]¶

Bases: Optimizer

Base class for all demo optimizers.

Demo optimizer are few-shot optimization, where it will sample raw examples from train dataset or bootstrap examples from the model’s output. It will work with a sampler to generate new values for a given text prompt.

If bootstrap is used, it will require a teacher genearator to generate the examples.

dataset: Sequence[DataClass]¶

exclude_input_fields_from_bootstrap_demos: bool = False¶

use_weighted_sampling(weighted: bool)[source]¶

config_shots(*args, **kwargs)[source]¶: Initialize the samples for each parameter.

set_dataset(dataset: Sequence[DataClass])[source]¶: Set the dataset for the optimizer.

add_scores(ids: List[str], scores: List[float], *args, **kwargs)[source]¶: Add scores to the optimizer.

class TextOptimizer(*args, **kwargs)[source]¶

Bases: Optimizer

Base class for all text optimizers.

Text optimizer is via textual gradient descent, which is a variant of gradient descent that optimizes the text directly. It will generate new values for a given text prompt.This includes: - System prompt - output format - prompt template

zero_grad()[source]¶: Clear all the gradients of the parameters.

class Gradient(*, from_response: Parameter, to_pred: Parameter, id: str | None = None, score: float | None = None, data_id: str | None = None, data: Any = None)[source]¶

Bases: DataClass

It will handle gradients and feedbacks.

It tracks the d_from_response_id / d_to_pred_id and the score of the whole response.

if two gradients have the same data_id, different from_response_id, and same from_response_component_id, this is a cycle component structure.

context: GradientContext = None¶

prompt: str | None = None¶

is_default_copy: bool = False¶

from_response_component_id: str = None¶

from_response_id: str = None¶

to_pred_id: str = None¶

score: float | None = None¶

data_id: str | None = None¶

data: Any = None¶

order: int | None = None¶

add_context(context: GradientContext)[source]¶

add_data(data: Any)[source]¶

update_from_to(from_response: Parameter, to_pred: Parameter)[source]¶

add_prompt(prompt: str)[source]¶

class GradientContext(variable_desc: str, input_output: str, response_desc: str)[source]¶

Bases: DataClass

GradientContext is used to describe the component’s function and trace its input and output.

To get the component’s function desc, use GradientContext.to_yaml_signature() To get the data: use instance.to_yaml()

variable_desc: str¶

input_output: str¶

response_desc: str¶