artefactual.calibration.rates_answers#
Functions
|
Rate answers generated by a model using a judge LLM. |
Classes
|
Configuration for answer rating. |
|
Model for a single result item in the input JSON. |
- class artefactual.calibration.rates_answers.RatingConfig(**data)[source]#
Bases:
BaseModelConfiguration for answer rating.
- model_config: ClassVar[ConfigDict] = {}#
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].
- class artefactual.calibration.rates_answers.ResultItem(**data)[source]#
Bases:
BaseModelModel for a single result item in the input JSON.
- model_config: ClassVar[ConfigDict] = {}#
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].
- artefactual.calibration.rates_answers.rate_answers(config)[source]#
Rate answers generated by a model using a judge LLM.
- Return type:
DataFrame
- Args:
config (RatingConfig): Configuration for the rating process.
- Returns:
pd.DataFrame: DataFrame containing uncertainty scores and judgments, indexed by query_id.