RankLLM Release Notes (v0.25.0)
- Release date: July 23, 2025
- Pyserini dependency: v1.2.0
Summary of Changes
- Added support for prompt templates using yaml files. The current default templates can be found here
- After this change, prompts for model coordinators are entirely generated based on the prompt templates so prompt_mode is no longer needed and will be deprecated in v0.30.0 (prompt_mode has been changed to an optional argument for creating model coordinators and analyzing responses)
- Added support for thinking/reasoning models
- The thinking traces for reasoning models can now be toggled off by setting the
is_thinking
argument to True
- A budget for the reasoning token usage can now be specified via the
reasoning_token_budget
argument
- Extended the support for adding few-shot examples to more model coordinators
- For supported model coordinators, the number of few-shot examples to use can be specified via the
num_few_shot_examples
argument and a json file containing the few-shot examples can be used via the few_shot_file
argument
- Improved/added test coverage for all new features and added regression tests to check for silent errors
- Updated documentations:
- Added all of the results from the rankllm paper to 2CR pages
- Updated training python dependencies to 3.10 and moved them to optional dependencies in pyproject, created conda config file to install environment and dependencies
- Added documentation with instructions on using rankllm with external integrations
- Other QoL changes/bug fixes:
- Updated and optimized first stage corpus caching logic and migrated commonly used corpus to HuggingFace Datasets repo and added support for new indices and topics that’s supported by Pyserini
- Added ability to save the model’s inference history via
populate_invocations_history
argument by running CLI command
- Added ability to choose the number of queries to run via
max_queries
argument so tests/experiments can be ran faster with fewer queries
- Bug fix: improved clean response function to effectively remove thinking traces as well as better handling of “fake” digits
- Bug fix: added chat template to rank_vicuna’s tokenizer
- Bug fix: fixed rank_fid’s LLM generation error with newer versions of Huggingface transformers (v4.50+)
Contributors
This Release
Sorted by number of commits:
All Time
All contributors with 3 or more commits, sorted by number of commits, according to GitHub: