Tool call testing by alay2shah · Pull Request #19 · Liquid4All/leap-finetune

alay2shah · 2026-03-30T16:33:30Z

This PR adds tool call format guardrails to the training pipeline.

Detects LFM2, LFM2.5, structured (OpenAI), and foreign formats from a sample of the data
Auto-strips <|tool_response_start|> from role=tool messages to prevent double wrapping (the LFM2 template adds these itself)
Auto-strips <|tool_list_start|> when training LFM2.5 models
Auto-converts OpenAI-style tool_calls fields to pre-baked bracket notation
Hard errors on foreign formats (<tool_call> XML etc.) with a message telling you exactly what to do
Warns on format mismatches it can't auto-fix (e.g. missing <|tool_list_start|> for LFM2)

Pipeline integration is light. The normalization runs as a Ray .map() step between column normalization and filtering — same pattern as existing transforms. Existing files got ~10 lines each. model_name flows from the YAML config through DatasetLoader so the pipeline knows which model family to validate against.

Also re-enables preprocess_fn — users can now specify preprocess_fn: "my_module.my_func" in their YAML config for custom dataset transforms.

…ut an invalid stinrg that isnt parsable as they include special characters that need escaping with \

…with correct escaping character \, passes all the unit tests now

Rouzbehat78

I did add few edge cases in the pytests where the tool_calls_to_pythonic converter was outputting invalid strings that arent parsable when a special character existed in the string like (", , \n, etc)
I also pushed the small fix, so we use json dumps for those cases that will make sure it's parsable stings.

Rouzbehat78 · 2026-04-20T23:15:53Z

    cache_dataset: bool = False
+    # Optional preprocessing function: takes Ray Dataset, returns Ray Dataset
+    # Applied before validation - use for custom filtering, transforms, joins, etc.
+    preprocess_fn: Callable | None = field(default=None, repr=False)


Are we using this anywhere?

Agents sometime use it but it's relatively harmless alone

alay2shah · 2026-04-21T14:12:21Z

Lgtm, just approve

alay2shah added 10 commits March 30, 2026 00:51

Add test plan for LFM2 vs LFM2.5 tool calling template differences

33c5f91

Add get_model_family() for LFM2 vs LFM2.5 format detection

bf0d92b

Add tool call format detection, validation, and normalization

d4ae828

Add model_name to DatasetLoader and deduplicate quick_validate calls

40d34ce

Add tool call format validation to quick_validate_schema

7347134

Add tool call normalization step to data pipeline

73211c9

Wire model_name and preprocess_fn through config parser

2d1fab3

Add tests for tool call detection, validation, and normalization

e6adcf4

Add tests for model_name passthrough and preprocess_fn config

fff0e39

Add tool calling dataset format docs and table of contents to README

6fa78a3

alay2shah requested a review from Paulescu March 30, 2026 16:33

alay2shah added 2 commits March 30, 2026 16:34

Fix prettier formatting in plan.md

4443baa

Remove plan.md from tracked files

ce28f17

alay2shah requested a review from Rouzbehat78 April 20, 2026 14:13

Rouzbehat78 added 2 commits April 20, 2026 22:44

New edge case test that converter (_tool_calls_to_pythonic) will outp…

335b7dd

…ut an invalid stinrg that isnt parsable as they include special characters that need escaping with \

Fixing the special character openai format tool calls to be parsable …

f882e39

…with correct escaping character \, passes all the unit tests now

Rouzbehat78 reviewed Apr 20, 2026

View reviewed changes

linting

3588236

Rouzbehat78 self-requested a review April 21, 2026 17:57

Rouzbehat78 approved these changes Apr 21, 2026

View reviewed changes

Rouzbehat78 merged commit d017458 into main Apr 21, 2026
1 check passed

Rouzbehat78 deleted the tool-call-testing branch April 21, 2026 17:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tool call testing#19

Tool call testing#19
Rouzbehat78 merged 15 commits intomainfrom
tool-call-testing

alay2shah commented Mar 30, 2026

Uh oh!

Rouzbehat78 left a comment •

edited

Loading

Uh oh!

Rouzbehat78 Apr 20, 2026

Uh oh!

alay2shah Apr 21, 2026

Uh oh!

alay2shah commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alay2shah commented Mar 30, 2026

Uh oh!

Rouzbehat78 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Rouzbehat78 Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

alay2shah Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

alay2shah commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Rouzbehat78 left a comment •

edited

Loading