cross_entropy calc #18

DanielVyazhev · 2025-08-02T13:11:32Z

Cross-entropy calculation scripts and folder/dataset

aigoncharov · 2025-08-02T16:15:01Z

src/reasoning_fine_tune/complexity_estimation/entropy/estimate_cross_entropy.py

+from reasoning_fine_tune.utils.seed import set_seed
+from reasoning_fine_tune.utils.validation import validate_mmlu_answer
+
+def estimate_dataset(


А мы бы не хотели попробовать параметризовать функцию для подсчета энтропии одного токена? Кажется, что если научить ее принимать compute_entropy_from_logits снаружи, то можно избежать дубликации кода. Что скажешь?

https://github.com/LabARSS/reasoning-fine-tune/blob/63b070b1b54af534ad1a30f5e2ed34a887a1a8bb/src/reasoning_fine_tune/complexity_estimation/entropy/estimate_single_token_entropy.py#L122

Ну и имена полей можно тогда тоже кастомные позволит задавать: field_ans и прочее

aigoncharov · 2025-08-02T16:18:35Z

src/reasoning_fine_tune/complexity_estimation/entropy/estimate_cross_entropy.py

+                    return_dict=True
+                )
+
+        last_token_logits = outputs.logits[:, -1,:]  # [batch_size, vocab_size]


А вот это прям хорошая находка! Надо нам и для подсчета single token entropy переехать на outputs.logits

aigoncharov · 2025-08-02T16:21:35Z

А для фишки сделаешь разбивку?

yndx-vyazhev

add data splits and trainer_state

cross_entropy calc

8e96977

DanielVyazhev requested a review from aigoncharov August 2, 2025 13:11

aigoncharov reviewed Aug 2, 2025

View reviewed changes

aigoncharov self-assigned this Aug 2, 2025

aigoncharov approved these changes Aug 2, 2025

View reviewed changes

add data and trainer_state

253b7e7

yndx-vyazhev reviewed Aug 8, 2025

View reviewed changes

aigoncharov approved these changes Aug 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cross_entropy calc #18

cross_entropy calc #18

Uh oh!

DanielVyazhev commented Aug 2, 2025

Uh oh!

aigoncharov Aug 2, 2025

Uh oh!

aigoncharov Aug 2, 2025

Uh oh!

aigoncharov Aug 2, 2025

Uh oh!

aigoncharov commented Aug 2, 2025

Uh oh!

yndx-vyazhev left a comment

Uh oh!

Uh oh!

cross_entropy calc #18

Are you sure you want to change the base?

cross_entropy calc #18

Uh oh!

Conversation

DanielVyazhev commented Aug 2, 2025

Uh oh!

aigoncharov Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

aigoncharov Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

aigoncharov Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

aigoncharov commented Aug 2, 2025

Uh oh!

yndx-vyazhev left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!