Refactor entropy estimate #24

SemyonEpanov · 2025-09-01T13:12:15Z

Объединяем дублирующийся код для извлечения энтропий CoT и single_token
#discussion_r2249304593

Добавили батчи для CoT
Добавили единый runner (runner.py), стратегии (strategies.py) и общий интерфейс (estimate_entropy.py)
Добавили (на будущее): обработку датасетов при помощи адаптеров

-- Режим определяется флагом --mode (single_token или cot).

aigoncharov · 2025-09-02T09:51:48Z

src/core/complexity_estimation/entropy/estimate_entropy.py

+            adapter.check_answer_correct)
+
+def main():
+    ap = argparse.ArgumentParser()


Let's remove command-line arguments in favor of passing configs as code

aigoncharov · 2025-09-02T09:59:39Z

src/core/complexity_estimation/entropy/estimate_entropy.py

+    ans = (ans or "").strip().upper()
+    return ans[:1]  # берем первую букву
+
+def build_row_funcs_from_columns(


What is the point of separate prompt_builder and build_row_funcs_from_columns? AFAIU, the end goal is to construct a prompt from the row. Shall we combine them in a single function then that for MMLU dataset accepts a row and outputs a prompt?

aigoncharov · 2025-09-02T10:00:24Z

Could you also refactor the existing experiments to see how the code is applied?

aigoncharov · 2025-09-09T07:33:45Z

src/core/complexity_estimation/entropy/strategies.py

+
+from core.prompts.mmlu_cot_answer import answer_marker as COT_MARKERS
+
+def make_prompt_builder(


Where do we still use it?

refactor: unify single-token & CoT entropy estimation

df4397c

SemyonEpanov force-pushed the refactor-entropy-estimate branch from f5c4497 to df4397c Compare September 1, 2025 13:23

aigoncharov reviewed Sep 2, 2025

View reviewed changes

inline estimate entropy functionality

c8fa018

aigoncharov reviewed Sep 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor entropy estimate #24

Refactor entropy estimate #24

Uh oh!

SemyonEpanov commented Sep 1, 2025

Uh oh!

aigoncharov Sep 2, 2025

Uh oh!

aigoncharov Sep 2, 2025

Uh oh!

aigoncharov commented Sep 2, 2025

Uh oh!

aigoncharov Sep 9, 2025

Uh oh!

Uh oh!


		from core.prompts.mmlu_cot_answer import answer_marker as COT_MARKERS

		def make_prompt_builder(

Refactor entropy estimate #24

Are you sure you want to change the base?

Refactor entropy estimate #24

Uh oh!

Conversation

SemyonEpanov commented Sep 1, 2025

Uh oh!

aigoncharov Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

aigoncharov Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

aigoncharov commented Sep 2, 2025

Uh oh!

aigoncharov Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!