Skip to content

Code Refactoring #2

@purvapruthi

Description

@purvapruthi

Refactor and modularize data generation and evaluation functionalities.

General

  • Make config variable and argument parser compatibility less verbose and smoother.

Data Generation

  • Split module function definitions, creation, application, and train/test split of compositions into separate files.
  • Split the synthetic data generator and loader functionalities.
  • Move equivalence class analysis, composition equivalence metric to the analysis folder.
  • Remove variable prompt length functionality.

Training

  • Remove unused functions in training/trainer.py.
  • Reduce argument loading/config in training/finetuning.py.

Evaluation

  • Reduce the verbosity of evaluation/fixed_evaluator.py.
  • Move plots, representation analysis, equivalence class analysis, and composition equivalence metric calculation in the analysis folder.

Analysis

  • Data-based composition equivalence calculation (CE metric).
  • Model-based equivalence class analysis.
  • Representation analysis.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions