-
-
Notifications
You must be signed in to change notification settings - Fork 201
Feature/synthetic data generation #298
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
HeerakKashyap
wants to merge
10
commits into
nidhaloff:master
Choose a base branch
from
HeerakKashyap:feature/synthetic-data-generation
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Feature/synthetic data generation #298
HeerakKashyap
wants to merge
10
commits into
nidhaloff:master
from
HeerakKashyap:feature/synthetic-data-generation
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
- Implement Model-Agnostic Meta-Learning (MAML) classifier - Add Prototypical Networks for few-shot learning - Create domain adaptation utilities with fine-tuning and MAML methods - Add transfer learning capabilities with feature extraction and fine-tuning - Include utility functions for creating and evaluating few-shot tasks - Add CLI commands: few-shot-learn, domain-adapt, transfer-learn - Update models_dict to include few-shot learning algorithms - Add few_shot_learning as supported model type - Create comprehensive documentation and examples - Add complete test suite for all few-shot learning components - Update README with new features and model table This addresses GitHub issue nidhaloff#237 'Add Support for Few-Shot Learning'
…em (Issue nidhaloff#233) - Implement MLflow-like experiment tracking with ExperimentTracker class - Add model versioning with lineage tracking and metadata management - Create experiment visualization and analysis capabilities - Include SQLite database for experiment and model metadata storage - Add support for metric tracking, parameter logging, and model logging - Implement experiment comparison and visualization tools - Add model lineage visualization and deployment tracking - Include interactive Plotly dashboards for experiment analysis - Support for experiment export and model version management - Add comprehensive documentation and examples This addresses GitHub issue nidhaloff#233 'Create Model Versioning and Experiment Tracking'
- Add SyntheticDataGenerator class for creating test datasets - Support for classification and regression data generation - Quick function for generating sample datasets - Addresses GitHub issue nidhaloff#285 - Add Support for Synthetic Data Generation
69fc9b0 to
7550db2
Compare
HeerakKashyap
pushed a commit
to HeerakKashyap/igel
that referenced
this pull request
Sep 17, 2025
- Add AdvancedFeatureEngineer class with comprehensive feature engineering capabilities - Implement polynomial features, interaction terms, binning, and encoding - Add statistical features with rolling windows and cluster-based features - Include automated feature selection with multiple methods (mutual_info, f_score, pca) - Add comprehensive feature engineering pipeline with configurable options - Include example configuration file and comprehensive demo script - Add test suite for feature engineering functionality - Update package imports to include feature engineering modules - Addresses GitHub issue nidhaloff#298 - Implement Advanced Feature Engineering
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR adds a new module for generating synthetic datasets and a corresponding CLI command, addressing Issue #285.
Key Features
Tabular data generation for classification/regression tasks
Time series data generation with trend and seasonality options
Categorical data generation with configurable categories
Flexible CLI interface with multiple data type options
Usage
Closes #285