🧠 Generating Machine Learning Models Using Machine Learning Models

Learning new tasks without new data.
This project explores how machine learning models themselves can be generated using other machine learning models, without direct task-specific supervision.

🚀 Overview

This project introduces an unconventional but powerful idea:
using generative models to create new classifiers by fusing existing ones.

Specifically, we leverage CycleGANs to merge the learned feature representations of two independently trained Convolutional Neural Networks (CNNs):

One CNN trained to recognize cats
One CNN trained to recognize the color black

By translating and combining their feature spaces, we generate a new CNN capable of detecting black cats — without ever being trained on a single black cat image.

📊 Result:
The generated model achieves 88% classification accuracy on black cat detection, despite zero direct exposure to black cat data.

This work opens new directions for:

Learning under data scarcity
Automated model generation
Knowledge transfer beyond traditional fine-tuning

✨ Key Contributions

🔁 CycleGAN-based Model Fusion
Uses CycleGANs to translate and align feature kernels between CNNs trained on unrelated domains.
🧬 Generated CNNs (Zero-shot Task Creation)
Constructs a task-specific classifier purely from pre-trained models.
📉 Feature Space Validation
Employs UMAP, K-Means, and DBSCAN to analyze and validate learned representations.
🧪 Unsupervised Generalization
Demonstrates black cat recognition without labeled black cat data.

🧠 Core Idea (Intuition)

Instead of training a model on data, we train a model on other models.

Train two CNNs on separate concepts
- Object: cat
- Attribute: black
Extract convolutional kernels from both networks.
Train CycleGANs to translate kernels between these feature domains.
Initialize a new CNN using the CycleGAN-generated kernels.
Evaluate whether this synthesized CNN can recognize black cats.

➡️ It can.

🧪 Methodology

📂 Datasets

Dataset	Samples
Black / Random Images	1,826 (1,745 black, 81 random)
Cat / Random Images	30,405 (29,843 cats, 562 random)
Kernel Sets for CycleGAN	4,498 per convolutional layer

🏗️ Model Architectures

CNNs

2 Convolutional layers
Kernel size: 5×5
Activation: ReLU
Max-pooling layers
Trained independently on separate domains

CycleGANs

Generator–Discriminator architecture
Learns kernel-space translation, not image translation
Operates directly on convolutional filters

Generated CNN

Initialized entirely using CycleGAN-generated kernels
No gradient updates using black cat images

📊 Evaluation Metrics

Accuracy
Precision & Recall
Cluster Entropy
Cluster Purity
Cosine Similarity
UMAP Visualization

📈 Results

✅ The generated CNN successfully clusters black cat images
📉 UMAP projections show clear separation of semantic concepts
📐 Cosine similarity confirms meaningful feature alignment
🧠 Demonstrates unsupervised semantic composition

The model learns “black AND cat” without ever seeing a black cat.

🧩 Why This Matters

Traditional ML assumes:

New task ⇒ new labeled data

This project challenges that assumption by showing:

Tasks can be composed
Models can be generated, not trained
Generative models can operate in parameter space, not just data space

This has implications for:

Low-resource domains
Privacy-sensitive data
Automated ML systems
Foundation model composition

🔮 Future Work

🔧 Hyperparameter optimization for clustering and feature fusion
📐 Alternative feature-space similarity metrics
🧠 Semantic-aware end-to-end pipelines
🧪 Scaling to deeper CNNs and transformers
🔁 Multi-attribute model composition

⚙️ Setup & Execution

🖥️ Hardware Requirements

GPU with ≥ 6GB VRAM (recommended)

🧪 Software Requirements

Python 3.11
PyTorch 2.5
torchvision
numpy
scikit-learn
seaborn
matplotlib
tqdm

Install dependencies:

pip install -r requirements.txt

###📄 License MIT

🧪 Research focus

Generative models, representation learning, and non-traditional ML paradigms.

“Why train on more data when you can train on more models?”

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
0_layers		0_layers
2_layers		2_layers
GeneratedCNN_test		GeneratedCNN_test
black_layers		black_layers
black_layers_permuted		black_layers_permuted
black_models		black_models
cat_layers		cat_layers
cat_layers_permuted		cat_layers_permuted
cat_models		cat_models
2_generator_A2B.pth		2_generator_A2B.pth
2_generator_B2A.pth		2_generator_B2A.pth
ML Project Report(1).docx		ML Project Report(1).docx
MyTensorDataset.py		MyTensorDataset.py
Opera Snapshot_2024-11-25_021054_chatgpt.com.png		Opera Snapshot_2024-11-25_021054_chatgpt.com.png
Opera Snapshot_2024-11-25_173554_localhost.png		Opera Snapshot_2024-11-25_173554_localhost.png
README.md		README.md
Screenshot 2024-11-25 002751.png		Screenshot 2024-11-25 002751.png
batch_img_resize.ipynb		batch_img_resize.ipynb
black_cnn.ipynb		black_cnn.ipynb
cats_cnn.ipynb		cats_cnn.ipynb
cnn.ipynb		cnn.ipynb
cycleCNN.ipynb		cycleCNN.ipynb
cyclegan_0.ipynb		cyclegan_0.ipynb
cyclegan_2.ipynb		cyclegan_2.ipynb
generatedCNN.ipynb		generatedCNN.ipynb
generator_A2B.pth		generator_A2B.pth
generator_B2A.pth		generator_B2A.pth
ml - ML.pdf		ml - ML.pdf
model_extractor.ipynb		model_extractor.ipynb
tensor_permute.ipynb		tensor_permute.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Generating Machine Learning Models Using Machine Learning Models

🚀 Overview

✨ Key Contributions

🧠 Core Idea (Intuition)

🧪 Methodology

📂 Datasets

🏗️ Model Architectures

CNNs

CycleGANs

Generated CNN

📊 Evaluation Metrics

📈 Results

🧩 Why This Matters

🔮 Future Work

⚙️ Setup & Execution

🖥️ Hardware Requirements

🧪 Software Requirements

🧪 Research focus

About

Uh oh!

Releases

Packages

Languages

Jain-Karan/mlusingml

Folders and files

Latest commit

History

Repository files navigation

🧠 Generating Machine Learning Models Using Machine Learning Models

🚀 Overview

✨ Key Contributions

🧠 Core Idea (Intuition)

🧪 Methodology

📂 Datasets

🏗️ Model Architectures

CNNs

CycleGANs

Generated CNN

📊 Evaluation Metrics

📈 Results

🧩 Why This Matters

🔮 Future Work

⚙️ Setup & Execution

🖥️ Hardware Requirements

🧪 Software Requirements

🧪 Research focus

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages