Transformers from Scratch

This repository contains my implementation of Transformers from scratch, inspired by Andrej Karpathy's video. While the original implementation focused on a decoder-only model, I have extended it by adding an encoder as well. The goal of this project is to deeply understand the inner workings of Transformers by building them step by step without relying on high-level libraries like Hugging Face Transformers.

Features

Implements a full Transformer model (Encoder-Decoder architecture)
Single-file implementation (gpt.py) for simplicity
Includes essential components:
- Token Embeddings
- Positional Encodings
- Multi-Head Self-Attention
- Feedforward Layers
- Layer Normalization
- Encoder and Decoder Blocks
Trained on sample text data to demonstrate functionality

Installation

To run the implementation, clone this repository and install the required dependencies:

git clone https://github.com/ahmetz3lka/transformers_from_scratch.git
cd transformers-from-scratch
pip install -r requirements.txt

Usage

Run the script with:

python gpt.py

Modify gpt.py to experiment with different model hyperparameters.

Understanding the Code

The entire Transformer model is implemented within a single file, gpt.py, to keep things simple and easy to follow. The key sections include:

Embedding Layer: Converts tokens into dense vector representations.
Self-Attention Mechanism: Captures relationships between tokens.
Feedforward Network: Adds non-linearity and depth.
Encoder-Decoder Architecture: Implements both parts of the Transformer model.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
gpt.py		gpt.py
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformers from Scratch

Features

Installation

Usage

Understanding the Code

Future Improvements

References

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Transformers from Scratch

Features

Installation

Usage

Understanding the Code

Future Improvements

References

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages