Implementing a Transformer from scratch #13

d-kleine · 2023-06-30T10:24:27Z

d-kleine
Jun 30, 2023

Hi,

I would like to ask what do you think about adding a showcase for how to build, train and test a Transformer "from scratch" with pytorch, for instance for a translation task like english to french?

I have already conducted some research and have found following resources to that:
Attention is all you need: A Pytorch Implementation
Build your own Transformer from scratch using Pytorch
Transformers from Scratch in PyTorch
Transformers from scratch

From my point of view, it would be nice to see

read in the real text datasets (english, french) + train-test-split
text wrangling + preprocessing
implementation of each step of the transformer in an ordered structure (positional enc, MHA/masked MHA, Encoder, Decoder)
and finally training the transformer on real text data, and evaluating it on real text data

I believe this would be also be great to demonstrate post-LN (as in the original paper) versus pre-LN implementation (performance boost).

In my option, the first resource would be a good reference (but not properly maintained) for an end-to-end workflow implementation working with real text data, but in the simplicity of the 2nd and 3rd resource (those just use some artificially sampled random data points), all in one single notebook.

Kind regards,
Daniel

rasbt · 2023-06-30T12:51:15Z

rasbt
Jun 30, 2023
Maintainer

Thanks for the suggestion! Coincidentally, I am already working on something like that -- although a bit more comprehensive and focused on decoder models. Thanks for suggesting, though!

4 replies

d-kleine Jun 30, 2023
Author

Alright, got it - thank you though! Looking forward for the decoder models notebook then! 😃

rasbt Oct 25, 2023
Maintainer

This is going to be a longer, separate project 😅
https://github.com/rasbt/LLMs-from-scratch

d-kleine Dec 7, 2023
Author

Great! 👍 If you need feedback at any stage of this project, I would be available to serve as a reviewer

rasbt Dec 28, 2023
Maintainer

Thanks a lot for offering @d-kleine! Would be happy to have you on board. Will follow up with you via LinkedIn!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implementing a Transformer from scratch #13

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 4 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Implementing a Transformer from scratch #13

Uh oh!

Uh oh!

d-kleine Jun 30, 2023

Replies: 1 comment · 4 replies

Uh oh!

rasbt Jun 30, 2023 Maintainer

Uh oh!

d-kleine Jun 30, 2023 Author

Uh oh!

rasbt Oct 25, 2023 Maintainer

Uh oh!

d-kleine Dec 7, 2023 Author

Uh oh!

rasbt Dec 28, 2023 Maintainer

d-kleine
Jun 30, 2023

Replies: 1 comment 4 replies

rasbt
Jun 30, 2023
Maintainer

d-kleine Jun 30, 2023
Author

rasbt Oct 25, 2023
Maintainer

d-kleine Dec 7, 2023
Author

rasbt Dec 28, 2023
Maintainer