Skip to content

Conversation

katjasrz
Copy link
Contributor

This repository supports a video + notebook series exploring how to run, optimize, and serve Large Language Models (LLMs) with a focus on latency, throughput, user experience (UX), and NVIDIA GPU acceleration.

@katjasrz katjasrz closed this Aug 12, 2025
@katjasrz
Copy link
Contributor Author

No longer publishing these series

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant