NLP Course, 2026

Current curriculum covers topic from basic NLP techinques to the most modern ones, that may be helpful for custom training of LLMs:

NLP Basics: tokenization, text preprocessing, text representations
Text & Language Models: embeddings, n-gram models, RNNs, LSTMs, seq2seq, attention
Transformers & LLMs: Transformer, pre-training (MLM/CLM), prompting, fine-tuning, PEFT
Scaling & Optimization: : distributed training, MoE, KV-cache, Flash Attention, efficient inference, quantization
Retrieval & Agents: Information Retrieval, RAG, agent-based systems
Post-training: alignment, RLHF, DPO

Course Staff

German Gritsai @grgera
Anastasiia Vozniuk @natriistorm
Ildar Khabutdinov @depinwhite

Materials

Week #	Date	Topic	Lecture	Seminar	Additional	Recording
1	February 10	Intro to NLP & Tokenization	slides	ipynb	materials	TBA

Homeworks

TBA

Game Rules

Final mark = 0.3 × (oral answer grade) + 0.7 × (average score for practical assignments)

Both oral exam and homeworks are blocking parts, you need to pass both parts to pass the course.

Prerequisities

Probability Theory + Statistics
Machine Learning
Python Python guide
Basic knowledge on NLP

We expect students to know basics of Natural Language Processing, as the course focuses on more advanced topics. When you unsure about the basics, we recommned to read these lectures / materials:

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
additional		additional
homeworks		homeworks
seminars		seminars
slides		slides
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Course, 2026

Course Staff

Materials

Homeworks

Game Rules

Prerequisities

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

intsystems/NLP_Course

Folders and files

Latest commit

History

Repository files navigation

NLP Course, 2026

Course Staff

Materials

Homeworks

Game Rules

Prerequisities

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages