Toy LLM

This project creates a simple language model, based on Markov Chains. This article explains Markov Chains for Natural Language Processing (NLP): geeksforgeeks.org.

In short, it analyses probability that a word appears (in text) after another word, and then randomly chooses a next word, based on the previous one.

This repo implements a model class with helper functions to fit the model from string or .txt file.

Project setup

Install required dependencies:

pip install -r requirements.txt

Then instantiate the model with example text:

from model import MarkovModel

model: MarkovModel = MarkovModel.from_text("A B C A", seed=42)
print(model.predict_n_tokens(10))
# B C A B C A B C A B C

Examples

Here take a look at demo notebooks:

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
examples		examples
models		models
samples		samples
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model.py		model.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Toy LLM

Project setup

Examples

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Toy LLM

Project setup

Examples

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages