Created NORA. She has been designed from zero. At this point, I have determined the best hyperparamers for her to train. Next step is to help her communicate on discord and see how she handles it.

This commit is contained in:
2025-06-03 23:43:58 -04:00
commit 16289fc942
10 changed files with 955 additions and 0 deletions

33
README.md Normal file
View File

@ -0,0 +1,33 @@
# Nora: Train a Transformer LM from Scratch
> A minimal, from-scratch language model. No pretrained weights—just public-domain books and your GPU (or CPU).
## Overview
Nora is a character-level Transformer language model written entirely in PyTorch. It learns from whatever plaintext `.txt` files you place in `data/books/`. Over time, you can extend Noras codebase (e.g., add reinforcement-learning loops, self-improvement modules, etc.) toward more advanced AI, if you wish.
## Why “Nora”?
- A simple, humanlike female name.
- Short, easy to pronounce.
- As the project scales, “Nora” could theoretically be extended with modules to approach more general intelligence.
## Requirements
- **Python 3.10.6** (Windows 11 or any OS)
- **CUDA-capable GPU** (if you want to train faster; otherwise CPU)
- **PyTorch** (install with `pip install torch torchvision`)
- **tqdm** (`pip install tqdm`)
- **Other Python packages**: `numpy`, `typing`
## Folder Structure
- nora/
- ├── config.py
- ├── tokenizer.py
- ├── data_loader.py
- ├── model.py
- ├── train.py
- ├── utils.py
- ├── main.py
- └── README.md