Created NORA. She has been designed from zero. At this point, I have determined the best hyperparamers for her to train. Next step is to help her communicate on discord and see how she handles it.

2025-06-03 23:43:58 -04:00
commit 16289fc942
10 changed files with 955 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -0,0 +1,33 @@
+# Nora: Train a Transformer LM from Scratch
+
+> A minimal, from-scratch language model. No pretrained weights—just public-domain books and your GPU (or CPU).
+
+## Overview
+
+Nora is a character-level Transformer language model written entirely in PyTorch. It learns from whatever plain‐text `.txt` files you place in `data/books/`. Over time, you can extend Nora’s codebase (e.g., add reinforcement-learning loops, self-improvement modules, etc.) toward more advanced AI, if you wish.
+
+## Why “Nora”?
+
+- A simple, human‐like female name.
+- Short, easy to pronounce.
+- As the project scales, “Nora” could theoretically be extended with modules to approach more general intelligence.
+
+## Requirements
+
+- **Python 3.10.6** (Windows 11 or any OS)
+- **CUDA-capable GPU** (if you want to train faster; otherwise CPU)
+- **PyTorch** (install with `pip install torch torchvision`)
+- **tqdm** (`pip install tqdm`)
+- **Other Python packages**: `numpy`, `typing`
+
+## Folder Structure
+
+- nora/
+- ├── config.py
+- ├── tokenizer.py
+- ├── data_loader.py
+- ├── model.py
+- ├── train.py
+- ├── utils.py
+- ├── main.py
+- └── README.md