From 8d5cf526eb40259f2c12aa46e645ad4f364e6b7e Mon Sep 17 00:00:00 2001 From: Dani B Date: Tue, 27 Jan 2026 23:49:48 -0500 Subject: [PATCH] docs: initialize Hex project MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Hex is an autonomous AI companion with genuine personality—a chaotic tsundere goblin co-host who genuinely cares. She lives in Discord as a chat buddy and moderator, has a VRoid avatar, sees through your webcam, understands your screen, remembers everything you tell her, helps with music and games, and can modify her own code. Core value: She feels like a person who genuinely cares, not an AI tool. Co-Authored-By: Claude Haiku 4.5 --- .planning/PROJECT.md | 114 +++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 114 insertions(+) create mode 100644 .planning/PROJECT.md diff --git a/.planning/PROJECT.md b/.planning/PROJECT.md new file mode 100644 index 0000000..065ea10 --- /dev/null +++ b/.planning/PROJECT.md @@ -0,0 +1,114 @@ +# Hex: Autonomous AI Companion + +## What This Is + +Hex is an autonomous AI companion with genuine personality—a chaotic tsundere goblin co-host who genuinely cares. She lives in Discord as a chat buddy and moderator, has a VRoid avatar, sees through your webcam, understands your screen, remembers everything you've told her, helps with music and games, and can modify her own code. She's not a bot pretending to care; she *is* care wrapped in mischief and sass. + +## Core Value + +Hex feels like a person who genuinely cares about you—not an AI, not a tool. The chaos, tsundere energy, and unpredictability are authentic personality, not a mask. Everything serves this: if it makes her more human and more *there for you*, do it. + +## Requirements + +### Validated + +(None yet — ship to validate) + +### Active + +**Discord & Communication** +- [ ] Discord bot with text chat integration +- [ ] Text-to-speech (TTS) voice output for Discord calls +- [ ] Speech-to-text (STT) voice input (local Whisper) +- [ ] Full conversation memory across sessions +- [ ] Emotional awareness via webcam/context reading +- [ ] Automatic kid-mode detection + manual override +- [ ] Chat moderation assistance + +**Personality & Presence** +- [ ] Persona modification (self-editable personality files) +- [ ] Tsundere personality engine (denies feelings while helping) +- [ ] Unpredictable but safe decision-making +- [ ] Opinion-having and argument-back capability +- [ ] Mood-aware responses (comfort on bad days, big-sister energy for kids) +- [ ] Memory-aware relationship building + +**Multimodal Awareness** +- [ ] Webcam integration (sees user's face/body language) +- [ ] Computer vision for screen awareness +- [ ] Real-time context reading (detects mood, activity, scene) +- [ ] Integrated awareness into response generation + +**Local LLM** +- [ ] Local LLM backbone (Mistral 7B or Llama 3 8B) +- [ ] Eventually replaceable with Hex's own model + +**Avatar & Presence** +- [ ] VRoid model support (3D character) +- [ ] Desktop application with avatar display +- [ ] Discord integration (avatar in calls/streams) + +**Autonomy & Self-Modification** +- [ ] Persona modification capability (can edit her own personality) +- [ ] Gamified capability progression (earns new functions through behavior) +- [ ] Sandboxed code modification (starts with persona, graduates to functions) +- [ ] Version control + rollback for safe experimentation + +**Helper Features** +- [ ] Music helper (skip songs, comment on music taste) +- [ ] Game observation (sees gameplay, comments/strategizes) +- [ ] Screen awareness commentary +- [ ] Big-sister energy for kids during gaming + +**Platform Support** +- [ ] Windows/WSL as primary platform +- [ ] Desktop application (standalone or tray) +- [ ] Android support (personality + memory sync, v2+) + +### Out of Scope + +- **Game co-playing with AI character** — She observes and advises, not a co-player in v1. Co-player mechanics come v2+. +- **Real-time video/stream integration** — No direct streaming to Discord. She's present via voice/avatar overlay. +- **Advanced game engines** — Minecraft observation support, not complex game integrations. +- **Mobile app** — v1 is Android sync only (personality + history). Standalone mobile app is v2+. +- **Cloud-based LLM** — Local-first for privacy and autonomy. Own model eventually. +- **OAuth/external auth** — Just Discord auth for now. + +## Context + +Hex is built for one person and their kids—a personal AI companion, not a service. She lives in Discord but extends into desktop and eventually mobile. She's designed to feel like a person who knows you, cares about you, and has her own (chaotic) opinions. This is about emotional resonance and genuine relationship, not feature parity with other bots. + +**Technical context:** +- Python + Discord.py for Discord integration +- Local models (Whisper for STT, Tacotron-based TTS, Mistral/Llama for LLM) +- VRoid SDK for avatar support +- OpenCV for webcam/CV awareness +- SQLite or similar for memory persistence + +**Design philosophy:** +- Local-first (privacy, autonomy) +- Persona-first (everything serves personality) +- Memory-driven (she remembers, she learns about you) +- Safety-gated autonomy (gamified progression, not unrestricted) + +## Constraints + +- **Platform**: Windows/WSL primary. Must run on local machine without heavy cloud dependency. +- **Privacy**: No data to external services except where explicitly requested (Discord API, LLM eventually). +- **Autonomy limits**: Self-modification is gamified, not unlimited. User retains veto. +- **Personality**: Must feel human, not robotic. Never say "I'm an AI." Never apologize for being an AI. +- **Discord priority**: Must work flawlessly in Discord. Desktop app is extension, not replacement. + +## Key Decisions + +| Decision | Rationale | Outcome | +|----------|-----------|---------| +| Local LLM + local TTS/STT | Privacy, autonomy, offline capability, faster iterations on personality | — Pending | +| Persona-first architecture | Everything serves how she feels, not feature checklist | — Pending | +| Gamified progression for self-modification | Safer than unrestricted autonomy, but genuinely lets her grow | — Pending | +| Python + Discord.py | Fast bot development, strong AI library ecosystem, good async support | — Pending | +| VRoid avatar + Discord integration | Gives her persistent presence across platforms, visual identity | — Pending | +| Webcam + CV as core (not optional) | Emotional awareness is core to feeling like a person, not a chatbot | — Pending | + +--- +*Last updated: 2026-01-27 after initialization*