mystiatech/Rosie

Go to file

Dani 337a681df3 feat: add VRM model loading functionality

This commit introduces the core VRM loading capabilities for the 3D rendering system. It implements a dedicated VRMLoader class that can parse VRM files (which are binary glTF format) and extract essential data including meshes, materials, textures, and VRM-specific extension metadata. The implementation handles both binary and JSON formatted VRM files while supporting standard glTF accessor data extraction for vertices, normals, UV coordinates, and indices. This enables the application to load and render 3D models with proper material and texture support from VRM files.

2025-09-30 19:24:18 -04:00

Initial commit: Desktop Waifu MVP foundation

2025-09-30 18:42:54 -04:00

Initial commit: Desktop Waifu MVP foundation

2025-09-30 18:42:54 -04:00

feat: add VRM model loading functionality

2025-09-30 19:24:18 -04:00

.env.example

Initial commit: Desktop Waifu MVP foundation

2025-09-30 18:42:54 -04:00

.gitignore

Initial commit: Desktop Waifu MVP foundation

2025-09-30 18:42:54 -04:00

CURRENT_STATUS.md

Initial commit: Desktop Waifu MVP foundation

2025-09-30 18:42:54 -04:00

main.py

Initial commit: Desktop Waifu MVP foundation

2025-09-30 18:42:54 -04:00

PROJECT_PLAN.md

Initial commit: Desktop Waifu MVP foundation

2025-09-30 18:42:54 -04:00

README.md

Initial commit: Desktop Waifu MVP foundation

2025-09-30 18:42:54 -04:00

requirements.txt

Initial commit: Desktop Waifu MVP foundation

2025-09-30 18:42:54 -04:00

RESEARCH_FINDINGS.md

Initial commit: Desktop Waifu MVP foundation

2025-09-30 18:42:54 -04:00

README.md

Desktop Waifu 🎀

An AI-powered desktop companion with VRM model rendering and Discord integration.

Features

Current (MVP)

✅ Transparent desktop widget
✅ Draggable VRM character
✅ Always-on-top window
✅ Basic state management
⏳ VRM model rendering (in progress)
⏳ Sound effects on interaction
⏳ Text chat interface
⏳ Local LLM integration
⏳ Expression changes based on emotion
⏳ Discord bot integration

Planned

Voice input/output (STT/TTS)
Advanced animations
System integration (notifications, app control)
Memory persistence
Proactive messaging
Cross-platform support

Setup

Prerequisites

Python 3.10+
VRM model file (.vrm)
Local LLM (Ollama, llama.cpp, etc.) - optional for testing

Installation

Clone/Download the project
Install dependencies:

pip install -r requirements.txt

Configure environment:

cp .env.example .env
# Edit .env with your settings

Add your VRM model:
- Place your .vrm file in the models/ folder
- Update VRM_MODEL_PATH in .env
Add sound effects (optional):
- Place .wav files in assets/sounds/
- Examples: squeak.wav, click.wav

Discord Setup (Optional)

Create a Discord bot at https://discord.com/developers/applications
Enable these intents:
- Message Content Intent
- Server Members Intent
Copy bot token to DISCORD_BOT_TOKEN in .env
Invite bot to your server with permissions:
- Send Messages
- Read Message History
- Use Slash Commands

LLM Setup (Optional)

Using Ollama:

# Install Ollama from https://ollama.ai
ollama pull llama2
# Update .env: LLM_API_URL=http://localhost:11434

Using llama.cpp:

pip install llama-cpp-python
# Configure model path in code

Running

python main.py

Usage

Desktop Interactions

Click and drag: Move the waifu around your screen
Double-click: Open chat interface (coming soon)
Right-click: Context menu (coming soon)

Discord Commands

!hello: Greet the waifu
!status: Check current mood/status
@mention: Talk to the waifu in any channel
DM: Send direct messages

Project Structure

Waifu/
├── main.py                 # Entry point
├── requirements.txt        # Dependencies
├── .env                    # Configuration (create from .env.example)
├── PROJECT_PLAN.md         # Development plan
├── RESEARCH_FINDINGS.md    # Technical research
├── models/                 # VRM model files
│   └── .gitkeep
├── assets/
│   └── sounds/             # Sound effects
│       └── .gitkeep
└── src/
    ├── core/
    │   └── state_manager.py    # State synchronization
    ├── ui/
    │   ├── waifu_window.py     # Main window
    │   └── vrm_widget.py       # VRM renderer
    ├── discord_bot/
    │   └── bot.py              # Discord integration
    ├── llm/                    # LLM integration (TODO)
    └── audio/                  # Audio system (TODO)

Development Status

Phase 1: MVP (In Progress)

Project structure
State management system
Transparent window framework
Drag functionality
Basic OpenGL setup
VRM model loading
VRM rendering with MToon shader
Sound effects
Chat interface
LLM integration
Discord bot
Expression control

Phase 2: Enhancement (Planned)

Voice I/O
Advanced animations
System integration
Memory/database
.exe packaging

Technical Details

Architecture

UI: PyQt6 with transparent windows
Rendering: OpenGL via QOpenGLWidget
VRM Parsing: pygltflib
State Management: Custom event-driven system
Discord: discord.py
LLM: Flexible (Ollama, llama.cpp, custom)

VRM Rendering

VRM models are glTF 2.0 with extensions
Custom MToon shader implementation needed
Blend shapes for facial expressions
Bone animations for gestures

See RESEARCH_FINDINGS.md for detailed technical research.

Contributing

This is a personal project, but suggestions and feedback are welcome!

License

TBD

Acknowledgments

VRM Consortium for the VRM specification
PyQt project for the excellent GUI framework
discord.py for Discord integration