EmotionalAI — Licensing & Compliance

Licensing & Compliance

Open-Source Model Usage, Data Sources, and Academic Integrity

Principle: Using open-source models as components is standard academic practice — equivalent to citing published libraries. Our original contribution is the architecture: how signals are fused, how memory is built, how conflicts are detected, and how analysis becomes video prompts. Swap any model — the system still works.

Pipeline Models (Perception Layer)

Component	Model	License	Academic Use	Commercial	Data Source
Face Detection	RetinaFace	MIT	OK	OK	WiderFace dataset (public)
Face Recognition	Facenet512	MIT (via DeepFace)	OK	OK	MS-Celeb-1M / VGGFace2
Facial Emotion (FER)	DeepFace	MIT	OK	OK	FER2013, AffectNet (public benchmarks)
Speech Recognition (ASR)	WhisperX	BSD 2-Clause	OK	OK	OpenAI Whisper weights (MIT)
Word Alignment	wav2vec2 (via WhisperX)	MIT (Meta)	OK	OK	LibriSpeech (public)
Speech Emotion (SER)	audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim	CC BY-NC-SA 4.0	OK	Non-commercial	MSP-Podcast (licensed for research)
Speaker Diarization	pyannote.audio	MIT	OK	OK	VoxCeleb, AMI (public)
Vocal Separation	Demucs (Meta)	MIT	OK	OK	MUSDB18 (public)
Noise Reduction	DeepFilterNet	MIT / Apache 2.0	OK	OK	DNS Challenge (public)
Face Restoration	GFPGAN (Tencent)	Apache 2.0	OK	OK	FFHQ (public)

Note on SER model: The audeering wav2vec2 emotion model uses CC BY-NC-SA 4.0, which permits academic research but not commercial use. This is the only non-commercial component in our pipeline. For commercial deployment, it can be replaced with an Apache-licensed alternative (e.g., emotion2vec) without affecting the system architecture.

Cognitive & Generation Models

Component	Model	License	Academic Use	Commercial	Notes
LLM (data translator)	Qwen2.5 14B (Alibaba)	Apache 2.0	OK	OK	Local deployment, offline, quantized Q4_K_M
Memory Store	ChromaDB	Apache 2.0	OK	OK	Persistent vector database
Video Generation	LTX-Video 2.3 (Lightricks)	Code: Apache 2.0 Weights: LTX Community License	OK	OK (<$10M revenue)	Israeli company, licensed training data
Text Encoder	Gemma 3 12B (Google)	Gemma Terms of Use	OK	OK with terms	Free for dev/research, commercial with terms
Workflow Engine	ComfyUI	GPL 3.0	OK	Copyleft	Used as tool, not embedded in our code

Micro-Expression Models

Model	Paper	Code License	Pretrained Weights	Status
Mimicking Annotation	ACM MM 2022	Research use (author provided)	Available (23GB)	Reproduced: UF1=0.7151
Micron-BERT	CVPR 2023	Research use	Requires university auth	Partial: re-implemented
MOL	TPAMI 2025	Research use	Not provided (Aff-Wild2)	Failed
FDP	TOMM 2025	Research use	Not provided (Aff-Wild2)	Failed

Datasets Used

Dataset	Source	License/Terms	Usage
Gone With The Wind (1939)	Public domain (copyright expired)	Public domain	Primary validation film
MELD	S. Poria et al., 2019	Research use	Emotion classification benchmark (62.94% accuracy)
SMIC-HS	University of Oulu, Finland	Research use	Micro-expression benchmark (UF1=0.7151)
4DME	Multiple sources	Research use	Micro-expression 4-class (UF1=0.6844, MTCNN cropped)
MMEW	Multiple sources	Research use	Micro-expression 7-class (UF1=0.4838)

All datasets obtained through legitimate academic channels. MELD is publicly available (HuggingFace). SMIC-HS, 4DME, MMEW are standard micro-expression research datasets used under research terms. GWTW (1939) is in the public domain. Dataset terms were individually reviewed — 2 dataset agreements from other providers were rejected due to overly restrictive terms (requiring intellectual property transfer).

Our Original Contribution

The following components are 100% original code, not derived from any external model or library:

Multimodal fusion layer — weighted voting with conflict detection, FER degradation tiers
Three-layer memory system — STM → Candidate Buffer → LTM with LSTM surprise promotion
Emotion State Machine — NATURAL/GRADUAL/TRIGGERED/SUDDEN with personality modulation
Scoring engine — 6-dimension white-box formulas with circumplex distance metrics
Prompt engine — 6-dimension pipeline-driven prompt generation with zero fabrication policy
Temporal alignment — FER offset auto-detection, 3-window matching, vectorized binary search
140-clip experimental methodology — factorial design, multi-seed validation, A/B blind test

LTX-Video 2.3 — Licensing Deep Dive

Component	License
Source code (GitHub)	Apache 2.0
Model weights (HuggingFace)	LTX Model Community License

Why we chose LTX-Video — data provenance matters:

Unlike many AI video models that scrape internet data without permission, Lightricks (Israel) obtained their training data through formal licensing agreements with authorized content providers. This is explicitly stated on their official license page. This means:

The model was trained on legally licensed data, not scraped/pirated content
Videos generated by LTX-Video are not subject to copyright claims from training data owners
Academic use is fully permitted under both Apache 2.0 (code) and Community License (weights)
Commercial use is free for organizations under $10M annual revenue
This legal clarity is a key reason we chose LTX-Video over alternatives with ambiguous data sourcing

Official resources: ltx.io | GitHub | HuggingFace | License

← Back to home