QuickTalk¶
When to Use QuickTalk¶
QuickTalk fits realtime lip-sync and low-latency validation. It is a good path when you want a real local model instead of Mock.
Requirements¶
- NVIDIA GPU is recommended.
- The local asset root must contain
checkpoints/. - You need
quicktalk.pth,repair.npy,chinese-hubert-large/, and InsightFaceauxiliary/models/buffalo_l/. - Avatars use OpenTalking's shared avatar flow; templates or caches needed by QuickTalk are generated by deployment commands, upload flows, or session startup.
Prepare Weights¶
The full download commands live in QuickTalk Local Deployment. This page keeps only the layout and configuration essentials.
quicktalk/
checkpoints/
quicktalk.pth
repair.npy
chinese-hubert-large/
pytorch_model.bin
auxiliary/models/buffalo_l/
det_10g.onnx
Prepare Avatar¶
Use the shared flow in Avatar Assets. QuickTalk does not require the avatar manifest to be bound to a dedicated type; if the runtime needs a fixed template video, make sure that asset is reachable from deployment configuration or session initialization.
Configure Backend¶
bash scripts/start_unified.sh \
--backend omnirt \
--model quicktalk \
--omnirt http://127.0.0.1:9000
Start Service¶
Verify¶
uv run opentalking-quicktalk-bench \
--asset-root ./examples/avatars/quicktalk-daytime \
--template-video ./examples/avatars/quicktalk-daytime/quicktalk/template_900.mp4 \
--audio ./assets/test.wav \
--output ./outputs/quicktalk-bench.mp4 \
--device cuda:0
Or verify in the WebUI by selecting the quicktalk model and sending a short prompt.
Troubleshooting¶
connected=false: check the asset path, QuickTalk dependencies, andOPENTALKING_TORCH_DEVICE.- Slow first turn: enable
OPENTALKING_QUICKTALK_WORKER_CACHE=1. - Avatar load failure: make sure the avatar is readable and any configured template asset is reachable.