This backend system consists of several components that enable real-time conversational characters using Pixel Streaming, STT, TTS, LLM integration, and lipsync.
Events should arrive in an order that maintains Anthropic's requirements when persisted, OR the SDK should provide guidance/helpers for provider-specific ordering. Currently, frameworks must detect ...