Subtitling and caption production is the technical and creative craft of transcribing dialogue, timing it to video, translating it when needed, and encoding it for distribution. Subtitles are translations of dialogue; captions (CC) are for deaf/hard of hearing and include sound effects, music cues, and speaker IDs. The workflow involves four steps: (1) transcription (convert speech to text), (2) timing (align text with video), (3) localization (translate to target languages), and (4) encoding (embed in video or deliver as separate files in SRT/VTT format).