HealthTech MultiVision · Yakaza

Voice-to-Report Clinical Documentation

Capture physician dictation and produce structured medical reports without breaking hospital workflows.

Voice→

Structured report

Flask

Lightweight backend

Flutter

Cross-platform UI

Problem

Physicians needed to convert spoken dictations directly into structured medical reports without breaking hospital workflows.

Solution

Built a Flask backend and Flutter client that capture physician voice recordings and convert them into structured clinical reports used in hospital workflows.

Architecture

$ render architecture.mmd

flowchart LR
  Mic[Physician Mic] --> App[Flutter Client]
  App -- audio chunks --> API[Flask API]
  API --> ASR[Speech Recognition]
  ASR --> NLP[Section Classifier]
  NLP --> Tmpl[Report Templater]
  Tmpl --> Out[(Structured Report)]
  Out --> Hospital[Hospital EHR]

Technical decisions

$ git log --oneline decisions/

#01

Streaming uploads, not single blobs

Audio captured and streamed in chunks so transcription started while the physician was still talking — perceived latency near zero.

#02

Templated structure, not free text

Outputs follow per-specialty report templates so the EHR import is deterministic and reviewers can scan reports in seconds.

Technologies

Python Flask Flutter Speech Recognition

Have a similar challenge?

Let's talk