MediaPipe

An open-source framework by Google for building multimodal machine learning pipelines, commonly used for real-time face, hand, and body tracking. In accessibility applications, MediaPipe Holistic extracts 3D landmarks from the user's body and hands via webcam, while MediaPipe Face Landmarker outputs 468 facial landmarks and 52 blendshape coefficients for facial movements. These capabilities enable applications like sign language recognition, gesture-controlled interfaces, and avatar-based sign language learning systems where user movements are mapped to virtual characters.

Category: artificial intelligence · computer vision · assistive technology

Related: Motion Capture · Facial Expression Recognition · Sign Language Recognition

Sources

https://doi.org/10.1145/3663547.3746382