Human Pose EstimationΒΆ

Pose Estimation In Action

Use Pose Estimation to track the position of people in images and video. Build an AI-powered fitness coach, immersive AR experiences, and more.

Models Compatible with the API

Technical Specifications

Architecture Format(s) Size Input Output Benchmarks
MobileNet backbone Core ML (iOS), TensorFlow Lite (Android) ~5 MB 353x257-pixel image Position of each person and body part detected, Number of people detected, The confidence associated with each detection 20 FPS on iPhone X, 10 FPS on Pixel 2

Prebuilt Models - Include our models directly in your app and use them with the API.

Name Example Description Keypoints
Human pose_estimation A model that tracks one or more poses in the scene. Identifies 17 body keypoints. Face: nose, eyes, ears; Torso: shoulders, elbows, wrists; Legs: hips, knees, ankles

View all COCO keypoints

Customizing Models for Pose Estimation

If you have your own dataset and would like to train a custom model that is compatible with the Pose Estimation API, sign up for the Standard Plan on Fritz to access training notebooks.