NTU RGB-D dataset

From HandWiki
Revision as of 18:55, 6 March 2023 by StanislovAI (talk | contribs) (simplify)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

The NTU RGB-D (Nanyang Technological University's Red Blue Green and Depth information) dataset is a large dataset containing recordings of labeled human activities .[1] This dataset consists of 56,880 action samples containing 4 different modalities (RGB videos, depth map sequences, 3D skeletal data, infrared videos) of data for each sample.

The dataset consists of 60 labelled actions. Specifically: drink water, eat meal/snack, brushing teeth, brushing hair, drop, pickup, throw, sitting down, standing up (from sitting position), clapping, reading, writing, tear up paper, wear jacket, take off jacket, wear a shoe, take off a shoe, wear on glasses, take off glasses, put on a hat/cap, take off a hat/cap, cheer up, hand waving, kicking something, put something inside pocket / take out something from pocket, hopping (one foot jumping), jump up, make a phone call/answer phone, playing with phone/tablet, typing on a keyboard, pointing to something with finger, taking a selfie, check time (from watch), rub two hands together, nod head/bow, shake head, wipe face, salute, put the palms together, cross hands in front (say stop), sneeze/cough, staggering, falling, touch head (headache), touch chest (stomachache/heart pain), touch back (backache), touch neck (neckache), nausea or vomiting condition, use a fan (with hand or paper)/feeling warm, punching/slapping other person, kicking other person, pushing other person, pat on back of other person, point finger at the other person, hugging other person, giving something to other person, touch other person's pocket, handshaking, walking towards each other and walking apart from each other.

Classifiers

This is a table of some of the machine learning methods used on the database and their error rates, by type of classifier:

Type Paper Preprocessing Description Accuracy (%)
Deep Learning Unconstraint attention mechanism over RGB stream 86.6
Deep Learning Arranging skeletal joints for tree-traversal 77.7
Deep Learning Deep LSTM [1] None 67.3

See also

References

  1. 1.0 1.1 Shahroudy, Amir; Liu, Jun; Ng, Tian-Tsong; Wang, Gang (2016). "NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis". arXiv:1604.02808 [cs.CV].