6533b882fe1ef96bd12db939
RESEARCH PRODUCT
KARD - Kinect Activity Recognition Dataset
Marco Moranasubject
Ambient IntelligenceArtificial IntelligenceComputer ScienceInterdisciplinary sciencesOtherActivity Recognition3D Imagingdescription
To cite this dataset, please refer to the following paper:Human Activity Recognition Process Using 3-D Posture Data. S. Gaglio, G. Lo Re, M. Morana. In IEEE Transactions on Human-Machine Systems. 2014 doi: 10.1109/THMS.2014.2377111******************************************************************KARD contains 18 Activities. Each activity is performed 3 times by 10 different subjects.1Horizontal arm wave2High arm wave3Two hand wave4Catch Cap5High throw6Draw X7Draw Tick8Toss Paper9Forward Kick10Side Kick11Take Umbrella12Bend13Hand Clap14Walk15Phone Call16Drink17Sit down18Stand upIn total, you have 4 (files) x 18 (activities) x 3 (repetitions) x 10 (subjects), that is 2160 files.Each filename is in the form aA_sS_eN_string where A is a two-digit actionID and S is a two-digit subjectID for the N-th repetition.The string parameter depends on the the type of provided information:- depthmaps.txt: depth map,- .mp4: 640x480 RGB video,- realworld.txt: joints position in real world coordinates,- screen.txt: joints position in screen coordinates and depth value.For example, the file a04_s03_e02_realworld.txt contains the skeleton joints position in real world coordinates for the second repetition of the action #4 performed by the subject #3.The files containing the skeleton coordinates (realworld.txt and screen.txt) list the 15 joints in consecutive blocks, one for each frame.line 1Headline 2Neckline 3Right Shoulderline 4Right Elbowline 5Right Handline 6Left Shoulderline 7Left Elbowline 8Left Handline 9Torsoline 10Right Hipline 11Right Kneeline 12Right Footline 13Left Hipline 14Left Kneeline 15Left FootEach file contains 15xF lines, where F is the number of frames for that sequence, and each line reports three numbers: real world coordinates (x, y, z) for realworld.txt, or screen coordinates and depth value (u, v, depth) for screen.txt.The dataset is made of 540 sequences for about a total of 1 hour of videos captured at a resolution of 640x480 pixels at 30fps. Uncompressed frame images are also available on request. THIS DATASET IS ARCHIVED AT DANS/EASY, BUT NOT ACCESSIBLE HERE. TO VIEW A LIST OF FILES AND ACCESS THE FILES IN THIS DATASET CLICK ON THE DOI-LINK ABOVE
year | journal | country | edition | language |
---|---|---|---|---|
2017-05-17 |