Table 5 VisionTool’s detection accuracy on MOCA dataset. A k-fold (k \(=\) 5) approach is used for each view point (i.e., the detectors are trained on fourfolds and the remaining one was predicted). The results reported in the table correspond to the average mAP computed across the different folds.

From: A semi-automatic toolbox for markerless effective semantic feature extraction

View point

mAP\(^{0.5}\)

mAP\(^{0.75}\)

mAP\(_{\text {index}}\)

mAP\(_{\text {little finger}}\)

mAP\(_{\text {hand}}\)

mAP\(_{\text {wrist}}\)

mAP\(_{\text {elbow}}\)

mAP

Lateral

0.969

0.905

0.865

0.845

0.889

0.958

0.988

0.909

Egocentric

0.962

0.929

0.925

0.789

0.963

0.922

0.978

0.915

Frontal

0.957

0.858

0.861

0.907

0.836

0.930

0.992

0.905

All together

0.954

0.904

0.880

0.821

0.912

0.949

0.980

0.908