Real-time sign language fingerspelling recognition using convolutional neural networks from depth map

Byeongkeun Kang, Subarna Tripathi, Truong Q. Nguyen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

98 Scopus citations

Abstract

Sign language recognition is important for natural and convenient communication between deaf community and hearing majority. We take the highly efficient initial step of automatic fingerspelling recognition system using convolutional neural networks (CNNs) from depth maps. In this work, we consider relatively larger number of classes compared with the previous literature. We train CNNs for the classification of 31 alphabets and numbers using a subset of collected depth data from multiple subjects. While using different learning configurations, such as hyper-parameter selection with and without validation, we achieve 99.99% accuracy for observed signers and 83.58% to 85.49% accuracy for new signers. The result shows that accuracy improves as we include more data from different subjects during training. The processing time is 3 ms for the prediction of a single image. To the best of our knowledge, the system achieves the highest accuracy and speed. The trained model and dataset is available on our repository1.

Original languageEnglish
Title of host publicationProceedings - 3rd IAPR Asian Conference on Pattern Recognition, ACPR 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages136-140
Number of pages5
ISBN (Electronic)9781479961009
DOIs
StatePublished - 7 Jun 2016
Event3rd IAPR Asian Conference on Pattern Recognition, ACPR 2015 - Kuala Lumpur, Malaysia
Duration: 3 Nov 20166 Nov 2016

Publication series

NameProceedings - 3rd IAPR Asian Conference on Pattern Recognition, ACPR 2015

Conference

Conference3rd IAPR Asian Conference on Pattern Recognition, ACPR 2015
Country/TerritoryMalaysia
CityKuala Lumpur
Period3/11/166/11/16

Fingerprint

Dive into the research topics of 'Real-time sign language fingerspelling recognition using convolutional neural networks from depth map'. Together they form a unique fingerprint.

Cite this