Chicago Fingerspelling in the Wild Data Sets (ChicagoFSWild, ChicagoFSWild+) |
This is the home of a collaborative data collection effort by U. Chicago and TTI-Chicago researchers. This is to our knowledge the first collection of American Sign Language fingerspelling data "in the wild," that is in naturally occurring (online) video. The collection consists of two data set releases, ChicagoFSWild and ChicagoFSWild+.
The ChicagoFSWild data set contains 7304 ASL fingerspelling sequences signed by 160 signers, carefully annotated by students who have studied ASL. ChicagoFSWild+ contains 55,232 fingerspelling sequences signed by 260 signers.
@article{fs18slt,
author = {B. Shi, A. Martinez Del Rio, J. Keane, J. Michaux, D. Brentari, G. Shakhnarovich, and K. Livescu},
title = {American Sign Language fingerspelling recognition in the wild},
journal = {SLT},
year = {2018},
month = {December}
}
@article{fs18iccv,
author = {B. Shi, A. Martinez Del Rio, J. Keane, D. Brentari, G. Shakhnarovich, and K. Livescu},
title = {Fingerspelling recognition in the wild with iterative visual attention},
journal = {ICCV},
year = {2019},
month = {October}
}
SLT'18 paper | ICCV'19 paper |
You can download the data sets here:
Files are structured as follows:
ChicagoFSWild.csv
- This is the main data description file. Each line corresponds to a single fingerspelling sequence.
filename
- Name of fingerspelling sequenceurl
- url of the video from which the sequence was obtainedstart_time
- start time of the sequence in that video, in the format HH:MM:SS.xxx number_of_frames
- number of frames of the fingerspelling sequence width
- frame widthheight
- frame heightlabel_raw
- raw labels from the annotatorslabel_notes
- annotator noteslabel_proc
- processed labels,
used for training and testingpartition
- partition (train/dev/test) the sequence belongs tosigner
- signer identity for this sequenceChicagoFSWild-Frames.tgz
- This file contains sequences of image frames (in .jpg), identified by filename
in ChicagoFSWild.csv.
annotation_instructions.txt
- This text file provides the instructions used by the annotators, which define the conventions used for the raw labels. This is provided for completeness. However, to reproduce our results, only the label_proc
field in the CSV file is needed.
HandAnnotation.csv
- Annotations of hand bounding boxes for a subset of the fingerspelling sequences in ChicagoFSWild
filename
- Name of fingerspelling sequencepartition
- partition (train/dev) the sequence belongs to, used to train and tune the hand detectorBBox
- A folder of hand bounding boxes
F/X.txt
- hand bounding boxes in frame indexed by X
of the fingerspelling sequence F
x0, y0, x1, y1, L
- top left corner (x0, y0), bottom right corner (x1, y1), L=1: signing hand(s), L=2: non-signing hand(s)These data sets are being released publicly in the interest of improving digital interfaces for signers, communication between signers and non-signers, linguistic understanding of American Sign Language, and computer vision research. If you see any of your own videos here and have any concerns with them being included, please contact the Principal Investigators: Karen Livescu (TTI-Chicago), Diane Brentari (U. Chicago), and Greg Shakhnarovich (TTI-Chicago).