STIP Samples

STIP Samples nbsp; STIP Samples

Dataset Details

Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction, IEEE RA-L & ICRA 2020 (Arxiv)
STIP includes over 900 hours of driving scene videos of front, right, and left cameras, while the vehicle was driving in dense areas of five cities in the United States. The videos were annotated at 2fps with pedestrian bounding boxes and labels of crossing/not-crossing the street, which are respectively shown with green/red boxes in the above videos. We used the JRMOT (JackRabbot real-time Multi-Object Tracker) platform to track the pedestrian and interpolate the annotations for all 20 frame per second. See https://github.com/StanfordVL/STIP/ for the details.

Bibtex

@inproceeding{liu2020spatiotemporal,
   title={Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction},
   author={Bingbin Liu and Ehsan Adeli and Zhangjie Cao and Kuan-Hui Lee and Abhijeet Shenoi and Adrien Gaidon and Juan Carlos Niebles},
   year={2020},
   booktitle={IEEE Robotics and Automation Letters (IEEE RA-L) and International Conference on Robotics and Automation (ICRA)},
   publisher={IEEE}
}

Team

Request STIP Dataset

The dataset access will only be granted to universities and non-profit research organizations, not individuals or commercial entities. You will need to acknowledge that you have carefully read the Data Use Agreement (DUA) and confirm that your organization accepts the terms of the template DUA. An institutional signatory will be required. Submit your request using the following form.