lavila / datasets /README.md
nateraw's picture
Upload . with huggingface_hub
39d5658
# Preparing datasets for LAVILA
Please download the (selected) datasets from the official websites and place or sim-link them under `$LAVILA_ROOT/datasets/`.
```bash
$LAVILA_ROOT/datasets/
CharadesEgo/
EGTEA/
EK100/
Ego4D/
```
## Ego4D
1. Download [Ego4D videos](https://ego4d-data.org/docs/start-here/#download-data) (license is required).
2. Preprocess(TBA)
3. Download annotations
a. Download [egomcq.json](https://drive.google.com/file/d/1-5iRYf4BCHmj4MYQYFRMY4bhsWJUN3rW/view) to `$LAVILA_ROOT/datasets/Ego4D` (if you want to evaluate EgoMCQ).
b. Download [metadata for train split](https://dl.fbaipublicfiles.com/lavila/metadata/ego4d/ego4d_train.pkl) and [val split](https://dl.fbaipublicfiles.com/lavila/metadata/ego4d/ego4d_val.pkl) to `$LAVILA_ROOT/datasets/Ego4D` ((if you want to train LAVILA from scratch).
The fold should look like this:
```bash
$LAVILA_ROOT/datasets/
Ego4D/
ego4d_train.pkl
ego4d_val.pkl
egomcq.json
video_288px/
000786a7-3f9d-4fe6-bfb3-045b368f7d44.mp4/
0.mp4
300.mp4
000a3525-6c98-4650-aaab-be7d2c7b9402.mp4/
0.mp4
...
```
## EPIC-Kitchens-100 (EK-100)
1. Download annotations
```bash
# Assume that you are under `datasets/EK100/`
git clone https://github.com/epic-kitchens/epic-kitchens-100-annotations
```
2. Download videos.
a. For raw videos, please download them from [https://epic-kitchens.github.io/](https://epic-kitchens.github.io/).
b. (Recommended) The raw videos are huge (~1 TB). As an alternative, please check out a [resized version]().
3. (For EK-100 MIR)
a. Generate the relevancy matrix of train/val splits using [the official code](https://github.com/mwray/Joint-Part-of-Speech-Embeddings).
b. (Recommended) The generated result has some randomness. Therefore, we also provide the [replica of train split](https://dl.fbaipublicfiles.com/lavila/metadata/EK100/caption_relevancy_EPIC_100_retrieval_train.pkl) and [val split](https://dl.fbaipublicfiles.com/lavila/metadata/EK100/caption_relevancy_EPIC_100_retrieval_test.pkl). Please put them to the folder `$LAVILA_ROOT/datasets/EK100/epic-kitchens-100-annotations/retrieval_annotations/relevancy/`.
The folder should look like this:
```bash
$LAVILA_ROOT/datasets/
EK100/
epic-kitchens-100-annotations/
EPIC_100_train.csv
EPIC_100_validation.csv
...
retrieval_annotations/relevancy/ # this appears if you do 3.
caption_relevancy_EPIC_100_retrieval_train.pkl
caption_relevancy_EPIC_100_retrieval_test.pkl
video_ht256px/
P01/
P01_01.MP4
P01_02.MP4
...
P01_19.MP4
P02/
P02_01.MP4
P02_02.MP4
...
P02_15.MP4
...
```
## CharadesEgo
1. Download annotations at [https://prior.allenai.org/projects/charades-ego](https://prior.allenai.org/projects/charades-ego).
```bash
### Annotations
# Assume that you are under `datasets/CharadesEgo/`
wget https://ai2-public-datasets.s3-us-west-2.amazonaws.com/charades/CharadesEgo.zip
unzip CharadesEgo.zip && rm CharadesEgo.zip
```
2. Download data (~11GB) at [https://prior.allenai.org/projects/charades-ego](https://prior.allenai.org/projects/charades-ego).
```bash
### Data
wget https://ai2-public-datasets.s3-us-west-2.amazonaws.com/charades/CharadesEgo_v1_480.tar
tar -xvf CharadesEgo_v1_480.tar # Or specify an external path using `-C` and sim-link it to here
rm CharadesEgo_v1_480.tar
```
3. (For fine-tuning CharadesEgo) Download two additional metadata files: [clip-level metadata (train)](https://dl.fbaipublicfiles.com/lavila/metadata/CharadesEgo/metadata_filtered_train.pkl) and [clip-level metadata (val)](https://dl.fbaipublicfiles.com/lavila/metadata/CharadesEgo/metadata_filtered_val.pkl). Put them to the folder `$LAVILA_ROOT/datasets/CharadesEgo/CharadesEgo/`.
The folder should look like this:
```bash
$LAVILA_ROOT/datasets/
CharadesEgo/
CharadesEgo/
CharadesEgo_v1_train_only1st.csv
CharadesEgo_v1_test_only1st.csv
...
metadata_filtered_train.pkl # this appears if you do 3.
metadata_filtered_val.pkl # this appears if you do 3.
CharadesEgo_v1_480/
005BU.mp4
005BUEGO.mp4
...
```
## EGTEA
1. Visit [https://cbs.ic.gatech.edu/fpv/](https://cbs.ic.gatech.edu/fpv/).
2. Download `TRIMMED_ACTION_CLIPS` (~20GB) and `ACTION_ANNOTATIONS` and untar to the current folder `$LAVILA_ROOT/datasets/EGTEA`.
```bash
unzip action_annotation.zip -d EGTEA/ && rm action_annotation.zip
```
The folder should look like this:
```bash
$LAVILA_ROOT/datasets/
EGTEA/
train_split1.txt
test_split1.txt
cropped_clips/
OP01-R01-PastaSalad/
OP01-R01-PastaSalad-1002316-1004005-F024051-F024101.mp4
OP01-R01-PastaSalad-1004110-1021110-F024057-F024548.mp4
OP01-R01-PastaSalad-1022590-1024050-F024539-F024581.mp4
...
OP01-R02-TurkeySandwich/
OP01-R02-TurkeySandwich-102320-105110-F002449-F002529.mp4
OP01-R02-TurkeySandwich-105440-106460-F002528-F002558.mp4
OP01-R02-TurkeySandwich-107332-133184-F002513-F003259.mp4
...
...
```