Evaluation guidelines¶

The SceneFun3D training and validation sets can be utilized to train and evaluate models locally. For evaluation on the validation set, we provide evaluation scripts in the SceneFun3D toolkit for each task here.

Currently, the benchmark is evaluated using version 0.2.0 of the dataset.

Benchmark results are evaluated on the hidden test set for which we do not provide the ground-truth annotations. The benchmark is hosted on EvalAI and can be found here.

Prior to making a submission on the evaluation benchmark, make sure the submission is in the correct format by following the instructions outlined for each task. Otherwise, the submission will fail.

On the benchmark submission page, you can submit to both the validation and test splits. Since evaluation on the validation split can also be performed locally, this step acts as an optional sanity check. Only test split submissions count towards official benchmarking.

In the sections below, you can find information about evaluation and benchmark submissions and description for each task: