EvalAI is an open source platform for evaluating and comparing machine learning (ML) and artificial intelligence (AI) algorithms at scale.
In recent years, it has become increasingly difficult to compare an algorithm solving a given task with other existing approaches. These comparisons suffer from minor differences in algorithm implementation, use of non-standard dataset splits and different evaluation metrics. By providing a central leaderboard and submission interface, we make it easier for researchers to reproduce the results mentioned in the paper and perform reliable & accurate quantitative analysis. By providing swift and robust backends based on map-reduce frameworks that speed up evaluation on the fly, EvalAI aims to make it easier for researchers to reproduce results from technical papers and perform reliable and accurate analyses.
Custom evaluation protocols and phases: We allow creation of an arbitrary number of evaluation phases and dataset splits, compatibility using any programming language, and organizing results in both public and private leaderboards.
Remote evaluation: Certain large-scale challenges need special compute capabilities for evaluation. If the challenge needs extra computational power, challenge organizers can easily add their own cluster of worker nodes to process participant submissions while we take care of hosting the challenge, handling user submissions, and maintaining the leaderboard.
Evaluation inside environments: EvalAI lets participants submit code for their agent in the form of docker images which are evaluated against test environments on the evaluation server. During evaluation, the worker fetches the image, test environment, and the model snapshot and spins up a new container to perform evaluation.
CLI support: evalai-cli is designed to extend the functionality of the EvalAI web application to your command line to make the platform more accessible and terminal-friendly.
Portability: EvalAI is designed with keeping in mind scalability and portability of such a system from the very inception of the idea. Most of the components rely heavily on open-source technologies – Docker, Django, Node.js, and PostgreSQL.
Faster evaluation: We warm-up the worker nodes at start-up by importing the challenge code and pre-loading the dataset in memory. We also split the dataset into small chunks that are simultaneously evaluated on multiple cores. These simple tricks result in faster evaluation and reduces the evaluation time by an order of magnitude in some cases.
Our ultimate goal is to build a centralized platform to host, participate and collaborate in AI challenges organized around the globe and we hope to help in benchmarking progress in AI.
Setting up EvalAI on your local machine is really easy. You can setup EvalAI using docker:The steps are:
Install docker and docker-compose on your machine.
Get the source code on to your machine via git.
git clone https://github.com/Cloud-CV/EvalAI.git evalai && cd evalai
Build and run the Docker containers. This might take a while.
docker-compose up --build
That's it. Open web browser and hit the URL http://127.0.0.1:8888. Three users will be created by default which are listed below -
SUPERUSER- username: admin
password: password
HOST USER- username: host
password: password
PARTICIPANT USER- username: participant
password: password
If you are facing any issue during installation, please see our common errors during installation page.
If you are using EvalAI for hosting challenges, please cite the following technical report:
@article{EvalAI,
title = {EvalAI: Towards Better Evaluation Systems for AI Agents},
author = {Deshraj Yadav and Rishabh Jain and Harsh Agrawal and Prithvijit
Chattopadhyay and Taranjeet Singh and Akash Jain and Shiv Baran
Singh and Stefan Lee and Dhruv Batra},
year = {2019},
volume = arXiv:1902.03570
}
EvalAI is currently maintained by Rishabh Jain, Deshraj Yadav, and Ram Ramrakhya. A non-exhaustive list of other major contributors includes: Akash Jain, Taranjeet Singh, Shiv Baran Singh, Harsh Agarwal, Prithvijit Chattopadhyay, Devi Parikh and Dhruv Batra.
If you are interested in contributing to EvalAI, follow our contribution guidelines.
需求分析 作为合格的AI竞赛平台,EvalAI支持组织者创建比赛,比赛的定义标准已经在EvalAI-starters中进行声明,组织者只需要根据文档修改后上传即可,其提供了两种上传方式,一种为在github创建fork修改由服务器拉取,而另一种是运行run.sh后打包本地的修改后的配置文件为一个压缩文件,然后上传。 问:如何解析这个压缩文件? 问题分析 通过压缩文件创建比赛,那么我们需要提前定义好
需求 文件上传后如果不设置会使用默认名称进行保存,重复存储后会在原名称后添加_1、_2等避免重名,那么如何将文件存储名称和其在数据库的保存数据联系上是一个问题。 分析 文件字段为FileField,其特有的参数为upload_to和storage,upload_to决定文件保存地址,比如当upload_to="uploads/"时,实际地址为settings.MEDIA_ROOT/uploads,