1st solution for AutoDL Challenge@NeurIPS, competition rules can be found at AutoDL Competition.

1. Motivation

There exists a series of common and tough problems in the real world, such as limited resources (CPU/ memory), skewed data, hand-craft features, model selection, network architecture details tuning, sensitivity of pre-trained models, sensitivity of hyperparameters and so on. How to solve them wholly and efficiently?

2. Solution

AutoDL concentrates on developing generic algorithms for multi-label classification problems in ANY modalities: image, video, speech, text and tabular data without ANY human intervention. Ten seconds at the soonest, our solution achieved SOTA performances on all the 24 offline datasets and 15 online datasets, beating a number of top players in the world.

3. Table of Contents

1. Motivation
2. Solution
3. Table of Contents
4. Features
5. Evaluation
6. Installation
- 6.1. With pip
7. Quick Tour
8. Public Datasets
- 8.1. Optional: Download public datasets
- 8.2. Public datasets sample info
9. Usage for AutoDL local development and testing
10. Contributing
11. Contact us
12. Join the Community
13. License

4. Features

Full-AutoML/AutoDL: Fully automated Deep Learning without ANY human intervention covering the whole pipelines.
Generic & Universal: Supporting ANY modality(image, video, speech, text, tabular) data, and ANY classification problems including binary-class, multi-class and multi-label problems.
SOTA: Winner solution of AutoDL challenge, involving both tranditional machine learning models and deep learning model backbones.
Out-of-the-Box: You can use the solution out-of-the-box.
Fast: You can train your model in ten seconds at the soonest to get highly competitive performance.
Real-time: You can get the performance feedback(AUC score) in real time.

5. Evaluation

Feedback-phase leaderboard: DeepWisdom Top 1, average rank 1.2, won 4 out of 5 datasets.
Final-phase leaderboard visualization: DeepWisdom Top 1, average rank 1.2, won 7 out of 10 datasets.

6. Installation

This repo is tested on Python 3.6+, PyTorch 1.0.0+ and TensorFlow 2.0.

You should install AutoDL in a virtual environment. If you're unfamiliar with Python virtual environments, check out the user guide.

Create a virtual environment with the version of Python you're going to use and activate it.

Now, if you want to use AutoDL, you can install it with pip.

6.1. With pip

AutoDL can be installed using pip as follows:

pip install autodl-gpu

7. Quick Tour

7.1. Run local test tour

see Quick Tour - Run local test tour, see sample code examples/run_local_test.py.

7.2. Tour of Image Classification

see Quick Tour - Image Classification Demo, see sample code examples/run_image_classification_example.py

7.3. Tour of Video Classification

see Quick Tour - Video Classification Demo, see sample code examples/run_video_classification_example.py.

7.4. Tour of Speech Classification

see Quick Tour - Speech Classification Demo, see sample code examples/run_speech_classification_example.py.

7.5. Tour of Text Classification

see Quick Tour - Text Classification Demo, see sample code examples/run_text_classification_example.py.

7.6. Tour of Tabular Classification

see Quick Tour - Tabular Classification Demo, see sample code examples/run_tabular_classification_example.py.

8. Public Datasets

8.1. Optional: Download public datasets

python download_public_datasets.py

8.2. Public datasets sample info

#	Name	Type	Domain	Size	Source	Data (w/o test labels)	Test labels
1	Munster	Image	HWR	18 MB	MNIST	munster.data	munster.solution
2	City	Image	Objects	128 MB	Cifar-10	city.data	city.solution
3	Chucky	Image	Objects	128 MB	Cifar-100	chucky.data	chucky.solution
4	Pedro	Image	People	377 MB	PA-100K	pedro.data	pedro.solution
5	Decal	Image	Aerial	73 MB	NWPU VHR-10	decal.data	decal.solution
6	Hammer	Image	Medical	111 MB	Ham10000	hammer.data	hammer.solution
7	Kreatur	Video	Action	469 MB	KTH	kreatur.data	kreatur.solution
8	Kreatur3	Video	Action	588 MB	KTH	kreatur3.data	kreatur3.solution
9	Kraut	Video	Action	1.9 GB	KTH	kraut.data	kraut.solution
10	Katze	Video	Action	1.9 GB	KTH	katze.data	katze.solution
11	data01	Speech	Speaker	1.8 GB	--	data01.data	data01.solution
12	data02	Speech	Emotion	53 MB	--	data02.data	data02.solution
13	data03	Speech	Accent	1.8 GB	--	data03.data	data03.solution
14	data04	Speech	Genre	469 MB	--	data04.data	data04.solution
15	data05	Speech	Language	208 MB	--	data05.data	data05.solution
16	O1	Text	Comments	828 KB	--	O1.data	O1.solution
17	O2	Text	Emotion	25 MB	--	O2.data	O2.solution
18	O3	Text	News	88 MB	--	O3.data	O3.solution
19	O4	Text	Spam	87 MB	--	O4.data	O4.solution
20	O5	Text	News	14 MB	--	O5.data	O5.solution
21	Adult	Tabular	Census	2 MB	Adult	adult.data	adult.solution
22	Dilbert	Tabular	--	162 MB	--	dilbert.data	dilbert.solution
23	Digits	Tabular	HWR	137 MB	MNIST	digits.data	digits.solution
24	Madeline	Tabular	--	2.6 MB	--	madeline.data	madeline.solution

9. Usage for AutoDL local development and testing

Git clone the repo

cd <path_to_your_directory>
git clone https://github.com/DeepWisdom/AutoDL.git

Prepare pretrained models. Download model speech_model.h5 and put it to AutoDL_sample_code_submission/at_speech/pretrained_models/ directory.

Optional: run in the exact same environment as on the challenge platform with docker.

CPU

cd path/to/autodl/
docker run -it -v "$(pwd):/app/codalab" -p 8888:8888 evariste/autodl:cpu-latest

GPU

nvidia-docker run -it -v "$(pwd):/app/codalab" -p 8888:8888 evariste/autodl:gpu-latest

Prepare sample datasets, using the toy data in AutoDL_sample_data or download new datasets.
Run local test

python run_local_test.py

The full usage is

python run_local_test.py -dataset_dir='AutoDL_sample_data/miniciao' -code_dir='AutoDL_sample_code_submission'

Then you can view the real-time feedback with a learning curve by opening the HTML page in AutoDL_scoring_output/.

Details can be seen in AutoDL Challenge official starting_kit.

10. Contributing

Feel free to dive in! Open an issue or submit PRs.

11. Contact us

12. Join the Community

Scan QR code and join AutoDL community!

13. License

Apache License 2.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README_EN.md

README_EN.md

1. Motivation

2. Solution

3. Table of Contents

4. Features

5. Evaluation

6. Installation

6.1. With pip

7. Quick Tour

7.1. Run local test tour

7.2. Tour of Image Classification

7.3. Tour of Video Classification

7.4. Tour of Speech Classification

7.5. Tour of Text Classification

7.6. Tour of Tabular Classification

8. Public Datasets

8.1. Optional: Download public datasets

8.2. Public datasets sample info

9. Usage for AutoDL local development and testing

10. Contributing

11. Contact us

12. Join the Community

13. License

Files

README_EN.md

Latest commit

History

README_EN.md

File metadata and controls

1. Motivation

2. Solution

3. Table of Contents

4. Features

5. Evaluation

6. Installation

6.1. With pip

7. Quick Tour

7.1. Run local test tour

7.2. Tour of Image Classification

7.3. Tour of Video Classification

7.4. Tour of Speech Classification

7.5. Tour of Text Classification

7.6. Tour of Tabular Classification

8. Public Datasets

8.1. Optional: Download public datasets

8.2. Public datasets sample info

9. Usage for AutoDL local development and testing

10. Contributing

11. Contact us

12. Join the Community

13. License