“Machine learning - Deep learning project approach and resources”
Project approach
- Define task (Object detection, Colorization of line arts)
- Collect dataset (MS Coco, Public web sites)
- Search for academic datasets and baselines
- Build your own (From Twitter, News, Website …)
- Define the metrics
- Search for established metrics
- Clean and preprocess the data
- Select features and transform data
- One-hot vector, bag of words, bucketize
- Logarithm scale, spectrogram
- Remove noise or outliers
- Remove invalid and duplicate data
- Scaling or whiten data
- Select features and transform data
- Split datasets for training, validation and testing
- Visualize and summarize dataset
- Validate dataset
- Establish a baseline
- Implement a simple model
- Compute metrics as a baseline
- Analyze errors for area of improvements
- Select network structure
- CNN, LSTM…
- Implement a deep network
- Code debugging and validation
- Parameter initialization
- Compute metrics
- Choose hyper-parameters
- Visualize, validate and summarize result
- Analyze errors
- Refine
- Add layers and nodes
- Optimization tricks
- Hyper-parameters fine tunings
- Try other model variants
Deep learning resources
Listing of major papers: [https://github.com/kjw0612/awesome-deep-vision]
Machine learning competition: [https://www.kaggle.com/]
Research paper publication:
CVPR: IEEE Conference on Computer Vision and Pattern Recognition
ICCV: International Conference on Computer Vision
ECCV: European Conference on Computer Vision
NIPS: Neural Information Processing Systems
ICLR: International Conference on Learning Representations \
Dataset listing:
[http://www.cvpapers.com/datasets.html]
[http://riemenschneider.hayko.at/vision/dataset/]
[https://en.wikipedia.org/wiki/List_of_datasets_for_machine_learning_research]