Applied Deep Learning Resources

时间:2022-08-26 23:53:21

Applied Deep Learning Resources

A collection of research articles, blog posts, slides and code snippets about deep learning in applied settings. Including trained models and simple methods that can be used out of the box. Mainly focusing on Convolutional Neural Networks (CNN) but Recurrent Neural Networks (RNN), deep Q-Networks (DQN) and other interesting architectures will also be listed.

CNN

Latest overview of the CNNs can be found from the paper "Deep learning for visual understanding: A review" [linkPDF]

Another decent overview in Nature by LeCun, Bengio and Hinton: "Deep learning" [linkPDF]

ImageNet

ImageNet is the most important image classification and localization competition. Other data sets with results can be found from here: "Discover the current state of the art in objects classification." [link].

Applied Deep Learning Resources

Prediction error of the ImageNet competition has been decreasing rapidly over the last 5 years: Applied Deep Learning Resources

Main network architectures on ImageNet

AlexNet

Original paper: "ImageNet Classification with Deep Convolutional Neural Networks" [PDF]

Properties: 8 weight layers (5 convolutional and 2 fully connected), 60 million parameters, Rectified Linear Units (ReLUs), Local Response Normalization, Dropout

Applied Deep Learning Resources

VGG

Original paper: "Very Deep Convolutional Networks for Large-Scale Image Recognition" [arxiv]

Properties: 19 weight layers, 144m parameters, 3x3 convolution filters, L2 regularised, Dropout, No Local Response Normalization

GoogLeNet

Original paper: "Going deeper with convolutions" [arxiv]

Lates upgrade to the model achieves even better scores with models and import to Torch: "Rethinking the Inception Architecture for Computer Vision" [arxiv], "Torch port of Inception V3" [github]

Properties: 22 layers, 7m parameters, Inception modules, 1x1 conv layers, ReLUs, Dropout, Mid-level outputs

Inception modules:

Applied Deep Learning Resources

ResNet

Original paper: "Deep Residual Learning for Image Recognition" [arxiv]

Very nice slides: "Deep Residual Learning" [PDF]

Github: [github]

Properties: 152 layers, ReLUs, Batch Normalization (See "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift" [arxiv]), less hacks (no dropout), more stable (different number of layers work as well) and lower complexity than VGG.

Main building block of the network:

Applied Deep Learning Resources

Features are also very good and transferable with (faster) R-CNNs (see below):

Applied Deep Learning Resources

Other architectures

  • Deep Learning for 3D shapes: "3D ShapeNets: A Deep Representation for Volumetric Shapes" [PDF]

  • Code and a model for faces: "Free and open source face recognition with deep neural networks." [github]

  • Fast neural networks which can perform arbitrary filters for images: "Deep Edge-Aware Filters" [PDF]

  • Lot's of different models in Caffe's "Model Zoo" [github]

Feature learning and object detection

  • "CNN Features off-the-shelf: an Astounding Baseline for Recognition" [arxiv]

  • First paper about R-CNN: "Rich feature hierarchies for accurate object detection and semantic segmentation" [PDFslides]

  • "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks" [arxivgithubSlides]

Applied Deep Learning Resources

  • "An Empirical Evaluation of Deep Learning on Highway Driving" [arxiv]

Applied Deep Learning Resources

  • "Object Detectors Emerge in Deep Scene CNNs" [arxiv]

Applied Deep Learning Resources

  • Faster and better features: "Efficient Deep Feature Learning and Extraction via StochasticNets" [arxiv]

Other

  • Code and models for automatic captions of images: "Deep Visual-Semantic Alignments for Generating Image Descriptions"[web posterPDFgithub]

Applied Deep Learning Resources

  • Google Deep Dream or neural networks on LSD: "Inceptionism: Going Deeper into Neural Networks" [link,deepdreamer.io/]

Deep dreaming from noise:

Applied Deep Learning Resources

Applied Deep Learning Resources

  • "Automatic Colorization" and it includes a pre-trained model [Link]

Applied Deep Learning Resources

  • "Learning visual similarity for product design with convolutional neural networks" [PDF]

Applied Deep Learning Resources

  • Using images and image descriptions to improve search results: "Images Don’t Lie: Transferring Deep Visual Semantic Features to Large-Scale Multimodal Learning to Rank" [arxiv]

  • "How Google Translate squeezes deep learning onto a phone" [post]

Applied Deep Learning Resources

  • "What a Deep Neural Network thinks about your #selfie" [blog]

Top selfies according to the ConvNet:

Applied Deep Learning Resources

  • "Recommending music on Spotify with deep learning" [github]

  • "DeepStereo: Learning to Predict New Views from the World's Imagery" [arxiv]

Applied Deep Learning Resources

  • Classifying street signs: "The power of Spatial Transformer Networks" [blog] with "Spatial Transformer Networks" [arxiv]

Applied Deep Learning Resources

  • "Pedestrian Detection with RCNN" [PDF]

DQN

  • Original paper: "Playing Atari with Deep Reinforcement Learning" [arxiv]

  • My popular science article about DQN: "Artificial General Intelligence that plays Atari video games: How did DeepMind do it?" [link]

  • DQN for RoboCup: "Deep Reinforcement Learning in Parameterized Action Space" [arxiv]

RNN

  • Original paper of the best RNN architecture: "Long short-term memory" [PDF]

  • Very good tutorial-like introduction to RNNs by Andrej Karpathy: "The Unreasonable Effectiveness of Recurrent Neural Networks" [link]

  • "Visualizing and Understanding Recurrent Networks" [arxiv]

  • "Composing Music With Recurrent Neural Networks" [blog]

Other promising or useful architectures

  • HTMs by Jeff Hawkins: "Continuous online sequence learning with an unsupervised neural network model"​ [arxiv]

  • Word2vec: "Efficient Estimation of Word Representations in Vector Space" [arxivGoogle code]

  • "Feedforward Sequential Memory Networks: A New Structure to Learn Long-term Dependency" [arxiv]

Framework benchmarks

  • "Comparative Study of Caffe, Neon, Theano and Torch for deep learning" [arxiv]

Their summary: From our experiments, we observe that Theano and Torch are the most easily extensible frameworks. We observe that Torch is best suited for any deep architecture on CPU, followed by Theano. It also achieves the best performance on the GPU for large convolutional and fully connected networks, followed closely by Neon. Theano achieves the best performance on GPU for training and deployment of LSTM networks. Finally Caffe is the easiest for evaluating the performance of standard deep architectures.

  • Very good qualitative analysis: zer0n/deepframeworks: [github]

  • Just performance comparison: soumith/convnet-benchmarks: [github]

  • "Deep Learning Libraries by Language" [link]

Other resources

Credits

Most of the snippets have come to my attention via internal mailing lists of Computational Neuroscience Lab at University of Tartu and London-based visual search company Dream It Get It. I am also reading a weekly newsletter by Data Elixir and checking research papers of the two main deep learning conferences: ICML and NIPS.

 

Applied Deep Learning Resources的更多相关文章

  1. (转) Deep Learning Resources

    转自:http://www.jeremydjacksonphd.com/category/deep-learning/ Deep Learning Resources Posted on May 13 ...

  2. why deep learning works

    https://medium.com/towards-data-science/deep-learning-for-object-detection-a-comprehensive-review-73 ...

  3. 深度学习阅读列表 Deep Learning Reading List

    Reading List List of reading lists and survey papers: Books Deep Learning, Yoshua Bengio, Ian Goodfe ...

  4. [C1W4] Neural Networks and Deep Learning - Deep Neural Networks

    第四周:深层神经网络(Deep Neural Networks) 深层神经网络(Deep L-layer neural network) 目前为止我们学习了只有一个单独隐藏层的神经网络的正向传播和反向 ...

  5. 论文笔记:A Review on Deep Learning Techniques Applied to Semantic Segmentation

    A Review on Deep Learning Techniques Applied to Semantic Segmentation 2018-02-22  10:38:12   1. Intr ...

  6. 机器学习(Machine Learning)&深度学习(Deep Learning)资料

    <Brief History of Machine Learning> 介绍:这是一篇介绍机器学习历史的文章,介绍很全面,从感知机.神经网络.决策树.SVM.Adaboost到随机森林.D ...

  7. 机器学习&lpar;Machine Learning&rpar;&amp&semi;深入学习&lpar;Deep Learning&rpar;资料

    <Brief History of Machine Learning> 介绍:这是一篇介绍机器学习历史的文章,介绍很全面,从感知机.神经网络.决策树.SVM.Adaboost 到随机森林. ...

  8. Machine and Deep Learning with Python

    Machine and Deep Learning with Python Education Tutorials and courses Supervised learning superstiti ...

  9. Understanding Convolution in Deep Learning

    Understanding Convolution in Deep Learning Convolution is probably the most important concept in dee ...

随机推荐

  1. JS组件系列——表格组件神器:bootstrap table(二:父子表和行列调序)

    前言:上篇 JS组件系列——表格组件神器:bootstrap table 简单介绍了下Bootstrap Table的基础用法,没想到讨论还挺热烈的.有园友在评论中提到了父子表的用法,今天就结合Boo ...

  2. Some settings of PostgreSQL

    Here are some setting recommendations about checkpoints, some values to set in postgresql.conf. A ch ...

  3. 工作中遇到的问题--使用DTO减少数据字段

    Location中包含如下字段以及AMfgObject中关于创建信息的字段,然而有时使用并不需要传输那么多数据,则对其中字段进行过滤. @Entity@Table(name = "LOCAT ...

  4. 【开源推荐】AllJoyn:打造全球物联网的通用开源框架

    摘要:随着智能设备的发展,物联网逐渐进入了人们的生活.据预测,未来几乎一切东西(超过500亿台设备)都可以互联.高通公司发布了开源项目AllJoyn,这是一个能够使连接设备间进行互操作的通用软件框架和 ...

  5. Bash 字符串处理命令

    字符串长度 str="abc" echo ${#str} 查找子串的位置 str="abc" str1=`expr index $str "a&quo ...

  6. win7设置虚拟wifi

    1. cmd,使用管理员权限打开: 2. 设置:netsh wlan set hostednetwork mode=allow ssid=ABCD key=12345678 mode:是否开启虚拟wi ...

  7. Android url中文编码问题

    最近项目遇见一个很奇葩问题,关于URL问题,项目中加载图片,图片的URL含有中文,但是,我的手机可以加载,没问题,同事也都可以,但是测试手机却不可以,加载失败,找到问题,就是URL含有中文问题. 解决 ...

  8. Android ListView 单条刷新方法实践及原理解析

    对于使用listView配合adapter进行刷新的方法大家都不陌生,先刷新adapter里的数据,然后调用notifydatasetchange通知listView刷新界面. 方法虽然简单,但这里面 ...

  9. 简单的视频采集demo

    打算做个简单的聊天软件,其中一个我没做过的,就是视频采集. 在网上查了许久资料,终于搞清楚了dshow采集视频的流程 参考资料如下: https://msdn.microsoft.com/en-us/ ...

  10. 备注: ubt 16&period;04 安装 gtx 1060 --- 成功运行 tensorflow - gpu

    ---------------------------------------------------------------------------------------------------- ...