ImageNet classification with deep convolutional neural networks

“ImageNet classification with deep convolutional neural networks” (2012) has been cited 194,363 times according to Google Scholar. CitationMap has resolved 1,061 citing papers from institutions across 59 countries.

Advances in Neural Information Processing Systems 25 (NIPS 2012)2012View paper

Authors: Alex Krizhevsky (University of Toronto), Ilya Sutskever (University of Toronto), Geoffrey E. Hinton (University of Toronto)

See Geoffrey Hinton's full citation map →

Where this paper is cited

United States · 465China · 282United Kingdom · 105Germany · 57Canada · 54Australia · 49Singapore · 44South Korea · 31

Top citing institutions

Tsinghua University (46)
Stanford University (37)
Google Research (29)
Facebook AI Research (28)
University of California, Berkeley (26)
Massachusetts Institute of Technology (26)
The Chinese University of Hong Kong (25)
Carnegie Mellon University (24)
Google (24)
UC Berkeley (23)
Nanyang Technological University (20)
Peking University (20)

Papers citing this work (1,061 resolved)

· Vikas Hassija, Vinay Chamola, Atmesh Mahapatra, Abhinandan Singal +6 more
· Muhammad Hussain
· Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian +50 more
VMamba: Visual State Space Model
Advances in Neural Information Processing Systems 37 (NeurIPS 2024) · 2024 · Yue Liu, Yunjie Tian, Yuzhong Zhao, Hongtian Yu +5 more
· Lianghui Zhu, Bencheng Liao, Qian Zhang, Xinlong Wang +2 more
· Tri Dao, Albert Gu
· Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su +11 more
· Xu Ma, Xiyang Dai, Yue Bai, Yizhou Wang +1 more
· Md Mostafijur Rahman, Mustafa Munir, Radu Marculescu
· Daliang Ouyang, Su He, Guozhong Zhang, Mingzhu Luo +3 more
· William Peebles, Saining Xie
Cross-entropy loss functions: Theoretical analysis and applications
40th International Conference on Machine Learning (ICML 2023) · 2023 · Anqi Mao, Mehryar Mohri, Yutao Zhong
Vision-language models for vision tasks: A survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) · 2024 · Jingyi Zhang, Jiaxing Huang, Sheng Jin, Shijian Lu +3 more
A Survey on Multimodal Large Language Models for Autonomous Driving
2024 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW) · 2024 · Can Cui, Yunsheng Ma, Xu Cao, Wenqian Ye +18 more
· Timothée Darcet, Maxime Oquab, Julien Mairal, Piotr Bojanowski
Understanding of Machine Learning with Deep Learning: Architectures, Workflow, Applications and Future Directions
Computers · 2023 · Mohammad Mustafa Taye
· Xinyu Liu, Yixuan Yuan, Houwen Peng, Ningxin Zheng +2 more
· Zhuang Liu, Saining Xie, Sanghyun Woo, Shoubhik Debnath +3 more
· Liyuan Wang, Xingxing Zhang, Hang Su, Jun Zhu
· Chuyi Li, Lulu Li, Hongliang Jiang, Kaiheng Weng +14 more
· Shams Forruque Ahmed, Md. Sakib Bin Alam, Maruf Hassan, Mahtabin Rodela Rozbu +8 more
· Xiyang Dai, Bin Xiao, Haiping Wu, Weijian Xu +5 more
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
7th Conference on Robot Learning (CoRL 2023) · 2023 · Li Fei-Fei, Wenlong Huang, Chen Wang, Ruohan Zhang +2 more
· Reabal Najjar
SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2023 · Jiafeng Li, Ying Wen, Lianghua He
· Tao Huang, Xiaohuan Pei, Shan You, Fei Wang +2 more
SpikingJelly: An open-source machine learning infrastructure platform for spike-based intelligence
Science Advances · 2023 · Wei Fang, Yanqi Chen, Jianhao Ding, Zhaofei Yu +6 more
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025) · 2024 · Ali Hatamizadeh, Jan Kautz
· Leo Grinsztajn, Edouard Oyallon, Gael Varoquaux
A comprehensive survey on pretrained foundation models: A history from BERT to ChatGPT
International Journal of Machine Learning and Cybernetics · 2024 · Philip S. Yu, Yixin Liu, Lichao Sun, Ce Zhou +15 more
MambaOut: Do We Really Need Mamba for Vision?
CVPR · 2024 · Weihao Yu, Xinchao Wang
The GenAI is out of the bottle: generative artificial intelligence from a business model innovation perspective
Review of Managerial Science · 2024 · Dominik K. Kanbach, Louisa Heiduk, Georg Blueher, Maximilian Schreiter
A comprehensive review on ensemble deep learning: Opportunities and challenges
Journal of King Saud University - Computer and Information Sciences · 2023 · Ammar Mohammed, Rania Kora
ModernTCN: A Modern Pure Convolution Structure for General Time Series Analysis
International Conference on Learning Representations (ICLR) · 2024 · DongHao Luo, Xue Wang
Artificial intelligence for geoscience: Progress, challenges, and perspectives
The Innovation · 2024 · Tianjie Zhao, Sheng Wang, Chaojun Ouyang, Lizhe Wang +20 more
Reinforcement learning algorithms: A brief survey
Expert Systems with Applications · 2023 · AK Shakya, G Pillai, S Chakrabarty
A Survey of GPT-3 Family Large Language Models Including ChatGPT and GPT-4
Natural Language Processing Journal 6 (2024) 100048 · 2023 · Katikapalli Subramanyam Kalyan
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2023 · Tong Lu, Yu Qiao, Wenhai Wang, Jifeng Dai +8 more
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
CVPR 2024 · 2024 · Xiaohan Ding, Yiyuan Zhang, Yixiao Ge, Sijie Zhao +3 more
DataComp: In search of the next generation of multimodal datasets
Advances in Neural Information Processing Systems 36 (NeurIPS 2023) Datasets and Benchmarks Track · 2023 · Romain Beaumont, Mitchell Wortsman, Ludwig Schmidt, Ranjay Krishna +30 more
A ConvNet for the 2020s
Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 · 2022 · Christoph Feichtenhofer, Saining Xie, Zhuang Liu, Hanzi Mao +2 more
Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 · 2023 · Jierun Chen, Shiu-hong Kao, Hao He, Weipeng Zhuo +4 more
TransNeXt: Robust Foveal Visual Perception for Vision Transformers
CVPR 2024 · 2023 · Dai Shi
Advancements in Generative AI: A Comprehensive Review of GANs, GPT, Autoencoders, Diffusion Model, and Transformers
IEEE Access · 2024 · Staphord Bengesi, Hoda El-Sayed, Md Kamruzzaman Sarker, Yao Houkpati +2 more
Neuromorphic computing at scale
Nature · 2025 · Rajkumar Kubendran, Gert Cauwenberghs, Dhireesha Kudithipudi, Catherine Schuman +19 more
Masked Autoencoders Are Scalable Vision Learners
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2022 · Piotr Dollár, Ross Girshick, Kaiming He, Xinlei Chen +4 more
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
CVPR 2023 · 2023 · Limin Wang, Bingkun Huang, Zhiyu Zhao, Zhan Tong +4 more
Optical neural networks: progress and challenges
Light: Science & Applications · 2024 · Tingzhao Fu, Jianfa Zhang, Run Sun, Yuyao Huang +4 more
CoCa: Contrastive Captioners are Image-Text Foundation Models
Transactions on Machine Learning Research (TMLR) · 2022 · Vijay Vasudevan, Yonghui Wu, Jiahui Yu, Zirui Wang +2 more
A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness
ACM Transactions on… · 2024 · Fali Wang, Zhiwei Zhang, Xianren Zhang, Zongyu Wu +14 more
Empowering biomedical discovery with AI agents
Cell · 2024 · Marinka Zitnik, Shanghua Gao, Ada Fang, Yepeng Huang +9 more
A Comprehensive Review of Deep Learning: Architectures, Recent Advances, and Applications
Information · 2024 · Ibomoiye Domor Mienye, Theo G. Swart
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2023 · Xinlong Wang, Xinggang Wang, Yuxin Fang, Wen Wang +5 more
On the Opportunities and Risks of Foundation Models
arXiv · 2021 · Jure Leskovec, Christopher Ré, Li Fei-Fei, Armin W. Thomas +111 more
Cross-city matters: A multimodal remote sensing benchmark dataset for cross-city semantic segmentation using high-resolution domain adaptation networks
Remote Sensing of Environment · 2023 · Danfeng Hong, Bing Zhang, Hao Li, Yuxuan Li +6 more
A Survey on Kolmogorov-Arnold Network
ACM Computing Surveys · 2024 · Shriyank Somvanshi, Syed Aaqib Javed, Md Monzurul Islam, Diwas Pandit +1 more
Theoretical Understanding of Convolutional Neural Network: Concepts, Architectures, Applications, Future Directions
Computation · 2023 · Mohammad Mustafa Taye
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2022 · Xiangyu Zhang, Jian Sun, Guiguang Ding, Xiaohan Ding +2 more
Foundation models defining a new era in vision: a survey and outlook
IEEE Transactions on Pattern Analysis and Machine Intelligence · 2025 · Muhammad Awais, Muzammal Naseer, Salman Khan, Rao Muhammad Anwer +4 more
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
arXiv preprint · 2023 · Ilya Sutskever, Adrien Ecoffet, Leo Gao, Jan Leike +8 more
Machine Learning Methods for Small Data Challenges in Molecular Science
Chemical Reviews · 2023 · Bozheng Dou, Zailiang Zhu, Ekaterina Merkurjev, Lu Ke +6 more
A Comprehensive Overview and Comparative Analysis on Deep Learning Models: CNN, RNN, LSTM, GRU
arXiv (preprint server), and Machine Learning (journal) · 2023 · Farhad Mortezapour Shiri, Thinagaran Perumal, Norwati Mustapha, Raihani Mohamed +1 more
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
Advances in Neural Information Processing Systems 34 (NeurIPS 2021) · 2021 · Ping Luo, Wenhai Wang, Anima Anandkumar, Enze Xie +2 more
Large Separable Kernel Attention: Rethinking the Large Kernel Attention design in CNN
Expert Systems With Applications · 2024 · Kin Wai Lau, Lai-Man Po, Yasar Abbas Ur Rehman, KW Lau +2 more
Data augmentation: A comprehensive survey of modern approaches
Array · 2022 · A. Mumuni, F. Mumuni
Emerging opportunities and challenges for the future of reservoir computing
Nature Communications · 2024 · Min Yan, Can Huang, Peter Bienstman, Peter Tino +2 more
Multimodal Foundation Models: From Specialists to General-Purpose Assistants
Foundations and Trends® in Computer Graphics and Vision · 2024 · Chunyuan Li, Zhe Gan, Zhengyuan Yang, Jianwei Yang +3 more
YOLO advances to its genesis: A decadal and comprehensive review of the You Only Look Once (YOLO) series
Artificial Intelligence Review · 2025 · Ranjan Sapkota, Marco Flores-Calero, Rizwan Qureshi, Chetan Badgujar +8 more
Swin Transformer V2: Scaling Up Capacity and Resolution
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2022 · Li Dong, Zhenda Xie, Zheng Zhang, Yue Cao +8 more
Deep learning-based structural health monitoring
Automation in Construction · 2024 · Young-Jin Cha, Rahmat Ali, John Lewis, Oral Büyükӧztürk
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey
arXiv · 2023 · Yunpeng Huang, Jingwei Xu, Zixu Jiang, Junyu Lai +6 more
Machine Learning: Algorithms, Real-World Applications and Research Directions
SN Computer Science · 2021 · Iqbal H. Sarker
Exploring Plain Vision Transformer Backbones for Object Detection
Computer Vision – ECCV 2022 · 2022 · Ross Girshick, Hanzi Mao, Yanghao Li, Kaiming He
MaxViT: Multi-Axis Vision Transformer
Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXIV · 2022 · Han Zhang, Zhengzhong Tu, Hossein Talebi, Feng Yang +3 more
Learning Transferable Visual Models From Natural Language Supervision
Proceedings of the 38th International Conference on Machine Learning · 2021 · Alec Radford, Jong Wook Kim, Ilya Sutskever, Pamela Mishkin +8 more
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) · 2021 · Ping Luo, Tong Lu, Ling Shao, Wenhai Wang +5 more
Advances in Medical Image Segmentation: A Comprehensive Review of Traditional, Deep Learning and Hybrid Approaches
Bioengineering (Basel) · 2024 · Yan Xu, Rixiang Quan, Weiting Xu, Yi Huang +2 more
A Study of CNN and Transfer Learning in Medical Imaging: Advantages, Challenges, Future Scope
Sustainability · 2023 · Ahmad Waleed Salehi, Shakir Khan, Gaurav Gupta, Bayan Ibrahimm Alabduallah +4 more
Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review
Journal of Medical Systems · 2024 · Satoshi Takahashi, Yusuke Sakaguchi, Nobuji Kouno, Ken Takasawa +13 more
Training data-efficient image transformers & distillation through attention
Proceedings of the 38th International Conference on Machine Learning · 2021 · Hugo Touvron, Matthieu Cord, Matthijs Douze, Francisco Massa +3 more
A comprehensive survey of loss functions and metrics in deep learning
Artificial Intelligence Review · 2025 · Juan Terven, Diana-Margarita Cordova-Esparza, Julio-Alejandro Romero-González, Alfonso Ramírez-Pedraza +1 more
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Proceedings of the 40th International Conference on Machine Learning, in Proceedings of Machine Learning Research · 2023 · Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei +9 more
Data-centric Artificial Intelligence: A Survey
ACM Computing Surveys · 2025 · Daochen Zha, Zaid Pervaiz Bhat, Kwei-Herng Lai, Fan Yang +3 more
Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions
SN Computer Science · 2021 · Iqbal H. Sarker
MLP-Mixer: An all-MLP Architecture for Vision
Advances in Neural Information Processing Systems 34 (NeurIPS 2021) · 2021 · Jakob Uszkoreit, Daniel Keysers, Neil Houlsby, Ilya O Tolstikhin +8 more
Creating Large Language Model Applications Utilizing LangChain: A Primer on Developing LLM Apps Fast
International Conference on Applied Engineering and Natural Sciences · 2023 · Oguzhan Topsakal, Tahir Cetin Akinci
HexPlane: A Fast Representation for Dynamic Scenes
CVPR 2023 · 2023 · Ang Cao, Justin Johnson
Visual attention network
Computational Visual Media · 2023 · Meng-Hao Guo, Cheng-Ze Lu, Zheng-Ning Liu, Ming-Ming Cheng +1 more
One-Step Effective Diffusion Network for Real-World Image Super-Resolution
NeurIPS 2024 · 2024 · Lei Zhang, Rongyuan Wu, Lingchen Sun, Zhiyuan Ma
Style Injection in Diffusion: A Training-Free Approach for Adapting Large-Scale Diffusion Models for Style Transfer
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2024 · Jiwoo Chung, Sangeek Hyun, Jae-Pil Heo
No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects
European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) · 2023 · Raja Sunkara, Tie Luo
A Survey on Generative Diffusion Models
IEEE Transactions on Knowledge and Data Engineering · 2024 · Hanqun Cao, Cheng Tan, Zhangyang Gao, Yilun Xu +3 more
InceptionNeXt: When Inception Meets ConvNeXt
CVPR 2024 · 2024 · Weihao Yu, Xinchao Wang, Pan Zhou, Shuicheng Yan
ViViT: A Video Vision Transformer
2021 IEEE/CVF International Conference on Computer Vision (ICCV) · 2021 · Mario Lucic, Anurag Arnab, Mostafa Dehghani, Georg Heigold +2 more
Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators
Nature Machine Intelligence · 2021 · L Lu, P Jin, G Pang, Z Zhang +1 more
Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers
38th Conference on Neural Information Processing Systems (NeurIPS 2024) · 2024 · Kaiming He, Xinlei Chen, Lirui Wang, Jialiang Zhao
A Survey on Vision Transformer
IEEE Transactions on Pattern Analysis and Machine Intelligence · 2023 · Kai Han, An Xiao, Jianyuan Guo, Chunjing Xu +9 more
Transfer learning for medical image classification: a literature review.
BMC medical imaging · 2022 · Hee E Kim, Alejandro Cosa-Linan, Nandhini Santhanam, Mahboubeh Jannesari +2 more
Deep learning for medical image segmentation: State-of-the-art advancements and challenges
Informatics in Medicine Unlocked · 2024 · Md. Mohsin Kabir, Md Eshmam Rayed, S. M. Sajibul Islam, Sadia Islam Niha +2 more
Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2024 · Jiazuo Yu, Yunzhi Zhuge, Lu Zhang, Ping Hu +3 more

Showing the top 100 of 1,061 resolved citing papers — see the full interactive list on Geoffrey Hinton's profile.

Map your own citations

CitationMap turns any Google Scholar profile into an interactive world map of citing institutions — free, no sign-up. Used for EB-1A / O-1 / NIW visa evidence, tenure files, and grant applications.

Create your citation map →