Skip to main content

ImageNet classification with deep convolutional neural networks

ImageNet classification with deep convolutional neural networks (2012) has been cited 194,363 times according to Google Scholar. CitationMap has resolved 1,061 citing papers from institutions across 62 countries.

Advances in Neural Information Processing Systems 25 (NIPS 2012)2012View paper

Authors: Alex Krizhevsky (University of Toronto), Ilya Sutskever (University of Toronto), Geoffrey E. Hinton (University of Toronto)

See Geoffrey Hinton's full citation map →

Where this paper is cited

United States · 469China · 278United Kingdom · 106Germany · 56Canada · 53Australia · 49Singapore · 45South Korea · 31

Top citing institutions

  • Tsinghua University (47)
  • Stanford University (38)
  • Google Research (29)
  • University of California, Berkeley (28)
  • Massachusetts Institute of Technology (27)
  • Facebook AI Research (27)
  • Google (26)
  • The Chinese University of Hong Kong (25)
  • UC Berkeley (25)
  • Carnegie Mellon University (24)
  • Nanyang Technological University (20)
  • Peking University (20)

Papers citing this work (1,061 resolved)

  1. · Vikas Hassija, Amir Hussain, Simone Scardapane, Kaizhu Huang +151 more

  2. · Muhammad Hussain

  3. · Aston Zhang, Arun Rao, Alan Schelten, Bhargav Kanagal +50 more

  4. VMamba: Visual State Space Model

    Advances in Neural Information Processing Systems 37 (NeurIPS 2024) · 2024 · Jianbin Jiao, Yunfan Liu, Yue Liu, Yunfan Liu +22 more

  5. · Bencheng Liao, Qian Zhang, Xinggang Wang, Xinlong Wang +8 more

  6. · Albert Gu, Tri Dao

  7. · Lewei Lu, Wenhai Wang, Muyan Zhong, Ping Luo +19 more

  8. · Yue Bai, Yizhou Wang, Xiyang Dai, Xu Ma +2 more

  9. · Radu Marculescu, Md Mostafijur Rahman, Mustafa Munir

  10. · Daliang Ouyang, Su He, Guozhong Zhang, Mingzhu Luo +3 more

  11. · William Peebles, Saining Xie, William Peebles, Saining Xie +104 more

  12. Cross-entropy loss functions: Theoretical analysis and applications

    40th International Conference on Machine Learning (ICML 2023) · 2023 · Mehryar Mohri, Anqi Mao, Mehryar Mohri, Yutao Zhong +3 more

  13. Vision-language models for vision tasks: A survey

    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) · 2024 · Shijian Lu, Jiaxing Huang, Jingyi Zhang, Sheng Jin +73 more

  14. A Survey on Multimodal Large Language Models for Autonomous Driving

    2024 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW) · 2024 · Shuqi Mei, Zhipeng Cao, Xinrui Yan, Yang Zhou +33 more

  15. · Timothée Darcet, Maxime Oquab, Julien Mairal, Piotr Bojanowski

  16. Understanding of Machine Learning with Deep Learning: Architectures, Workflow, Applications and Future Directions

    Computers · 2023 · Mohammad Mustafa Taye

  17. · Xinyu Liu, Houwen Peng, Ningxin Zheng, Yuqing Yang +2 more

  18. · Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen +3 more

  19. · Jun Zhu, Liyuan Wang, Hang Su, Xingxing Zhang

  20. · Chuyi Li, Lulu Li, Hongliang Jiang, Kaiheng Weng +14 more

  21. · Shams Forruque Ahmed, Md. Sakib Bin Alam, Maruf Hassan, Mahtabin Rodela Rozbu +83 more

  22. · Bin Xiao, Haiping Wu, Weijian Xu, Xiyang Dai +5 more

  23. VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

    7th Conference on Robot Learning (CoRL 2023) · 2023 · Jiajun Wu, Yunzhu Li, Ruohan Zhang, Wenlong Huang +55 more

  24. · Reabal Najjar

  25. SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy

    2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2023 · Ying Wen, Lianghua He, Jiafeng Li, Jiafeng Li +155 more

  26. · Tao Huang, Xiaohuan Pei, Shan You, Fei Wang +332 more

  27. SpikingJelly: An open-source machine learning infrastructure platform for spike-based intelligence

    Science Advances · 2023 · Wei Fang, Yonghong Tian, Ding Chen, Huihui Zhou +19 more

  28. MambaVision: A Hybrid Mamba-Transformer Vision Backbone

    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025) · 2024 · Jan Kautz, Ali Hatamizadeh

  29. · Leo Grinsztajn, Edouard Oyallon, Gael Varoquaux, Leo Grinsztajn +2 more

  30. A comprehensive survey on pretrained foundation models: A history from BERT to ChatGPT

    International Journal of Machine Learning and Cybernetics · 2024 · Jun Yu, Guangjing Wang, Jia Wu, Lichao Sun +35 more

  31. MambaOut: Do We Really Need Mamba for Vision?

    CVPR · 2024 · Weihao Yu, Xinchao Wang

  32. The GenAI is out of the bottle: generative artificial intelligence from a business model innovation perspective

    Review of Managerial Science · 2024 · Dominik K. Kanbach, Louisa Heiduk, Georg Blueher, Maximilian Schreiter

  33. A comprehensive review on ensemble deep learning: Opportunities and challenges

    Journal of King Saud University - Computer and Information Sciences · 2023 · Ammar Mohammed, Rania Kora

  34. ModernTCN: A Modern Pure Convolution Structure for General Time Series Analysis

    International Conference on Learning Representations (ICLR) · 2024 · DongHao Luo, Xue Wang, DongHao Luo, Xue Wang +50 more

  35. Artificial intelligence for geoscience: Progress, challenges, and perspectives

    The Innovation · 2024 · Chaojun Ouyang, Tianjie Zhao, Lizhe Wang, Sheng Wang +21 more

  36. Reinforcement learning algorithms: A brief survey

    Expert Systems with Applications · 2023 · AK Shakya, G Pillai, S Chakrabarty

  37. A Survey of GPT-3 Family Large Language Models Including ChatGPT and GPT-4

    Natural Language Processing Journal 6 (2024) 100048 · 2023 · Katikapalli Subramanyam Kalyan

  38. InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

    2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2023 · Zhiqi Li, Xiaowei Hu, Yu Qiao, Wenhai Wang +18 more

  39. UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

    CVPR 2024 · 2024 · Lin Song, Xiaohan Ding, Sijie Zhao, Ying Shan +63 more

  40. DataComp: In search of the next generation of multimodal datasets

    Advances in Neural Information Processing Systems 36 (NeurIPS 2023) Datasets and Benchmarks Track · 2023 · Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase +30 more

  41. A ConvNet for the 2020s

    Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 · 2022 · Hanzi Mao, Trevor Darrell, Saining Xie, Zhuang Liu +8 more

  42. Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks

    Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 · 2023 · Shueng-Han Gary Chan, Weipeng Zhuo, Jierun Chen, Hao He +11 more

  43. TransNeXt: Robust Foveal Visual Perception for Vision Transformers

    CVPR 2024 · 2023 · Dai Shi, Dai Shi, Dai Shi, Dai Shi +53 more

  44. Advancements in Generative AI: A Comprehensive Review of GANs, GPT, Autoencoders, Diffusion Model, and Transformers

    IEEE Access · 2024 · Staphord Bengesi, Hoda El-Sayed, Md Kamruzzaman Sarker, Yao Houkpati +2 more

  45. Neuromorphic computing at scale

    Nature · 2025 · Dhireesha Kudithipudi, Catherine Schuman, Craig M. Vineyard, Tej Pandit +19 more

  46. Masked Autoencoders Are Scalable Vision Learners

    2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2022 · Kaiming He, Yanghao Li, Saining Xie, Xinlei Chen +2 more

  47. VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

    CVPR 2023 · 2023 · Yinan He, Limin Wang, Yi Wang, Bingkun Huang +12 more

  48. Optical neural networks: progress and challenges

    Light: Science & Applications · 2024 · Tingzhao Fu, Jianfa Zhang, Run Sun, Yuyao Huang +304 more

  49. CoCa: Contrastive Captioners are Image-Text Foundation Models

    Transactions on Machine Learning Research (TMLR) · 2022 · Jiahui Yu, Zirui Wang, Vijay Vasudevan, Legg Yeung +2 more

  50. A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness

    ACM Transactions on… · 2024 · Qiuhao Lu, Xianfeng Tang, Fali Wang, Tzuhao Mo +208 more

  51. Empowering biomedical discovery with AI agents

    Cell · 2024 · Shanghua Gao, Valentina Giunchiglia, Marinka Zitnik, Ayush Noori +15 more

  52. A Comprehensive Review of Deep Learning: Architectures, Recent Advances, and Applications

    Information · 2024 · Ibomoiye Domor Mienye, Theo G. Swart

  53. EVA: Exploring the Limits of Masked Visual Representation Learning at Scale

    2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2023 · Yuxin Fang, Wen Wang, Binhui Xie, Quan Sun +5 more

  54. On the Opportunities and Risks of Foundation Models

    arXiv · 2021 · Ananya Kumar, Zanele Munyikwa, Isabel Papadimitriou, Mark Krass +224 more

  55. Cross-city matters: A multimodal remote sensing benchmark dataset for cross-city semantic segmentation using high-resolution domain adaptation networks

    Remote Sensing of Environment · 2023 · Danfeng Hong, Bing Zhang, Hao Li, Yuxuan Li +296 more

  56. A Survey on Kolmogorov-Arnold Network

    ACM Computing Surveys · 2024 · Shriyank Somvanshi, Syed Aaqib Javed, Md Monzurul Islam, Diwas Pandit +1 more

  57. Theoretical Understanding of Convolutional Neural Network: Concepts, Architectures, Applications, Future Directions

    Computation · 2023 · Mohammad Mustafa Taye

  58. Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs

    The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2022 · Xiaohan Ding, Xiangyu Zhang, Yizhuang Zhou, Jungong Han +2 more

  59. Foundation models defining a new era in vision: a survey and outlook

    IEEE Transactions on Pattern Analysis and Machine Intelligence · 2025 · Muhammad Awais, Muzammal Naseer, Salman Khan, Rao Muhammad Anwer +304 more

  60. Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

    arXiv preprint · 2023 · Collin Burns, Pavel Izmailov, Jan Hendrik Kirchner, Bowen Baker +8 more

  61. Machine Learning Methods for Small Data Challenges in Molecular Science

    Chemical Reviews · 2023 · Zailiang Zhu, Guo-Wei Wei, Bengong Zhang, Jie Liu +9 more

  62. A Comprehensive Overview and Comparative Analysis on Deep Learning Models: CNN, RNN, LSTM, GRU

    arXiv (preprint server), and Machine Learning (journal) · 2023 · Thinagaran Perumal, Farhad Mortezapour Shiri, Norwati Mustapha, Raihani Mohamed +135 more

  63. SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

    Advances in Neural Information Processing Systems 34 (NeurIPS 2021) · 2021 · Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar +2 more

  64. Large Separable Kernel Attention: Rethinking the Large Kernel Attention design in CNN

    Expert Systems With Applications · 2024 · Lai-Man Po, Yasar Abbas Ur Rehman, Kin Wai Lau, KW Lau +2 more

  65. Data augmentation: A comprehensive survey of modern approaches

    Array · 2022 · A. Mumuni, F. Mumuni

  66. Emerging opportunities and challenges for the future of reservoir computing

    Nature Communications · 2024 · Peter Tino, Can Huang, Peter Bienstman, Min Yan +6 more

  67. Multimodal Foundation Models: From Specialists to General-Purpose Assistants

    Foundations and Trends® in Computer Graphics and Vision · 2024 · Lijuan Wang, Jianwei Yang, Chunyuan Li, Linjie Li +3 more

  68. YOLO advances to its genesis: A decadal and comprehensive review of the You Only Look Once (YOLO) series

    Artificial Intelligence Review · 2025 · Ranjan Sapkota, Marco Flores-Calero, Rizwan Qureshi, Chetan Badgujar +8 more

  69. Swin Transformer V2: Scaling Up Capacity and Resolution

    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2022 · Ze Liu, Han Hu, Yutong Lin, Zhuliang Yao +8 more

  70. Deep learning-based structural health monitoring

    Automation in Construction · 2024 · Rahmat Ali, John Lewis, Young-Jin Cha, Rahmat Ali +144 more

  71. Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey

    arXiv · 2023 · Yunpeng Huang, Jingwei Xu, Zixu Jiang, Junyu Lai +606 more

  72. Machine Learning: Algorithms, Real-World Applications and Research Directions

    SN Computer Science · 2021 · Iqbal H. Sarker, Iqbal H. Sarker, Iqbal H. Sarker, Iqbal H. Sarker +2 more

  73. Exploring Plain Vision Transformer Backbones for Object Detection

    Computer Vision – ECCV 2022 · 2022 · Yanghao Li, Hanzi Mao, Ross Girshick, Kaiming He

  74. MaxViT: Multi-Axis Vision Transformer

    Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXIV · 2022 · Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang +3 more

  75. Learning Transferable Visual Models From Natural Language Supervision

    Proceedings of the 38th International Conference on Machine Learning · 2021 · Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh +8 more

  76. Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions

    Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) · 2021 · Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan +5 more

  77. Advances in Medical Image Segmentation: A Comprehensive Review of Traditional, Deep Learning and Hybrid Approaches

    Bioengineering (Basel) · 2024 · Yan Xu, Rixiang Quan, Weiting Xu, Yi Huang +2 more

  78. A Study of CNN and Transfer Learning in Medical Imaging: Advantages, Challenges, Future Scope

    Sustainability · 2023 · Ahmad Waleed Salehi, Shakir Khan, Gaurav Gupta, Bayan Ibrahimm Alabduallah +410 more

  79. Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review

    Journal of Medical Systems · 2024 · Satoshi Takahashi, Yusuke Sakaguchi, Nobuji Kouno, Ken Takasawa +13 more

  80. Training data-efficient image transformers & distillation through attention

    Proceedings of the 38th International Conference on Machine Learning · 2021 · Hugo Touvron, Matthieu Cord, Matthijs Douze, Francisco Massa +7 more

  81. A comprehensive survey of loss functions and metrics in deep learning

    Artificial Intelligence Review · 2025 · Juan Terven, Diana-Margarita Cordova-Esparza, Julio-Alejandro Romero-González, Alfonso Ramírez-Pedraza +1 more

  82. Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

    Proceedings of the 40th International Conference on Machine Learning, in Proceedings of Machine Learning Research · 2023 · Vaibhav Aggarwal, Haoqi Fan, Daniel Bolya, Christoph Feichtenhofer +634 more

  83. Data-centric Artificial Intelligence: A Survey

    ACM Computing Surveys · 2025 · Daochen Zha, Zaid Pervaiz Bhat, Kwei-Herng Lai, Fan Yang +3 more

  84. Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions

    SN Computer Science · 2021 · Iqbal H. Sarker, Iqbal H. Sarker

  85. MLP-Mixer: An all-MLP Architecture for Vision

    Advances in Neural Information Processing Systems 34 (NeurIPS 2021) · 2021 · Ilya O Tolstikhin, Neil Houlsby, Alexander Kolesnikov, Lucas Beyer +8 more

  86. Creating Large Language Model Applications Utilizing LangChain: A Primer on Developing LLM Apps Fast

    International Conference on Applied Engineering and Natural Sciences · 2023 · Oguzhan Topsakal, Tahir Cetin Akinci

  87. HexPlane: A Fast Representation for Dynamic Scenes

    CVPR 2023 · 2023 · Ang Cao, Justin Johnson

  88. Visual attention network

    Computational Visual Media · 2023 · Cheng-Ze Lu, Ming-Ming Cheng, Meng-Hao Guo, Shi-Min Hu +2 more

  89. One-Step Effective Diffusion Network for Real-World Image Super-Resolution

    NeurIPS 2024 · 2024 · Rongyuan Wu, Lingchen Sun, Zhiyuan Ma, Lei Zhang

  90. Style Injection in Diffusion: A Training-Free Approach for Adapting Large-Scale Diffusion Models for Style Transfer

    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2024 · Jiwoo Chung, Sangeek Hyun, Jae-Pil Heo

  91. No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects

    European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) · 2023 · Raja Sunkara, Tie Luo

  92. A Survey on Generative Diffusion Models

    IEEE Transactions on Knowledge and Data Engineering · 2024 · Hanqun Cao, Cheng Tan, Zhangyang Gao, Yilun Xu +3 more

  93. InceptionNeXt: When Inception Meets ConvNeXt

    CVPR 2024 · 2024 · Weihao Yu, Pan Zhou, Shuicheng Yan, Xinchao Wang

  94. ViViT: A Video Vision Transformer

    2021 IEEE/CVF International Conference on Computer Vision (ICCV) · 2021 · Anurag Arnab, Mostafa Dehghani, Georg Heigold, Chen Sun +2 more

  95. Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators

    Nature Machine Intelligence · 2021 · L Lu, P Jin, G Pang, Z Zhang +95 more

  96. Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers

    38th Conference on Neural Information Processing Systems (NeurIPS 2024) · 2024 · Lirui Wang, Jialiang Zhao, Lirui Wang, Jialiang Zhao +93 more

  97. A Survey on Vision Transformer

    IEEE Transactions on Pattern Analysis and Machine Intelligence · 2023 · Yiman Zhang, Kai Han, Chunjing Xu, Yunhe Wang +23 more

  98. Transfer learning for medical image classification: a literature review.

    BMC medical imaging · 2022 · Hee E Kim, Alejandro Cosa-Linan, Nandhini Santhanam, Mahboubeh Jannesari +2 more

  99. Deep learning for medical image segmentation: State-of-the-art advancements and challenges

    Informatics in Medicine Unlocked · 2024 · Md Eshmam Rayed, S. M. Sajibul Islam, Sadia Islam Niha, Jamin Rahman Jim +2 more

  100. Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters

    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) · 2024 · Jiazuo Yu, Yunzhi Zhuge, Lu Zhang, Ping Hu +3 more

Showing the top 100 of 1,061 resolved citing papers — see the full interactive list on Geoffrey Hinton's profile.

Map your own citations

CitationMap turns any Google Scholar profile into an interactive world map of citing institutions — free, no sign-up. Used for EB-1A / O-1 / NIW visa evidence, tenure files, and grant applications.

Create your citation map →