Follow
Jared Casper
Jared Casper
Research Scientist, NVIDIA
Verified email at nvidia.com
Title
Cited by
Cited by
Year
Deep speech 2: End-to-end speech recognition in english and mandarin
D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ...
International conference on machine learning, 173-182, 2016
38482016
Deep Speech: Scaling up end-to-end speech recognition
A Hannun
arXiv preprint arXiv:1412.5567, 2014
27502014
Megatron-lm: Training multi-billion parameter language models using model parallelism
M Shoeybi, M Patwary, R Puri, P LeGresley, J Casper, B Catanzaro
arXiv preprint arXiv:1909.08053, 2019
18032019
Bloom: A 176b-parameter open-access multilingual language model
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
16262023
Efficient large-scale language model training on gpu clusters using megatron-lm
D Narayanan, M Shoeybi, J Casper, P LeGresley, M Patwary, ...
Proceedings of the International Conference for High Performance Computing …, 2021
6572021
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model
S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ...
arXiv preprint arXiv:2201.11990, 2022
6482022
An effective hybrid transactional memory system with strong isolation guarantees
CC Minh, M Trautmann, JW Chung, A McDonald, N Bronson, J Casper, ...
Proceedings of the 34th annual international symposium on Computer …, 2007
4632007
A practical concurrent binary search tree
NG Bronson, J Casper, H Chafi, K Olukotun
ACM Sigplan Notices 45 (5), 257-268, 2010
3112010
The vector-thread architecture
R Krashinsky, C Batten, M Hampton, S Gerding, B Pharris, J Casper, ...
ACM SIGARCH Computer Architecture News 32 (2), 52, 2004
2722004
Hardware acceleration of database operations
J Casper, K Olukotun
Proceedings of the 2014 ACM/SIGDA international symposium on Field …, 2014
2292014
Reducing activation recomputation in large transformer models
VA Korthikanti, J Casper, S Lym, L McAfee, M Andersch, M Shoeybi, ...
Proceedings of Machine Learning and Systems 5, 341-353, 2023
2002023
A scalable, non-blocking approach to transactional memory
H Chafi, J Casper, BD Carlstrom, A McDonald, CC Minh, W Baek, ...
2007 IEEE 13th International Symposium on High Performance Computer …, 2007
1882007
Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, and Bryan Catanzaro
S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ...
Using deepspeed and megatron to train megatron-turing nlg 530b, a large …, 2022
1442022
Eigenbench: A simple exploration tool for orthogonal TM characteristics
S Hong, T Oguntebi, J Casper, N Bronson, C Kozyrakis, K Olukotun
IEEE International Symposium on Workload Characterization (IISWC'10), 1-11, 2010
972010
A practical FPGA-based framework for novel CMP research
S Wee, J Casper, N Njoroge, Y Tesylar, D Ge, C Kozyrakis, K Olukotun
Proceedings of the 2007 ACM/SIGDA 15th international symposium on Field …, 2007
952007
Atlas: A chip-multiprocessor with transactional memory support
N Njoroge, J Casper, S Wee, Y Teslyar, D Ge, C Kozyrakis, K Olukotun
2007 Design, Automation & Test in Europe Conference & Exhibition, 1-6, 2007
872007
Systems and methods for speech transcription
A Hannun, C Case, J Casper, B Catanzaro, G Diamos, E Elsen, ...
US Patent 10,540,957, 2020
762020
Transactional predication: high-performance concurrent sets and maps for stm
NG Bronson, J Casper, H Chafi, K Olukotun
Proceedings of the 29th ACM SIGACT-SIGOPS symposium on Principles of …, 2010
682010
Deep speech: Scaling up end-to-end speech recognition. arXiv 2014
A Hannun, C Case, J Casper, B Catanzaro, G Diamos, E Elsen, ...
arXiv preprint arXiv:1412.5567, 2014
642014
Hardware acceleration of transactional memory on commodity systems
J Casper, T Oguntebi, S Hong, NG Bronson, C Kozyrakis, K Olukotun
ACM SIGPLAN Notices 46 (3), 27-38, 2011
412011
The system can't perform the operation now. Try again later.
Articles 1–20