Radiotalk: A large-scale corpus of talk radio transcripts D Beeferman, W Brannon, D Roy Interspeech 2019, 564--568, 2019 | 17 | 2019 |
The data provenance initiative: A large scale audit of dataset licensing & attribution in ai S Longpre, R Mahari, A Chen, N Obeng-Marnu, D Sileo, W Brannon, ... arXiv preprint arXiv:2310.16787, 2023 | 14 | 2023 |
Dubbing in practice: A large scale study of human localization with insights for automatic dubbing W Brannon, Y Virkar, B Thompson Transactions of the Association for Computational Linguistics 11, 419-435, 2023 | 14 | 2023 |
Congrat: Self-supervised contrastive pretraining for joint graph and text embeddings W Brannon, S Fulay, H Jiang, W Kang, B Roy, J Kabbara, D Roy arXiv preprint arXiv:2305.14321, 2023 | 8 | 2023 |
The Data Provenance Project S Longpre, R Mahari, N Muennighoff, A Chen, K Perisetla, W Brannon, ... Proceedings of the 40th International Conference on Machine Learning, 2023 | 2 | 2023 |
Data Authenticity, Consent, and Provenance for AI Are All Broken: What Will It Take to Fix Them? S Longpre, R Mahari, N Obeng-Marnu, W Brannon, T South, J Kabbara, ... MIT, 2024 | | 2024 |
Mapping US talk radio: a textual survey at scale WW Brannon Massachusetts Institute of Technology, 2020 | | 2020 |