I am currently an assistant professor at the computer science department in Stanford university.
My research uses tools from statistics to make machine learning systems more robust and reliable — especially in challenging tasks involving natural language. The goal of my research is to use robustness and worst-case performance as a lens to understand and make progress on several fundamental challenges in machine learning and natural language processing. A few topics of recent interest are,
Previously, I was a post-doc at Stanford working for John C. Duchi and Percy Liang on tradeoffs between the average and worst-case performance of machine learning models. Before my post-doc, I was a graduate student at MIT co-advised by Tommi Jaakkola and David Gifford and a undergraduate student at Harvard in statistics and math advised by Edoardo Airoldi.
Most recent publications on Google Scholar.
Approximate Selection with Guarantees using Proxies PDF
Daniel Kang*, Edward Gan*, Peter Bailis, Tatsunori B Hashimoto, Matei Zaharia
International Conference on Very Large Data Bases (VLDB 2020, To Appear)
Improved Natural Language Generation via Loss Truncation PDF
Daniel Kang, Tatsunori B Hashimoto
Annual Meeting of the Association of Computational Linguistics (ACL 2020, To Appear)
Robustness to Spurious Correlations via Human Annotations
Megha Srivastava, Tatsunori B Hashimoto, Percy Liang
International Conference on Machine Learning (ICML 2020, To Appear)
Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-case Generalization PDF
Shiori Sagawa*, Pang Wei Koh*, Tatsunori B Hashimoto, Percy Liang
International Conference on Learning Representations (ICLR 2020)
Distributionally Robust Language Modeling PDF
Yonatan Oren*, Shiori Sagawa *, Tatsunori B Hashimoto *, Percy Liang
Empirical Methods in Natural Language Processing (EMNLP 2019)
Distributionally Robust Losses Against Mixture Covariate Shifts PDF
John C Duchi, Tatsunori B Hashimoto, Hongseok Namkoong
Preprint
Unifying Human and Statistical Evaluation for Natural Language Generation PDF
Tatsunori B Hashimoto*, Hugh Zhang*, Percy Liang
Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019)
Generating Sentences by Editing Prototypes PDF
Kelvin Guu*, Tatsunori B Hashimoto*, Yonatan Oren, Percy Liang
Transactions of the Association of Computational Linguistics (TACL, presented at ACL 2018)
Fairness Without Demographics in Repeated Loss Minimization PDF
Tatsunori B Hashimoto, Megha Srivastava, Hongseok Namkoong, Percy Liang
Proceedings of the 35th International Conference on Machine Learning (ICML 2018, Best paper runner up)
Word embeddings as metric recovery in semantic spaces PDF
Tatsunori B Hashimoto, David Alvarez-Melis, Tommi S Jaakkola
Transactions of the Association for Computational Linguistics 4 (TACL, presented at ACL 2016)
Metric recovery from directed unweighted graphs PDF
Tatsunori B Hashimoto, Yi Sun, Tommi S Jaakkola
Artificial Intelligence and Statistics (AISTATS 2015), (best poster at NeurIPS 2014 workshop on networks)
Approximate Selection with Guarantees using Proxies PDF
Daniel Kang*, Edward Gan*, Peter Bailis, Tatsunori B Hashimoto, Matei Zaharia
International Conference on Very Large Data Bases (VLDB 2020, To Appear)
Improved Natural Language Generation via Loss Truncation PDF
Daniel Kang, Tatsunori B Hashimoto
Annual Meeting of the Association of Computational Linguistics (ACL 2020, To Appear)
Robustness to Spurious Correlations via Human Annotations
Megha Srivastava, Tatsunori B Hashimoto, Percy Liang
International Conference on Machine Learning (ICML 2020, To Appear)
Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-case Generalization PDF
Shiori Sagawa*, Pang Wei Koh*, Tatsunori B Hashimoto, Percy Liang
International Conference on Learning Representations (ICLR 2020)
Learning Autocomplete Systems as a Communication Game PDF
Mina Lee, Tatsunori B Hashimoto, Percy Liang
Emergent Communication Workshop at Neural Information Processing Systems (NeurIPS 2019)
Distributionally Robust Language Modeling PDF
Yonatan Oren*, Shiori Sagawa *, Tatsunori B Hashimoto *, Percy Liang
Empirical Methods in Natural Language Processing (EMNLP 2019)
Distributionally Robust Losses Against Mixture Covariate Shifts PDF
John C Duchi, Tatsunori B Hashimoto, Hongseok Namkoong
Preprint
Inferring Multidimensional Rates of Aging from Cross-Sectional Data PDF
Emma Pierson*, Pang Wei Koh *, Tatsunori B Hashimoto *, Daphne Koller, Jure Lesokevic, Nicholas Eriksson, Percy Liang
22nd International Conference on Artificial Intelligence and Statistics (AISTATS 2019)
Unifying Human and Statistical Evaluation for Natural Language Generation PDF
Tatsunori B Hashimoto*, Hugh Zhang*, Percy Liang
Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019)
A Retrieve-and-Edit Framework for Predicting Structured Outputs PDF
Tatsunori B Hashimoto, Kelvin Guu, Yonatan Oren, Percy Liang
Advances in Neural Information Processing Systems 31 (NeurIPS 2018, Oral)
Generating Sentences by Editing Prototypes PDF
Kelvin Guu*, Tatsunori B Hashimoto*, Yonatan Oren, Percy Liang
Transactions of the Association of Computational Linguistics (TACL, presented at ACL 2018)
Fairness Without Demographics in Repeated Loss Minimization PDF
Tatsunori B Hashimoto, Megha Srivastava, Hongseok Namkoong, Percy Liang
Proceedings of the 35th International Conference on Machine Learning (ICML 2018, Best paper runner up)
Derivative free optimization via repeated classification PDF
Tatsunori B Hashimoto, Steve Yadlowsky, John C Duchi
21st International Conference on Artificial Intelligence and Statistics (AISTATS 2018)
Unsupervised Transformation Learning via Convex Relaxations PDF
Tatsunori B Hashimoto, Percy S Liang, John C Duchi
Advances in Neural Information Processing Systems 30 (NeurIPS 2017)
Continuous representations and models from random walk diffusion limits PDF
Tatsunori B Hashimoto
PhD Thesis, MIT CSAIL, 2016
Word embeddings as metric recovery in semantic spaces PDF
Tatsunori B Hashimoto, David Alvarez-Melis, Tommi S Jaakkola
Transactions of the Association for Computational Linguistics 4 (TACL, presented at ACL 2016)
Learning Population-Level Diffusions with Generative RNNs PDF
Tatsunori B Hashimoto, David Gifford, Tommi S Jaakkola
Proceedings of the 33rd International Conference on Machine Learning (ICML 2016)
From random walks to distances on unweighted graphs PDF
Tatsunori B Hashimoto, Yi Sun, Tommi S Jaakkola
Advances in Neural Information Processing Systems (NeurIPS 2015)
Metric recovery from directed unweighted graphs PDF
Tatsunori B Hashimoto, Yi Sun, Tommi S Jaakkola
Artificial Intelligence and Statistics (AISTATS 2015), (best poster at NeurIPS 2014 workshop on networks)
A synergistic DNA logic predicts genome-wide chromatin accessibility PDF
Tatsunori B Hashimoto*, Richard I Sherwood*, Daniel D Kang*, Nisha Rajagopal, Amira A Barkal, Haoyang Zeng, Bart JM Emons, Sharanya Srinivasan, Tommi S Jaakkola, David K Gifford
Genome research (2016)
Cas9 Functionally Opens Chromatin PDF
Amira A Barkal, Sharanya Srinivasan, Tatsunori B Hashimoto, David K Gifford, Richard I Sherwood
PloS One (2016)
Cloning-free CRISPR PDF
Mandana Arbab, Sharanya Srinivasan, Tatsunori B Hashimoto, Niels Geijsen, Richard I Sherwood
Stem cell reports (2015)
GERV: a statistical method for generative evaluation of regulatory variants for transcription factor binding PDF
Haoyang Zeng, Tatsunori B Hashimoto, Daniel D Kang, David K Gifford
Bioinformatics (2015)
Long-term persistence and development of induced pancreatic beta cells generated by lineage conversion of acinar cells PDF
Weida Li, Claudia Cavelti-Weder, Yinying Zhang, Kendell Clement, Scott Donovan, Gabriel Gonzalez, Jiang Zhu, Marianne Stemann, Ke Xu, Tatsunori B Hashimoto, Takatsugu Yamada, Mio Nakanishi, Yuemei Zhang, Samuel Zeng, David Gifford, Alexander Meissner, Gordon Weir, Qiao Zhou
Nature Biotechnology (2014)
Universal count correction for high-throughput sequencing PDF
Tatsunori B Hashimoto, Matthew D Edwards, David K Gifford
PLoS computational biology (2014)
Discovery of directional and nondirectional pioneer transcription factors by modeling DNase profile magnitude and shape PDF
Richard I Sherwood *, Tatsunori B Hashimoto *, Charles W O'donnell *, Sophia Lewis, Amira A Barkal, John Peter Van Hoff, Vivek Karun, Tommi S Jaakkola, David K Gifford
Nature Biotechnology (2014)
Quantifying condition-dependent intracellular protein levels enables high-precision fitness estimates PDF
Kerry A Geiler-Samerotte, Tatsunori B Hashimoto, Michael F Dion, Bogdan A Budnik, Edoardo M Airoldi, D Allan Drummond
PloS one (2013)
Lineage-based identification of cellular states and expression programs PDF
Tatsunori B Hashimoto, Tommi S Jaakkola, Richard Sherwood, Esteban O. Mazzoni, Hynek Wichterle, David Gifford
Bioinformatics (2012)
Finding drug discovery rules of thumb with bump hunting PDF
Tatsunori B Hashimoto, Matthew Segall
Proceedings of the ACS (2010)
BFL: a node and edge betweenness based fast layout algorithm for large scale networks. PDF
Tatsunori B Hashimoto, Masao Nagasaki, Kaname Kojima, Satoru Miyano
BMC bioinformatics (2009)
Approximate Selection with Guarantees using Proxies PDF
Daniel Kang*, Edward Gan*, Peter Bailis, Tatsunori B Hashimoto, Matei Zaharia
International Conference on Very Large Data Bases (VLDB 2020, To Appear)
Robustness to Spurious Correlations via Human Annotations
Megha Srivastava, Tatsunori B Hashimoto, Percy Liang
International Conference on Machine Learning (ICML 2020, To Appear)
Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-case Generalization PDF
Shiori Sagawa*, Pang Wei Koh*, Tatsunori B Hashimoto, Percy Liang
International Conference on Learning Representations (ICLR 2020)
Distributionally Robust Losses Against Mixture Covariate Shifts PDF
John C Duchi, Tatsunori B Hashimoto, Hongseok Namkoong
Preprint
A Retrieve-and-Edit Framework for Predicting Structured Outputs PDF
Tatsunori B Hashimoto, Kelvin Guu, Yonatan Oren, Percy Liang
Advances in Neural Information Processing Systems 31 (NeurIPS 2018, Oral)
Fairness Without Demographics in Repeated Loss Minimization PDF
Tatsunori B Hashimoto, Megha Srivastava, Hongseok Namkoong, Percy Liang
Proceedings of the 35th International Conference on Machine Learning (ICML 2018, Best paper runner up)
Derivative free optimization via repeated classification PDF
Tatsunori B Hashimoto, Steve Yadlowsky, John C Duchi
21st International Conference on Artificial Intelligence and Statistics (AISTATS 2018)
Unsupervised Transformation Learning via Convex Relaxations PDF
Tatsunori B Hashimoto, Percy S Liang, John C Duchi
Advances in Neural Information Processing Systems 30 (NeurIPS 2017)
Continuous representations and models from random walk diffusion limits PDF
Tatsunori B Hashimoto
PhD Thesis, MIT CSAIL, 2016
Word embeddings as metric recovery in semantic spaces PDF
Tatsunori B Hashimoto, David Alvarez-Melis, Tommi S Jaakkola
Transactions of the Association for Computational Linguistics 4 (TACL, presented at ACL 2016)
Learning Population-Level Diffusions with Generative RNNs PDF
Tatsunori B Hashimoto, David Gifford, Tommi S Jaakkola
Proceedings of the 33rd International Conference on Machine Learning (ICML 2016)
From random walks to distances on unweighted graphs PDF
Tatsunori B Hashimoto, Yi Sun, Tommi S Jaakkola
Advances in Neural Information Processing Systems (NeurIPS 2015)
Metric recovery from directed unweighted graphs PDF
Tatsunori B Hashimoto, Yi Sun, Tommi S Jaakkola
Artificial Intelligence and Statistics (AISTATS 2015), (best poster at NeurIPS 2014 workshop on networks)
Improved Natural Language Generation via Loss Truncation PDF
Daniel Kang, Tatsunori B Hashimoto
Annual Meeting of the Association of Computational Linguistics (ACL 2020, To Appear)
Learning Autocomplete Systems as a Communication Game PDF
Mina Lee, Tatsunori B Hashimoto, Percy Liang
Emergent Communication Workshop at Neural Information Processing Systems (NeurIPS 2019)
Distributionally Robust Language Modeling PDF
Yonatan Oren*, Shiori Sagawa *, Tatsunori B Hashimoto *, Percy Liang
Empirical Methods in Natural Language Processing (EMNLP 2019)
Unifying Human and Statistical Evaluation for Natural Language Generation PDF
Tatsunori B Hashimoto*, Hugh Zhang*, Percy Liang
Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019)
A Retrieve-and-Edit Framework for Predicting Structured Outputs PDF
Tatsunori B Hashimoto, Kelvin Guu, Yonatan Oren, Percy Liang
Advances in Neural Information Processing Systems 31 (NeurIPS 2018, Oral)
Generating Sentences by Editing Prototypes PDF
Kelvin Guu*, Tatsunori B Hashimoto*, Yonatan Oren, Percy Liang
Transactions of the Association of Computational Linguistics (TACL, presented at ACL 2018)
Continuous representations and models from random walk diffusion limits PDF
Tatsunori B Hashimoto
PhD Thesis, MIT CSAIL, 2016
Word embeddings as metric recovery in semantic spaces PDF
Tatsunori B Hashimoto, David Alvarez-Melis, Tommi S Jaakkola
Transactions of the Association for Computational Linguistics 4 (TACL, presented at ACL 2016)
Inferring Multidimensional Rates of Aging from Cross-Sectional Data PDF
Emma Pierson*, Pang Wei Koh *, Tatsunori B Hashimoto *, Daphne Koller, Jure Lesokevic, Nicholas Eriksson, Percy Liang
22nd International Conference on Artificial Intelligence and Statistics (AISTATS 2019)
Continuous representations and models from random walk diffusion limits PDF
Tatsunori B Hashimoto
PhD Thesis, MIT CSAIL, 2016
Learning Population-Level Diffusions with Generative RNNs PDF
Tatsunori B Hashimoto, David Gifford, Tommi S Jaakkola
Proceedings of the 33rd International Conference on Machine Learning (ICML 2016)
A synergistic DNA logic predicts genome-wide chromatin accessibility PDF
Tatsunori B Hashimoto*, Richard I Sherwood*, Daniel D Kang*, Nisha Rajagopal, Amira A Barkal, Haoyang Zeng, Bart JM Emons, Sharanya Srinivasan, Tommi S Jaakkola, David K Gifford
Genome research (2016)
Cas9 Functionally Opens Chromatin PDF
Amira A Barkal, Sharanya Srinivasan, Tatsunori B Hashimoto, David K Gifford, Richard I Sherwood
PloS One (2016)
Cloning-free CRISPR PDF
Mandana Arbab, Sharanya Srinivasan, Tatsunori B Hashimoto, Niels Geijsen, Richard I Sherwood
Stem cell reports (2015)
GERV: a statistical method for generative evaluation of regulatory variants for transcription factor binding PDF
Haoyang Zeng, Tatsunori B Hashimoto, Daniel D Kang, David K Gifford
Bioinformatics (2015)
Long-term persistence and development of induced pancreatic beta cells generated by lineage conversion of acinar cells PDF
Weida Li, Claudia Cavelti-Weder, Yinying Zhang, Kendell Clement, Scott Donovan, Gabriel Gonzalez, Jiang Zhu, Marianne Stemann, Ke Xu, Tatsunori B Hashimoto, Takatsugu Yamada, Mio Nakanishi, Yuemei Zhang, Samuel Zeng, David Gifford, Alexander Meissner, Gordon Weir, Qiao Zhou
Nature Biotechnology (2014)
Universal count correction for high-throughput sequencing PDF
Tatsunori B Hashimoto, Matthew D Edwards, David K Gifford
PLoS computational biology (2014)
Discovery of directional and nondirectional pioneer transcription factors by modeling DNase profile magnitude and shape PDF
Richard I Sherwood *, Tatsunori B Hashimoto *, Charles W O'donnell *, Sophia Lewis, Amira A Barkal, John Peter Van Hoff, Vivek Karun, Tommi S Jaakkola, David K Gifford
Nature Biotechnology (2014)
Quantifying condition-dependent intracellular protein levels enables high-precision fitness estimates PDF
Kerry A Geiler-Samerotte, Tatsunori B Hashimoto, Michael F Dion, Bogdan A Budnik, Edoardo M Airoldi, D Allan Drummond
PloS one (2013)
Lineage-based identification of cellular states and expression programs PDF
Tatsunori B Hashimoto, Tommi S Jaakkola, Richard Sherwood, Esteban O. Mazzoni, Hynek Wichterle, David Gifford
Bioinformatics (2012)
Finding drug discovery rules of thumb with bump hunting PDF
Tatsunori B Hashimoto, Matthew Segall
Proceedings of the ACS (2010)
BFL: a node and edge betweenness based fast layout algorithm for large scale networks. PDF
Tatsunori B Hashimoto, Masao Nagasaki, Kaname Kojima, Satoru Miyano
BMC bioinformatics (2009)
NeuralGen Workshop (NAACL 2019): Defining and Evaluating Diversity in Generation
Neyman Seminar (Berkeley Statistics, 8/28): Statistical Challenges in Understanding and Improving Natural Language Generation
This website uses the website design and template by Martin Saveski