Tatsunori Hashimoto

Researcher, Microsoft Semantic Machines

Acting Assistant Professor, Stanford

thashim [AT] stanford.edu

Bio

From fall 2019 to 2020 I will be at Microsoft Semantic Machines as a researcher.

Starting fall 2020, I will an assistant professor at the computer science department at Stanford.

My research uses tools from statistics to make machine learning systems more robust and reliable — especially in challenging tasks involving natural language. The goal of my research is to use robustness and worst-case performance as a lens to understand and make progress on several fundamental challenges in machine learning and natural language processing. A few topics of recent interest are,

Long-tail behavior
How can we ensure that a machine learning system won't fail catastrophically in the wild under changing conditions?
Understanding
A system which understands how to answer questions or generate text should also do so robustly out-of-domain.
Fairness
Machine learning systems which rely on unreliable correlations can result in spurious and harmful predictions.

Previously, I was a post-doc at Stanford working for John C. Duchi and Percy Liang on tradeoffs between the average and worst-case performance of machine learning models. Before my post-doc, I was a graduate student at MIT co-advised by Tommi Jaakkola and David Gifford and a undergraduate student at Harvard in statistics and math advised by Edoardo Airoldi.

Recent Research Areas

Distributionally Robust Models
Training methods to make models perform even in the worst-case on a population.
Diverse Natural Language Generation
Improving and quantifying the tendency for generation systems to memorize generic statements
Representations from Random Walks
Understanding when random-walk based word and graph representations works.

Publications

Most recent publications on Google Scholar.

Approximate Selection with Guarantees using Proxies PDF

Daniel Kang*, Edward Gan*, Peter Bailis, Tatsunori B Hashimoto, Matei Zaharia

International Conference on Very Large Data Bases (VLDB 2020, To Appear)

Improved Natural Language Generation via Loss Truncation PDF

Daniel Kang, Tatsunori B Hashimoto

Annual Meeting of the Association of Computational Linguistics (ACL 2020, To Appear)

Robustness to Spurious Correlations via Human Annotations

Megha Srivastava, Tatsunori B Hashimoto, Percy Liang

International Conference on Machine Learning (ICML 2020, To Appear)

Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-case Generalization PDF

Shiori Sagawa*, Pang Wei Koh*, Tatsunori B Hashimoto, Percy Liang

International Conference on Learning Representations (ICLR 2020)

Distributionally Robust Language Modeling PDF

Yonatan Oren*, Shiori Sagawa *, Tatsunori B Hashimoto *, Percy Liang

Empirical Methods in Natural Language Processing (EMNLP 2019)

Distributionally Robust Losses Against Mixture Covariate Shifts PDF

John C Duchi, Tatsunori B Hashimoto, Hongseok Namkoong

Preprint

Unifying Human and Statistical Evaluation for Natural Language Generation PDF

Tatsunori B Hashimoto*, Hugh Zhang*, Percy Liang

Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019)

Generating Sentences by Editing Prototypes PDF

Kelvin Guu*, Tatsunori B Hashimoto*, Yonatan Oren, Percy Liang

Transactions of the Association of Computational Linguistics (TACL, presented at ACL 2018)

Fairness Without Demographics in Repeated Loss Minimization PDF

Tatsunori B Hashimoto, Megha Srivastava, Hongseok Namkoong, Percy Liang

Proceedings of the 35th International Conference on Machine Learning (ICML 2018, Best paper runner up)

Word embeddings as metric recovery in semantic spaces PDF

Tatsunori B Hashimoto, David Alvarez-Melis, Tommi S Jaakkola

Transactions of the Association for Computational Linguistics 4 (TACL, presented at ACL 2016)

Metric recovery from directed unweighted graphs PDF

Tatsunori B Hashimoto, Yi Sun, Tommi S Jaakkola

Artificial Intelligence and Statistics (AISTATS 2015), (best poster at NeurIPS 2014 workshop on networks)

Approximate Selection with Guarantees using Proxies PDF

Daniel Kang*, Edward Gan*, Peter Bailis, Tatsunori B Hashimoto, Matei Zaharia

International Conference on Very Large Data Bases (VLDB 2020, To Appear)

Improved Natural Language Generation via Loss Truncation PDF

Daniel Kang, Tatsunori B Hashimoto

Annual Meeting of the Association of Computational Linguistics (ACL 2020, To Appear)

Robustness to Spurious Correlations via Human Annotations

Megha Srivastava, Tatsunori B Hashimoto, Percy Liang

International Conference on Machine Learning (ICML 2020, To Appear)

Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-case Generalization PDF

Shiori Sagawa*, Pang Wei Koh*, Tatsunori B Hashimoto, Percy Liang

International Conference on Learning Representations (ICLR 2020)

Learning Autocomplete Systems as a Communication Game PDF

Mina Lee, Tatsunori B Hashimoto, Percy Liang

Emergent Communication Workshop at Neural Information Processing Systems (NeurIPS 2019)

Distributionally Robust Language Modeling PDF

Yonatan Oren*, Shiori Sagawa *, Tatsunori B Hashimoto *, Percy Liang

Empirical Methods in Natural Language Processing (EMNLP 2019)

Distributionally Robust Losses Against Mixture Covariate Shifts PDF

John C Duchi, Tatsunori B Hashimoto, Hongseok Namkoong

Preprint

Inferring Multidimensional Rates of Aging from Cross-Sectional Data PDF

Emma Pierson*, Pang Wei Koh *, Tatsunori B Hashimoto *, Daphne Koller, Jure Lesokevic, Nicholas Eriksson, Percy Liang

22nd International Conference on Artificial Intelligence and Statistics (AISTATS 2019)

Unifying Human and Statistical Evaluation for Natural Language Generation PDF

Tatsunori B Hashimoto*, Hugh Zhang*, Percy Liang

Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019)

A Retrieve-and-Edit Framework for Predicting Structured Outputs PDF

Tatsunori B Hashimoto, Kelvin Guu, Yonatan Oren, Percy Liang

Advances in Neural Information Processing Systems 31 (NeurIPS 2018, Oral)

Generating Sentences by Editing Prototypes PDF

Kelvin Guu*, Tatsunori B Hashimoto*, Yonatan Oren, Percy Liang

Transactions of the Association of Computational Linguistics (TACL, presented at ACL 2018)

Fairness Without Demographics in Repeated Loss Minimization PDF

Tatsunori B Hashimoto, Megha Srivastava, Hongseok Namkoong, Percy Liang

Proceedings of the 35th International Conference on Machine Learning (ICML 2018, Best paper runner up)

Derivative free optimization via repeated classification PDF

Tatsunori B Hashimoto, Steve Yadlowsky, John C Duchi

21st International Conference on Artificial Intelligence and Statistics (AISTATS 2018)

Unsupervised Transformation Learning via Convex Relaxations PDF

Tatsunori B Hashimoto, Percy S Liang, John C Duchi

Advances in Neural Information Processing Systems 30 (NeurIPS 2017)

Continuous representations and models from random walk diffusion limits PDF

Tatsunori B Hashimoto

PhD Thesis, MIT CSAIL, 2016

Word embeddings as metric recovery in semantic spaces PDF

Tatsunori B Hashimoto, David Alvarez-Melis, Tommi S Jaakkola

Transactions of the Association for Computational Linguistics 4 (TACL, presented at ACL 2016)

Learning Population-Level Diffusions with Generative RNNs PDF

Tatsunori B Hashimoto, David Gifford, Tommi S Jaakkola

Proceedings of the 33rd International Conference on Machine Learning (ICML 2016)

From random walks to distances on unweighted graphs PDF

Tatsunori B Hashimoto, Yi Sun, Tommi S Jaakkola

Advances in Neural Information Processing Systems (NeurIPS 2015)

Metric recovery from directed unweighted graphs PDF

Tatsunori B Hashimoto, Yi Sun, Tommi S Jaakkola

Artificial Intelligence and Statistics (AISTATS 2015), (best poster at NeurIPS 2014 workshop on networks)

A synergistic DNA logic predicts genome-wide chromatin accessibility PDF

Tatsunori B Hashimoto*, Richard I Sherwood*, Daniel D Kang*, Nisha Rajagopal, Amira A Barkal, Haoyang Zeng, Bart JM Emons, Sharanya Srinivasan, Tommi S Jaakkola, David K Gifford

Genome research (2016)

Cas9 Functionally Opens Chromatin PDF

Amira A Barkal, Sharanya Srinivasan, Tatsunori B Hashimoto, David K Gifford, Richard I Sherwood

PloS One (2016)

Cloning-free CRISPR PDF

Mandana Arbab, Sharanya Srinivasan, Tatsunori B Hashimoto, Niels Geijsen, Richard I Sherwood

Stem cell reports (2015)

GERV: a statistical method for generative evaluation of regulatory variants for transcription factor binding PDF

Haoyang Zeng, Tatsunori B Hashimoto, Daniel D Kang, David K Gifford

Bioinformatics (2015)

Long-term persistence and development of induced pancreatic beta cells generated by lineage conversion of acinar cells PDF

Weida Li, Claudia Cavelti-Weder, Yinying Zhang, Kendell Clement, Scott Donovan, Gabriel Gonzalez, Jiang Zhu, Marianne Stemann, Ke Xu, Tatsunori B Hashimoto, Takatsugu Yamada, Mio Nakanishi, Yuemei Zhang, Samuel Zeng, David Gifford, Alexander Meissner, Gordon Weir, Qiao Zhou

Nature Biotechnology (2014)

Universal count correction for high-throughput sequencing PDF

Tatsunori B Hashimoto, Matthew D Edwards, David K Gifford

PLoS computational biology (2014)

Discovery of directional and nondirectional pioneer transcription factors by modeling DNase profile magnitude and shape PDF

Richard I Sherwood *, Tatsunori B Hashimoto *, Charles W O'donnell *, Sophia Lewis, Amira A Barkal, John Peter Van Hoff, Vivek Karun, Tommi S Jaakkola, David K Gifford

Nature Biotechnology (2014)

Quantifying condition-dependent intracellular protein levels enables high-precision fitness estimates PDF

Kerry A Geiler-Samerotte, Tatsunori B Hashimoto, Michael F Dion, Bogdan A Budnik, Edoardo M Airoldi, D Allan Drummond

PloS one (2013)

Lineage-based identification of cellular states and expression programs PDF

Tatsunori B Hashimoto, Tommi S Jaakkola, Richard Sherwood, Esteban O. Mazzoni, Hynek Wichterle, David Gifford

Bioinformatics (2012)

Finding drug discovery rules of thumb with bump hunting PDF

Tatsunori B Hashimoto, Matthew Segall

Proceedings of the ACS (2010)

BFL: a node and edge betweenness based fast layout algorithm for large scale networks. PDF

Tatsunori B Hashimoto, Masao Nagasaki, Kaname Kojima, Satoru Miyano

BMC bioinformatics (2009)

Approximate Selection with Guarantees using Proxies PDF

Daniel Kang*, Edward Gan*, Peter Bailis, Tatsunori B Hashimoto, Matei Zaharia

International Conference on Very Large Data Bases (VLDB 2020, To Appear)

Robustness to Spurious Correlations via Human Annotations

Megha Srivastava, Tatsunori B Hashimoto, Percy Liang

International Conference on Machine Learning (ICML 2020, To Appear)

Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-case Generalization PDF

Shiori Sagawa*, Pang Wei Koh*, Tatsunori B Hashimoto, Percy Liang

International Conference on Learning Representations (ICLR 2020)

Distributionally Robust Losses Against Mixture Covariate Shifts PDF

John C Duchi, Tatsunori B Hashimoto, Hongseok Namkoong

Preprint

A Retrieve-and-Edit Framework for Predicting Structured Outputs PDF

Tatsunori B Hashimoto, Kelvin Guu, Yonatan Oren, Percy Liang

Advances in Neural Information Processing Systems 31 (NeurIPS 2018, Oral)

Fairness Without Demographics in Repeated Loss Minimization PDF

Tatsunori B Hashimoto, Megha Srivastava, Hongseok Namkoong, Percy Liang

Proceedings of the 35th International Conference on Machine Learning (ICML 2018, Best paper runner up)

Derivative free optimization via repeated classification PDF

Tatsunori B Hashimoto, Steve Yadlowsky, John C Duchi

21st International Conference on Artificial Intelligence and Statistics (AISTATS 2018)

Unsupervised Transformation Learning via Convex Relaxations PDF

Tatsunori B Hashimoto, Percy S Liang, John C Duchi

Advances in Neural Information Processing Systems 30 (NeurIPS 2017)

Continuous representations and models from random walk diffusion limits PDF

Tatsunori B Hashimoto

PhD Thesis, MIT CSAIL, 2016

Word embeddings as metric recovery in semantic spaces PDF

Tatsunori B Hashimoto, David Alvarez-Melis, Tommi S Jaakkola

Transactions of the Association for Computational Linguistics 4 (TACL, presented at ACL 2016)

Learning Population-Level Diffusions with Generative RNNs PDF

Tatsunori B Hashimoto, David Gifford, Tommi S Jaakkola

Proceedings of the 33rd International Conference on Machine Learning (ICML 2016)

From random walks to distances on unweighted graphs PDF

Tatsunori B Hashimoto, Yi Sun, Tommi S Jaakkola

Advances in Neural Information Processing Systems (NeurIPS 2015)

Metric recovery from directed unweighted graphs PDF

Tatsunori B Hashimoto, Yi Sun, Tommi S Jaakkola

Artificial Intelligence and Statistics (AISTATS 2015), (best poster at NeurIPS 2014 workshop on networks)

Improved Natural Language Generation via Loss Truncation PDF

Daniel Kang, Tatsunori B Hashimoto

Annual Meeting of the Association of Computational Linguistics (ACL 2020, To Appear)

Learning Autocomplete Systems as a Communication Game PDF

Mina Lee, Tatsunori B Hashimoto, Percy Liang

Emergent Communication Workshop at Neural Information Processing Systems (NeurIPS 2019)

Distributionally Robust Language Modeling PDF

Yonatan Oren*, Shiori Sagawa *, Tatsunori B Hashimoto *, Percy Liang

Empirical Methods in Natural Language Processing (EMNLP 2019)

Unifying Human and Statistical Evaluation for Natural Language Generation PDF

Tatsunori B Hashimoto*, Hugh Zhang*, Percy Liang

Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019)

A Retrieve-and-Edit Framework for Predicting Structured Outputs PDF

Tatsunori B Hashimoto, Kelvin Guu, Yonatan Oren, Percy Liang

Advances in Neural Information Processing Systems 31 (NeurIPS 2018, Oral)

Generating Sentences by Editing Prototypes PDF

Kelvin Guu*, Tatsunori B Hashimoto*, Yonatan Oren, Percy Liang

Transactions of the Association of Computational Linguistics (TACL, presented at ACL 2018)

Continuous representations and models from random walk diffusion limits PDF

Tatsunori B Hashimoto

PhD Thesis, MIT CSAIL, 2016

Word embeddings as metric recovery in semantic spaces PDF

Tatsunori B Hashimoto, David Alvarez-Melis, Tommi S Jaakkola

Transactions of the Association for Computational Linguistics 4 (TACL, presented at ACL 2016)

Inferring Multidimensional Rates of Aging from Cross-Sectional Data PDF

Emma Pierson*, Pang Wei Koh *, Tatsunori B Hashimoto *, Daphne Koller, Jure Lesokevic, Nicholas Eriksson, Percy Liang

22nd International Conference on Artificial Intelligence and Statistics (AISTATS 2019)

Continuous representations and models from random walk diffusion limits PDF

Tatsunori B Hashimoto

PhD Thesis, MIT CSAIL, 2016

Learning Population-Level Diffusions with Generative RNNs PDF

Tatsunori B Hashimoto, David Gifford, Tommi S Jaakkola

Proceedings of the 33rd International Conference on Machine Learning (ICML 2016)

A synergistic DNA logic predicts genome-wide chromatin accessibility PDF

Tatsunori B Hashimoto*, Richard I Sherwood*, Daniel D Kang*, Nisha Rajagopal, Amira A Barkal, Haoyang Zeng, Bart JM Emons, Sharanya Srinivasan, Tommi S Jaakkola, David K Gifford

Genome research (2016)

Cas9 Functionally Opens Chromatin PDF

Amira A Barkal, Sharanya Srinivasan, Tatsunori B Hashimoto, David K Gifford, Richard I Sherwood

PloS One (2016)

Cloning-free CRISPR PDF

Mandana Arbab, Sharanya Srinivasan, Tatsunori B Hashimoto, Niels Geijsen, Richard I Sherwood

Stem cell reports (2015)

GERV: a statistical method for generative evaluation of regulatory variants for transcription factor binding PDF

Haoyang Zeng, Tatsunori B Hashimoto, Daniel D Kang, David K Gifford

Bioinformatics (2015)

Long-term persistence and development of induced pancreatic beta cells generated by lineage conversion of acinar cells PDF

Weida Li, Claudia Cavelti-Weder, Yinying Zhang, Kendell Clement, Scott Donovan, Gabriel Gonzalez, Jiang Zhu, Marianne Stemann, Ke Xu, Tatsunori B Hashimoto, Takatsugu Yamada, Mio Nakanishi, Yuemei Zhang, Samuel Zeng, David Gifford, Alexander Meissner, Gordon Weir, Qiao Zhou

Nature Biotechnology (2014)

Universal count correction for high-throughput sequencing PDF

Tatsunori B Hashimoto, Matthew D Edwards, David K Gifford

PLoS computational biology (2014)

Discovery of directional and nondirectional pioneer transcription factors by modeling DNase profile magnitude and shape PDF

Richard I Sherwood *, Tatsunori B Hashimoto *, Charles W O'donnell *, Sophia Lewis, Amira A Barkal, John Peter Van Hoff, Vivek Karun, Tommi S Jaakkola, David K Gifford

Nature Biotechnology (2014)

Quantifying condition-dependent intracellular protein levels enables high-precision fitness estimates PDF

Kerry A Geiler-Samerotte, Tatsunori B Hashimoto, Michael F Dion, Bogdan A Budnik, Edoardo M Airoldi, D Allan Drummond

PloS one (2013)

Lineage-based identification of cellular states and expression programs PDF

Tatsunori B Hashimoto, Tommi S Jaakkola, Richard Sherwood, Esteban O. Mazzoni, Hynek Wichterle, David Gifford

Bioinformatics (2012)

Finding drug discovery rules of thumb with bump hunting PDF

Tatsunori B Hashimoto, Matthew Segall

Proceedings of the ACS (2010)

BFL: a node and edge betweenness based fast layout algorithm for large scale networks. PDF

Tatsunori B Hashimoto, Masao Nagasaki, Kaname Kojima, Satoru Miyano

BMC bioinformatics (2009)

Talks and slides

NeuralGen Workshop (NAACL 2019): Defining and Evaluating Diversity in Generation

Resume

Acknowledgement

This website uses the website design and template by Martin Saveski