{Reference Type}: Journal Article {Title}: On knowing a gene: A distributional hypothesis of gene function. {Author}: Kwon JJ;Pan J;Gonzalez G;Hahn WC;Zitnik M; {Journal}: Cell Syst {Volume}: 15 {Issue}: 6 {Year}: 2024 Jun 19 {Factor}: 11.091 {DOI}: 10.1016/j.cels.2024.04.008 {Abstract}: As words can have multiple meanings that depend on sentence context, genes can have various functions that depend on the surrounding biological system. This pleiotropic nature of gene function is limited by ontologies, which annotate gene functions without considering biological contexts. We contend that the gene function problem in genetics may be informed by recent technological leaps in natural language processing, in which representations of word semantics can be automatically learned from diverse language contexts. In contrast to efforts to model semantics as "is-a" relationships in the 1990s, modern distributional semantics represents words as vectors in a learned semantic space and fuels current advances in transformer-based models such as large language models and generative pre-trained transformers. A similar shift in thinking of gene functions as distributions over cellular contexts may enable a similar breakthrough in data-driven learning from large biological datasets to inform gene function.