About the Imageomics Institute (NSF OAC-2118240)


Creating a collaborative research, training, and community-facing environment for extracting existing and new biological traits from images of organisms, with the necessary infrastructure for cyber, information, and model development. The Institute will advance Imageomics-enabled biology, accelerate innovations in machine learning, and create digital resources for the researchers and practitioners in biology, data science, and machine learning, as well as the broader scientific community. It will further interdisciplinary training and education, and engage the broader public in the scientific process. Accomplishing these tasks will provide unique insights and enable biological discovery over a wide range of informative organismal attributes - some not yet comprehended or studied - and across multiple scales of biological organization from individuals to species.


To establish a new scientific field called imageomics that harnesses revolutions in data science and computing, as well as the rapidly expanding collections of biological image data, in order to accelerate biological understanding of phenotypic traits extracted from images of organisms. The Imageomics Institute will make the study of the interrelationships, associations, and dynamics of traits computable by using the existing structured biological knowledge to inform the computational methods. It will develop integrated, interdisciplinary approaches to advance the state of the art in knowledge-guided machine learning while rapidly accelerating biological discovery and public engagement in science, forming a virtuous cycle.


  • Digital resources for the scientific community and practitioners in biology, data science, and ML
  • Advancements in Imageomics-enabled biology
  • Innovations in machine learning
  • Advancements in interdisciplinary training and education
  • Engagement of the broader public in the scientific process by leveraging images as the source of data to democratize science

Technical Abstract

The traits that characterize living organisms—in particular, their morphology, physiology, behavior and genetic make-up—enable them to cope with forces of the physical as well as the biological and social environments that impinge on them. Moreover, since function follows form, traits provide the raw material upon which natural selection operates, thus shaping evolutionary trajectories and the history of life. Interestingly, most living organisms, from microscopic microbes to charismatic megafauna, reveal themselves visually and are routinely captured in copious images taken by humans from all walks of life. The resulting massive amount of image data has the potential to further our understanding of how multifaceted traits of organisms shape the behavior of individuals, collectives, populations, and the ecological communities they live in, as well as the evolutionary trajectories of the species they comprise. Images are increasingly the currency for documenting the details of life on the planet, and yet traits of organisms, known or novel, cannot be readily extracted from them. Just like with genomic data two decades ago, our ability to collect data at the moment far outstripts our ability to extract biological insight from it. The Institute will establish a new field of IMAGEOMICS, in which biologists utilize machine learning algorithms (ML) to analyze vast stores of existing image data—especially publicly funded digital collections from national centers, field stations, museums and individual laboratories—to characterize patterns and gain novel insights on how function follows form in all areas of biology to expand our understanding of the rules of life on Earth and how it evolves.

This Institute will introduce structured knowledge from the biological sciences to guide and structure ML algorithms to enable biological trait discovery from images, establishing the field of Imageomics. With images captured and annotated by scientists and the public serving as the basis for the work, the Institute’s convergent approach uses structured biological knowledge to provide scientifically validated inductive biases and rich supervision for ML, and ML will in turn enrich the body of biological knowledge. The resulting ML models and tools will help to make what was hidden visible, so that scientists from a wide range of biological communities can discover and infer the traits of organisms; assess shared similarities and differences between individuals, populations and species; and come to see the world in new ways. Imageomics will accelerate and transform the biomedical, agricultural and basic biological sciences as they seek to understand and control genes that relate to particular phenotypes and enable an overarching understanding of how the genome evolved in tandem with the organismal phenome. Because traits are the essential links between genes and the environment, using ML to help characterize them will lead to emergent understandings of how they function. Harnessing the insights that arise from these new visualizations will stimulate the use of new genetic technologies, such as CRISPR, and more nuanced ecological practices, such as modified land use schemes that emerge from better understanding the connections between individual decision-making within species and their impact on their population dynamics. With the emergence of new and better targeted practices that generate fewer unintended consequences, the new linkages resulting from a better understanding of traits and their consequences will bolster the nation’s bioeconomy. In addition, by leveraging and expanding existing diverse, inclusive and intellectually wide-ranging collaborative networks, the Institute will also educate the next generation of scientists and engage the broader public in scientific inquiry and knowledge discovery so that Imageomics can transform and democratize science for public good.

imageomics | brand-new noun 


A new scientific field in which computational (machine learning) tools built around biological knowledge bases are used by biologists to analyze image data in order to characterize patterns and gain insights into traits and relationships at individual, population and species scales—insights that then get incorporated into the algorithms that run the tools.

Imageomics Institute | brand-new entity 

'i-mi-jə-'ō-miks 'in(t)-stə-tÜt 

The NSF Harnessing the Data Revolution institute (NSF OAC-2118240) established October 1, 2021, to create machine learning tools using publicly funded collections of digital image data from national centers, field stations, museums and individual laboratories that will enable scientists to study how function follows form in all areas of biology and expand public understanding of the rules of Life on Earth and how it evolves. See also: One of 5 inaugural institutes for data-intensive research in science and engineering established by the NSF.