INFRAFRONTIER programmatic data access

Infrafrontier enriched strain data are not only accessible through the web interface but also via a RESTful API, enabling automated access and integration into external analysis pipelines. The API is powered by Apache SOLR, supporting SOLR query syntax and SOLR query parameters , including advanced filtering, faceting, and full-text search capabilities. Data are distributed across multiple Solr cores, each representing a distinct dataset or domain (e.g., cancer-related strains, COVID-19 strains), and are independently queryable. Each core defines a specific schema with dedicated fields that can be used for both targeted searching and result filtering. Detailed information about the available cores, fields, and usage examples is provided in the Solr cores documentation below.

SOLR cores

EMMA cancer strains

This core contains information related to EMMA strains associated to more than 50 different cancer types.

Accessible at emma_cancer_strains


Fields

Parameter

Type

Description

id_str

Integer

Unique identifier for the strain

name_str

String

Name of the strain

do_term

String

Disease Ontology (DO) term

do_id

String

Unique identifier for the disease (DOID)

gene_symbol

String

Symbol for the gene

mp_id

String

Unique identifier for the phenotype (MP ontology)

mp_term

String

Mammalian Phenotype term

Examples


Strain with strain id 1

id_str:1

https://infrafrontier.eu/solr/disease/cancer_resource/select?q=id_str:1


Strains with disease term “prostate cancer”

do_term:”prostate cancer”

https://infrafrontier.eu/solr/disease/cancer_resource/select?q=do_term:%22prostate%20cancer%22


Strains with gene Thrb

gene_symbol:Thrb

https://infrafrontier.eu/solr/disease/cancer_resource/select?q=gene_symbol:Thrb

INFRAFRONTIER® and European Mouse Mutant Archive - EMMA® are registered trademarks at the European Union Intellectual Property Office (EUIPO).