Genomics API

You need to apply an API token if you want to access the data in your programs.

The API token is unique for each user. It will be allocated by CMDB and should not be shared with others. The API token will be used to validate the requests are legal or not.

Requests to the CMDB Genomics API for variant data must be accompanied by an identifier, which can just be the API token. It will return 403 if the token is not right or empty.

Where is my API token?
Typically, you can find the register button of API token in you personal profile page of CMDB. New to CMDB? Create an account.

Overview

The CMDB Genomics API is created to enable researchers to better access CMDB variants data across remote sites. For example, instead of searching variants one by one in the website, the API allows retrieval of variants’ information. CMDB Genomics API currently supports access to variant information via position or RSID.

For more information please see this manual.

Schemas and formats

The CMDB Genomics API is created to enable researchers to better access CMDB variants data across remote sites. For example, instead of searching variants one by one in the website, the API allows retrieval of variants’ information. CMDB Genomics API currently supports access to variant information via position or RSID.

For more information please see this manual.

On the wire, the CMDB Genomics API takes the form of a client and a server exchanging JSON-serialized objects over HTTPS.

Getting Start

Here shows how to request a single variant by API token.

Query Parameters

Parameter Description
variant The objects used to represent variant information.
type The type of data.
query Requests for data
token=$key API token

Supporing endpoints for CMDB entities

CMDB Entitity Base Path
position api/v1.0/variant?type=position
rs api/v1.0/variant?type=rs

Example

Http Request

https://cmdb.bgi.com/cmdb/api/v1.0/variant?type=position&query=chr22-36955161&token=$key

https://cmdb.bgi.com/cmdb/api/v1.0/variant?type=rs&query=rs148755828&token=$key`

Running the query returns

Here is a serialized variant in JSON:

[
  {
    "allele_count": 27, 
    "allele_freq": 0.00218, 
    "allele_num": 10589, 
    "alt": "A",
    "chrom": "4", 
    "filter_status": "PASS", 
    "genes": [
      "ENSG00000131127"
    ], 
    "pos": 370328,
    "ref": "T", 
    "rsid": "rs189432301", 
    "site_quality": 30.12, 
    "transcripts": [
      "ENST00000240499"
    ], 
    "variant_id": "4-370328-T-A", 
    "xstart": 4000370328, 
    "xstop": 4000370328
  }
]

Entry Reference Data

Parameter Description
variant_id chr:pos variant’s position consist of chromosomal id and position
rsid dbSNP RSID
filter_status Whether the variants passed QC or not.
chrom Chromosome (e.g.chr20 or X)
allele_count Alternate allele count
allele_num Total number of alleles(i.e. Total read count in CMDB)
allele_freq Alternate allele frenquencyin CMDB.The graphic displays allele frequency on a discrete:<2/1000,<5/1000,<1%,<5%,<10%,<50%,>50%
site_quality Phred-scaled quality score for the assertion made in ALT.
xstop End position of the variant described in this record.
genes reference genes.
alt List of alternate non-reference alleles.
ref Reference base.Each base must be one of A,C,G,T (case insensitive).
xStart The reference position, with the first base having position 1.
transcripts A transcript is the operational unit of a gene. In a genomic context, transcripts consist of one or more exons, with adjoining exons being separated by introns. The exons/introns are transcribed and then the introns spliced out. Transcripts may or may not encode a protein.
Feature_type Feature that is annotated by this region.

Return Code Description

Return Code Error code description Descrition
404 Unauthorized query not found
403 Bad Request malformed request syntax,invalid request