CTCFBSDB 2.0: A database for CTCF binding sites and genome organization
  Home Search Experimentally Identified CTCFBS Browse Topological Domain Predicted CTCFBS CTCFBS Prediction Tool Help  

Browse experimentally determined binding sites Browse binding sites by topological domain Browse computationally predicted CTCF binding sites Range Search Keyword Search Scan sequences for CTCF motif matches
[Browse experimentally determined binding sites] [Browse binding sites by topological domain ]
[Browse computationally predicted CTCF binding sites ] [Search the database ] [Scan sequences for CTCF motif matches]
CCCTC-binding factor (CTCF) is a versatile transcription regulator that is evolutionarily conserved from fruit fly to human. CTCF binds to different DNA sequences through combinatorial use of 11-zinc fingers, and shows distinct functions (transcription activation/repression and chromatin insulation) depending on the biological context. Insulators, with the functions of enhancer-blocking and domain-bordering, are critical regulatory elements for gene expression control. They represent a class of diverged DNA sequences capable of shielding genes against inappropriate cis-regulatory signals from their genomic neighborhood. Recent studies also linked insulators to epigenetics, such as imprinting and X-chromosome inactivation. In eukaryotic genomes, maintenance of distinct chromatin domains is critical for transcription control, and CTCF has been identified as playing a crucial role in the global organization of chromatic architecture. Evidence for this CTCF function has been strengthened by Hi-C experiments that have shown that interacting genomic regions commonly contain CTCF binding sites and that the boundaries of genomic topological domains are enriched for CTCF binding sites. To analyze this important type of DNA regulatory element, we created a CTCF binding site database (CTCFBSDB), a comprehensive collection of experimentally determined and computationally predicted CTCF binding sites (CTCFBS) from the literature. The database is designed to facilitate the studies on insulators and their roles in demarcating functional genomic domains. Currently, the database contains almost 15 million experimentally determined CTCF binding sites across several species. CTCF binding sites were collected from published papers containing CTCF binding sites identified using ChIPSeq or similar methods, data from the ENCODE project, and a set of approximately 100 manually curated binding sites identified by low-throughput experiments. A complete list of the sources used to curate the CTCF binding sites within the database can be found on the Help page.
Download binding sites
How to cite the CTCFBSDB database
Ziebarth JD, Bhattacharya A, Cui Y (2013) CTCFBSDB 2.0: a database for CTCF-binding sites and genome organization. Nucleic Acids Research. 41(D1):D188-D194.
Bao L, Zhou M and Cui Y (2008) CTCFBSDB: a CTCF binding site database for characterization of vertebrate genomic insulators. Nucleic Acids Research. 36: D83-D87.

Other databases

SomamiR Database


A comprehensive resource that integrates several types of data for use in investigating the impact of somatic and germline mutations on miRNA function in cancer. The database contains somatic mutations that may create or disrupt miRNA target sites and integrates these somatic mutations with germline mutations within the same target sites, genome-wide and candidate gene association studies of cancer, and functional annotations that link genes containing mutations with cancer. Additionally, the database contains a collection of germline and somatic mutations in miRNAs and their targets that have been experimentally shown to impact miRNA function and have been associated with cancer. (Reference)


PolymiRTS Database


Linking polymorphisms in microRNA target sites with complex traits. Polymorphism in microRNA Target Site (PolymiRTS) is a database of naturally occurring DNA variations in putative microRNA (miRNA) target sites. This database characterizes PolymiRTS as a new class of sequence variants underlying gene expression traits and/or higher order mammalian traits. (Reference)