References

  1. Andersson R, Gebhard C, Miguel-Escalada I, et al. An atlas of active enhancers across human cell types and tissues. Nature. 2014;507(7493):455–61. doi: 10.1038/nature12787.
  2. Bailey SD, Zhang X, Desai K, Aid M, Corradin O, Cowper-Sal·lari R, et al. ZNF143 provides sequence specificity to secure chromatin interactions at gene promoters. Nat Commun. 2015;2:6186. doi: 10.1038/ncomms7186.
  3. Bellen HJ, Kane CJO, Wilson C, Grossniklaus U, Pearson RK, Gehring WJ. P-element-mediated enhancer detection: a versatile method to study development in drosophila. Genes Dev. 1989;3(9):1288–300. doi: 10.1101/gad.3.9.1288.
  4. Blanchette M. Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression. Genome Res. 2006;16(5):656–68. doi: 10.1101/gr.4866006.
  5. Bond HM, Scicchitano S, Chiarella E, Amodio N, Lucchino V, Aloisio A, et al. ZNF423: a new player in estrogen receptor-positive breast cancer. Front Endocrinol. 2018;9:255. doi: 10.3389/fendo.2018.00255.
  6. Breiman L, Friedman J, Stone CJ, Olshen RA. Classification and regression trees. The Wadsworth statistics probability series. Belmont, CA: Wadsworth International Group; 1984.
  7. Breiman L. Random forests. Mach Learn. 2001;45:5–32. doi: 10.1023/a:1010933404324.
  8. Bron C, Kerbosch J. Algorithm 457: finding all cliques of an undirected graph. Commun ACM. 1973;16(9):575–7. doi: 10.1145/362342.362367.
  9. Cai X, Hou L, Su N, Hu H, Deng M, Li X. Systematic identification of conserved motif modules in the human genome. BMC Genom. 2010;11:567. doi: 10.1186/1471-2164-11-567.
  10. Cao F, Fullwood MJ. Inflated performance measures in enhancer–promoter interaction-prediction methods. Nat Genet. 2019;51:1196–8. doi: 10.1038/s41588-019-0434-7.
  11. Cao Q, Anyansi C, Hu X, Xu L, Xiong L, Tang W, et al. Reconstruction of enhancer–target networks in 935 samples of human primary cells, tissues and cell lines. Nat Genet. 2017;49:1428. doi: 10.1038/ng.3950.
  12. Chen H, Li C, Peng X, et al. A pan-cancer analysis of enhancer expression in nearly 9000 patient samples. Cell. 2018;173(2):386–39912. doi: 10.1016/j.cell.2018.03.027.
  13. Consortium TEP. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489(7414):57–74. doi: 10.1038/nature11247.
  14. Corradin O, Saiakhova A, Akhtar-Zaidi B, Myeroff L, Willis J, dot bracelari RC-S, Lupien M, Markowitz S, Scacheri PC. Combinatorial effects of multiple enhancer variants in linkage disequilibrium dictate levels of gene expression to confer susceptibility to common traits. Genome Res. 2013; 24(1):1–13. 10.1101/gr.164079.113.
  15. Crawford GE. Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS) Genome Res. 2005;16(1):123–31. doi: 10.1101/gr.4074106.
  16. Daniel B, Nagy G, Hah N, et al. The active enhancer network operated by liganded RXR supports angiogenic activity in macrophages. Genes Dev. 2014;28(14):1562–77. doi: 10.1101/gad.242685.114.
  17. Danko CG, Hyland SL, Core LJ, et al. Identification of active transcriptional regulatory elements from GRO-seq data. Nat Methods. 2015;12(5):433–8. doi: 10.1038/nmeth.3329.
  18. De Laat W., Duboule D. (2013) Topology of mammalian developmental enhancers and their regulatory landscapes. Nature, 502, 499–506.
  19. Dekker J. Capturing chromosome conformation. Science. 2002;295(5558):1306–11. doi: 10.1126/science.1067799.
  20. Ding J, Cai X, Wang Y, Hu H, Li X. Chipmodule: systematic discovery of transcription factors and their cofactors from chip-seq data. Biocomputing. 2013;2013:320–31.
  21. Ding J, Dhillon V, Li X, Hu H. Systematic discovery of cofactor motifs from ChIP-seq data by SIOMICS. Methods. 2015;79:47–51. doi: 10.1016/j.ymeth.2014.08.006.
  22. Ding J, Hu H, Li X. SIOMICS: a novel approach for systematic identification of motifs in ChIP-seq data. Nucleic Acids Res. 2014;42:e35. doi: 10.1093/nar/gkt1288.
  23. Ding J, Li X, Hu H. Systematic prediction of cis-regulatory elements in the Chlamydomonas reinhardtii genome using comparative genomics. Plant Physiol. 2012;160:613–23. doi: 10.1104/pp.112.200840.
  24. Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, Hu M, Liu JS, Ren B. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012;485(7398):376–80. doi: 10.1038/nature11082.
  25. Dostie J, Richmond TA, Arnaout RA, et al. Chromosome conformation capture carbon copy (5c): A massively parallel solution for mapping interactions between genomic elements. Genome Res. 2006;16(10):1299–309. doi: 10.1101/gr.5571506.
  26. Dunham I, Kundaje A, Aldred SF, Collins PJ, Davis CA, Doyle F. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247.
  27. Duren Z, Chen X, Jiang R, Wang Y, Wong WH. Modeling gene regulation from paired expression and chromatin accessibility data. Proc Natl Acad Sci U S A. 2017;114:E4914–23. doi: 10.1073/pnas.1704553114.
  28. Edelman LB, Fraser P. Transcription factories: genetic programming in three dimensions. Curr Opin Genet Dev. 2012;22(2):110–4. doi: 10.1016/j.gde.2012.01.010.
  29. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792–7. doi: 10.1093/nar/gkh340.
  30. Ernst J, Kellis M. ChromHMM: automating chromatin-state discovery and characterization. Nat Methods. 2012;9(3):215–6. doi: 10.1038/nmeth.1906.
  31. Ernst J, Kheradpour P, Mikkelsen TS, et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature. 2011;473(7345):43–49. doi: 10.1038/nature09906.
  32. Ernst J., Kellis M. (2012) ChromHMM: automating chromatin-state discovery and characterization. Nat. Methods, 9, 215–216.
  33. Forcato M. et al. (2017) Comparison of computational methods for Hi-C data analysis. Nat. Methods, 14, 679–685.
  34. Freund Y., Schapire R.E. (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci., 55, 119–139.
  35. Fullwood M.J. et al. (2009) An oestrogen-receptor-α-bound human chromatin interactome. Nature, 462, 58–64.
  36. Furlong E.E.M., Levine M. Developmental enhancers and chromosome topology. Science. 2018; 361(6409):1341–5. https://doi.org/10.1126/science.aau0320.
  37. Gao T, Qian J. EnhancerAtlas 2.0: an updated resource with enhancer annotation in 586 tissue/cell types across nine species. Nucleic Acids Res. 2020;48:D58–64. doi: 10.1093/nar/gkz980.
  38. Harrow J, Frankish A, Gonzalez JM, et al. GENCODE: The reference human genome annotation for the ENCODE project. Genome Res. 2012;22(9):1760–74. doi: 10.1101/gr.135350.111.
  39. He B, Chen C, Teng L, Tan K. Global view of enhancer-promoter interactome in human cells. Proc Natl Acad Sci. 2014;111(21):2191–9. doi: 10.1073/pnas.1320308111.
  40. Heintzman ND, Stuart RK, Hon G, Fu Y, et al. Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome. Nat Genet. 2007;39(3):311–8. doi: 10.1038/ng1966.
  41. Hnisz D, Day DS, Young RA. Insulated neighborhoods: structural and functional units of mammalian gene control. Cell. 2016;167:1188–200. doi: 10.1016/j.cell.2016.10.024.
  42. Hoffman M.M. et al. (2012) Unsupervised pattern discovery in human chromatin structure through genomic segmentation. Nat. Methods, 9, 473–476.
  43. Hu J, Hu H, Li X. MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs. Nucleic Acids Res. 2008;36:4488–97. doi: 10.1093/nar/gkn407.
  44. Javierre B.M., Burren O.S., Wilder S.P., et al. Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters. Cell. 2016; 167(5):1369–138419. https://doi.org/10.1016/j.cell.2016.09.037.
  45. Jin F, Li Y, Dixon JR, et al. A high-resolution map of the three-dimensional chromatin interactome in human cells. Nature. 2013; 503(7475):290–4. https://doi.org/10.1038/nature12644.
  46. Jing F, Zhang SW, Zhang S. Prediction of enhancer-promoter interactions using the cross-cell type information and domain adversarial neural network. BMC Bioinf. 2020;21:507. doi: 10.1186/s12859-020-03844-4.
  47. Johnson DS, Mortazavi A, Myers RM, Wold B. Genome-wide mapping of in vivo protein-DNA interactions. Science. 2007;316(5830):1497–502. doi: 10.1126/science.1141319.
  48. Khan, A., et al., JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework. Nucleic acids research, 2018. 46(D1): p. D260–D266.
  49. Latapy M, Magnien C, Vecchio ND. Basic notions for the analysis of large two-mode networks. Soc Netw. 2008;30(1):31–48. doi: 10.1016/j.socnet.2007.04.006.
  50. Lettice L.A., Horikoshi T., Heaney S.J.H., et al. Disruption of a long-range cis-acting regulator for shh causes preaxial polydactyly. Proc Natl Acad Sci. 2002; 99(11):7548–53. https://doi.org/10.1073/pnas.112212199.
  51. Li G, Ruan X, Auerbach RK, et al. Extensive promoter-centered chromatin interactions provide a topological basis for transcription regulation. Cell. 2012; 148(1-2):84–98. https://doi.org/10.1016/j.cell.2011.12.014.
  52. Li L, Cheng ASL, Jin VX, Paik HH, Fan M, Li X, et al. A mixture model-based discriminate analysis for identifying ordered transcription factor binding site pairs in gene promoters directly regulated by estrogen receptor-α. Bioinformatics. 2006;22:2210–6. doi: 10.1093/bioinformatics/btl329.
  53. Li X, Zheng Y, Hu H, Li X. Integrative analyses shed new light on human ribosomal protein gene regulation. Sci Rep. 2016; 6(1). 10.1038/srep28619.
  54. Lieberman-Aiden E, van Berkum NL, Williams L, et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science. 2009;326(5950):289–93. doi: 10.1126/science.1181369.
  55. Mahony S, Benos PV. STAMP: a web tool for exploring DNA-binding motif similarities. Nucleic Acids Res. 2007;35:W253–8. doi: 10.1093/nar/gkm272.
  56. Malin J, Aniba MR, Hannenhalli S. Enhancer networks revealed by correlated DNAse hypersensitivity states of enhancers. Nucleic Acids Res. 2013;41(14):6828–38. doi: 10.1093/nar/gkt374.
  57. Mann HB, Whitney DR. On a test of whether one of two random variables is stochastically larger than the other. Ann Math Stat. 1947;18(1):50–60. doi: 10.1214/aoms/1177730491.
  58. Matsubara E, Sakai I, Yamanouchi J, Fujiwara H, Yakushijin Y, Hato T, et al. The role of zinc finger protein 521/early hematopoietic zinc finger protein in erythroid cell differentiation. J Biol Chem. 2009;284:3480–7. doi: 10.1074/jbc.m805874200.
  59. McLean YC, Bristor D, Hiller M, Clarke SL, Schaar BT, Lowe CB, Wenger AM, Bejerano G. GREAT improves functional interpretation of cis-regulatory regions. Nat Biotechnol. 2010;28(5):495–501. doi: 10.1038/nbt.1630.
  60. Mesuraca M, Chiarella E, Scicchitano S, Codispoti B, Giordano M, Nappo G, et al. ZNF423 and ZNF521: EBF1 antagonists of potential relevance in B-lymphoid malignancies. BioMed Res Int. 2015;2015:165238. doi: 10.1155/2015/165238.
  61. Moore JE, Pratt HE, Purcaro MJ, Weng Z. A curated benchmark of enhancer-gene interactions for evaluating enhancer-target gene prediction methods. Genome Biol. 2020;21:17. doi: 10.1186/s13059-019-1924-8.
  62. Mora A. et al. (2016) In the loop: promoter-enhancer interactions and bioinformatics. Brief. Bioinform., 17, 980–995.
  63. Mossing M, Record M. Upstream operators enhance repression of the lac promoter. Science. 1986;233(4766):889–92. doi: 10.1126/science.3090685.
  64. Mumbach MR, Rubin AJ, Flynn RA, Dai C, Khavari PA, Greenleaf WJ, et al. HiChIP: efficient and sensitive analysis of protein-directed genome architecture. Nat Methods. 2016;13:919–22. doi: 10.1038/nmeth.3999.
  65. Okonechnikov K, Erkek S, Korbel JO, Pfister SM, Chavez L. InTAD: chromosome conformation guided analysis of enhancer target genes. BMC Bioinf. 2019;20:60. doi: 10.1186/s12859-019-2655-2.
  66. Papantonis A, Cook PR. Transcription factories: genome organization and gene regulation. Chem Rev. 2013;113(11):8683–705. doi: 10.1021/cr300513p.
  67. Pedregosa F. et al. (2011) Scikit-learn: machine learning in Python. J. Mach. Learn. Res., 12, 2825–2830.
  68. Pennacchio L.A., Bickmore W., Dean A., Nobrega M.A., Bejerano G. Enhancers: five essential questions. Nat Rev Genet. 2013; 14(4):288–95.
  69. Polikar R. et al. (2000) Acoustics, speech, and signal processing. In Proceedings, 2000 IEEE International Conference on IEEE (ICASSP’00), Vol. 6, pp. 3414–3417.
  70. Pott S, Lieb JD. What are super-enhancers? Nat Genet. 2015;47(1):8–12. doi: 10.1038/ng.3167.
  71. Quinodoz SA, Ollikainen N, Tabak B, Palla A, Schmidt JM, Detmar E, Lai MM, Shishkin AA, Bhat P, Takei Y, et al. Higher-order inter-chromosomal hubs shape 3d genome organization in the nucleus. Cell. 2018; 174(3):744–57.
  72. Rao S.S. et al. (2014) A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell, 159, 1665–1680.
  73. Ren G, Jin W, Cui K, Rodrigez J, Hu G, Zhang Z, et al. CTCF-mediated enhancer-promoter interaction is a critical regulator of cell-to-cell variation of gene expression. Mol Cell. 2017;67:1049–58. doi: 10.1016/j.molcel.2017.08.026.
  74. Robertson G, Hirst M, Bainbridge M, et al. Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nat Methods. 2007;4(8):651–7. doi: 10.1038/nmeth1068.
  75. Roy S, Siahpirani AF, Chasman D, Knaack S, Ay F, Stewart R, et al. A predictive modeling approach for cell line-specific long-range regulatory interactions. Nucleic Acids Res. 2015;43:8694–712. doi: 10.1093/nar/gkv865.
  76. Rödelsperger C, Guo G, Kolanczyk M, Pletschacher A, et al. Integrative analysis of genomic, functional and protein interaction data predicts long-range enhancer-target gene interactions. Nucleic Acids Res. 2010;39(7):2492–502. doi: 10.1093/nar/gkq1081.
  77. Sharan R. Analysis of Biological Networks: Transcriptional Networks – Promoter Sequence Analysis (PDF). Tel Aviv University. Retrieved 30 December 2012.
  78. Singh S, Yang Y, Póczos B, Ma J. Predicting enhancer-promoter interaction from genomic sequence with deep neural networks. Quant Biol. 2019;7:122–37. doi: 10.1007/s40484-019-0154-0.
  79. Smola AJ, Schölkopf B. A tutorial on support vector regression. Stat Comput. 2004;14:199–222. doi: 10.1023/b:stco.0000035301.49549.88.
  80. Stark C, Breitkreutz B, Reguly T, Boucher L, Breitkreutz A, Tyers M. BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006;34:D535–9. doi: 10.1093/nar/gkj109.
  81. Talukder A, Hu H, Li X. An intriguing characteristic of enhancer-promoter interactions. BMC Genom. 2021;22:163. doi: 10.1186/s12864-021-07440-5.
  82. Talukder A, Saadat S, Li X, Hu H. EPIP: a novel approach for condition-specific enhancer–promoter interaction prediction. Bioinformatics. 2019. 10.1093/bioinformatics/btz641.
  83. Tang Z, Luo OJ, Li X, Zheng M, Zhu JJ, Szalaj P, et al. CTCF-mediated human 3D genome architecture reveals chromatin topology for transcription. Cell. 2015;163:1611–27. doi: 10.1016/j.cell.2015.11.024.
  84. Thurman R.E. et al. (2012) The accessible chromatin landscape of the human genome. Nature, 489, 75–82.
  85. Tibshirani R. Regression shrinkage and selection via the lasso. J Roy Stat Soc B. 1996;58:267–88. doi: 10.1111/j.2517-6161.1996.tb02080.x.
  86. Vakoc CR, Letting DL, Gheldof N, Sawado T, Bender MA, Groudine M, et al. Proximity among distant regulatory elements at the beta-globin locus requires GATA-1 and FOG-1. Mol Cell. 2005;17:453–62. doi: 10.1016/j.molcel.2004.12.028.
  87. Visel A, Minovitsky S, Dubchak I, Pennacchio LA. VISTA enhancer browser–a database of tissue-specific human enhancers. Nucleic Acids Res. 2007;35(Database):88–92. doi: 10.1093/nar/gkl822.
  88. Wang J, Dai X, Berry LD, Cogan JD, Liu Q, Shyr Y. HACER: an atlas of human active enhancers to interpret regulatory variants. Nucleic Acids Res. 2019;47:D106–12. doi: 10.1093/nar/gky864.
  89. Wang S, Hu H, Li X. Shared distal regulatory regions may contribute to the coordinated expression of human ribosomal protein genes. Genomics. 2020;112:2886–93. doi: 10.1016/j.ygeno.2020.03.028.
  90. Wang Y, Goodison S, Li X, Hu H. Prognostic cancer gene signatures share common regulatory motifs. Sci Rep. 2017;7:1–9. doi: 10.1038/s41598-017-05035-3.
  91. Wang Y, Li X, Hu H. H3k4me2 reliably defines transcription factor binding regions in different cells. Genomics. 2014;103(2-3):222–8. doi: 10.1016/j.ygeno.2014.02.002.
  92. Weber F, de Villiers J, Schaffner W. An SV40 “enhancer trap” incorporates exogenous enhancers or generates enhancers from its own sequences. Cell. 1984;36(4):983–92. doi: 10.1016/0092-8674(84)90048-5.
  93. Weintraub AS, Li CH, Zamudio AV, Sigova AA, Hannett NM, Day DS, et al. YY1 is a structural regulator of enhancer-promoter loops. Cell. 2017;171:1573–88. doi: 10.1016/j.cell.2017.11.008.
  94. Weirauch MT, Yang A, Albu M, Cote AG, Montenegro-Montero A, Drewe P, et al. Determination and inference of eukaryotic transcription factor sequence specificity. Cell. 2014;158:1431–43. doi: 10.1016/j.cell.2014.08.009.
  95. Whalen S, Truty RM, Pollard KS. Enhancer-promoter interactions are encoded by complex genomic signatures on looping chromatin. Nat Genet. 2016;48(5):488–96. doi: 10.1038/ng.3539.
  96. Whyte WA, Orlando DA, Hnisz D, Abraham BJ, Lin CY, Kagey MH, Rahl PB, Lee TI, Young RA. Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell. 2013;153(2):307–19. doi: 10.1016/j.cell.2013.03.035.
  97. Won K-J, Ren B, Wang W. Genome-wide prediction of transcription factor binding sites using an integrated model. Genome Biol. 2010;11(1):7. doi: 10.1186/gb-2010-11-1-r7.
  98. Wong KC, Li Y, Peng C. Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells. Bioinformatics. 2016;32:321–4. doi: 10.1093/bioinformatics/btv555.
  99. Xi W, Beer MA. Local epigenomic state cannot discriminate interacting and non-interacting enhancer–promoter pairs with high accuracy. PLoS Comput Biol. 2018;14:e1006625. doi: 10.1371/journal.pcbi.1006625.
  100. Zeng W, Wu M, Jiang R. Prediction of enhancer-promoter interactions via natural language processing. BMC Genom. 2018;19:13–22. doi: 10.1186/s12864-018-4459-6.
  101. Zhang J, Lee D, Dhiman V, Jiang P, Xu J, McGillivray P, et al. An integrative ENCODE resource for cancer genomics. Nat Commun. 2020;11:3696. doi: 10.1038/s41467-020-14743-w.
  102. Zhang K, Li N, Ainsworth RI, Wang W. Systematic identification of protein combinations mediating chromatin looping. Nat Commun. 2016;7:1–11. doi: 10.1038/ncomms12249.
  103. Zhang X, Branciamore S, Gogoshin G, Rodin AS, Riggs AD. Analysis of high-resolution 3D intrachromosomal interactions aided by Bayesian network modeling. Proc Natl Acad Sci U S A. 2017;114:E10359–68. doi: 10.1073/pnas.1620425114.
  104. Zhao C, Li X, Hu H. PETModule: a motif module based approach for enhancer target gene prediction. Sci Rep. 2016; 6(1). 10.1038/srep30043.
  105. Zheng Y, Li X, Hu H. Comprehensive discovery of DNA motifs in 349 human cells and tissues reveals new features of motifs. Nucleic Acids Res. 2014;43(1):74–83. doi: 10.1093/nar/gku1261.
  106. Zheng Y, Li X, Hu H. PreDREM: a database of predicted DNA regulatory motifs from 349 human cell and tissue samples. Database. 2015;2015. 10.1093/database/bav007.
  107. Zheng Y. et al. (2015) Comprehensive discovery of DNA motifs in 349 human cells and tissues reveals new features of motifs. Nucl. Acids Res., 43, 74–83.
  108. Zhuang Z, Shen X, Pan W. A simple convolutional neural network for prediction of enhancer–promoter interactions with DNA sequence data. Bioinformatics. 2019;35:2899–906. doi: 10.1093/bioinformatics/bty1050.