Please use this identifier to cite or link to this item: http://dx.doi.org/10.25673/121789
Title: Cross-platform motif discovery and benchmarking to explore binding specificities of poorly studied human transcription factors
Author(s): Vorontsov, Ilya E.
Plescher, Marie-Luise
Grau, JanLook up in the Integrated Authority File of the German National Library
Große, IvoLook up in the Integrated Authority File of the German National Library
Issue Date: 2025
Type: Article
Language: English
Abstract: A sequence motif representing the DNA-binding specificity of a transcription factor (TF) is commonly modelled with a positional weight matrix (PWM). Focusing on understudied human TFs, we processed results of 4,237 experiments for 394 TFs, assayed using five different experimental platforms. By human curation, we approved a subset of experiments that yielded consistent motifs across platforms and replicates, and evaluated quantitatively the cross-platform performance of PWMs obtained with ten motif discovery tools. Notably, nucleotide composition and information content are not correlated with motif performance and do not help in detecting underperformers, while motifs with low information content, in many cases, describe well the binding specificity assessed across different experimental platforms. By combining multiple PMWs into a random forest, we demonstrate the potential of accounting for multiple modes of TF binding. Finally, we present the Codebook Motif Explorer (https://mex.autosome.org), cataloguing motifs, benchmarking results, and the underlying experimental data.
URI: https://opendata.uni-halle.de//handle/1981185920/123740
http://dx.doi.org/10.25673/121789
Open Access: Open access publication
License: (CC BY 4.0) Creative Commons Attribution 4.0(CC BY 4.0) Creative Commons Attribution 4.0
Journal Title: Communications biology
Publisher: Springer Nature
Publisher Place: London
Volume: 8
Original Publication: 10.1038/s42003-025-08909-9
Page Start: 1
Page End: 19
Appears in Collections:Open Access Publikationen der MLU

Files in This Item:
File Description SizeFormat 
s42003-025-08909-9.pdf5.36 MBAdobe PDFThumbnail
View/Open