MAVE-NN: learning genotype-phenotype maps from multiplex assays of variant effect

Version: 1.1.1 (Built on Jan 24, 2025)

MAVE-NN [1] enables the rapid quantitative modeling of genotype-phenotype (G-P) maps from the data produced by multiplex assays of variant effect (MAVEs). Such assays include deep mutational scanning (DMS) experiments on proteins, massively parallel reporter assays (MPRAs) on DNA or RNA regulatory sequences, and more. MAVE-NN conceptualizes G-P map inference as a problem in information compression; this problem is then solved by training a neural network using a TensorFlow backend. To learn more about this modeling strategy, please see our manuscript in Genome Biology.

MAVE-NN is written for Python 3 and is provided under an MIT open source license. The documentation provided here is meant to help users quickly get MAVE-NN working for their own research needs. Please do not hesitate to contact us with any questions or suggestions for improvements. For technical assistance or to report bugs, please create an issue on the MAVE-NN GitHub repository. For more general correspondence, please contact Justin Kinney (Email: jkinney@cshl.edu).