Resources (work in progress)

Group guide


Learning Resources

A curated collection of external resources, tutorials, and tools that we recommend for learning about computational chemistry, machine learning, bioinformatics, and chemical informatics.

Education

Educational materials, courses, and tutorials to get you started

Cheminformatics

Cheminformatics Material: Resources, Code, and Other things I use to teach Cheminformatics

Bioinformatics

Bioinformatics Material: Resources, Code, and Other things I use to teach Bioinformatics

  • awosome-bioinformatics - A curated list of resources for learning bioinformatics.

  • UC Davis Bioinformatics Core Training - UC Davis Bioinformatics Core Training Training course repository.

  • Structural-Bioinformatics - This repository includes the slides and the practicals for the course of Structural Bioinformatics of the MBB/QB degrees at the University of Milano.

  • Scikit-bio - A community-driven Python library for bioinformatics, providing versatile data structures, algorithms and educational resources.

In Silico Drug Design and Discovery

Materials for Drug Design and Discovery Computing Techniques

  • Awesome Drug Discovery - A meticulously curated resource list focused on computational methods for drug discovery.

  • TeachOpenCADD - A teaching platform for computer-aided drug design (CADD) using open source packages and data.

  • Open Source Molecular Modeling - Catalog of Open Source Molecular Modeling Projects.

  • AutoDock Vina - An open-source program for doing molecular docking.

  • DOCK - DOCK3 and DOCK6

  • rDock - A fast and versatile Open Source docking program that can be used to dock small molecules against proteins and nucleic acids.

  • Amber - Amber is a suite of biomolecular simulation programs.

  • GROMACS - A free and open-source software suite for high-performance molecular dynamics and output analysis.

  • OpenMM - A high-performance toolkit for molecular simulation. Use it as an application, a library, or a flexible programming environment.

  • MDAnalysis - A Python library for analyzing molecular dynamics (MD) simulations.

  • MDTraj - A python library that allows users to manipulate molecular dynamics (MD) trajectories.

  • Open-Source PyMOL - Open-source foundation of the user-sponsored PyMOL molecular visualization system.

  • UCSF Chimera - A program for the interactive visualization and analysis of molecular structures and related data, including density maps, trajectories, and sequence alignments.

  • UCSF ChimeraX - UCSF ChimeraX (or simply ChimeraX) is the next-generation molecular visualization program from the Resource for Biocomputing, Visualization, and Informatics (RBVI), following UCSF Chimera.

Blogs

Regular updates and insights from experts

Github

Research Hub

  • AI4MolConformation - List of molecules ( small molecules, RNA, peptide, protein, enzymes, antibody, and PPIs) conformations and molecular dynamics (force fields) using generative artificial intelligence and deep learning

  • GenAI4Mol - List of Molecular and Material design (molecular conformation generation) using Generative AI and Deep Learning

  • GenAI4Protein - List of papers about Protein Design using Deep Learning

  • AI for Chemical Spectra - Artificial intelligence and generative deep learning are used to unlock infrared spectroscopy(IR), mass spectroscopy spectroscopy, NMR spectroscopy, and chemical spectroscopy.

Tools

Interactive tools, utilities, and kits

  • Biopython - Biopython is a set of freely available tools for biological computation written in Python by an international team of developers.

  • CDPKit - The Chemical Data Processing Toolkit

  • MolPipeline - A Python package for processing molecules with RDKit in scikit-learn

  • scikit-mol - scikit-learn classes for molecular vectorization using RDKit.

  • mmpdb - A package to identify matched molecular pairs and use them to predict property changes.

  • mols2grid - An interactive molecule viewer for 2D structures, based on RDKit.

  • mordred - A molecular descriptor calculator.

  • ColabFold - Making protein folding accessible to all.

  • DockQ - A single continuous quality measure for Protein, Nucleic Acids and Small Molecule Docking Models.

  • reaction_utils - Utilities for working with datasets of chemical reactions, reaction templates and template extraction.

  • SMARTSPlus - Interactive SMARTS pattern matching tool

Software

Open-source (AI-powered) software packages

  • RDKit - Open source toolkit for cheminformatics

  • Open Babel - Open Babel is a chemical toolbox designed to speak the many languages of chemical data.

  • Psi4 - Open-Source Quantum Chemistry – an electronic structure package in C++ driven by Python.

  • PySCF - Python module for quantum chemistry.

  • AlphaFold 2 - Open source code for AlphaFold 2.

  • AlphaFold 3 - AlphaFold 3 inference pipeline.

  • Boltz - Official repository for the Boltz biomolecular interaction models.

  • Open Chemistry - Permissively licensed cross-platform tools that provide applications for computational chemistry, materials science, and related areas.

  • Open Reaction Database (ORD) - Open-access schema and infrastructure for structuring and sharing organic reaction data

Books

Comprehensive textbooks and reference materials