Hub members Have many expertise, covering most of the fields in bioinformatics and biostatistics. You'll find below a non-exhaustive list of these expertise

Search by keywords | Search by organisms

Searched keyword : Sofware development and engineering

Related people (14)

Thomas BIGOT

Group : GIPhy - Embedded : Biology of Infection

I joined the C3BI Hub in 2016 after a curriculum widely dedicated to Bioinformatics studies, and more precisely to Phylogeny and Evolution, topics of my PhD thesis. At Institut Pasteur, I am involved in projects dealing with sequences homology : alignments, hmm profiles, making homologous family databases, kmers signatures. I am also a developer (Python / C++) with a solid interest in optimization as well as in developing usable tools for final user such as automated pipeline for metagenomics sequence analysis. I’m currently embedded in Marc Eloit’s team (80% of my work time). My main task in this team is to develop strategies to identify, in their metagenomics samples, new pathogens, or new combination pathogen / symptoms. The rest of my time, I manage small projects and participate to the Hub life. I am currently experimenting with functional programming (for now, using Python) and its applicability to bioinformatics issues.

AlgorithmicsScientific computingSofware development and engineeringParallel computingGraph theory and analysis
Projects (11)


One of my projects consists in developing GRAVITY, a java tool based on Cytoscape to integrate genetic variants within protein-protein interaction networks to allow the visual and statistical interpretation of next-generation sequencing data, ultimately helping geneticists and clinicians to identify causal variants and better diagnose their patients. I’m also involved in several other projects in the lab, taking part in the design of pipelines for the processing and the analysis of genomics data, including SNP arrays, whole-exome and whole-genome sequencing data. This means being confronted to the big data problematic, the unit having to manage hundreds of terabytes of genomics data. Finally, I am now analysing these data in order to identify possible causes for autism, to help clinicians with their diagnosis but also to better understand the biological mechanisms at play in this complex disease. This is done through the project aiming at understanding the genetic architecture of autism in the Faroe Islands, and also with the newly starting IMI2 European project AIMS2-Trials.

AlgorithmicsData managementData VisualizationGenomicsMachine learningProteomicsGenome analysisBiostatisticsProgram developmentScientific computingApplication of mathematics in sciencesExploratory data analysisSofware development and engineeringData and text miningGenetics

Projects (0)

    Thomas COKELAER

    Group : PLATEFORM - Detached : Biomics

    I joined the Bioinformatics and Biostatistics Hub at Institut Pasteur in 2016 where I am currently developing pipelines related to NGS for the Biomics Pôle. I have an interdisciplinary research experience: after a PhD in Astronomy (gravitational wave data analysis), I joined several research institute to work in the fields of plant modelling (INRIA, Montpellier, 2008-2011), System Biology — in particular logical modelling (EMBL-EBI Cambridge, U.K., 2011-2015), and drug discovery (Sanger Institute, Cambridge, U.K.), 2015). On a daily basis, I use data analysis and machine learning techniques within high-quality software to tackle scientific problems.

    AlgorithmicsData managementData VisualizationGenome assemblyGenomicsMachine learningModelingScientific computingDatabases and ontologiesSofware development and engineeringData and text miningIllumina HiSeqGraph theory and analysisIllumina MiSeq

    Projects (2)

    Kenzo-Hugo HILLION

    Group : WINTER - Hub Core

    After a Master degree in Genetics at Magistère Européen de Génétique, Paris Diderot, I did a second Master in bioinformatics at University of Nantes where I focused my work on the study of mapping strategy for allele specific analysis at the bioinformatics platform of Institut Curie. I then joined Institut Pasteur to work on an ELIXIR project related to the registry through the development of a dedicated tool and the participation of several workshops and hackathons. As an engineer of the bioinformatics and Biostatistics Hub, I am involved in several projects from Differential Analysis of RNA-seq data to Metagenomics. I am also in charge of the maintenance of the Galaxy Pasteur instance.

    ChIP-seqEpigenomicsGenomicsSequence analysisProgram developmentDatabases and ontologiesSofware development and engineeringGeneticsData integrationRead mappingWorkflow and pipeline developmentConfocal Microscopy

    Projects (4)

    Etienne KORNOBIS

    Group : PLATEFORM - Detached : Epigenetic regulation

    After a PhD in Biology in 2011 on population genetics and phylogeography on amazing little amphipods (Crangonyx, Crymostygius) at the University of Reykjavik (Iceland), I pursued my interest in Bioinformatics and Evolutionary Biology in various post-docs in Spain (MNCN Madrid, UB Barcelona). During this time, I investigated transcriptomic landscapes for various non-model species (groups Conus, Junco and Caecilians) using de novo assemblies and participated in the development of TRUFA, a web platform for de novo RNA-seq analysis. In July 2016, I integrated the Revive Consortium and the Epigenetic Regulation unit at Pasteur Institute, where my main focus were transcriptomic and epigenetic analyses on various thematics using short and long reads technologies, with a special interest in alternative splicing events detection. I joined the Bioinformatics and Biostatistics Hub in January 2018. My latest interests are long reads technologies, alternative splicing and achieving reproducibility in Bioinformatics using workflow managers, container technologies and literate programming.

    Data managementData VisualizationSequence analysisTranscriptomicsWeb developmentGenome analysisProgram developmentExploratory data analysisSofware development and engineeringGeneticsEvolutionRead mappingWorkflow and pipeline developmentPopulation geneticsMotifs and patterns detectionGrid and cloud computing
    HumanInsect or arthropodOther animalAnopheles gambiae (African malaria mosquito)Mouse
    Projects (3)

    Pierre LECHAT

    Group : ALPS - Hub Core

    I have been involved in genomic projects for prokaryotic and human genetic studies (GWAS) since 1998. Currently, I am working on novel visualization techniques to explore large and highly complex data sets. I have develop a web based graphical user interface, called SynTView ( to visualize biological features in comparative genomic studies. The tool allows interactive visualization of microbial genomes to investigate massive amounts of information efficiently. The software is characterized by the presentation of synthetic organisations of microbial genomes and the visualization of polymorphism data. I am extending this work into designing novel dynamic views for comparative analysis of viruses in emerging disease.

    Data VisualizationDatabaseSofware development and engineeringComparative metagenomicsOrthology and paralogy analysis

    Projects (27)

    Rachel LEGENDRE

    Group : GORE - Hub Core

    Rachel Legendre is a bioinformatics engineer. She completed her master degree in apprenticeship for two years at INRA in Jouy-en-Josas in the Genetic Animal department. She was involved in a project aiming at the detection and the expression analysis of micro-RNA involved in an equine disease. In 2012, she joined the Genomic, Structure and Translation Team at Paris-Sud (Paris XI) university. She worked principally on Ribosome Profiling data analysis, a new technique that allows to identify the position of the ribosome on the mRNA at the nucleotide level. Since november 2015, she worked at Institut Pasteur. During 4 years, she was detached to the Biomics Platform, where she was in charge of the bioinformatics analyses for transcriptomics and epigenomics projects. She was also involved in Long Reads (PacBio and Nanopore) developments with other bioinformaticians of Biomics. Since november 2019, she has joined the Hub of Bioinformatics and Biostatistics, et more precisely the Genome Organization Regulation and Expression group.

    AlgorithmicsChIP-seqEpigenomicsNon coding RNATranscriptomicsGenome analysisProgram developmentScientific computingSofware development and engineeringIllumina HiSeqRead mappingSequencingWorkflow and pipeline developmentChromatin accessibility assaysPac BioRibosome profiling
    BacteriaFungiParasiteHumanInsect or arthropodOther animal
    Projects (25)

    Blaise LI

    Group : ALPS - Hub Core

    I obtained a PhD in phylogeny in 2008 at the Muséum National d’Histoire Naturelle in Paris, then worked as a post-doc in Torino (Italy, 2009 – 2011) and Faro (Portugal, 2011 – 2013) where I worked on methodological aspects of phylogeny. In 2013, I have been hired as research engineer in bioinformatics at the Institut de Génétique Humaine in Montpellier where I wrote tools to analyse high-throughput sequencing data, especially small RNA-seq. This is also the kind of job I do now at Institut Pasteur, since 2016. I enjoy programming in Python, I’m interested in evolutionary biology, and I find teaching the UNIX command-line and other practical computer skills a rewarding activity. I’m also particularly involved in a course introducing PhD students (and sometimes other staff at Institut Pasteur) to R programming and basic descriptive statistics. The course support is available on-line and can hopefully be studied autonomously: One of my main activities is the development of automated data analysis workflows using Snakemake. My published work is available here:

    GenomicsNon coding RNATranscriptomicsSofware development and engineeringGeneticsWorkflow and pipeline development
    Insect or arthropodOther animalDrosophila melanogaster (Fruit fly)C. elegans
    Projects (10)

    Nicolas MAILLET

    Group : SINGLE - Embedded : Structural Virology

    After a PhD in bioinformatics at Inria/IRISA, Université de Rennes 1, Rennes (France), under the supervision of Dominique Lavenier and Pierre Peterlongo, I did a postdoc in bioinformatics at Laboratory of Ecology and Evolution of Plankton in Stazione Zoologica Anton Dohrn of Naples, Italy. Both my thesis and my postdoc were about the Tara Oceans projet and the development of new software to analyze huge quantities of raw reads coming from metagenomics sample. I am currently occupying a research engineer position at the Hub as leader of ALPS group and focus on several different computing problems including metagenomics, protein assembly and several short term developments.

    AlgorithmicsData managementProteomicsDatabaseProgram developmentScientific computingSofware development and engineeringComparative metagenomics

    Projects (8)

    Rachel TORCHET

    Group : WINTER - Hub Core

    In 2012 I completed my master degree at the MicroScope Platform located at Genoscope (the French National Sequencing Center). I was involved in a project aiming at the management of evolution projects which rely on the Next Generation Sequencing (NGS) technologies to try to decipher the dynamics of genomic changes as well as the molecular bases and the mechanisms underlying adaptative evolution of micro-organisms (Remigi et al. 2014). Since November 2014, I joined the Bioinformatics and Biostatistics HUB at Institut Pasteur. I participated to the creation and updates of the C3BI website. I joined the WINTER group where I’m in charge of web and interface development projects. I have completed an UX-Design training to add extra value to my front-end development skills. I design and develop bioinformatics tools and interfaces that are users oriented.

    Data VisualizationWeb developmentDatabaseGenome analysisScientific computingDatabases and ontologiesSofware development and engineeringWorkflow and pipeline development

    Projects (12)

    Related projects (5)

    Providing correlationPlus software to the scientific community for analysis of dynamical correlations in biological macromolecules

    Molecular dynamics simulations and elastic network models are two widely used computational methods for investigation of dynamics of biological macromolecules. These methods can reveal dynamical correlations between residues, nucleotides, domains and chains of biological macromolecules. Even though analyses of these correlations are employed frequently, there is not an application and API that can facilitate the analysis and the visualization of them. A coherent API/app can accelerate the analysis process and reveal details of allosteric interactions. We developed a Python package called correlationPlus that can facilitate and accelerate the dynamical correlation analyses. The package contains both an API and a command line interface. It analyzes raw dynamical correlation maps and plots 2D heatmaps. It can extract the correlation map of individual chains automatically. The correlations can be projected onto PDB structures with correlationPlus and they can be visualized by the popular molecular visualization software VMD. Several studies showed that graph theoretical analysis of dynamical correlations can reveal active sites and domains within proteins. correlationPlus provides a purely Python framework to calculate several graph theoretical centrality measures such as degree, betweenness, closeness, current flow closeness, and current flow betweenness etc. In addition to 2D figures of the centralities, the centrality measure in question can also be projected onto the protein structure with correlationPlus for 3D inspection by VMD. To make correlationPlus app and API available to the scientific community, we need to package and make it distributable. As in many scientific software, correlationPlus also depend on many excellent libraries such as numpy, matplotlib, prody etc. Installation of correlationPlus with pip and/or conda can help the users to install correlationPlus by satisfying the requirements automatically. In this way, the end-users can analyze dynamical correlations rapidly. Unfortunately, we do not have any expertise in the packaging and distribution of Python packages. As a result, we need technical expertise of C3BI for packaging and making correlationPlus distributable to the scientific community.

    Project status : Closed