Hub members Have many expertise, covering most of the fields in bioinformatics and biostatistics. You'll find below a non-exhaustive list of these expertise
Searched keyword : Genome analysis
Related people (20)
Developing and evaluating bioinformatic tools for: – next generation sequencing data – genome analysis & comparison Specialties:Genome & Transcriptome Bioinformatics
Data managementData VisualizationGenomicsNon coding RNASequence analysisTranscriptomicsGenome analysisBiostatisticsProgram developmentScientific computingData and text miningBiosensors and biomarkersEpidemiology and public health
- Identification of non-coding RNAs under the control of the PerR regulators(Nadia BENAROUDJ - Biology of Spirochetes) - In Progress
- Tissue-resident stromal cell heterogeneity(Lucie PEDUTO - Stroma, Inflammation and Tissue Repair) - In Progress
- Role of small non coding RNAs in the adaptive response to oxidative stress in pathogenic Leptospira(NADIA BENAROUDJ - Biology of Spirochetes) - In Progress
As a computational biologist I have been involved in various projects seeking to answer different biological questions. Those projects have allowed me to define my main research interest, namely the evolutionary study of the emergence, storage and modulation of information in biological systems assisted by computational methods. During my research career I have acquired extensive experience in the analysis of sequence data at the DNA and protein level. I’m trained both in NGS bioinformatic protocols (ChIP-seq, ATAC-seq, RNA-seq, genome assembly) and fine detail sequence analysis. Most importantly, I have gained proficiency in the use of the statistical models that are at the basis of the quantitative analysis of low and high throughput sequence data. Additionally, my experience as a lecturer and instructor has taught me that training researchers about the formal basis of bioinformatic methodologies is the key for a successful collaboration between wet and dry lab. Likewise, I have gained valuable skills by working within two international consortia (TARA Oceans project and TRANSNET): the ability to collaborate with multidisciplinary groups and to coordinate younger researchers.
AlgorithmicsGenomicsSequence analysisTranscriptomicsGenome analysisGeneticsEvolutionInteractomics
- basic alignment/visualisation pipeline(Pablo NAVARRO - Epigenetics of Stem Cells) - In Progress
- Regulation of IL-23 Receptor expression(Elisabetta BIANCHI - Immunoregulation) - In Progress
- Identifying new population(s) of NK cells involved in memory to bacterial infection(Melanie HAMON - Chromatin and Infection) - In Progress
One of my projects consists in developing GRAVITY, a java tool based on Cytoscape to integrate genetic variants within protein-protein interaction networks to allow the visual and statistical interpretation of next-generation sequencing data, ultimately helping geneticists and clinicians to identify causal variants and better diagnose their patients. I’m also involved in several other projects in the lab, taking part in the design of pipelines for the processing and the analysis of genomics data, including SNP arrays, whole-exome and whole-genome sequencing data. This means being confronted to the big data problematic, the unit having to manage hundreds of terabytes of genomics data. Finally, I am now analysing these data in order to identify possible causes for autism, to help clinicians with their diagnosis but also to better understand the biological mechanisms at play in this complex disease. This is done through the project aiming at understanding the genetic architecture of autism in the Faroe Islands, and also with the newly starting IMI2 European project AIMS2-Trials.
AlgorithmicsData managementData VisualizationGenomicsMachine learningProteomicsGenome analysisBiostatisticsProgram developmentScientific computingApplication of mathematics in sciencesExploratory data analysisSofware development and engineeringData and text miningGenetics
| work as a research engineer in the ßioinƒormatics and ßiostatistics HUß of the |nstitut Pasteur. Holder of a PhD in bioinƒormatics, my main interest is on ƒast but robust phylogenetic inƒerence algorithms and methods ƒrom large genome-scaled datasets. |n consequence, | am oƒten involved in related bioinƒormatics projects, such as perƒorming de novo or ab initio genome assemblies, designing and processing core genome †yping schemes, building and analysing phylogenomics datasets, or implementing and distributing novel tools and methods.
AlgorithmicsClusteringGenome assemblyGenomicsGenotypingPhylogeneticsTaxonomyGenome analysisProgram developmentEvolutionSequence homology analysis
- Long-term preservation of a Campylobacter fetus strain and genomic stability(Dominique CLERMONT - Collection of the Institut Pasteur (CIP)) - In Progress
- DNMT and RNMT in Leishmania(Gerald SPAETH - Molecular Parasitology and Signaling) - In Progress
- Séquençage à haut débit (NGS) et traitement de séquences ADN des domaines variables d’anticorps simple chaine d’alpaga (domaines VHH ou Nanobodies®)(Margarida GOMES - Antibody Engineering) - In Progress
After a PhD in informatics on graph analysis (metabolic networks and sRNA-mRNA interaction graphs) at the LaBRI (Université de Bordeaux), I joined the DSIMB team (INTS) for a post-doc on structural modeling. Then, I performed a second post-doc at Metagenopolis – INRA Jouy-en-Josas, where I was initiated to the analysis of metagenomic data. I was recruited at the HUB in 2015, and since I pursue the development of methods dedicated to the treatment of metagenomic data by combining either the treatment of sequencing data, the statistics, the protein structural modeling and the graph analysis.
AlgorithmicsClusteringGenome assemblyGenomicsMetabolomicsModelingNon coding RNASequence analysisStructural bioinformaticsTargeted metagenomicsDatabaseGenome analysisBiostatisticsProgram developmentScientific computingDatabases and ontologiesExploratory data analysisData and text miningIllumina HiSeqComparative metagenomicsRead mappingIllumina MiSeqSequence homology analysisGene predictionMultidimensional data analysisSequencingShotgun metagenomics
- Regulation of nasopharyngeal microbiome in normal healthy individuals(James DI SANTO - Innate Immunity) - Pending
- Changes in cellular pathways during SARS-CoV-2 infection upon drug treatment(Bjoern MEYER - Viral Populations and Pathogenesis) - Pending
- Identification, structural and functional analyses of peptides from SARS-Cov2 binding with high affinity to MHC-E and impact on NK cell responses in SARS-Cov2 infection(Huot NICOLAS - HIV, Inflammation and Persistence) - Pending
After a PhD in Microbiology on bacterial toxin-antitoxin systems at the Free University of Brussels, I joined the Institut Pasteur for a 3 years postdoc in Eduardo Rocha’s lab. During this period, I performed comparative genomics and pylogenetic analysis on bacterial conjugation and type IV secretion systems. Then, I worked 2 years in Olivier Tenaillon’s team on the modelling and evolution of organismal complexity. I joined the HUB in 2015, and I am involved in phylogenetic and comparative genomics projects.
GenomicsPhylogeneticsSequence analysisGenome analysisGeneticsEvolutionPopulation genetics
- Evaluation of the mutation rate per site and dN/dS in the genomes of Yersinia enterocolitica(Cyril SAVIN - Yersinia) - Awaiting Publication
- Phylogenetic analysis of HHD-PDZ containing proteins(Nicolas WOLFF - Channel Receptors) - Awaiting Publication
- Phylogenetic analysis of insect-specific flaviviruses(Artem BAIDALIUK - Insect-Virus Interactions) - Closed
Bernd Jagla received his PhD in bioinformatics (department of Biology, Chemistry, and Parmacy) from the Free University in Berlin, Germany in 1999. Before joining the Institut Pasteur, he worked for almost ten years in New York City, including as an associate research scientist in the Joint Centers for System Biology (Columbia University) and at the Columbia University Screening Center led by Dr J.E. Rothman. He joined the Institut Pasteur in 2009 to take charge of the bioinformatic needs at the Transcriptome et Epigenome platform, focusing on Next Generation Sequencing. As of 2016 he is member of the C3BI – HUB Team detached to the Human immunology center (CIH) and provides support for cytometry, next generation sequencing, and microarray data analysis. His areas of interest include the quality assurance and data analysis and visualization at the facility. He also has strong expertise in developing algorithms for function prediction from sequence data, image analysis, analysis of mass spectrometry data, workflow management systems. While at Pasteur he developed: KNIME extensions for Next Generation Sequencing (Link) Post Alignment Visualization and Characterization of High-Throughput Sequencing Experiments (Link) Post Alignment statistics of Illumina reads (Link)
AlgorithmicsChIP-seqData managementData VisualizationImage analysisMachine learningSequence analysisDatabaseGenome analysisBiostatisticsProgram developmentScientific computingData and text miningIllumina HiSeqGraphics and Image ProcessingIllumina MiSeqHigh Throughput ScreeningFlow cytometry/cell sortingPac Bio
I am seeking to apply my knowledge in computer science and statistics to understand real world data. I have interdisciplinary background spanning complex systems, Big Data, machine learning, biostatistics and genomics. I have completed a PhD in which I applied clustering and PCA to epigenomics data and discovered new insights on the coupling between replication and epigenetics. I worked at Dataiku, a dynamic start up in which I was actively engaged to help their clients to build their Big Data strategy and draw value from their data. I studied the human microbiota during two years at MetaGenoPolis (MGP), an innovative research center. We aim at improving human health by developing strategies (eg. nutritional, therapeutical, preventive…) to restore dysbiosed microbiota with our industrial and academical partners. I currently work in the statistical genetics group at the Pasteur Institut where I apply my software development and data science skills to quantify the impact of the human genome variation on diverse health parameters.
ClusteringData managementGenomicsGenome analysisExploratory data analysisGeneticsComparative metagenomicsDimensional reductionMultidimensional data analysis
2015 – . – Institut Pasteur, Paris, France – Unit : Bioinformatics and Biostatistics HUB 2012 – 2015 – Institut Pasteur, Paris, France – Unit : Molecular Genetics of Yeasts Supervisor : Prof. B. Dujon 2012 – Institut Pasteur, Paris, France – Unit : Integrated Mycobacterial Pathogenomics Supervisor: Dr. R. Brosch Education 2012– MSc. Bioinformatics – Université Paris Diderot (Paris VII)
Genome assemblySequence analysisGenome analysisOrthology and paralogy analysisRead mappingSequence homology analysisDNA structure analysisGenome rearrangementsMotifs and patterns detection
- Comparative genomics of Listeria monocytogenes isolates(Marc LECUIT - Biology of Infection) - Awaiting Publication
- Duplications in bacteriophage genomes.(Luisa DE SORDI - Molecular Biology of Gene in Extremophiles) - Awaiting Publication
- De novo sequencing and analysis of three unassigned species of non tuberculous mycobacteria.(RIM GHARBI - Integrated Mycobacterial Pathogenomics) - Awaiting Publication
After a PhD in Biology in 2011 on population genetics and phylogeography on amazing little amphipods (Crangonyx, Crymostygius) at the University of Reykjavik (Iceland), I pursued my interest in Bioinformatics and Evolutionary Biology in various post-docs in Spain (MNCN Madrid, UB Barcelona). During this time, I investigated transcriptomic landscapes for various non-model species (groups Conus, Junco and Caecilians) using de novo assemblies and participated in the development of TRUFA, a web platform for de novo RNA-seq analysis. In July 2016, I integrated the Revive Consortium and the Epigenetic Regulation unit at Pasteur Institute, where my main focus were transcriptomic and epigenetic analyses on various thematics using short and long reads technologies, with a special interest in alternative splicing events detection. I joined the Bioinformatics and Biostatistics Hub in January 2018. My latest interests are long reads technologies, alternative splicing and achieving reproducibility in Bioinformatics using workflow managers, container technologies and literate programming.
Data managementData VisualizationSequence analysisTranscriptomicsWeb developmentGenome analysisProgram developmentExploratory data analysisSofware development and engineeringGeneticsEvolutionRead mappingWorkflow and pipeline developmentPopulation geneticsMotifs and patterns detectionGrid and cloud computing
HumanInsect or arthropodOther animalAnopheles gambiae (African malaria mosquito)Mouse
- Build a software to decipher Gephyrin alternative transcripts obtained with long read sequencing(allemand ERIC - Epigenetic Regulation) - In Progress
- Transcriptomics of Anopheles – Plasmodium vivax interactions towards identification of malaria transmission blocking targets(Catherine BOURGOUIN - Functional Genetics of Infectious Diseases) - In Progress
- Mapping of Enhancers from transcriptome data(Christian MUCHARDT - Epigenetic Regulation) - In Progress
Rachel Legendre is a bioinformatics engineer. She completed her master degree in apprenticeship for two years at INRA in Jouy-en-Josas in the Genetic Animal department. She was involved in a project aiming at the detection and the expression analysis of micro-RNA involved in an equine disease. In 2012, she joined the Genomic, Structure and Translation Team at Paris-Sud (Paris XI) university. She worked principally on Ribosome Profiling data analysis, a new technique that allows to identify the position of the ribosome on the mRNA at the nucleotide level. Since november 2015, she worked at Institut Pasteur. During 4 years, she was detached to the Biomics Platform, where she was in charge of the bioinformatics analyses for transcriptomics and epigenomics projects. She was also involved in Long Reads (PacBio and Nanopore) developments with other bioinformaticians of Biomics. Since november 2019, she has joined the Hub of Bioinformatics and Biostatistics, et more precisely the Genome Organization Regulation and Expression group.
AlgorithmicsChIP-seqEpigenomicsNon coding RNATranscriptomicsGenome analysisProgram developmentScientific computingSofware development and engineeringIllumina HiSeqRead mappingSequencingWorkflow and pipeline developmentChromatin accessibility assaysPac BioRibosome profiling
BacteriaFungiParasiteHumanInsect or arthropodOther animal
- basic alignment/visualisation pipeline(Pablo NAVARRO - Epigenetics of Stem Cells) - In Progress
- The master regulator of virulence gene expression in Group B Streptococcus(Arnaud FIRON - Biology of Gram-Positive Pathogens) - Pending
- Transcriptional analysis of injured skeletal muscle(Eleonora ROSSI - Stroma, Inflammation and Tissue Repair) - Closed
After a Master degree in bioinformatics and biostatistics, I did a PhD in computer science / bioinformatics at University Paris-Sud (now in University Paris-Saclay), where I worked on integration and analysis of comparative genomics data. After a postdoc in Lausanne, Switzerland where I worked on small-RNA sequencing data, I joined GenoSplice where I was responsible for the development of bioinformatics projects related to next generation sequencing. I joined Institut Pasteur in Nov. 2015, to work in the Evolutionary Bioinformatics Unit and participate in the development of new tools and algorithms that are able to tackle efficiently the ever increasing amount of sequencing data.
AlgorithmicsData managementPhylogeneticsSequence analysisDatabaseGenome analysisProgram developmentScientific computingDatabases and ontologiesSequencingWorkflow and pipeline development
After a PhD in biochemistry of the rapeseed proteins, during which I developed my first automated scripts for handling data processing and analysis, I join Danone research facility center for developing multivariate models for the prediction of milk protein composition using infrared spectrometry.
As I was already developing my own informatics tools, I decided to join the course of informatic for biology of the Institut Pasteur in 2007. At the end of the course I was recruited by the Institute and integrate the unit of “génétique des interactions macromoléculaires” of Alain Jacquier. Within this group, I learn to handle sequencing data and I developed processing and analysis tools using python and R. I also create a genome browser and database system for storing, retrieving and visualizing microarray data. After 8 years within the Alain Jacquier’s lab, I join the Hub of bioinformatics and biostatistics as co-head of the team.
ClusteringData managementSequence analysisTranscriptomicsWeb developmentDatabaseGenome analysisProgram developmentScientific computingExploratory data analysisData and text miningIllumina HiSeqRead mappingLIMSIllumina MiSeqHigh Throughput ScreeningMultidimensional data analysisWorkflow and pipeline developmentRibosome profilingMotifs and patterns detection
- SHERLOCK4HAT - WP1.1(Brice ROTUREAU - Group: Trypanosome transmission) - In Progress
- Remettre les servers Genolist comme LegioList, TuberclListe, Colibri etc en service(Carmen BUCHRIESER - Biology Of Intracellular Bacteria) - Closed
- Identification of eukaryotic 5'UTRs(Arnaud ECHARD - Membrane Traffic and Cell Division) - Closed
Professional Experience Today - Institut Pasteur,Paris - HUB Team 2017 - Bioinformatician 2001 - 2017 - Institut Pasteur,Paris; CIB/DSI - Engineer 1997 - 2000 Thesis: NMR and molecular modelisation, CEA, Saclay,
Data managementSequence analysisTranscriptomicsGenome analysisProgram developmentScientific computing
FungiCandida albicansCryptococcus gattiiCryptococcus neoformans
- Characterization of Salmonella mutants(FRANCOISE NOREL - Biochemistry of Macromolecular Interactions) - In Progress
- Etude des TSS alternatif chez Cryptococcus(Guilhem JANBON - RNA Biology of Fungal Pathogens) - In Progress
- maintenance du site de la Collection des cyanobactéries(Bénédicte BENEDIC - Collection of Cyanobacteria) - Closed
Graduated in “Structural Genomics and Bioinformatics”, I mainly worked during almost 6 years at the Genoscope (CEA) in the LABGeM team, within the microbial annotation platform MicroScope. I specifically focused on functional annotation and microbial metabolic pathways prediction and reconstruction, through pipeline implementation, database modeling and web interface development. Broadly, interactions in the MicroScope platform allowed me to tackle the whole annotation process: from genome assembly and gene prediction to network reconstruction. I also performed several comparative genomics analyses. As a member of the “Hub team”, I now take part to various projects, linked to HTS data, on different subjects (lncRNAs and stem cells, HIV integration and DNA structure, Ribosomal protein genes and genome evolution, Natural Antisense Transcripts in compact genomes…).
Data managementGenomicsSequence analysisWeb developmentDatabaseGenome analysisDatabases and ontologiesOrthology and paralogy analysisRead mappingSequence homology analysisGene prediction
- Setup of bioinformatic pipelines for paleo(meta)genomics(Nicolás RASCOVAN - Department of Genomes and Genetics) - In Progress
- Multiparametric immunophenotyping of whole blood in IFN-treated multiple sclerosis patients(Priyanka DEVI - Cytokine Signaling) - Awaiting Publication
- Genomic DNA sequencing of Burkholderia ambifaria Q53 strain isolated from peanut rizospheric soil(Mathilde BEN ASSAYA - Structural Microbiology) - In Progress
Dr. Natalia Pietrosemoli is an Engineer with a M. Sc. in Modeling and Simulation of Complex Realities from the International Center for Theoretical Physics, ICTP and the International School of Advanced Studies, SISSA (Triest, Italy). During her M. Sc. internships she mostly worked in modeling, optimization, combinatorics and information theory applied to medical imaging. In 2012 she got a Ph. D in Computational Biology from the School of Bioengineering of Rice University (Houston, TX, US), where she specialized in computational structural biology and functional genomics. Her doctoral thesis “Protein functional features extracted with from primary sequences : a focus on disordered regions”, contributed to a better understanding of the functional and evolutionary role of intrinsic disorder in protein plasticity, complexity and adaptation to stress conditions. As part of her Ph. D., Natalia was a visiting scholar in two labs in Madrid: the Structural Computational Biology Group at the Spanish National Cancer Research Centre (CNIO), where she mainly worked in sequence analysis and the functional-structural relationships of proteins, and the Computational Systems Biology Group at the Spanish National Centre for Biotechnology (CNB-CSIC ), where she studied the functional implications of intrinsically disordered proteins at the genomic level for several organisms, collaborating with different experimental and theoretical groups. In 2013, she joined the Swiss Institute of Bioinformatics as a postdoctoral fellow in the Bioinformactics Core Facility. Her main project consisted in the molecular classification of a rare type of lymphoma, which involved the integration of transcriptomic, clinical and mutational data for the identification of molecular markers for classification, diagnosis and prognosis. This work was performed in collaboration with the Pathology Institute at the University Hospital of Lausanne (CHUV). In November of 2015 Natalia joined the Hub Team @ Pasteur C3BI as a Senior Bioinformatician. Natalia is especially interested in the integrative analysis of different omics data, both at large-scale and for small datasets, and loves collaborating in interdisciplinary environments and having feedback from her fellow experimental colleagues. Currently, she’s coordinating several projects performing functional and pathway analysis at the genomic level. By grouping genes, proteins and other biological molecules into the pathways they are involved in, the complexity of the analyses is significantly reduced, while the explanatory power increases with respect to having a list of differentially expressed genes or proteins.
AlgorithmicsData managementGenomicsImage analysisMachine learningModelingProteomicsSequence analysisStructural bioinformaticsTranscriptomicsDatabaseGenome analysisBiostatisticsScientific computingDatabases and ontologiesApplication of mathematics in sciencesData and text miningGeneticsGraphics and Image ProcessingBiosensors and biomarkersClinical researchCell biology and developmental biologyInteractomicsBioimage analysis
- Global BioID-based SARS-CoV-2 proteins proximal interactome unveils novel ties between viral polypeptides and host factors involved in multiple COVID19-associated mechanisms(Yves JACOB - Molecular Genetics of RNA Viruses) - In Progress
- Mitochondrial polarization identifies functionally mature human NK cells(Laura SURACE - Innate Immunity) - Awaiting Publication
- Proteomic analysis of the intracellular compartments containing Brucella abortus(Javier PIZARRO-CERDA - Yersinia) - In Progress
Najwa has been a postdoctoral fellow funded by the PTR project OM-Nega of the Institut Pasteur. Since January 2018 she has become the permanent bioinformatician of the group as part of the Hub team C3BI of the Institut Pasteur.
GenomicsSequence analysisDatabaseGenome analysisEvolutionOrthology and paralogy analysis
In 2012 I completed my master degree at the MicroScope Platform located at Genoscope (the French National Sequencing Center). I was involved in a project aiming at the management of evolution projects which rely on the Next Generation Sequencing (NGS) technologies to try to decipher the dynamics of genomic changes as well as the molecular bases and the mechanisms underlying adaptative evolution of micro-organisms (Remigi et al. 2014). Since November 2014, I joined the Bioinformatics and Biostatistics HUB at Institut Pasteur. I participated to the creation and updates of the C3BI website. I joined the WINTER group where I’m in charge of web and interface development projects. I have completed an UX-Design training to add extra value to my front-end development skills. I design and develop bioinformatics tools and interfaces that are users oriented.
Data VisualizationWeb developmentDatabaseGenome analysisScientific computingDatabases and ontologiesSofware development and engineeringWorkflow and pipeline development
- BIGSdb-Pasteur web pages and design renewal(Federica PALMA - Biological Resource Center of Institut Pasteur (CRBIP)) - In Progress
- Page web modélisation covid-19 pasteur(Simon CAUCHEMEZ - Mathematical Modelling Of Infectious Diseases) - Pending
- French GWAS Covid webpage(Hugues ASCHARD - Statistical Genetics) - In Progress
Related projects (3)
Whole genome sequencing is revolutionizing the surveillance of foodborne and waterborne bacterial pathogens. The speed with which public health laboratories obtain information after the onset of symptoms and the regular sharing of this information between public health laboratories and epidemiologists are critical for the successful use of information to detect outbreaks early and to identify their source. For this purpose, this project aims at providing the most relevant bacterial genomic information in a timely-manner by integrating different validated in silico tools (core genome MLST, CRISPR, O and H molecular serotype, ....) into a single automated analysis pipeline.
Bacteria of Shigella species are responsible for bacillary dysentery (shigellosis) in humans. A characteristic feature of these bacteria, used for identification purposes, is their inability to use la
Identification of a secondary mutation which likely appeared during the construction of a mutant strain of Salmonella serotype Typhimurium