Support information
- Category: Genome assembly
- Coordination: support.sigenae@inrae.fr
- Service: since 2018
- Project example: GenoFish
Genome assembly
Sequencing technology evolution toward long high quality reads enables today to assemble large genomes
(1Gb and more) for a few thousand euros. Which makes it possible for a team or even a single researcher
to produce the reference best adapted for her/his daily work. Genome assembly is know to be like
building a jigsaw puzzle, the larger the pieces the easier the work. Long read assembly will produce
contigs which depending on the read quality will have to be corrected (Nanopore or PacBio CLR) or not
(PacBio HiFi). Contig correction is performed by aligning long and/or short reads and updating bases in
the reference which do not match with the alignment. Contigs are then scaffolded in chromosomes using
BioNano optical maps, Hi-C ligation reads or a genetic map.
Sigenae has assembled over fifteen fish genome as well as the bee genome. Some of these assemblies
where quite complicated due to slow evolution and recent duplication as for example the sterlet
sturgeon genome (Nature Ecology & Evolution,
2020). We have taken part in the assemblies of the
SeqOccin project. And we have also been in charge of the assemblies of the
GenoFish project of which some genomes are references at the NCBI or Ensembl like the
javafish, the
yellow perch or the
european perch.
Assembly projects recently published