Skip to content

The project generated a De novo whole genome assembly of two Hippobosca species. The repository contains all the scripts, software, and other materials used in the project.

License

Notifications You must be signed in to change notification settings

fredrickkebaso/Hippobosca_Genome_Assembly_And_Annotation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

De-novo-Genome-Assembly-and-chemosensory-gene-annotation-in-Hippobosca-variegata-and-Hippobosca camelina

Background of the study

Hippoboscids (keds/flies) are obligate hematophagous or blood-sucking ectoparasites that infest birds, mammals, and rarely humans. They vector pathogens such as Anaplasma camelii and Acanthocheilonema dracunculoides. The keds require about 1.5mg of blood in a span of 6 hours at an interval of 13 minutes as such heavy infestation would lead to anemia when unchecked. The frequent and rapid feeding intervals result in numerous painful bites on the host, prompting the host to scratch against rough surfaces. The scratches lead to the development of wounds, creating a conducive environment for myiasis, a condition where larvae infest the host's wounds. The occurrence of myiasis adds to the economic burden for farmers, as it results in increased production costs and substantial losses when livestock succumb to the negative effects of this infestation.

The control strategies of vector-borne diseases (VBDs) and their effects target the vector itself, thereby controlling the transmission rates of the vectored pathogens. Most vectors, particularly Dipterans or Insects, have developed a robust chemosensory system that plays a vital role in their ability to locate hosts for feeding, mates for reproduction, and predator evasion, which ultimately contributes to their successful establishment ecologically. To effectively suppress pathogen transmission to hosts, strategies have been devised to exploit the vector's chemosensory system in order to trap and repel the vectors themselves.

In this study, we hypothesize that Hippoboscus, a dipteran, relies extensively on its chemosensory system to locate its host, facilitate reproduction, and potentially transmit pathogens. A genome assembly for the Hippobosca species is currently unavailable in the genome database. This limits access to molecular insights on Hippobosca species, particularly the chemosensory system.

This study therefore focused on generating a whole genome assembly for two Hippobosca species: Hippobosca variegata and Hippobosca camelina and describing their chemosensory system.

Screenshot from 2023-08-27 15-24-57

H.camelina; Adapted from ICIPE Insect of the month, photo by Dr. Dan Masiga.; H.variegata - Adapted from Namibia discussion forum.

The availing of the genome assembly will provide the knowledge base upon which vector competence, epidemiology and effective control strategies of the hippoboscus flies will be based.

Study Objectives

Overall objective

  • To construct a whole genome assembly of H. variegata and H. camelina and describe the genes responsible for chemosensing in Hippobosca.

Specific objectives

  1. To generate whole genome assembly for H. variegata and H. camelina.

  2. To identify the genomic architecture and gene content of the chemosensory genes in the two Hippobosca fly species.

Analysis workflow

De novo genome assembly workflow

Workflows-De-novo assembly drawio

Functional annotation Workflow

Workflows-Functional annotation drawio

Software and Packages

All tools used have been documented here

Results

Objective 1. Assembled Genome Features

Metric Hippoboscus variegata Hippoboscus camelina
Genome size (mb) 147.25 148.44
Contig N50 (kb) 3kb 3kb
Number of contigs 177664 179543
GC content 29.40% 28.60%
Percentage gaps 0.74% 0.08%

Table Showing the genome quality metrics for each of the Hippoboscus species. The metric column represents the genome attribute, and the Hippoboscus variegata and Hippoboscus camelina column shows the genome score per feature.

Objective 2. Chemosensory genes annotated for Hippoboscus Species

Sample predicted Gustatory Receptor for H. camelina

H camelina_Gustatory_receptor

Figure 1: Positions of the transmembrane domains of the predicted Putative Gustatory receptor of Hippobosca camelina. The numerical values ranging from 1 to 5 correspond to the existing domains. The horizontal bar symbolizes the cellular membrane.

hippoboscus_only_chem_genes

Figure 2: A group bar chart showing the predicted chemosensory gene counts per gene familly in H. variegata and H. camelina gene counts. The Y-axis shows the actual gene counts per gene family. The X-axis shows the gene family; Chemosensory Specific Proteins (CSPs), Gustatory Proteins(GRs), Ionotropic Receptors (IRs), Odorant Binding Proteins (OBPs), and Odorant Receptors (ORs).

chemosensory_genes_hippoboscus_vs_Glossina

Figure 3: A group bar chart showing the predicted chemosensory gene counts per gene familly in H. variegata and H. camelina alongside _Glossina morsitans _ gene. The Y-axis shows the actual gene counts per gene family. The X-axis shows the gene family; Chemosensory Specific Proteins (CSPs), Gustatory Proteins(GRs), Ionotropic Receptors (IRs), Odorant Binding Proteins (OBPs), and Odorant Receptors (ORs).

Phylogenetic Clustering

Phylogenetic tree for the predicted Hippoboscus chemosensory genes

Gene_tree13

Figure 4: shows the phylogenetic clustering of all the predicted putative chemosensory genes for Hippoboscus species and selected models from the closest species. Each chemosensory gene family is colored distinctly as defined by the key on the right of the figure. The tree leaves indicate the aligned genes with the prefix GMOY, representing selected models from Glossina morsitans yale isolate, GBRI- Glossina Brevipalpis, and FBp - Drosophila melanogaster. The Hippoboscus individual gene names (temporal working gene names) are prefixed as H. variegata (hv) and H. camelina as hc for all aligned gene sequences.

References

  1. Zhang, Q., Zhou, Q., Han, S., Li, Y., Wang, Y., & He, H. (2023). The genome of sheep ked (Melophagus ovinus) reveals potential mechanisms underlying reproduction and narrower ecological niches. BMC Genomics, 24(1). https://doi.org/10.1186/s12864-023-09155-1

  2. Bezerra-Santos, M. A., & Otranto, D. (2020). Keds, the enigmatic flies and their role as vectors of pathogens. Acta Tropica, 209(April), 105521. https://doi.org/10.1016/j.actatropica.2020.105521

  3. Bargul, J. L., Kidambasi, K. O., Getahun, M. N., Villinger, J., Copeland, R. S., Muema, J. M., Carrington, M., & Masiga, D. K. (2021). Transmission of ‘candidatus anaplasma camelii’ to mice and rabbits by camel-specific keds, hippobosca camelina. PLoS Neglected Tropical Diseases, 15(8), 1–20. https://doi.org/10.1371/journal.pntd.0009671

  4. Macharia, R., Mireji, P., Murungi, E., Murilla, G., Christoffels, A., Aksoy, S., & Masiga, D. (2016). Genome-Wide Comparative Analysis of Chemosensory Gene Families in Five Tsetse Fly Species. PLoS Neglected Tropical Diseases, 10(2). https://doi.org/10.1371/journal.pntd.0004421

About

The project generated a De novo whole genome assembly of two Hippobosca species. The repository contains all the scripts, software, and other materials used in the project.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published