Genomic rearrangements play a significant role in disease,evolution,and tumorigenesis(Mani and Chinnaiyan,2010).While single nucleotide polymorphisms(SNPs)and small insertions/deletions contribute substantially to genetic variation,discerning the independent impact of structural variations poses a challenge.
Background The reliance on a solitary linear reference genome has imposed a significant constraint on our compre-hensive understanding of genetic variation in animals.This constraint is particularly pronounced for non-reference sequences(NRSs),which have not been extensively studied.Results In this study,we constructed a pig pangenome graph using 21 pig assemblies and identified 23,831 NRSs with a total length of 105 Mb.Our findings revealed that NRSs were more prevalent in breeds exhibiting greater genetic divergence from the reference genome.Furthermore,we observed that NRSs were rarely found within coding sequences,while NRS insertions were enriched in immune-related Gene Ontology terms.Notably,our investigation also unveiled a close association between novel genes and the immune capacity of pigs.We observed substantial differences in terms of frequencies of NRSs between Eastern and Western pigs,and the heat-resistant pigs exhibited a substantial number of NRS insertions in an 11.6 Mb interval on chromosome X.Additionally,we discovered a 665 bp insertion in the fourth intron of the TNFRSF19 gene that may be associated with the ability of heat tolerance in South-ern Chinese pigs.Conclusions Our findings demonstrate the potential of a graph genome approach to reveal important functional features of NRSs in pig populations.
Jian MiaoXingyu WeiCaiyun CaoJiabao SunYuejin XuZhe ZhangQishan WangYuchun PanZhen Wang
Objective:To address the phylogenetic and phylogeographic relationship between different lineages of Anopheles(An.)subpictus species complex in most parts of the Asian continent by maximum utilization of Internal Transcriber Spacer 2(ITS2)and cytochrome C oxidase I(COI)sequences deposited at the GenBank.Methods:Seventy-five ITS2,210 COI and 26 concatenated sequences available in the NCBI database were used.Phylogenetic analysis was performed using Bayesian likelihood trees,whereas median-joining haplotype networks and time-scale divergence trees were generated for phylogeographic analysis.Genetic diversity indices and genetic differentiation were also calculated.Results:Two genetically divergent molecular forms of An.subpictus species complex corresponding to sibling species A and B are established.Species A evolved around 37-82 million years ago in Sri Lanka,India,and the Netherlands,and species B evolved around 22-79 million years ago in Sri Lanka,India,and Myanmar.Vietnam,Thailand,and Cambodia have two molecular forms:one is phylogenetically similar to species B.Other forms differ from species A and B and evolved recently in the above mentioned countries,Indonesia and the Philippines.Genetic subdivision among Sri Lanka,India,and the Netherlands is almost absent.A substantial genetic differentiation was obtained for some populations due to isolation by large geographical distances.Genetic diversity indices reveal the presence of a long-established stable mosquito population,at mutation-drift equilibrium,regardless of population fluctuations.Conclusions:An.subpictus species complex consists of more than two genetically divergent molecular forms.Species A is highly divergent from the rest.Sri Lanka and India contain only species A and B.
Lihini Sandaleka MuthukumaranaMethsala Madurangi WedageSamanthika RathnayakeNissanka Kolitha De Silva
Learning modality-fused representations and processing unaligned multimodal sequences are meaningful and challenging in multimodal emotion recognition.Existing approaches use directional pairwise attention or a message hub to fuse language,visual,and audio modalities.However,these fusion methods are often quadratic in complexity with respect to the modal sequence length,bring redundant information and are not efficient.In this paper,we propose an efficient neural network to learn modality-fused representations with CB-Transformer(LMR-CBT)for multimodal emotion recognition from unaligned multi-modal sequences.Specifically,we first perform feature extraction for the three modalities respectively to obtain the local structure of the sequences.Then,we design an innovative asymmetric transformer with cross-modal blocks(CB-Transformer)that enables complementary learning of different modalities,mainly divided into local temporal learning,cross-modal feature fusion and global self-attention representations.In addition,we splice the fused features with the original features to classify the emotions of the sequences.Finally,we conduct word-aligned and unaligned experiments on three challenging datasets,IEMOCAP,CMU-MOSI,and CMU-MOSEI.The experimental results show the superiority and efficiency of our proposed method in both settings.Compared with the mainstream methods,our approach reaches the state-of-the-art with a minimum number of parameters.