0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Characterizing Microsatellite Distribution Patterns Across Distinct Gene Categories in Human
Authors :
Elahe Mehrazin
1
Mahmoud Naghibzadeh
2
Sara Jamali
3
1- Dept. of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran
2- Dept. of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran
3- Dept. of Medical Genetics, Hormozgan University of Medical Sciences, Bandar Abbas, Iran
Keywords :
Genome،Gene،Microsatellite،Coding Sequence،Protein-Coding genes،Simple Sequence Repeat (SSR)،Distribution pattern،Long non-coding RNAs،protein
Abstract :
Microsatellites are genomic regions composed of short repeat units (typically 1–6 base pairs) that are tandemly repeated multiple times and are distributed throughout the genome of various organisms, including humans. With the growing understanding of their roles in the human genome, such as involvement in the development of diseases like hereditary neurological disorders and certain colorectal tumors, as well as their use as genetic markers in population studies and forensic science, numerous algorithms have been developed to identify these sequences across the genome. To date, many studies using such algorithms have statistically analyzed the distribution patterns of microsatellites across different genomic regions, though most of them have focused on exonic, intronic, intergenic, and coding regions. In this study, we used SQL and the powerful Power BI tool to investigate the distribution patterns of microsatellite sequences in the human genome, with a specific emphasis on different types of genes and coding regions. The results revealed that the distribution pattern of microsatellites varies among different gene types. Protein-coding genes contained the highest number of microsatellites, whereas small nucleolar RNA (snoRNA) and microRNA (miRNA) genes included very few, and some gene types, such as small nuclear RNA (snRNA) and transfer RNA (tRNA), were completely devoid of them. Interestingly, protein-coding genes had the highest frequency of microsatellite occurrences, including long repeats over 100 nucleotides. Distribution analysis in coding regions showed that among all repeats, trinucleotide sequences such as ACG, CAG, and GAG were the most frequently found in Known messenger RNA (mRNA), contributing to the formation of repeat-rich polypeptides composed of threonine, glutamine, and glutamic acid. We also provided a list of protein-coding genes with the highest number of protein products encoded from microsatellite-containing regions.
Papers List
List of archived papers
Evaluation of Efficient Electrocardiomatrix-based Identification Using Deep Learning Methods
Amirhossein Safari - Narges Mokhtari - Mohsen Hooshmand - Sadegh Sadeghi - Peyman Pahlevani
TriMAE: Fashion visual search with Triplet Masked Auto Encoder Vision Transformer
Lachin Zamani - Reza Azmi
IranITJobs2021: a Dataset for Analyzing Iranian Online IT Job Advertisements Collected Using a New Crowdsourcing Process
Fakhroddin Noorbehbahani - Nikta Akbarpour - Mohammad Reza Saeidi
Distilled BERT Model In Natural Language Processing
Yazdan Zandiye Vakili - Avisa Fallah - Hedieh Sajedi
Introducing E4MT and LMBNC: Persian pre-processing utilities
Zakieh Shakeri - Mehran Ziabary - Behrooz Vedadian - Fatemeh Azadi - Saeed Torabzadeh - Arian Atefi
Energy-Aware Dynamic Digital Twin Placement in Mobile Edge Computing
Mahdi Hematyar - Zeinab Movahedi
A Language-Independent Approach to Classification of Textual File Fragments: Case Study of Persian, English, and Chinese Languages
Fatemeh Mansouri Hanis - Hamidreza Khoshvaghti - Mehdi Teimouri - Hadi Veisi
Semantic Segmentation Using Region Proposals and Weakly-Supervised Learning
Maryam Taghizadeh - Abdolah Chalechale
A Deep CNN Model Based Ensemble Approach for Semantic and Instance Segmentation of Indoor Environment
Sajad Rezaei - Jafar Tanha - Zahra Jafari - SeyedEhsan Roshan - Mohammad-Amin Memar Kochebagh
Age Estimation Based on Facial Images Using Hybrid Features and Particle Swarm Optimization
NILOUFAR MEHRABI - SAYED PEDRAM HAERI BOROUJENI
more
Samin Hamayesh - Version 43.7.0