WorldWideScience

Sample records for non-synonymous coding snps

  1. In-silico analysis of non-synonymous-SNPs of STEAP2: To provoke the progression of prostate cancer

    Directory of Open Access Journals (Sweden)

    Naveed Muhammad

    2016-01-01

    Full Text Available As a novel biomarker from the STEAP family, STEAP2 encodes six transmembrane epithelial antigens to prostate cancer. The overexpression of STEAP2 is predicted as the second most common cancer in the world that is responsible for male cancer-related deaths. Nonsynonymous SNPs are important group of SNPs which lead to alternations in encoded polypeptides. Changes in the amino acid sequence of gene products can lead to abnormal tissue function. The present study firstly sorted out those SNPs which exist in the coding region of STEAP2 and evaluated their impact through computational tools. Secondly, the three-dimensional structure of STEAP2 was formed through I-TASSER and validated by different software. Genomic data has been retrieved from the 1000 Genome project and Ensembl and subsequently analysed using computational tools. Out of 177 non-synonymous single nucleotide polymorphisms (nsSNPs within the coding region, 42 mis-sense SNPs have been predicted as deleterious by all analyses. Our research shows a welldesigned computational methodology to inspect the prostate cancer associated nsSNPs. It can be concluded that these nsSNPs can play their role in the up-regulation of STEAP2 which further leads to progression of prostate cancer. It can benefit scientists in the handling of cancerassociated diseases related to STEAP2 through developing novel drug therapies.

  2. Bioinformatic Analysis of Deleterious Non-Synonymous Single Nucleotide Polymorphisms (nsSNPs in the Coding Regions of Human Prion Protein Gene (PRNP

    Directory of Open Access Journals (Sweden)

    Kourosh Bamdad

    2016-12-01

    Full Text Available Background & Objective: Single nucleotide polymorphisms are the cause of genetic variation to living organisms. Single nucleotide polymorphisms alter residues in the protein sequence. In this investigation, the relationship between prion protein gene polymorphisms and its relevance to pathogenicity was studied. Material & Method: Amino acid sequence of the main isoform from the human prion protein gene (PRNP was extracted from UniProt database and evaluated by FoldAmyloid and AmylPred servers. All non-synonymous single nucleotide polymorphisms (nsSNPs from SNP database (dbSNP were further analyzed by bioinformatics servers including SIFT, PolyPhen-2, I-Mutant-3.0, PANTHER, SNPs & GO, PHD-SNP, Meta-SNP, and MutPred to determine the most damaging nsSNPs. Results: The results of the first structure analyses by FoldAmyloid and AmylPerd servers implied that regions including 5-15, 174-178, 180-184, 211-217, and 240-252 were the most sensitive parts of the protein sequence to amyloidosis. Screening all nsSNPs of the main protein isoform using bioinformatic servers revealed that substitution of Aspartic acid with Valine at position 178 (ID code: rs11538766 was the most deleterious nsSNP in the protein structure. Conclusion:  Substitution of the Aspartic acid with Valine at position 178 (D178V was the most pathogenic mutation in the human prion protein gene. Analyses from the MutPred server also showed that beta-sheets’ increment in the secondary structure was the main reason behind the molecular mechanism of the prion protein aggregation.

  3. In silico analysis of consequences of non-synonymous SNPs of Slc11a2 gene in Indian bovines

    Directory of Open Access Journals (Sweden)

    Shreya M. Patel

    2015-09-01

    Full Text Available The aim of our study was to analyze the consequences of non-synonymous SNPs in Slc11a2 gene using bioinformatic tools. There is a current need of efficient bioinformatic tools for in-depth analysis of data generated by the next generation sequencing technologies. SNPs are known to play an imperative role in understanding the genetic basis of many genetic diseases. Slc11a2 is one of the major metal transporter families in mammals and plays a critical role in host defenses. In this study, we performed a comprehensive analysis of the impact of all non-synonymous SNPs in this gene using multiple tools like SIFT, PROVEAN, I-Mutant and PANTHER. Among the total 124 SNPs obtained from amplicon sequencing of Slc11a2 gene by Ion Torrent PGM involving 10 individuals of Gir cattle and Murrah buffalo each, we found 22 non-synonymous. Comparing the prediction of these 4 methods, 5 nsSNPs (G369R, Y374C, A377V, Q385H and N492S were identified as deleterious. In addition, while tested out for polar interactions with other amino acids in the protein, from above 5, Y374C, Q385H and N492S showed a change in interaction pattern and further confirmed by an increase in total energy after energy minimizations in case of mutant protein compared to the native.

  4. The effects of non-synonymous single nucleotide polymorphisms (nsSNPs) on protein-protein interactions.

    Science.gov (United States)

    Yates, Christopher M; Sternberg, Michael J E

    2013-11-01

    Non-synonymous single nucleotide polymorphisms (nsSNPs) are single base changes leading to a change to the amino acid sequence of the encoded protein. Many of these variants are associated with disease, so nsSNPs have been well studied, with studies looking at the effects of nsSNPs on individual proteins, for example, on stability and enzyme active sites. In recent years, the impact of nsSNPs upon protein-protein interactions has also been investigated, giving a greater insight into the mechanisms by which nsSNPs can lead to disease. In this review, we summarize these studies, looking at the various mechanisms by which nsSNPs can affect protein-protein interactions. We focus on structural changes that can impair interaction, changes to disorder, gain of interaction, and post-translational modifications before looking at some examples of nsSNPs at human-pathogen protein-protein interfaces and the analysis of nsSNPs from a network perspective. © 2013.

  5. LS-SNP/PDB: annotated non-synonymous SNPs mapped to Protein Data Bank structures.

    Science.gov (United States)

    Ryan, Michael; Diekhans, Mark; Lien, Stephanie; Liu, Yun; Karchin, Rachel

    2009-06-01

    LS-SNP/PDB is a new WWW resource for genome-wide annotation of human non-synonymous (amino acid changing) SNPs. It serves high-quality protein graphics rendered with UCSF Chimera molecular visualization software. The system is kept up-to-date by an automated, high-throughput build pipeline that systematically maps human nsSNPs onto Protein Data Bank structures and annotates several biologically relevant features. LS-SNP/PDB is available at (http://ls-snp.icm.jhu.edu/ls-snp-pdb) and via links from protein data bank (PDB) biology and chemistry tabs, UCSC Genome Browser Gene Details and SNP Details pages and PharmGKB Gene Variants Downloads/Cross-References pages.

  6. Resequencing of 200 human exomes identifies an excess of low-frequency non-synonymous coding variants

    DEFF Research Database (Denmark)

    Li, Yingrui; Vinckenbosch, Nicolas; Tian, Geng

    2010-01-01

    data, we derived the allele frequency spectrum of cSNPs with a minor allele frequency greater than 0.02. We identified a 1.8-fold excess of deleterious, non-syonomyous cSNPs over synonymous cSNPs in the low-frequency range (minor allele frequencies between 2% and 5%). This excess was more pronounced...

  7. Common non-synonymous SNPs associated with breast cancer susceptibility

    DEFF Research Database (Denmark)

    Milne, Roger L; Burwinkel, Barbara; Michailidou, Kyriaki

    2014-01-01

    Candidate variant association studies have been largely unsuccessful in identifying common breast cancer susceptibility variants, although most studies have been underpowered to detect associations of a realistic magnitude. We assessed 41 common non-synonymous single-nucleotide polymorphisms (ns......SNPs) for which evidence of association with breast cancer risk had been previously reported. Case-control data were combined from 38 studies of white European women (46 450 cases and 42 600 controls) and analyzed using unconditional logistic regression. Strong evidence of association was observed for three ns...... associations reached genome-wide statistical significance in a combined analysis of available data, including independent data from nine genome-wide association studies (GWASs): for ATXN7-K264R, OR = 1.07 (95% CI = 1.05-1.10, P = 1.0 × 10(-8)); for AKAP9-M463I, OR = 1.05 (95% CI = 1.04-1.07, P = 2.0 × 10...

  8. Predicting the effects of non-synonymous amino acid variants on ...

    African Journals Online (AJOL)

    amino acid change) coding SNPs to cause functional impact on protein at the PRLR locus of cattle and chicken using the MEGA MD bioinformatics tool. In cattle, sixteen out of the first twenty non synonymous amino substitutions obtained: V5A, T9V, ...

  9. Determining effects of non-synonymous SNPs on protein-protein interactions using supervised and semi-supervised learning.

    Directory of Open Access Journals (Sweden)

    Nan Zhao

    2014-05-01

    Full Text Available Single nucleotide polymorphisms (SNPs are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs have been found near or inside the protein-protein interaction (PPI interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor. Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1 a 2-class problem (strengthening/weakening PPI mutations, (2 another 2-class problem (mutations that disrupt/preserve a PPI, and (3 a 3-class classification (detrimental/neutral/beneficial mutation effects. In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the

  10. Determining Effects of Non-synonymous SNPs on Protein-Protein Interactions using Supervised and Semi-supervised Learning

    Science.gov (United States)

    Zhao, Nan; Han, Jing Ginger; Shyu, Chi-Ren; Korkin, Dmitry

    2014-01-01

    Single nucleotide polymorphisms (SNPs) are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs) have been found near or inside the protein-protein interaction (PPI) interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor). Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1) a 2-class problem (strengthening/weakening PPI mutations), (2) another 2-class problem (mutations that disrupt/preserve a PPI), and (3) a 3-class classification (detrimental/neutral/beneficial mutation effects). In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the rewiring of

  11. Common non-synonymous SNPs associated with breast cancer susceptibility: findings from the Breast Cancer Association Consortium

    OpenAIRE

    Milne, Roger L.; Burwinkel, Barbara; Michailidou, Kyriaki; Arias-Perez, Jose-Ignacio; Zamora, M. Pilar; Menéndez-Rodríguez, Primitiva; Hardisson, David; Mendiola, Marta; González-Neira, Anna; Pita, Guillermo; Alonso, M. Rosario; Dennis, Joe; Wang, Qin; Bolla, Manjeet K.; Swerdlow, Anthony

    2014-01-01

    Candidate variant association studies have been largely unsuccessful in identifying common breast cancer susceptibility variants, although most studies have been underpowered to detect associations of a realistic magnitude. We assessed 41 common non-synonymous single-nucleotide polymorphisms (nsSNPs) for which evidence of association with breast cancer risk had been previously reported. Case-control data were combined from 38 studies of white European women (46 450 cases and 42 600 controls) ...

  12. Common non-synonymous SNPs associated with breast cancer susceptibility: findings from the Breast Cancer Association Consortium

    OpenAIRE

    Milne, Roger L.; Burwinkel, Barbara; Michailidou, Kyriaki; Arias-Perez, Jose-Ignacio; Zamora, M. Pilar; Menéndez-Rodríguez, Primitiva; Hardisson, David; Mendiola, Marta; González-Neira, Anna; Pita, Guillermo; Alonso, M. Rosario; Dennis, Joe; Wang, Qin; Bolla, Manjeet K.; Swerdlow, Anthony

    2014-01-01

    Candidate variant association studies have been largely unsuccessful in identifying common breast cancer susceptibility\\ud variants, although most studies have been underpowered to detect associations of a realistic magnitude.\\ud We assessed 41 common non-synonymous single-nucleotide polymorphisms (nsSNPs) for which\\ud evidence of association with breast cancer risk had been previously reported. Case-control data were combined\\ud from 38 studies of white European women (46 450 cases and 42 60...

  13. Targeted Metabolic Engineering Guided by Computational Analysis of Single-Nucleotide Polymorphisms (SNPs)

    DEFF Research Database (Denmark)

    Udatha, D B R K Gupta; Rasmussen, Simon; Sicheritz-Pontén, Thomas

    2013-01-01

    The non-synonymous SNPs, the so-called non-silent SNPs, which are single-nucleotide variations in the coding regions that give "birth" to amino acid mutations, are often involved in the modulation of protein function. Understanding the effect of individual amino acid mutations on a protein...

  14. Common non-synonymous SNPs associated with breast cancer susceptibility: findings from the Breast Cancer Association Consortium

    OpenAIRE

    Milne, Roger L; Burwinkel, Barbara; Michailidou, Kyriaki; Arias-Perez, Jose-Ignacio; Zamora, M Pilar; Menéndez-Rodríguez, Primitiva; Hardisson, David; Mendiola, Marta; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Dennis, Joe; Wang, Qin; Bolla, Manjeet K; Swerdlow, Anthony

    2014-01-01

    Candidate variant association studies have been largely unsuccessful in identifying common breast cancer susceptibility variants, although most studies have been underpowered to detect associations of a realistic magnitude We assessed 41 common non-synonymous single-nucleotide polymorphisms (nsSNPs) for which evidence of association with breast cancer risk had been previously reported. Case-control data were combined from 38 studies of white European women (46 450 cases and 42 600 controls) a...

  15. Partition dataset according to amino acid type improves the prediction of deleterious non-synonymous SNPs

    International Nuclear Information System (INIS)

    Yang, Jing; Li, Yuan-Yuan; Li, Yi-Xue; Ye, Zhi-Qiang

    2012-01-01

    Highlights: ► Proper dataset partition can improve the prediction of deleterious nsSNPs. ► Partition according to original residue type at nsSNP is a good criterion. ► Similar strategy is supposed promising in other machine learning problems. -- Abstract: Many non-synonymous SNPs (nsSNPs) are associated with diseases, and numerous machine learning methods have been applied to train classifiers for sorting disease-associated nsSNPs from neutral ones. The continuously accumulated nsSNP data allows us to further explore better prediction approaches. In this work, we partitioned the training data into 20 subsets according to either original or substituted amino acid type at the nsSNP site. Using support vector machine (SVM), training classification models on each subset resulted in an overall accuracy of 76.3% or 74.9% depending on the two different partition criteria, while training on the whole dataset obtained an accuracy of only 72.6%. Moreover, the dataset was also randomly divided into 20 subsets, but the corresponding accuracy was only 73.2%. Our results demonstrated that partitioning the whole training dataset into subsets properly, i.e., according to the residue type at the nsSNP site, will improve the performance of the trained classifiers significantly, which should be valuable in developing better tools for predicting the disease-association of nsSNPs.

  16. Common non-synonymous SNPs associated with breast cancer susceptibility: findings from the Breast Cancer Association Consortium.

    Science.gov (United States)

    Milne, Roger L; Burwinkel, Barbara; Michailidou, Kyriaki; Arias-Perez, Jose-Ignacio; Zamora, M Pilar; Menéndez-Rodríguez, Primitiva; Hardisson, David; Mendiola, Marta; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Dennis, Joe; Wang, Qin; Bolla, Manjeet K; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk; Ko, Yon-Dschun; Brauch, Hiltrud; Hamann, Ute; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Tchatchou, Sandrine; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Tajima, Kazuo; Li, Jingmei; Brand, Judith S; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Lambrechts, Diether; Peuteman, Gilian; Christiaens, Marie-Rose; Smeets, Ann; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katazyna; Hartman, Mikael; Hui, Miao; Yen Lim, Wei; Wan Chan, Ching; Marme, Federick; Yang, Rongxi; Bugert, Peter; Lindblom, Annika; Margolin, Sara; García-Closas, Montserrat; Chanock, Stephen J; Lissowska, Jolanta; Figueroa, Jonine D; Bojesen, Stig E; Nordestgaard, Børge G; Flyger, Henrik; Hooning, Maartje J; Kriege, Mieke; van den Ouweland, Ans M W; Koppert, Linetta B; Fletcher, Olivia; Johnson, Nichola; dos-Santos-Silva, Isabel; Peto, Julian; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha J; Long, Jirong; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Cox, Angela; Cross, Simon S; Reed, Malcolm W R; Schmidt, Marjanka K; Broeks, Annegien; Cornelissen, Sten; Braaf, Linde; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K; Noh, Dong-Young; Simard, Jacques; Dumont, Martine; Goldberg, Mark S; Labrèche, France; Fasching, Peter A; Hein, Alexander; Ekici, Arif B; Beckmann, Matthias W; Radice, Paolo; Peterlongo, Paolo; Azzollini, Jacopo; Barile, Monica; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Miller, Nicola; Hopper, John L; Schmidt, Daniel F; Makalic, Enes; Southey, Melissa C; Hwang Teo, Soo; Har Yip, Cheng; Sivanandan, Kavitta; Tay, Wan-Ting; Shen, Chen-Yang; Hsiung, Chia-Ni; Yu, Jyh-Cherng; Hou, Ming-Feng; Guénel, Pascal; Truong, Therese; Sanchez, Marie; Mulot, Claire; Blot, William; Cai, Qiuyin; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Wu, Anna H; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O; Bogdanova, Natalia; Dörk, Thilo; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Zhang, Ben; Couch, Fergus J; Toland, Amanda E; Yannoukakos, Drakoulis; Sangrajrang, Suleeporn; McKay, James; Wang, Xianshu; Olson, Janet E; Vachon, Celine; Purrington, Kristen; Severi, Gianluca; Baglietto, Laura; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline; Czene, Kamila; Eriksson, Mikael; Humphreys, Keith; Darabi, Hatef; Ahmed, Shahana; Shah, Mitul; Pharoah, Paul D P; Hall, Per; Giles, Graham G; Benítez, Javier; Dunning, Alison M; Chenevix-Trench, Georgia; Easton, Douglas F

    2014-11-15

    Candidate variant association studies have been largely unsuccessful in identifying common breast cancer susceptibility variants, although most studies have been underpowered to detect associations of a realistic magnitude. We assessed 41 common non-synonymous single-nucleotide polymorphisms (nsSNPs) for which evidence of association with breast cancer risk had been previously reported. Case-control data were combined from 38 studies of white European women (46 450 cases and 42 600 controls) and analyzed using unconditional logistic regression. Strong evidence of association was observed for three nsSNPs: ATXN7-K264R at 3p21 [rs1053338, per allele OR = 1.07, 95% confidence interval (CI) = 1.04-1.10, P = 2.9 × 10(-6)], AKAP9-M463I at 7q21 (rs6964587, OR = 1.05, 95% CI = 1.03-1.07, P = 1.7 × 10(-6)) and NEK10-L513S at 3p24 (rs10510592, OR = 1.10, 95% CI = 1.07-1.12, P = 5.1 × 10(-17)). The first two associations reached genome-wide statistical significance in a combined analysis of available data, including independent data from nine genome-wide association studies (GWASs): for ATXN7-K264R, OR = 1.07 (95% CI = 1.05-1.10, P = 1.0 × 10(-8)); for AKAP9-M463I, OR = 1.05 (95% CI = 1.04-1.07, P = 2.0 × 10(-10)). Further analysis of other common variants in these two regions suggested that intronic SNPs nearby are more strongly associated with disease risk. We have thus identified a novel susceptibility locus at 3p21, and confirmed previous suggestive evidence that rs6964587 at 7q21 is associated with risk. The third locus, rs10510592, is located in an established breast cancer susceptibility region; the association was substantially attenuated after adjustment for the known GWAS hit. Thus, each of the associated nsSNPs is likely to be a marker for another, non-coding, variant causally related to breast cancer risk. Further fine-mapping and functional studies are required to identify the underlying risk-modifying variants and the genes through which they act. © The

  17. Single nucleotide polymorphisms (SNPs in coding regions of canine dopamine- and serotonin-related genes

    Directory of Open Access Journals (Sweden)

    Lingaas Frode

    2008-01-01

    Full Text Available Abstract Background Polymorphism in genes of regulating enzymes, transporters and receptors of the neurotransmitters of the central nervous system have been associated with altered behaviour, and single nucleotide polymorphisms (SNPs represent the most frequent type of genetic variation. The serotonin and dopamine signalling systems have a central influence on different behavioural phenotypes, both of invertebrates and vertebrates, and this study was undertaken in order to explore genetic variation that may be associated with variation in behaviour. Results Single nucleotide polymorphisms in canine genes related to behaviour were identified by individually sequencing eight dogs (Canis familiaris of different breeds. Eighteen genes from the dopamine and the serotonin systems were screened, revealing 34 SNPs distributed in 14 of the 18 selected genes. A total of 24,895 bp coding sequence was sequenced yielding an average frequency of one SNP per 732 bp (1/732. A total of 11 non-synonymous SNPs (nsSNPs, which may be involved in alteration of protein function, were detected. Of these 11 nsSNPs, six resulted in a substitution of amino acid residue with concomitant change in structural parameters. Conclusion We have identified a number of coding SNPs in behaviour-related genes, several of which change the amino acids of the proteins. Some of the canine SNPs exist in codons that are evolutionary conserved between five compared species, and predictions indicate that they may have a functional effect on the protein. The reported coding SNP frequency of the studied genes falls within the range of SNP frequencies reported earlier in the dog and other mammalian species. Novel SNPs are presented and the results show a significant genetic variation in expressed sequences in this group of genes. The results can contribute to an improved understanding of the genetics of behaviour.

  18. In-Silico Computing of the Most Deleterious nsSNPs in HBA1 Gene.

    Directory of Open Access Journals (Sweden)

    Sayed AbdulAzeez

    Full Text Available α-Thalassemia (α-thal is a genetic disorder caused by the substitution of single amino acid or large deletions in the HBA1 and/or HBA2 genes.Using modern bioinformatics tools as a systematic in-silico approach to predict the deleterious SNPs in the HBA1 gene and its significant pathogenic impact on the functions and structure of HBA1 protein was predicted.A total of 389 SNPs in HBA1 were retrieved from dbSNP database, which includes: 201 non-coding synonymous (nsSNPs, 43 human active SNPs, 16 intronic SNPs, 11 mRNA 3' UTR SNPs, 9 coding synonymous SNPs, 9 5' UTR SNPs and other types. Structural homology-based method (PolyPhen and sequence homology-based tool (SIFT, SNPs&Go, PROVEAN and PANTHER revealed that 2.4% of the nsSNPs are pathogenic.A total of 5 nsSNPs (G60V, K17M, K17T, L92F and W15R were predicted to be responsible for the structural and functional modifications of HBA1 protein. It is evident from the deep comprehensive in-silico analysis that, two nsSNPs such as G60V and W15R in HBA1 are highly deleterious. These "2 pathogenic nsSNPs" can be considered for wet-lab confirmatory analysis.

  19. A computational prospect to aspirin side effects: aspirin and COX-1 interaction analysis based on non-synonymous SNPs.

    Science.gov (United States)

    Marjan, Mojtabavi Naeini; Hamzeh, Mesrian Tanha; Rahman, Emamzadeh; Sadeq, Vallian

    2014-08-01

    Aspirin (ASA) is a commonly used nonsteroidal anti-inflammatory drug (NSAID), which exerts its therapeutic effects through inhibition of cyclooxygenase (COX) isoform 2 (COX-2), while the inhibition of COX-1 by ASA leads to apparent side effects. In the present study, the relationship between COX-1 non-synonymous single nucleotide polymorphisms (nsSNPs) and aspirin related side effects was investigated. The functional impacts of 37 nsSNPs on aspirin inhibition potency of COX-1 with COX-1/aspirin molecular docking were computationally analyzed, and each SNP was scored based on DOCK Amber score. The data predicted that 22 nsSNPs could reduce COX-1 inhibition, while 15 nsSNPs showed increasing inhibition level in comparison to the regular COX-1 protein. In order to perform a comparing state, the Amber scores for two Arg119 mutants (R119A and R119Q) were also calculated. Moreover, among nsSNP variants, rs117122585 represented the closest Amber score to R119A mutant. A separate docking computation validated the score and represented a new binding position for ASA that acetyl group was located within the distance of 3.86Å from Ser529 OH group. This could predict an associated loss of activity of ASA through this nsSNP variant. Our data represent a computational sub-population pattern for aspirin COX-1 related side effects, and provide basis for further research on COX-1/ASA interaction. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. Three-cohort targeted gene screening reveals a non-synonymous TRKA polymorphism associated with schizophrenia

    DEFF Research Database (Denmark)

    van Schijndel, Jessica E; van Loo, Karen M J; van Zweeden, Martine

    2009-01-01

    selected non-synonymous single-nucleotide polymorphisms (SNPs) in three independent Caucasian schizophrenia case-control cohorts (USA, Denmark and Norway). A meta-analysis revealed ten non-synonymous SNPs that were nominally associated with schizophrenia, nine of which have not been previously linked...... attractive candidate for further study concerns SNP rs6336 (q=0.12) that causes the substitution of an evolutionarily highly conserved amino acid residue in the kinase domain of the neurodevelopmentally important receptor TRKA. Thus, TRKA signaling may represent a novel susceptibility pathway...

  1. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.

    Science.gov (United States)

    Cingolani, Pablo; Platts, Adrian; Wang, Le Lily; Coon, Melissa; Nguyen, Tung; Wang, Luan; Land, Susan J; Lu, Xiangyi; Ruden, Douglas M

    2012-01-01

    We describe a new computer program, SnpEff, for rapidly categorizing the effects of variants in genome sequences. Once a genome is sequenced, SnpEff annotates variants based on their genomic locations and predicts coding effects. Annotated genomic locations include intronic, untranslated region, upstream, downstream, splice site, or intergenic regions. Coding effects such as synonymous or non-synonymous amino acid replacement, start codon gains or losses, stop codon gains or losses, or frame shifts can be predicted. Here the use of SnpEff is illustrated by annotating ~356,660 candidate SNPs in ~117 Mb unique sequences, representing a substitution rate of ~1/305 nucleotides, between the Drosophila melanogaster w(1118); iso-2; iso-3 strain and the reference y(1); cn(1) bw(1) sp(1) strain. We show that ~15,842 SNPs are synonymous and ~4,467 SNPs are non-synonymous (N/S ~0.28). The remaining SNPs are in other categories, such as stop codon gains (38 SNPs), stop codon losses (8 SNPs), and start codon gains (297 SNPs) in the 5'UTR. We found, as expected, that the SNP frequency is proportional to the recombination frequency (i.e., highest in the middle of chromosome arms). We also found that start-gain or stop-lost SNPs in Drosophila melanogaster often result in additions of N-terminal or C-terminal amino acids that are conserved in other Drosophila species. It appears that the 5' and 3' UTRs are reservoirs for genetic variations that changes the termini of proteins during evolution of the Drosophila genus. As genome sequencing is becoming inexpensive and routine, SnpEff enables rapid analyses of whole-genome sequencing data to be performed by an individual laboratory.

  2. Annotating pathogenic non-coding variants in genic regions.

    Science.gov (United States)

    Gelfman, Sahar; Wang, Quanli; McSweeney, K Melodi; Ren, Zhong; La Carpia, Francesca; Halvorsen, Matt; Schoch, Kelly; Ratzon, Fanni; Heinzen, Erin L; Boland, Michael J; Petrovski, Slavé; Goldstein, David B

    2017-08-09

    Identifying the underlying causes of disease requires accurate interpretation of genetic variants. Current methods ineffectively capture pathogenic non-coding variants in genic regions, resulting in overlooking synonymous and intronic variants when searching for disease risk. Here we present the Transcript-inferred Pathogenicity (TraP) score, which uses sequence context alterations to reliably identify non-coding variation that causes disease. High TraP scores single out extremely rare variants with lower minor allele frequencies than missense variants. TraP accurately distinguishes known pathogenic and benign variants in synonymous (AUC = 0.88) and intronic (AUC = 0.83) public datasets, dismissing benign variants with exceptionally high specificity. TraP analysis of 843 exomes from epilepsy family trios identifies synonymous variants in known epilepsy genes, thus pinpointing risk factors of disease from non-coding sequence data. TraP outperforms leading methods in identifying non-coding variants that are pathogenic and is therefore a valuable tool for use in gene discovery and the interpretation of personal genomes.While non-coding synonymous and intronic variants are often not under strong selective constraint, they can be pathogenic through affecting splicing or transcription. Here, the authors develop a score that uses sequence context alterations to predict pathogenicity of synonymous and non-coding genetic variants, and provide a web server of pre-computed scores.

  3. Identification and analysis of Single Nucleotide Polymorphisms (SNPs in the mosquito Anopheles funestus, malaria vector

    Directory of Open Access Journals (Sweden)

    Hemingway Janet

    2007-01-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the most common source of genetic variation in eukaryotic species and have become an important marker for genetic studies. The mosquito Anopheles funestus is one of the major malaria vectors in Africa and yet, prior to this study, no SNPs have been described for this species. Here we report a genome-wide set of SNP markers for use in genetic studies on this important human disease vector. Results DNA fragments from 50 genes were amplified and sequenced from 21 specimens of An. funestus. A third of specimens were field collected in Malawi, a third from a colony of Mozambican origin and a third form a colony of Angolan origin. A total of 494 SNPs including 303 within the coding regions of genes and 5 indels were identified. The physical positions of these SNPs in the genome are known. There were on average 7 SNPs per kilobase similar to that observed in An. gambiae and Drosophila melanogaster. Transitions outnumbered transversions, at a ratio of 2:1. The increased frequency of transition substitutions in coding regions is likely due to the structure of the genetic code and selective constraints. Synonymous sites within coding regions showed a higher polymorphism rate than non-coding introns or 3' and 5'flanking DNA with most of the substitutions in coding regions being observed at the 3rd codon position. A positive correlation in the level of polymorphism was observed between coding and non-coding regions within a gene. By genotyping a subset of 30 SNPs, we confirmed the validity of the SNPs identified during this study. Conclusion This set of SNP markers represents a useful tool for genetic studies in An. funestus, and will be useful in identifying candidate genes that affect diverse ranges of phenotypes that impact on vector control, such as resistance insecticide, mosquito behavior and vector competence.

  4. Characterization of coding synonymous and non-synonymous variants in ADAMTS13 using ex vivo and in silico approaches.

    Directory of Open Access Journals (Sweden)

    Nathan C Edwards

    Full Text Available Synonymous variations, which are defined as codon substitutions that do not change the encoded amino acid, were previously thought to have no effect on the properties of the synthesized protein(s. However, mounting evidence shows that these "silent" variations can have a significant impact on protein expression and function and should no longer be considered "silent". Here, the effects of six synonymous and six non-synonymous variations, previously found in the gene of ADAMTS13, the von Willebrand Factor (VWF cleaving hemostatic protease, have been investigated using a variety of approaches. The ADAMTS13 mRNA and protein expression levels, as well as the conformation and activity of the variants have been compared to that of wild-type ADAMTS13. Interestingly, not only the non-synonymous variants but also the synonymous variants have been found to change the protein expression levels, conformation and function. Bioinformatic analysis of ADAMTS13 mRNA structure, amino acid conservation and codon usage allowed us to establish correlations between mRNA stability, RSCU, and intracellular protein expression. This study demonstrates that variants and more specifically, synonymous variants can have a substantial and definite effect on ADAMTS13 function and that bioinformatic analysis may allow development of predictive tools to identify variants that will have significant effects on the encoded protein.

  5. Transcriptome-Wide Single Nucleotide Polymorphisms (SNPs for Abalone (Haliotis midae: Validation and Application Using GoldenGate Medium-Throughput Genotyping Assays

    Directory of Open Access Journals (Sweden)

    Rouvay Roodt-Wilding

    2013-09-01

    Full Text Available Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and Single Nucleotide Polymorphisms (SNPs . Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%−69% conversion rate (percentage polymorphic markers with a global genotyping success rate of 76%−85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174 were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50 located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.

  6. Transcriptome-wide single nucleotide polymorphisms (SNPs) for abalone (Haliotis midae): validation and application using GoldenGate medium-throughput genotyping assays.

    Science.gov (United States)

    Bester-Van Der Merwe, Aletta; Blaauw, Sonja; Du Plessis, Jana; Roodt-Wilding, Rouvay

    2013-09-23

    Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and single nucleotide (SNPs). Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%-69% conversion rate (percentage polymorphic markers) with a global genotyping success rate of 76%-85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174) were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50) located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.

  7. Non-Synonymous Polymorphisms in the FCN1 Gene Determine Ligand-Binding Ability and Serum Levels of M-Ficolin

    DEFF Research Database (Denmark)

    Ammitzbøll, Christian Gytz; Kjær, Troels Rønn; Steffensen, Rudi

    2012-01-01

    The innate immune system encompasses various recognition molecules able to sense both exogenous and endogenous danger signals arising from pathogens or damaged host cells. One such pattern-recognition molecule is M-ficolin, which is capable of activating the complement system through the lectin...... of non-synonymous SNPs on protein function....

  8. Non-synonymous polymorphisms in the P2RX ( 4 ) are related to bone mineral density and osteoporosis risk in a cohort of Dutch fracture patients

    DEFF Research Database (Denmark)

    Wesselius, Anke; Bours, Martijn Jl; Jørgensen, Niklas R

    2013-01-01

    of these two receptors on osteoporosis risk. Patients with fracture (690 females and 231 males, aged ≥50 years) were genotyped for three non-synonymous P2X ( 4 ) R SNPs. Bone mineral density (BMD) was measured at the total hip, lumbar spine, and femoral neck. Subject carrying the variant allele of the Tyr315...... of non-synonymous polymorphisms in the P2RX ( 4 ) and the risk of osteoporosis, suggesting a role of the P2X ( 4 ) R in the regulation of bone mass....

  9. In silico Analysis of the Functional and Structural Impacts of Non-synonymous Single Nucleotide Polymorphisms in the Human Paraxonase 1 Gene

    Directory of Open Access Journals (Sweden)

    Sudip Paul

    2015-09-01

    Full Text Available Computational approaches could help in identifying deleterious non-synonymous single nucleotide polymorphisms (nsSNPs in a disease related gene which is a difficult and laborious task through laboratory experiments. In the present study, we analyzed the impacts of nsSNPs on structure and function of Paraxonase 1 (PON1 using different bioinformatics tools. The human PON1 protein sequence and its corresponding gene's SNP information were collected from UniProt and dbSNP databases, respectively. We utilized SIFT, Polyphen, I-Mutant 2.0, MutPred, SNP and GO, PhD-SNP and PANTHER tools in order to examine the total 39 nsSNPs occurring in the PON1 coding region. We filtered the most pathological mutations by combining the scores of the aforementioned servers and found 8 SNPs (G344C, S302L, W281C, D279Y, H134R, F120S, L90P, C42R as deleterious and disease causing. The PDB structure of PON1 protein was obtained from RCSB Protein Data Bank (PDB ID: 1V04. The deleterious SNPs in native PON1 were introduced using Swiss-PDB Viewer package and changes in free energy were observed for six out of eight mutant structures. Two SNPs, S302L (substitution of serine to leucine at 302 position in amino acid sequence and L90P (substitution of leucine to proline at 90 position in amino acid sequence caused the highest energy increase amongst all. The findings implicate that these nsSNPs would be analyzed further in detail to enumerate their possible association with the protein deteriorating and disease causal potentialities.

  10. A second generation human haplotype map of over 3.1 million SNPs.

    Science.gov (United States)

    Frazer, Kelly A; Ballinger, Dennis G; Cox, David R; Hinds, David A; Stuve, Laura L; Gibbs, Richard A; Belmont, John W; Boudreau, Andrew; Hardenbol, Paul; Leal, Suzanne M; Pasternak, Shiran; Wheeler, David A; Willis, Thomas D; Yu, Fuli; Yang, Huanming; Zeng, Changqing; Gao, Yang; Hu, Haoran; Hu, Weitao; Li, Chaohua; Lin, Wei; Liu, Siqi; Pan, Hao; Tang, Xiaoli; Wang, Jian; Wang, Wei; Yu, Jun; Zhang, Bo; Zhang, Qingrun; Zhao, Hongbin; Zhao, Hui; Zhou, Jun; Gabriel, Stacey B; Barry, Rachel; Blumenstiel, Brendan; Camargo, Amy; Defelice, Matthew; Faggart, Maura; Goyette, Mary; Gupta, Supriya; Moore, Jamie; Nguyen, Huy; Onofrio, Robert C; Parkin, Melissa; Roy, Jessica; Stahl, Erich; Winchester, Ellen; Ziaugra, Liuda; Altshuler, David; Shen, Yan; Yao, Zhijian; Huang, Wei; Chu, Xun; He, Yungang; Jin, Li; Liu, Yangfan; Shen, Yayun; Sun, Weiwei; Wang, Haifeng; Wang, Yi; Wang, Ying; Xiong, Xiaoyan; Xu, Liang; Waye, Mary M Y; Tsui, Stephen K W; Xue, Hong; Wong, J Tze-Fei; Galver, Luana M; Fan, Jian-Bing; Gunderson, Kevin; Murray, Sarah S; Oliphant, Arnold R; Chee, Mark S; Montpetit, Alexandre; Chagnon, Fanny; Ferretti, Vincent; Leboeuf, Martin; Olivier, Jean-François; Phillips, Michael S; Roumy, Stéphanie; Sallée, Clémentine; Verner, Andrei; Hudson, Thomas J; Kwok, Pui-Yan; Cai, Dongmei; Koboldt, Daniel C; Miller, Raymond D; Pawlikowska, Ludmila; Taillon-Miller, Patricia; Xiao, Ming; Tsui, Lap-Chee; Mak, William; Song, You Qiang; Tam, Paul K H; Nakamura, Yusuke; Kawaguchi, Takahisa; Kitamoto, Takuya; Morizono, Takashi; Nagashima, Atsushi; Ohnishi, Yozo; Sekine, Akihiro; Tanaka, Toshihiro; Tsunoda, Tatsuhiko; Deloukas, Panos; Bird, Christine P; Delgado, Marcos; Dermitzakis, Emmanouil T; Gwilliam, Rhian; Hunt, Sarah; Morrison, Jonathan; Powell, Don; Stranger, Barbara E; Whittaker, Pamela; Bentley, David R; Daly, Mark J; de Bakker, Paul I W; Barrett, Jeff; Chretien, Yves R; Maller, Julian; McCarroll, Steve; Patterson, Nick; Pe'er, Itsik; Price, Alkes; Purcell, Shaun; Richter, Daniel J; Sabeti, Pardis; Saxena, Richa; Schaffner, Stephen F; Sham, Pak C; Varilly, Patrick; Altshuler, David; Stein, Lincoln D; Krishnan, Lalitha; Smith, Albert Vernon; Tello-Ruiz, Marcela K; Thorisson, Gudmundur A; Chakravarti, Aravinda; Chen, Peter E; Cutler, David J; Kashuk, Carl S; Lin, Shin; Abecasis, Gonçalo R; Guan, Weihua; Li, Yun; Munro, Heather M; Qin, Zhaohui Steve; Thomas, Daryl J; McVean, Gilean; Auton, Adam; Bottolo, Leonardo; Cardin, Niall; Eyheramendy, Susana; Freeman, Colin; Marchini, Jonathan; Myers, Simon; Spencer, Chris; Stephens, Matthew; Donnelly, Peter; Cardon, Lon R; Clarke, Geraldine; Evans, David M; Morris, Andrew P; Weir, Bruce S; Tsunoda, Tatsuhiko; Mullikin, James C; Sherry, Stephen T; Feolo, Michael; Skol, Andrew; Zhang, Houcan; Zeng, Changqing; Zhao, Hui; Matsuda, Ichiro; Fukushima, Yoshimitsu; Macer, Darryl R; Suda, Eiko; Rotimi, Charles N; Adebamowo, Clement A; Ajayi, Ike; Aniagwu, Toyin; Marshall, Patricia A; Nkwodimmah, Chibuzor; Royal, Charmaine D M; Leppert, Mark F; Dixon, Missy; Peiffer, Andy; Qiu, Renzong; Kent, Alastair; Kato, Kazuto; Niikawa, Norio; Adewole, Isaac F; Knoppers, Bartha M; Foster, Morris W; Clayton, Ellen Wright; Watkin, Jessica; Gibbs, Richard A; Belmont, John W; Muzny, Donna; Nazareth, Lynne; Sodergren, Erica; Weinstock, George M; Wheeler, David A; Yakub, Imtaz; Gabriel, Stacey B; Onofrio, Robert C; Richter, Daniel J; Ziaugra, Liuda; Birren, Bruce W; Daly, Mark J; Altshuler, David; Wilson, Richard K; Fulton, Lucinda L; Rogers, Jane; Burton, John; Carter, Nigel P; Clee, Christopher M; Griffiths, Mark; Jones, Matthew C; McLay, Kirsten; Plumb, Robert W; Ross, Mark T; Sims, Sarah K; Willey, David L; Chen, Zhu; Han, Hua; Kang, Le; Godbout, Martin; Wallenburg, John C; L'Archevêque, Paul; Bellemare, Guy; Saeki, Koji; Wang, Hongguang; An, Daochang; Fu, Hongbo; Li, Qing; Wang, Zhen; Wang, Renwu; Holden, Arthur L; Brooks, Lisa D; McEwen, Jean E; Guyer, Mark S; Wang, Vivian Ota; Peterson, Jane L; Shi, Michael; Spiegel, Jack; Sung, Lawrence M; Zacharia, Lynn F; Collins, Francis S; Kennedy, Karen; Jamieson, Ruth; Stewart, John

    2007-10-18

    We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r2 of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r2 of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.

  11. Predicting deleterious nsSNPs: an analysis of sequence and structural attributes

    Directory of Open Access Journals (Sweden)

    Saqi Mansoor AS

    2006-04-01

    Full Text Available Abstract Background There has been an explosion in the number of single nucleotide polymorphisms (SNPs within public databases. In this study we focused on non-synonymous protein coding single nucleotide polymorphisms (nsSNPs, some associated with disease and others which are thought to be neutral. We describe the distribution of both types of nsSNPs using structural and sequence based features and assess the relative value of these attributes as predictors of function using machine learning methods. We also address the common problem of balance within machine learning methods and show the effect of imbalance on nsSNP function prediction. We show that nsSNP function prediction can be significantly improved by 100% undersampling of the majority class. The learnt rules were then applied to make predictions of function on all nsSNPs within Ensembl. Results The measure of prediction success is greatly affected by the level of imbalance in the training dataset. We found the balanced dataset that included all attributes produced the best prediction. The performance as measured by the Matthews correlation coefficient (MCC varied between 0.49 and 0.25 depending on the imbalance. As previously observed, the degree of sequence conservation at the nsSNP position is the single most useful attribute. In addition to conservation, structural predictions made using a balanced dataset can be of value. Conclusion The predictions for all nsSNPs within Ensembl, based on a balanced dataset using all attributes, are available as a DAS annotation. Instructions for adding the track to Ensembl are at http://www.brightstudy.ac.uk/das_help.html

  12. Application of computational algorithms to assess the functionality of non-synonymous substitutions in MHC DRB gene of Nigerian goats

    Directory of Open Access Journals (Sweden)

    Yakubu Abdulmojeed

    2017-01-01

    Full Text Available The Major Histocompatibility Complex (MHC contains highly variable multi-gene families, which play a key role in the adaptive immune response within vertebrates. Among the Capra MHC class II genes, the expressed DRB locus is highly polymorphic, particularly in exon 2, which encodes the antigen-binding site. Models of variable non-synonymous/synonymous rate ratios among sites may provide important insights into functional constraints at different amino acid sites and may be used to detect sites under positive selection. Many non-synonymous single nucleotide polymorphisms (nsSNPs at the DRB locus in goats are suspected to impact protein function. This study, therefore, aimed at comparing the efficiency of six computational approaches to predict the likelihood of a particular non-synonymous (amino acid change coding SNP to cause a functional impact on the protein. This involved the use of PANTHER, SNAP, SIFT, PolyPhen-2, PROVEAN and nsSNPAnalyzer bioinformatics analytical tools in detecting harmful and beneficial effects at H57G, Y89R, V104D and Y112I substitutions in the peptide binding region of the DRB gene of Nigerian goats. The results from PANTHER analysis revealed that H57G, Y89R and Y112I substitutions (Pdeleterious= 0.113, 0.204 and 0.472, respectively were beneficial; while that of V104D was deleterious (Pdeleterious= 0.756, an indication that it was non-neutral. As regards the SNAP approach, H57G and Y89R substitutions were returned neutral with expected accuracy of 53 and 69%, respectively while V104D and Y112I substitutions were harmful. H57G and Y89R substitutions were also found harmless in the SIFT analysis. However, only H57G (PROVEAN and V104D (nsSNPAnalyzer amino acid substitutions were found to be beneficial. Interestingly, the predicted 3D structures of both native and mutant DRB protein appeared similar as validated by Ramachandran plots. The consensus reached by PANTHER, SNAP, SIFT and PolyPhen-2 approaches on the neutrality

  13. Prediction of the damage-associated non-synonymous single nucleotide polymorphisms in the human MC1R gene.

    Science.gov (United States)

    Hepp, Diego; Gonçalves, Gislene Lopes; de Freitas, Thales Renato Ochotorena

    2015-01-01

    The melanocortin 1 receptor (MC1R) is involved in the control of melanogenesis. Polymorphisms in this gene have been associated with variation in skin and hair color and with elevated risk for the development of melanoma. Here we used 11 computational tools based on different approaches to predict the damage-associated non-synonymous single nucleotide polymorphisms (nsSNPs) in the coding region of the human MC1R gene. Among the 92 nsSNPs arranged according to the predictions 62% were classified as damaging in more than five tools. The classification was significantly correlated with the scores of two consensus programs. Alleles associated with the red hair color (RHC) phenotype and with the risk of melanoma were examined. The R variants D84E, R142H, R151C, I155T, R160W and D294H were classified as damaging by the majority of the tools while the r variants V60L, V92M and R163Q have been predicted as neutral in most of the programs The combination of the prediction tools results in 14 nsSNPs indicated as the most damaging mutations in MC1R (L48P, R67W, H70Y, P72L, S83P, R151H, S172I, L206P, T242I, G255R, P256S, C273Y, C289R and R306H); C273Y showed to be highly damaging in SIFT, Polyphen-2, MutPred, PANTHER and PROVEAN scores. The computational analysis proved capable of identifying the potentially damaging nsSNPs in MC1R, which are candidates for further laboratory studies of the functional and pharmacological significance of the alterations in the receptor and the phenotypic outcomes.

  14. Prediction of the damage-associated non-synonymous single nucleotide polymorphisms in the human MC1R gene.

    Directory of Open Access Journals (Sweden)

    Diego Hepp

    Full Text Available The melanocortin 1 receptor (MC1R is involved in the control of melanogenesis. Polymorphisms in this gene have been associated with variation in skin and hair color and with elevated risk for the development of melanoma. Here we used 11 computational tools based on different approaches to predict the damage-associated non-synonymous single nucleotide polymorphisms (nsSNPs in the coding region of the human MC1R gene. Among the 92 nsSNPs arranged according to the predictions 62% were classified as damaging in more than five tools. The classification was significantly correlated with the scores of two consensus programs. Alleles associated with the red hair color (RHC phenotype and with the risk of melanoma were examined. The R variants D84E, R142H, R151C, I155T, R160W and D294H were classified as damaging by the majority of the tools while the r variants V60L, V92M and R163Q have been predicted as neutral in most of the programs The combination of the prediction tools results in 14 nsSNPs indicated as the most damaging mutations in MC1R (L48P, R67W, H70Y, P72L, S83P, R151H, S172I, L206P, T242I, G255R, P256S, C273Y, C289R and R306H; C273Y showed to be highly damaging in SIFT, Polyphen-2, MutPred, PANTHER and PROVEAN scores. The computational analysis proved capable of identifying the potentially damaging nsSNPs in MC1R, which are candidates for further laboratory studies of the functional and pharmacological significance of the alterations in the receptor and the phenotypic outcomes.

  15. Study of five novel non-synonymous polymorphisms in human brain-expressed genes in a Colombian sample.

    Science.gov (United States)

    Ojeda, Diego A; Forero, Diego A

    2014-10-01

    Non-synonymous single nucleotide polymorphisms (nsSNPs) in brain-expressed genes represent interesting candidates for genetic research in neuropsychiatric disorders. To study novel nsSNPs in brain-expressed genes in a sample of Colombian subjects. We applied an approach based on in silico mining of available genomic data to identify and select novel nsSNPs in brain-expressed genes. We developed novel genotyping assays, based in allele-specific PCR methods, for these nsSNPs and genotyped them in 171 Colombian subjects. Five common nsSNPs (rs6855837; p.Leu395Ile, rs2305160; p.Thr394Ala, rs10503929; p.Met289Thr, rs2270641; p.Thr4Pro and rs3822659; p.Ser735Ala) were studied, located in the CLOCK, NPAS2, NRG1, SLC18A1 and WWC1 genes. We reported allele and genotype frequencies in a sample of South American healthy subjects. There is previous experimental evidence, arising from genome-wide expression and association studies, for the involvement of these genes in several neuropsychiatric disorders and endophenotypes, such as schizophrenia, mood disorders or memory performance. Frequencies for these nsSNPSs in the Colombian samples varied in comparison to different HapMap populations. Future study of these nsSNPs in brain-expressed genes, a synaptogenomics approach, will be important for a better understanding of neuropsychiatric diseases and endophenotypes in different populations.

  16. Screening of Missense SNPs in Coding Regions of COX-2 as a Key Enzyme Involved in Cancer

    Directory of Open Access Journals (Sweden)

    Sodabeh Jahanbakhsh-Godehkahriz

    2013-09-01

    Full Text Available Background & Objectives: Non-synonymous single nucleotide polymorphism (nsSNPs which results in disruption of protein function are used as markers in linkage and association of human proteins that might be involved in diseases and cancers .   Methods: To study the functional effect of nsSNP in cyclooxygenase-2 (COX2 amino acids, the nucleotide sequences encoding COX-2 gene in cancers were extracted from the NCBI (gi|223941909 data bank (283 cases and analyzed by SIFT, I-Mutant 2.0, SNP and GO, PANTHER and FASTSNP servers. These servers involve programs that predict the effects of amino acid substitution on protein function, stability and missense .   Results: COX-2 is an essential enzyme for the production of pro-inflammatory prostaglandins which are relevant to cancer development and progression. The substitutions in some positions such as R228H and S428A of COX-2 in most of cancers linked to reformed protein function through disruption in enzyme active site.   Conclusion: Amino acid substitutions as a consequence of COX-2 nsSNPs have important role in human disease. Substitutions which are located in catalytic domain are important for the enzymatic function of COX-2 and associated with higher expression of COX-2.

  17. Prediction by graph theoretic measures of structural effects in proteins arising from non-synonymous single nucleotide polymorphisms.

    Directory of Open Access Journals (Sweden)

    Tammy M K Cheng

    Full Text Available Recent analyses of human genome sequences have given rise to impressive advances in identifying non-synonymous single nucleotide polymorphisms (nsSNPs. By contrast, the annotation of nsSNPs and their links to diseases are progressing at a much slower pace. Many of the current approaches to analysing disease-associated nsSNPs use primarily sequence and evolutionary information, while structural information is relatively less exploited. In order to explore the potential of such information, we developed a structure-based approach, Bongo (Bonds ON Graph, to predict structural effects of nsSNPs. Bongo considers protein structures as residue-residue interaction networks and applies graph theoretical measures to identify the residues that are critical for maintaining structural stability by assessing the consequences on the interaction network of single point mutations. Our results show that Bongo is able to identify mutations that cause both local and global structural effects, with a remarkably low false positive rate. Application of the Bongo method to the prediction of 506 disease-associated nsSNPs resulted in a performance (positive predictive value, PPV, 78.5% similar to that of PolyPhen (PPV, 77.2% and PANTHER (PPV, 72.2%. As the Bongo method is solely structure-based, our results indicate that the structural changes resulting from nsSNPs are closely associated to their pathological consequences.

  18. In silico analysis of single nucleotide polymorphism (SNPs in human β-globin gene.

    Directory of Open Access Journals (Sweden)

    Mohammed Alanazi

    Full Text Available Single amino acid substitutions in the globin chain are the most common forms of genetic variations that produce hemoglobinopathies--the most widespread inherited disorders worldwide. Several hemoglobinopathies result from homozygosity or compound heterozygosity to beta-globin (HBB gene mutations, such as that producing sickle cell hemoglobin (HbS, HbC, HbD and HbE. Several of these mutations are deleterious and result in moderate to severe hemolytic anemia, with associated complications, requiring lifelong care and management. Even though many hemoglobinopathies result from single amino acid changes producing similar structural abnormalities, there are functional differences in the generated variants. Using in silico methods, we examined the genetic variations that can alter the expression and function of the HBB gene. Using a sequence homology-based Sorting Intolerant from Tolerant (SIFT server we have searched for the SNPs, which showed that 200 (80% non-synonymous polymorphism were found to be deleterious. The structure-based method via PolyPhen server indicated that 135 (40% non-synonymous polymorphism may modify protein function and structure. The Pupa Suite software showed that the SNPs will have a phenotypic consequence on the structure and function of the altered protein. Structure analysis was performed on the key mutations that occur in the native protein coded by the HBB gene that causes hemoglobinopathies such as: HbC (E→K, HbD (E→Q, HbE (E→K and HbS (E→V. Atomic Non-Local Environment Assessment (ANOLEA, Yet Another Scientific Artificial Reality Application (YASARA, CHARMM-GUI webserver for macromolecular dynamics and mechanics, and Normal Mode Analysis, Deformation and Refinement (NOMAD-Ref of Gromacs server were used to perform molecular dynamics simulations and energy minimization calculations on β-Chain residue of the HBB gene before and after mutation. Furthermore, in the native and altered protein models, amino acid

  19. In Silico survey of functional coding variants in human AEG-1 gene ...

    African Journals Online (AJOL)

    Background and aims: Non-synonymous (ns)SNPs represent typical genetic variations that may potentially affect the structure or function of expressed proteins and therefore could have an impact on complex disorders. A computational-based (In Silico) analysis has been done to evaluate the phenotypic effect of nsSNPs in ...

  20. Proportionally more deleterious genetic variation in European than in African populations

    DEFF Research Database (Denmark)

    Lohmueller, Kirk E; Indap, Amit R; Schmidt, Steffen

    2008-01-01

    of functional SNPs considered, including synonymous, non-synonymous, predicted 'benign', predicted 'possibly damaging' and predicted 'probably damaging' SNPs. This result is wholly consistent with previous work showing higher overall levels of nucleotide variation in African populations than in Europeans. EA...... individuals, in contrast, have significantly more genotypes homozygous for the derived allele at synonymous and non-synonymous SNPs and for the damaging allele at 'probably damaging' SNPs than AAs do. For SNPs segregating only in one population or the other, the proportion of non-synonymous SNPs...

  1. Prediction of disease causing non-synonymous SNPs by the Artificial Neural Network Predictor NetDiseaseSNP.

    Directory of Open Access Journals (Sweden)

    Morten Bo Johansen

    Full Text Available We have developed a sequence conservation-based artificial neural network predictor called NetDiseaseSNP which classifies nsSNPs as disease-causing or neutral. Our method uses the excellent alignment generation algorithm of SIFT to identify related sequences and a combination of 31 features assessing sequence conservation and the predicted surface accessibility to produce a single score which can be used to rank nsSNPs based on their potential to cause disease. NetDiseaseSNP classifies successfully disease-causing and neutral mutations. In addition, we show that NetDiseaseSNP discriminates cancer driver and passenger mutations satisfactorily. Our method outperforms other state-of-the-art methods on several disease/neutral datasets as well as on cancer driver/passenger mutation datasets and can thus be used to pinpoint and prioritize plausible disease candidates among nsSNPs for further investigation. NetDiseaseSNP is publicly available as an online tool as well as a web service: http://www.cbs.dtu.dk/services/NetDiseaseSNP.

  2. Impact of SNPs on Protein Phosphorylation Status in Rice (Oryza sativa L.

    Directory of Open Access Journals (Sweden)

    Shoukai Lin

    2016-11-01

    Full Text Available Single nucleotide polymorphisms (SNPs are widely used in functional genomics and genetics research work. The high-quality sequence of rice genome has provided a genome-wide SNP and proteome resource. However, the impact of SNPs on protein phosphorylation status in rice is not fully understood. In this paper, we firstly updated rice SNP resource based on the new rice genome Ver. 7.0, then systematically analyzed the potential impact of Non-synonymous SNPs (nsSNPs on the protein phosphorylation status. There were 3,897,312 SNPs in Ver. 7.0 rice genome, among which 9.9% was nsSNPs. Whilst, a total 2,508,261 phosphorylated sites were predicted in rice proteome. Interestingly, we observed that 150,197 (39.1% nsSNPs could influence protein phosphorylation status, among which 52.2% might induce changes of protein kinase (PK types for adjacent phosphorylation sites. We constructed a database, SNP_rice, to deposit the updated rice SNP resource and phosSNPs information. It was freely available to academic researchers at http://bioinformatics.fafu.edu.cn. As a case study, we detected five nsSNPs that potentially influenced heterotrimeric G proteins phosphorylation status in rice, indicating that genetic polymorphisms showed impact on the signal transduction by influencing the phosphorylation status of heterotrimeric G proteins. The results in this work could be a useful resource for future experimental identification and provide interesting information for better rice breeding.

  3. Association of P2Y(2) receptor SNPs with bone mineral density and osteoporosis risk in a cohort of Dutch fracture patients

    DEFF Research Database (Denmark)

    Wesselius, Anke; Bours, Martijn J L; Henriksen, Zanne

    2013-01-01

    The P2Y(2) receptor is a G-protein-coupled receptor with adenosine 5'-triphosphate (and UTP) as natural ligands. It is thought to be involved in bone physiology in an anti-osteogenic manner. As several non-synonymous single nucleotide polymorphisms (SNPs) have been identified within the P2Y(2) re...

  4. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff

    Science.gov (United States)

    Cingolani, Pablo; Platts, Adrian; Wang, Le Lily; Coon, Melissa; Nguyen, Tung; Wang, Luan; Land, Susan J.; Lu, Xiangyi; Ruden, Douglas M.

    2012-01-01

    We describe a new computer program, SnpEff, for rapidly categorizing the effects of variants in genome sequences. Once a genome is sequenced, SnpEff annotates variants based on their genomic locations and predicts coding effects. Annotated genomic locations include intronic, untranslated region, upstream, downstream, splice site, or intergenic regions. Coding effects such as synonymous or non-synonymous amino acid replacement, start codon gains or losses, stop codon gains or losses, or frame shifts can be predicted. Here the use of SnpEff is illustrated by annotating ~356,660 candidate SNPs in ~117 Mb unique sequences, representing a substitution rate of ~1/305 nucleotides, between the Drosophila melanogaster w1118; iso-2; iso-3 strain and the reference y1; cn1 bw1 sp1 strain. We show that ~15,842 SNPs are synonymous and ~4,467 SNPs are non-synonymous (N/S ~0.28). The remaining SNPs are in other categories, such as stop codon gains (38 SNPs), stop codon losses (8 SNPs), and start codon gains (297 SNPs) in the 5′UTR. We found, as expected, that the SNP frequency is proportional to the recombination frequency (i.e., highest in the middle of chromosome arms). We also found that start-gain or stop-lost SNPs in Drosophila melanogaster often result in additions of N-terminal or C-terminal amino acids that are conserved in other Drosophila species. It appears that the 5′ and 3′ UTRs are reservoirs for genetic variations that changes the termini of proteins during evolution of the Drosophila genus. As genome sequencing is becoming inexpensive and routine, SnpEff enables rapid analyses of whole-genome sequencing data to be performed by an individual laboratory. PMID:22728672

  5. Phosphorylation states of cell cycle and DNA repair proteins can be altered by the nsSNPs

    International Nuclear Information System (INIS)

    Savas, Sevtap; Ozcelik, Hilmi

    2005-01-01

    Phosphorylation is a reversible post-translational modification that affects the intrinsic properties of proteins, such as structure and function. Non-synonymous single nucleotide polymorphisms (nsSNPs) result in the substitution of the encoded amino acids and thus are likely to alter the phosphorylation motifs in the proteins. In this study, we used the web-based NetPhos tool to predict candidate nsSNPs that either introduce or remove putative phosphorylation sites in proteins that act in DNA repair and cell cycle pathways. Our results demonstrated that a total of 15 nsSNPs (16.9%) were likely to alter the putative phosphorylation patterns of 14 proteins. Three of these SNPs (CDKN1A-S31R, OGG1-S326C, and XRCC3-T241M) have already found to be associated with altered cancer risk. We believe that this set of nsSNPs constitutes an excellent resource for further molecular and genetic analyses. The novel systematic approach used in this study will accelerate the understanding of how naturally occurring human SNPs may alter protein function through the modification of phosphorylation mechanisms and contribute to disease susceptibility

  6. SNPs in the coding region of the metastasis-inducing gene MACC1 and clinical outcome in colorectal cancer

    Directory of Open Access Journals (Sweden)

    Schmid Felicitas

    2012-07-01

    Full Text Available Abstract Background Colorectal cancer is one of the main cancers in the Western world. About 90% of the deaths arise from formation of distant metastasis. The expression of the newly identified gene metastasis associated in colon cancer 1 (MACC1 is a prognostic indicator for colon cancer metastasis. Here, we analyzed for the first time the impact of single nucleotide polymorphisms (SNPs in the coding region of MACC1 for clinical outcome of colorectal cancer patients. Additionally, we screened met proto-oncogene (Met, the transcriptional target gene of MACC1, for mutations. Methods We sequenced the coding exons of MACC1 in 154 colorectal tumors (stages I, II and III and the crucial exons of Met in 60 colorectal tumors (stages I, II and III. We analyzed the association of MACC1 polymorphisms with clinical data, including metachronous metastasis, UICC stages, tumor invasion, lymph node metastasis and patients’ survival (n = 154, stages I, II and III. Furthermore, we performed biological assays in order to evaluate the functional impact of MACC1 SNPs on the motility of colorectal cancer cells. Results We genotyped three MACC1 SNPs in the coding region. Thirteen % of the tumors had the genotype cg (rs4721888, L31V, 48% a ct genotype (rs975263, S515L and 84% a gc or cc genotype (rs3735615, R804T. We found no association of these SNPs with clinicopathological parameters or with patients’ survival, when analyzing the entire patients’ cohort. An increased risk for a shorter metastasis-free survival of patients with a ct genotype (rs975263 was observed in younger colon cancer patients with stage I or II (P = 0.041, n = 18. In cell culture, MACC1 SNPs did not affect MACC1-induced cell motility and proliferation. Conclusion In summary, the identification of coding MACC1 SNPs in primary colorectal tumors does not improve the prediction for metastasis formation or for patients’ survival compared to MACC1 expression analysis alone. The ct genotype (rs

  7. Prediction of Disease Causing Non-Synonymous SNPs by the Artificial Neural Network Predictor NetDiseaseSNP

    DEFF Research Database (Denmark)

    Johansen, Morten Bo; Gonzalez-Izarzugaza, Jose Maria; Brunak, Søren

    2013-01-01

    We have developed a sequence conservation-based artificial neural network predictor called NetDiseaseSNP which classifies nsSNPs as disease-causing or neutral. Our method uses the excellent alignment generation algorithm of SIFT to identify related sequences and a combination of 31 features...

  8. Effect of two non-synonymous ecto-5'-nucleotidase variants on the genetic architecture of inosine 5'-monophosphate (IMP) and its degradation products in Japanese Black beef.

    Science.gov (United States)

    Uemoto, Yoshinobu; Ohtake, Tsuyoshi; Sasago, Nanae; Takeda, Masayuki; Abe, Tsuyoshi; Sakuma, Hironori; Kojima, Takatoshi; Sasaki, Shinji

    2017-11-13

    Umami is a Japanese term for the fifth basic taste and is an important sensory property of beef palatability. Inosine 5'-monophosphate (IMP) contributes to umami taste in beef. Thus, the overall change in concentration of IMP and its degradation products can potentially affect the beef palatability. In this study, we investigated the genetic architecture of IMP and its degradation products in Japanese Black beef. First, we performed genome-wide association study (GWAS), candidate gene analysis, and functional analysis to detect the causal variants that affect IMP, inosine, and hypoxanthine. Second, we evaluated the allele frequencies in the different breeds, the contribution of genetic variance, and the effect on other economical traits using the detected variants. A total of 574 Japanese Black cattle were genotyped using the Illumina BovineSNP50 BeadChip and were then used for GWAS. The results of GWAS showed that the genome-wide significant single nucleotide polymorphisms (SNPs) on BTA9 were detected for IMP, inosine, and hypoxanthine. The ecto-5'-nucleotidase (NT5E) gene, which encodes the enzyme NT5E for the extracellular degradation of IMP to inosine, was located near the significant region on BTA9. The results of candidate gene analysis and functional analysis showed that two non-synonymous SNPs (c.1318C > T and c.1475 T > A) in NT5E affected the amount of IMP and its degradation products in beef by regulating the enzymatic activity of NT5E. The Q haplotype showed a positive effect on IMP and a negative effect on the enzymatic activity of NT5E in IMP degradation. The two SNPs were under perfect linkage disequilibrium in five different breeds, and different haplotype frequencies were seen among breeds. The two SNPs contribute to about half of the total genetic variance in IMP, and the results of genetic relationship between IMP and its degradation products showed that NT5E affected the overall concentration balance of IMP and its degradation products

  9. Non-Synonymous Single-Nucleotide Polymorphisms and Physical Activity Interactions on Adiposity Parameters in Malaysian Adolescents

    Directory of Open Access Journals (Sweden)

    Nur Lisa Zaharan

    2018-04-01

    Full Text Available BackgroundSeveral non-synonymous single-nucleotide polymorphisms (nsSNPs have been shown to be associated with obesity. Little is known about their associations and interactions with physical activity (PA in relation to adiposity parameters among adolescents in Malaysia.MethodsWe examined whether (a PA and (b selected nsSNPs are associated with adiposity parameters and whether PA interacts with these nsSNPs on these outcomes in adolescents from the Malaysian Health and Adolescents Longitudinal Research Team study (n = 1,151. Body mass indices, waist–hip ratio, and percentage body fat (% BF were obtained. PA was assessed using Physical Activity Questionnaire for Older Children (PAQ-C. Five nsSNPs were included: beta-3 adrenergic receptor (ADRB3 rs4994, FABP2 rs1799883, GHRL rs696217, MC3R rs3827103, and vitamin D receptor rs2228570, individually and as combined genetic risk score (GRS. Associations and interactions between nsSNPs and PAQ-C scores were examined using generalized linear model.ResultsPAQ-C scores were associated with % BF (β = −0.44 [95% confidence interval −0.72, −0.16], p = 0.002. The CC genotype of ADRB3 rs4994 (β = −0.16 [−0.28, −0.05], corrected p = 0.01 and AA genotype of MC3R rs3827103 (β = −0.06 [−0.12, −0.00], p = 0.02 were significantly associated with % BF compared to TT and GG genotypes, respectively. Significant interactions with PA were found between ADRB3 rs4994 (β = −0.05 [−0.10, −0.01], p = 0.02 and combined GRS (β = −0.03 [−0.04, −0.01], p = 0.01 for % BF.ConclusionHigher PA score was associated with reduced % BF in Malaysian adolescents. Of the nsSNPs, ADRB3 rs4994 and MC3R rs3827103 were associated with % BF. Significant interactions with PA were found for ADRB3 rs4994 and combined GRS on % BF but not on measurements of weight or circumferences. Targeting body fat represent prospects for molecular studies and lifestyle intervention

  10. Non-Synonymous Single-Nucleotide Polymorphisms and Physical Activity Interactions on Adiposity Parameters in Malaysian Adolescents.

    Science.gov (United States)

    Zaharan, Nur Lisa; Muhamad, Nor Hanisah; Jalaludin, Muhammad Yazid; Su, Tin Tin; Mohamed, Zahurin; Mohamed, M N A; A Majid, Hazreen

    2018-01-01

    Several non-synonymous single-nucleotide polymorphisms (nsSNPs) have been shown to be associated with obesity. Little is known about their associations and interactions with physical activity (PA) in relation to adiposity parameters among adolescents in Malaysia. We examined whether (a) PA and (b) selected nsSNPs are associated with adiposity parameters and whether PA interacts with these nsSNPs on these outcomes in adolescents from the Malaysian Health and Adolescents Longitudinal Research Team study ( n  = 1,151). Body mass indices, waist-hip ratio, and percentage body fat (% BF) were obtained. PA was assessed using Physical Activity Questionnaire for Older Children (PAQ-C). Five nsSNPs were included: beta-3 adrenergic receptor (ADRB3) rs4994, FABP2 rs1799883, GHRL rs696217, MC3R rs3827103, and vitamin D receptor rs2228570, individually and as combined genetic risk score (GRS). Associations and interactions between nsSNPs and PAQ-C scores were examined using generalized linear model. PAQ-C scores were associated with % BF (β = -0.44 [95% confidence interval -0.72, -0.16], p  = 0.002). The CC genotype of ADRB3 rs4994 (β = -0.16 [-0.28, -0.05], corrected p  = 0.01) and AA genotype of MC3R rs3827103 (β = -0.06 [-0.12, -0.00], p  = 0.02) were significantly associated with % BF compared to TT and GG genotypes, respectively. Significant interactions with PA were found between ADRB3 rs4994 (β = -0.05 [-0.10, -0.01], p  = 0.02) and combined GRS (β = -0.03 [-0.04, -0.01], p  = 0.01) for % BF. Higher PA score was associated with reduced % BF in Malaysian adolescents. Of the nsSNPs, ADRB3 rs4994 and MC3R rs3827103 were associated with % BF. Significant interactions with PA were found for ADRB3 rs4994 and combined GRS on % BF but not on measurements of weight or circumferences. Targeting body fat represent prospects for molecular studies and lifestyle intervention in this population.

  11. Strand bias in complementary single-nucleotide polymorphisms of transcribed human sequences: evidence for functional effects of synonymous polymorphisms

    Directory of Open Access Journals (Sweden)

    Majewski Jacek

    2006-08-01

    Full Text Available Abstract Background Complementary single-nucleotide polymorphisms (SNPs may not be distributed equally between two DNA strands if the strands are functionally distinct, such as in transcribed genes. In introns, an excess of A↔G over the complementary C↔T substitutions had previously been found and attributed to transcription-coupled repair (TCR, demonstrating the valuable functional clues that can be obtained by studying such asymmetry. Here we studied asymmetry of human synonymous SNPs (sSNPs in the fourfold degenerate (FFD sites as compared to intronic SNPs (iSNPs. Results The identities of the ancestral bases and the direction of mutations were inferred from human-chimpanzee genomic alignment. After correction for background nucleotide composition, excess of A→G over the complementary T→C polymorphisms, which was observed previously and can be explained by TCR, was confirmed in FFD SNPs and iSNPs. However, when SNPs were separately examined according to whether they mapped to a CpG dinucleotide or not, an excess of C→T over G→A polymorphisms was found in non-CpG site FFD SNPs but was absent from iSNPs and CpG site FFD SNPs. Conclusion The genome-wide discrepancy of human FFD SNPs provides novel evidence for widespread selective pressure due to functional effects of sSNPs. The similar asymmetry pattern of FFD SNPs and iSNPs that map to a CpG can be explained by transcription-coupled mechanisms, including TCR and transcription-coupled mutation. Because of the hypermutability of CpG sites, more CpG site FFD SNPs are relatively younger and have confronted less selection effect than non-CpG FFD SNPs, which can explain the asymmetric discrepancy of CpG site FFD SNPs vs. non-CpG site FFD SNPs.

  12. In silico analysis of SNPs of SYK gene Involved in Oral Cancer

    Directory of Open Access Journals (Sweden)

    Sarita Swain

    2017-12-01

    Full Text Available Oral cancer is the sixth most common cancer in the world. Oral cancer is the cancer of the oral cavity and pharynx, including cancer of the lip, tongue, salivary glands, gum, floor and other areas of the mouth. The aim of the study is to identify SNPs using dbSNP and predict the effect of mutation using Predict SNP. The association of genes is done by STRING. The disease and drugs associated with the genes are obtained from Webgestalt. The prediction of binding site is done by CASTp. The interaction of ligand and protein is done by using Autodock and Visualised through Discovery studio, pymol, Ligplot. From this report we found that oral cancer differs from person to person based on their genes and genetic interactions and expressions which recommend the clinicians to go for personalized medicine rather that generalized medicine for the patients with oral cancer. Seeking the importance of genetic background of oral cancer patients further studies can be done by mining of non-synonymous SNPs associated with genes for causing oral cancer.

  13. Tuning protein expression using synonymous codon libraries targeted to the 5' mRNA coding region

    DEFF Research Database (Denmark)

    Goltermann, Lise; Borch Jensen, Martin; Bentin, Thomas

    2011-01-01

    intermediate expression levels of green fluorescent protein in Escherichia coli. At least in one case, no apparent effect on protein stability was observed, pointing to RNA level effects as the principal reason for the observed expression differences. Targeting a synonymous codon library to the 5' coding...

  14. Is really endogenous ghrelin a hunger signal in chickens? Association of GHSR SNPs with increase appetite, growth traits, expression and serum level of GHRL, and GH.

    Science.gov (United States)

    El-Magd, Mohammed Abu; Saleh, Ayman A; Abdel-Hamid, Tamer M; Saleh, Rasha M; Afifi, Mohammed A

    2016-10-01

    Chicken growth hormone secretagogue receptor (GHSR) is a receptor for ghrelin (GHRL), a peptide hormone produced by chicken proventriculus, which stimulates growth hormone (GH) release and food intake. The purpose of this study was to search for single nucleotide polymorphisms (SNPs) in exon 2 of GHSR gene and to analyze their effect on the appetite, growth traits and expression levels of GHSR, GHRL, and GH genes as well as serum levels of GH and GHRL in Mandara chicken. Two adjacent SNPs, A239G and G244A, were detected in exon 2 of GHSR gene. G244A SNP was non-synonymous mutation and led to replacement of lysine amino acid (aa) by arginine aa, while A239G SNP was synonymous mutation. The combined genotypes of A239G and G244A SNPs produced three haplotypes; GG/GG, GG/AG, AG/AG, which associated significantly (P4 to 16w. Chickens with the homozygous GG/GG haplotype showed higher growth performance than other chickens. The two SNPs were also correlated with mRNA levels of GHSR and GH (in pituitary gland), and GHRL (in proventriculus and hypothalamus) as well as with serum level of GH and GHRL. Also, chickens with GG/GG haplotype showed higher mRNA and serum levels. This is the first study to demonstrate that SNPs in GHSR can increase appetite, growth traits, expression and level of GHRL, suggesting a hunger signal role for endogenous GHRL. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. Clustered coding variants in the glutamate receptor complexes of individuals with schizophrenia and bipolar disorder.

    Directory of Open Access Journals (Sweden)

    René A W Frank

    2011-04-01

    Full Text Available Current models of schizophrenia and bipolar disorder implicate multiple genes, however their biological relationships remain elusive. To test the genetic role of glutamate receptors and their interacting scaffold proteins, the exons of ten glutamatergic 'hub' genes in 1304 individuals were re-sequenced in case and control samples. No significant difference in the overall number of non-synonymous single nucleotide polymorphisms (nsSNPs was observed between cases and controls. However, cluster analysis of nsSNPs identified two exons encoding the cysteine-rich domain and first transmembrane helix of GRM1 as a risk locus with five mutations highly enriched within these domains. A new splice variant lacking the transmembrane GPCR domain of GRM1 was discovered in the human brain and the GRM1 mutation cluster could perturb the regulation of this variant. The predicted effect on individuals harbouring multiple mutations distributed in their ten hub genes was also examined. Diseased individuals possessed an increased load of deleteriousness from multiple concurrent rare and common coding variants. Together, these data suggest a disease model in which the interplay of compound genetic coding variants, distributed among glutamate receptors and their interacting proteins, contribute to the pathogenesis of schizophrenia and bipolar disorders.

  16. Potentially functional SNPs (pfSNPs as novel genomic predictors of 5-FU response in metastatic colorectal cancer patients.

    Directory of Open Access Journals (Sweden)

    Jingbo Wang

    Full Text Available 5-Fluorouracil (5-FU and its pro-drug Capecitabine have been widely used in treating colorectal cancer. However, not all patients will respond to the drug, hence there is a need to develop reliable early predictive biomarkers for 5-FU response. Here, we report a novel potentially functional Single Nucleotide Polymorphism (pfSNP approach to identify SNPs that may serve as predictive biomarkers of response to 5-FU in Chinese metastatic colorectal cancer (CRC patients. 1547 pfSNPs and one variable number tandem repeat (VNTR in 139 genes in 5-FU drug (both PK and PD pathway and colorectal cancer disease pathways were examined in 2 groups of CRC patients. Shrinkage of liver metastasis measured by RECIST criteria was used as the clinical end point. Four non-responder-specific pfSNPs were found to account for 37.5% of all non-responders (P<0.0003. Five additional pfSNPs were identified from a multivariate model (AUC under ROC = 0.875 that was applied for all other pfSNPs, excluding the non-responder-specific pfSNPs. These pfSNPs, which can differentiate the other non-responders from responders, mainly reside in tumor suppressor genes or genes implicated in colorectal cancer risk. Hence, a total of 9 novel SNPs with potential functional significance may be able to distinguish non-responders from responders to 5-FU. These pfSNPs may be useful biomarkers for predicting response to 5-FU.

  17. regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution.

    Science.gov (United States)

    Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong

    2017-09-01

    While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.

  18. In silico screening, genotyping, molecular dynamics simulation and activity studies of SNPs in pyruvate kinase M2.

    Directory of Open Access Journals (Sweden)

    Ponnusamy Kalaiarasan

    Full Text Available Role of, 29-non-synonymous, 15-intronic, 3-close to UTR, single nucleotide polymorphisms (SNPs and 2 mutations of Human Pyruvate Kinase (PK M2 were investigated by in-silico and in-vitro functional studies. Prediction of deleterious substitutions based on sequence homology and structure based servers, SIFT, PANTHER, SNPs&GO, PhD-SNP, SNAP and PolyPhen, depicted that 19% emerged common between all the mentioned programs. SNPeffect and HOPE showed three substitutions (C31F, Q310P and S437Y in-silico as deleterious and functionally important. In-vitro activity assays showed C31F and S437Y variants of PKM2 with reduced activity, while Q310P variant was catalytically inactive. The allosteric activation due to binding of fructose 1-6 bisphosphate (FBP was compromised in case of S437Y nsSNP variant protein. This was corroborated through molecular dynamics (MD simulation study, which was also carried out in other two variant proteins. The 5 intronic SNPs of PKM2, associated with sporadic breast cancer in a case-control study, when subjected to different computational analyses, indicated that 3 SNPs (rs2856929, rs8192381 and rs8192431 could generate an alternative transcript by influencing splicing factor binding to PKM2. We propose that these, potentially functional and important variations, both within exons and introns, could have a bearing on cancer metabolism, since PKM2 has been implicated in cancer in the recent past.

  19. Single Nucleotide Polymorphisms of Gene and Association with Non-specific Digestive Disorder in Rabbit

    Directory of Open Access Journals (Sweden)

    Yun-Fu Liu

    2013-08-01

    Full Text Available The NLRP12 (NLR family, pyrin domain containing 12 serves as a suppressor factor in the inflammatory response and protects the host against inflammation-induced damage. In the present study, we aimed to study the polymorphisms of NLRP12 gene and its association with susceptibility to non-specific digestive disorder (NSDD in rabbits. We re-sequenced the entire coding region of the rabbit NLRP12 gene and detected a total of 19 SNPs containing 14 synonymous and five non-synonymous variations. Among them, the coding SNP (c.1682A>G, which would carry a potential functional implication, was subsequently subjected to genotyping for case-control association study (272 cases and 267 controls. The results revealed that allele A was significantly protective against NSDD with an odds ratio value of 0.884 (95% confidence interval, 0.788 to 0.993; p = 0.038. We also experimentally induced NSDD in growing rabbits by feeding a fibre-deficient diet and subsequently investigated NLRP12 mRNA expression. The mRNA expression of NLRP12 in healthy status was significantly higher than that in severe NSDD (p = 0.0016. The highest expression was observed in individuals carrying the protective genotype AA (p = 0.0108. These results suggested that NLRP12 was significantly associated with the NSDD in rabbits. However, the precise molecular mechanism of NLRP12 involving in the development of rabbit NSDD requires further research.

  20. SNiPlay: a web-based tool for detection, management and analysis of SNPs. Application to grapevine diversity projects.

    Science.gov (United States)

    Dereeper, Alexis; Nicolas, Stéphane; Le Cunff, Loïc; Bacilieri, Roberto; Doligez, Agnès; Peros, Jean-Pierre; Ruiz, Manuel; This, Patrice

    2011-05-05

    High-throughput re-sequencing, new genotyping technologies and the availability of reference genomes allow the extensive characterization of Single Nucleotide Polymorphisms (SNPs) and insertion/deletion events (indels) in many plant species. The rapidly increasing amount of re-sequencing and genotyping data generated by large-scale genetic diversity projects requires the development of integrated bioinformatics tools able to efficiently manage, analyze, and combine these genetic data with genome structure and external data. In this context, we developed SNiPlay, a flexible, user-friendly and integrative web-based tool dedicated to polymorphism discovery and analysis. It integrates:1) a pipeline, freely accessible through the internet, combining existing softwares with new tools to detect SNPs and to compute different types of statistical indices and graphical layouts for SNP data. From standard sequence alignments, genotyping data or Sanger sequencing traces given as input, SNiPlay detects SNPs and indels events and outputs submission files for the design of Illumina's SNP chips. Subsequently, it sends sequences and genotyping data into a series of modules in charge of various processes: physical mapping to a reference genome, annotation (genomic position, intron/exon location, synonymous/non-synonymous substitutions), SNP frequency determination in user-defined groups, haplotype reconstruction and network, linkage disequilibrium evaluation, and diversity analysis (Pi, Watterson's Theta, Tajima's D).Furthermore, the pipeline allows the use of external data (such as phenotype, geographic origin, taxa, stratification) to define groups and compare statistical indices.2) a database storing polymorphisms, genotyping data and grapevine sequences released by public and private projects. It allows the user to retrieve SNPs using various filters (such as genomic position, missing data, polymorphism type, allele frequency), to compare SNP patterns between populations, and to

  1. Molecular Characterization of Bovine SMO Gene and Effects of Its Genetic Variations on Body Size Traits in Qinchuan Cattle (Bos taurus).

    Science.gov (United States)

    Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen

    2015-07-27

    Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1-8) were identified and genotyped via direct sequencing covering most of the coding region and 3'UTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3'UTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs.

  2. Evaluation of novel SNPs and haplotypes within the ATBF1 gene and their effects on economically important production traits in cattle

    Directory of Open Access Journals (Sweden)

    H. Xu

    2017-08-01

    Full Text Available AT motif binding factor 1 (ATBF1 gene can promote the expression level of the growth hormone 1 (GH1 gene by binding to the enhancers of the POU1F1 and PROP1 genes; thus, it affects the growth and development of livestock. Considering that the ATBF1 gene also has a close relationship with the Janus kinase–signal transductor and activator of transcription (JAK–STAT pathway, the objective of this work was to identify novel single-nucleotide polymorphism (SNP variations and their association with growth traits in native Chinese cattle breeds. Five novel SNPs within the ATBF1 gene were found in 644 Qinchuan and Jinnan cattle for first time using 25 pairs of screening and genotyping primers. The five novel SNPs were named as AC_000175:g.140344C>G (SNP1, g.146573T>C (SNP2, g.205468C>T (SNP3, g.205575A>G (SNP4 and g.297690Csynonymous coding SNPs, while SNP5 was a missense coding SNP, and the other SNPs were intronic. Haplotype analysis found 18 haplotypes in the two breeds, and three and five closely linked loci were revealed in Qinchuan and Jinnan breeds, respectively. Association analysis revealed that SNP1 was significantly associated with the height across the hip in Qinchuan cattle. SNP2 was found to be significantly related to chest circumference and body side length traits in Jinnan cattle. SNP3 was found to have significant associations with four growth traits in Qinchuan cattle. Moreover, the different combined genotypes, SNP1–SNP3, SNP1–SNP4 and SNP2–SNP5 were significantly associated with the growth traits in cattle. These findings indicated that the bovine ATBF1 gene had marked effects on growth traits, and the growth-trait-related loci can be used as DNA markers for maker-assisted selection (MAS breeding programs in cattle.

  3. Enrichment of risk SNPs in regulatory regions implicate diverse tissues in Parkinson's disease etiology.

    Science.gov (United States)

    Coetzee, Simon G; Pierce, Steven; Brundin, Patrik; Brundin, Lena; Hazelett, Dennis J; Coetzee, Gerhard A

    2016-07-27

    Recent genome-wide association studies (GWAS) of Parkinson's disease (PD) revealed at least 26 risk loci, with associated single nucleotide polymorphisms (SNPs) located in non-coding DNA having unknown functions in risk. In order to explore in which cell types these SNPs (and their correlated surrogates at r(2) ≥ 0.8) could alter cellular function, we assessed their location overlap with histone modification regions that indicate transcription regulation in 77 diverse cell types. We found statistically significant enrichment of risk SNPs at 12 loci in active enhancers or promoters. We investigated 4 risk loci in depth that were most significantly enriched (-logeP > 14) and contained 8 putative enhancers in the different cell types. These enriched loci, along with eQTL associations, were unexpectedly present in non-neuronal cell types. These included lymphocytes, mesendoderm, liver- and fat-cells, indicating that cell types outside the brain are involved in the genetic predisposition to PD. Annotating regulatory risk regions within specific cell types may unravel new putative risk mechanisms and molecular pathways that contribute to PD development.

  4. Exploration of structural stability in deleterious nsSNPs of the XPA gene: A molecular dynamics approach

    Directory of Open Access Journals (Sweden)

    N NagaSundaram

    2011-01-01

    Full Text Available Background: Distinguishing the deleterious from the massive number of non-functional nsSNPs that occur within a single genome is a considerable challenge in mutation research. In this approach, we have used the existing in silico methods to explore the mutation-structure-function relationship in the XPA gene. Materials and Methods: We used the Sorting Intolerant From Tolerant (SIFT, Polymorphism Phenotyping (PolyPhen, I-Mutant 2.0, and the Protein Analysis THrough Evolutionary Relationships methods to predict the effects of deleterious nsSNPs on protein function and evaluated the impact of mutation on protein stability by Molecular Dynamics simulations. Results: By comparing the scores of all the four in silico methods, nsSNP with an ID rs104894131 at position C108F was predicted to be highly deleterious. We extended our Molecular dynamics approach to gain insight into the impact of this non-synonymous polymorphism on structural changes that may affect the activity of the XPA gene. Conclusion: Based on the in silico methods score, potential energy, root-mean-square deviation, and root-mean-square fluctuation, we predict that deleterious nsSNP at position C108F would play a significant role in causing disease by the XPA gene. Our approach would present the application of in silico tools in understanding the functional variation from the perspective of structure, evolution, and phenotype.

  5. Synonymous codon bias and functional constraint on GC3-related DNA backbone dynamics in the prokaryotic nucleoid.

    Science.gov (United States)

    Babbitt, Gregory A; Alawad, Mohammed A; Schulze, Katharina V; Hudson, André O

    2014-01-01

    While mRNA stability has been demonstrated to control rates of translation, generating both global and local synonymous codon biases in many unicellular organisms, this explanation cannot adequately explain why codon bias strongly tracks neighboring intergene GC content; suggesting that structural dynamics of DNA might also influence codon choice. Because minor groove width is highly governed by 3-base periodicity in GC, the existence of triplet-based codons might imply a functional role for the optimization of local DNA molecular dynamics via GC content at synonymous sites (≈GC3). We confirm a strong association between GC3-related intrinsic DNA flexibility and codon bias across 24 different prokaryotic multiple whole-genome alignments. We develop a novel test of natural selection targeting synonymous sites and demonstrate that GC3-related DNA backbone dynamics have been subject to moderate selective pressure, perhaps contributing to our observation that many genes possess extreme DNA backbone dynamics for their given protein space. This dual function of codons may impose universal functional constraints affecting the evolution of synonymous and non-synonymous sites. We propose that synonymous sites may have evolved as an 'accessory' during an early expansion of a primordial genetic code, allowing for multiplexed protein coding and structural dynamic information within the same molecular context. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Synonymous genes explore different evolutionary landscapes.

    Directory of Open Access Journals (Sweden)

    Guillaume Cambray

    2008-11-01

    Full Text Available The evolutionary potential of a gene is constrained not only by the amino acid sequence of its product, but by its DNA sequence as well. The topology of the genetic code is such that half of the amino acids exhibit synonymous codons that can reach different subsets of amino acids from each other through single mutation. Thus, synonymous DNA sequences should access different regions of the protein sequence space through a limited number of mutations, and this may deeply influence the evolution of natural proteins. Here, we demonstrate that this feature can be of value for manipulating protein evolvability. We designed an algorithm that, starting from an input gene, constructs a synonymous sequence that systematically includes the codons with the most different evolutionary perspectives; i.e., codons that maximize accessibility to amino acids previously unreachable from the template by point mutation. A synonymous version of a bacterial antibiotic resistance gene was computed and synthesized. When concurrently submitted to identical directed evolution protocols, both the wild type and the recoded sequence led to the isolation of specific, advantageous phenotypic variants. Simulations based on a mutation isolated only from the synthetic gene libraries were conducted to assess the impact of sub-functional selective constraints, such as codon usage, on natural adaptation. Our data demonstrate that rational design of synonymous synthetic genes stands as an affordable improvement to any directed evolution protocol. We show that using two synonymous DNA sequences improves the overall yield of the procedure by increasing the diversity of mutants generated. These results provide conclusive evidence that synonymous coding sequences do experience different areas of the corresponding protein adaptive landscape, and that a sequence's codon usage effectively constrains the evolution of the encoded protein.

  7. Enrichment of risk SNPs in regulatory regions implicate diverse tissues in Parkinson’s disease etiology

    Science.gov (United States)

    Coetzee, Simon G.; Pierce, Steven; Brundin, Patrik; Brundin, Lena; Hazelett, Dennis J.; Coetzee, Gerhard A.

    2016-01-01

    Recent genome-wide association studies (GWAS) of Parkinson’s disease (PD) revealed at least 26 risk loci, with associated single nucleotide polymorphisms (SNPs) located in non-coding DNA having unknown functions in risk. In order to explore in which cell types these SNPs (and their correlated surrogates at r2 ≥ 0.8) could alter cellular function, we assessed their location overlap with histone modification regions that indicate transcription regulation in 77 diverse cell types. We found statistically significant enrichment of risk SNPs at 12 loci in active enhancers or promoters. We investigated 4 risk loci in depth that were most significantly enriched (−logeP > 14) and contained 8 putative enhancers in the different cell types. These enriched loci, along with eQTL associations, were unexpectedly present in non-neuronal cell types. These included lymphocytes, mesendoderm, liver- and fat-cells, indicating that cell types outside the brain are involved in the genetic predisposition to PD. Annotating regulatory risk regions within specific cell types may unravel new putative risk mechanisms and molecular pathways that contribute to PD development. PMID:27461410

  8. High-confidence assessment of functional impact of human mitochondrial non-synonymous genome variations by APOGEE.

    Directory of Open Access Journals (Sweden)

    Stefano Castellana

    2017-06-01

    Full Text Available 24,189 are all the possible non-synonymous amino acid changes potentially affecting the human mitochondrial DNA. Only a tiny subset was functionally evaluated with certainty so far, while the pathogenicity of the vast majority was only assessed in-silico by software predictors. Since these tools proved to be rather incongruent, we have designed and implemented APOGEE, a machine-learning algorithm that outperforms all existing prediction methods in estimating the harmfulness of mitochondrial non-synonymous genome variations. We provide a detailed description of the underlying algorithm, of the selected and manually curated training and test sets of variants, as well as of its classification ability.

  9. Investigation of Polymorphisms in Coding Region of OsHKT1 in Relation to Salinity in Rice

    Directory of Open Access Journals (Sweden)

    Pham Quynh-Hoa

    2016-11-01

    Full Text Available Rice (Oryza sativa is sensitive to salinity, but the salt tolerance level differs among cultivars, which might result from natural variations in the genes that are responsible for salt tolerance. High-affinity potassium transporter (HKTs has been proven to be involved in salt tolerance in plants. Therefore, we screened for natural nucleotide polymorphism in the coding sequence of OsHKT1, which encodes the HKT protein in eight Vietnamese rice cultivars differing in salt tolerance level. In total, seven nucleotide substitutions in coding sequence of OsHKT1 were found, including two non-synonymous and five synonymous substitutions. Further analysis revealed that these two non-synonymous nucleotide substitutions (G50T and T1209A caused changes in amino acids (Gly17Val and Asp403Glu at signal peptide and the loop of the sixth transmembrane domain, respectively. To assess the potential effect of these substitutions on the protein function, the 3D structure of HKT protein variants was modelled by using PHYRE2 webserver. The results showed that no difference was observed when compared those predicted 3D structure of HKT protein variants with each other. In addition, the codon bias of synonymous substitutions cannot clearly show correlation with salt tolerance level. It might be interesting to further investigate the functional roles of detected non-synonymous substitutions as it might correlate to salt tolerance in rice.

  10. Contrasting association of a non-synonymous leptin receptor gene polymorphism with Wegener's granulomatosis and Churg-Strauss syndrome.

    Science.gov (United States)

    Wieczorek, Stefan; Holle, Julia U; Bremer, Jan P; Wibisono, David; Moosig, Frank; Fricke, Harald; Assmann, Gunter; Harper, Lorraine; Arning, Larissa; Gross, Wolfgang L; Epplen, Joerg T

    2010-05-01

    There is evidence that the leptin/ghrelin system is involved in T-cell regulation and plays a role in (auto)immune disorders such as SLE, RA and ANCA-associated vasculitides (AAVs). Here, we evaluate the genetic background of this system in WG. We screened variations in the genes encoding leptin, ghrelin and their receptors, the leptin receptor (LEPR) and the growth hormone secretagogue receptor (GHSR). Three single nucleotide polymorphisms (SNPs) in each gene region were analysed in 460 German WG cases and 878 ethnically matched healthy controls. A three-SNP haplotype of GHSR was significantly associated with WG [P = 0.0067; corrected P-value (P(c)) = 0.026; odds ratio (OR) = 1.30; 95% CI 1.08, 1.57], as was one non-synonymous SNP in LEPR (Lys656Asn, P = 0.0034; P(c) = 0.013; OR = 0.72; 95% CI 0.58, 0.90). These four SNPs were re-analysed in independent cohorts of 226 German WG cases and 519 controls. While the GHSR association was not confirmed, allele frequencies of the LEPR SNP were virtually identical to those from the initial cohorts. Analysis of this SNP in the combined WG and control panels revealed a significant association of the LEPR 656Lys allele with WG (P = 0.00032; P(c) = 0.0013; OR = 0.72; 95% CI 0.60, 0.86). Remarkably, the Lys656Asn SNP showed contrasting allele distribution in two cohorts of 108 and 88 German cases diagnosed with Churg-Strauss syndrome (CSS, combined P = 0.0067; OR = 1.41; 95% CI 1.10, 1.81), whereas identical allele frequencies were revealed when comparing British WG and microscopic polyangiitis cases. While GHSR has to be further evaluated, these data provide profound evidence for an association of the LEPR Lys656Asn SNP with AAV, resulting in opposing effects in WG and CSS.

  11. Functional non-synonymous variants of ABCG2 and gout risk.

    Science.gov (United States)

    Stiburkova, Blanka; Pavelcova, Katerina; Zavada, Jakub; Petru, Lenka; Simek, Pavel; Cepek, Pavel; Pavlikova, Marketa; Matsuo, Hirotaka; Merriman, Tony R; Pavelka, Karel

    2017-11-01

    Common dysfunctional variants of ATP binding cassette subfamily G member 2 (Junior blood group) (ABCG2), a high-capacity urate transporter gene, that result in decreased urate excretion are major causes of hyperuricemia and gout. In the present study, our objective was to determine the frequency and effect on gout of common and rare non-synonymous and other functional allelic variants in the ABCG2 gene. The main cohort recruited from the Czech Republic consisted of 145 gout patients; 115 normouricaemic controls were used for comparison. We amplified, directly sequenced and analysed 15 ABCG2 exons. The associations between genetic variants and clinical phenotype were analysed using the t-test, Fisher's exact test and a logistic and linear regression approach. Data from a New Zealand Polynesian sample set and the UK Biobank were included for the p.V12M analysis. In the ABCG2 gene, 18 intronic (one dysfunctional splicing) and 11 exonic variants were detected: 9 were non-synonymous (2 common, 7 rare including 1 novel), namely p.V12M, p.Q141K, p.R147W, p.T153M, p.F373C, p.T434M, p.S476P, p.D620N and p.K360del. The p.Q141K (rs2231142) variant had a significantly higher minor allele frequency (0.23) in the gout patients compared with the European-origin population (0.09) and was significantly more common among gout patients than among normouricaemic controls (odds ratio = 3.26, P gout (42 vs 48 years, P = 0.0143) and a greater likelihood of a familial history of gout (41% vs 27%, odds ratio = 1.96, P = 0.053). In a meta-analysis p.V12M exerted a protective effect from gout (P gout. Non-synonymous allelic variants of ABCG2 had a significant effect on earlier onset of gout and the presence of a familial gout history. ABCG2 should thus be considered a common and significant risk factor for gout. © The Author 2017. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  12. Coding variants in NOD-like receptors: An association study on risk and survival of colorectal cancer.

    Science.gov (United States)

    Huhn, Stefanie; da Silva Filho, Miguel I; Sanmuganantham, Tharmila; Pichulik, Tica; Catalano, Calogerina; Pardini, Barbara; Naccarati, Alessio; Polakova-Vymetálkova, Veronika; Jiraskova, Katerina; Vodickova, Ludmila; Vodicka, Pavel; Löffler, Markus W; Courth, Lioba; Wehkamp, Jan; Din, Farhat V N; Timofeeva, Maria; Farrington, Susan M; Jansen, Lina; Hemminki, Kari; Chang-Claude, Jenny; Brenner, Hermann; Hoffmeister, Michael; Dunlop, Malcolm G; Weber, Alexander N R; Försti, Asta

    2018-01-01

    Nod-like receptors (NLRs) are important innate pattern recognition receptors and regulators of inflammation or play a role during development. We systematically analysed 41 non-synonymous single nucleotide polymorphisms (SNPs) in 21 NLR genes in a Czech discovery cohort of sporadic colorectal cancer (CRC) (1237 cases, 787 controls) for their association with CRC risk and survival. Five SNPs were found to be associated with CRC risk and eight with survival at 5% significance level. In a replication analysis using data of two large genome-wide association studies (GWASs) from Germany (DACHS: 1798 cases and 1810 controls) and Scotland (2210 cases and 9350 controls) the associations found in the Czech discovery set were not confirmed. However, expression analysis in human gut-related tissues and immune cells revealed that the NLRs associated with CRC risk or survival in the discovery set were expressed in primary human colon or rectum cells, CRC tissue and/or cell lines, providing preliminary evidence for a potential involvement of NLRs in general in CRC development and/or progression. Most interesting was the finding that the enigmatic development-related NLRP5 (also known as MATER) was not expressed in normal colon tissue but in colon cancer tissue and cell lines. Future studies may show whether regulatory variants instead of coding variants might affect the expression of NLRs and contribute to CRC risk and survival.

  13. Non-Protein Coding RNAs

    CERN Document Server

    Walter, Nils G; Batey, Robert T

    2009-01-01

    This book assembles chapters from experts in the Biophysics of RNA to provide a broadly accessible snapshot of the current status of this rapidly expanding field. The 2006 Nobel Prize in Physiology or Medicine was awarded to the discoverers of RNA interference, highlighting just one example of a large number of non-protein coding RNAs. Because non-protein coding RNAs outnumber protein coding genes in mammals and other higher eukaryotes, it is now thought that the complexity of organisms is correlated with the fraction of their genome that encodes non-protein coding RNAs. Essential biological processes as diverse as cell differentiation, suppression of infecting viruses and parasitic transposons, higher-level organization of eukaryotic chromosomes, and gene expression itself are found to largely be directed by non-protein coding RNAs. The biophysical study of these RNAs employs X-ray crystallography, NMR, ensemble and single molecule fluorescence spectroscopy, optical tweezers, cryo-electron microscopy, and ot...

  14. Detecting non-coding selective pressure in coding regions

    Directory of Open Access Journals (Sweden)

    Blanchette Mathieu

    2007-02-01

    Full Text Available Abstract Background Comparative genomics approaches, where orthologous DNA regions are compared and inter-species conserved regions are identified, have proven extremely powerful for identifying non-coding regulatory regions located in intergenic or intronic regions. However, non-coding functional elements can also be located within coding region, as is common for exonic splicing enhancers, some transcription factor binding sites, and RNA secondary structure elements affecting mRNA stability, localization, or translation. Since these functional elements are located in regions that are themselves highly conserved because they are coding for a protein, they generally escaped detection by comparative genomics approaches. Results We introduce a comparative genomics approach for detecting non-coding functional elements located within coding regions. Codon evolution is modeled as a mixture of codon substitution models, where each component of the mixture describes the evolution of codons under a specific type of coding selective pressure. We show how to compute the posterior distribution of the entropy and parsimony scores under this null model of codon evolution. The method is applied to a set of growth hormone 1 orthologous mRNA sequences and a known exonic splicing elements is detected. The analysis of a set of CORTBP2 orthologous genes reveals a region of several hundred base pairs under strong non-coding selective pressure whose function remains unknown. Conclusion Non-coding functional elements, in particular those involved in post-transcriptional regulation, are likely to be much more prevalent than is currently known. With the numerous genome sequencing projects underway, comparative genomics approaches like that proposed here are likely to become increasingly powerful at detecting such elements.

  15. SNP2Structure: A Public and Versatile Resource for Mapping and Three-Dimensional Modeling of Missense SNPs on Human Protein Structures

    Directory of Open Access Journals (Sweden)

    Difei Wang

    2015-01-01

    Full Text Available One of the long-standing challenges in biology is to understand how non-synonymous single nucleotide polymorphisms (nsSNPs change protein structure and further affect their function. While it is impractical to solve all the mutated protein structures experimentally, it is quite feasible to model the mutated structures in silico. Toward this goal, we built a publicly available structure database resource (SNP2Structure, https://apps.icbi.georgetown.edu/snp2structure focusing on missense mutations, msSNP. Compared with web portals with similar aims, SNP2Structure has the following major advantages. First, our portal offers direct comparison of two related 3D structures. Second, the protein models include all interacting molecules in the original PDB structures, so users are able to determine regions of potential interaction changes when a protein mutation occurs. Third, the mutated structures are available to download locally for further structural and functional analysis. Fourth, we used Jsmol package to display the protein structure that has no system compatibility issue. SNP2Structure provides reliable, high quality mapping of nsSNPs to 3D protein structures enabling researchers to explore the likely functional impact of human disease-causing mutations.

  16. Molecular Characterization of Bovine SMO Gene and Effects of Its Genetic Variations on Body Size Traits in Qinchuan Cattle (Bos taurus)

    Science.gov (United States)

    Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen

    2015-01-01

    Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1–8) were identified and genotyped via direct sequencing covering most of the coding region and 3ʹUTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3ʹUTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p < 0.05) greater body length than those with Hap2/Hap2. Our results indicate that variations in the SMO gene could affect body size traits of Qinchuan cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs. PMID:26225956

  17. A synonymous polymorphic variation in ACADM exon 11 affects splicing efficiency and may affect fatty acid oxidation

    DEFF Research Database (Denmark)

    Bruun, Gitte Hoffmann; Doktor, Thomas Koed; Andresen, Brage Storstein

    2013-01-01

    beta-oxidation of medium-chain fatty acids. We examined the functional basis for this association and identified linkage between rs211718 and the intragenic synonymous polymorphic variant c.1161A>G in ACADM exon 11 (rs1061337). Employing minigene studies we show that the c.1161A allele is associated......, perhaps due to improved splicing. This study is a proof of principle that synonymous SNPs are not neutral. By changing the binding sites for splicing regulatory proteins they can have significant effects on pre-mRNA splicing and thus protein function. In addition, this study shows that for a sequence...

  18. Patterns of mutation and selection at synonymous sites in Drosophila

    DEFF Research Database (Denmark)

    Singh, Nadia D; Bauer DuMont, Vanessa L; Hubisz, Melissa J

    2007-01-01

    , when applied to 18 coding sequences in 3 species of Drosophila, confirmed an earlier report that the Notch gene in Drosophila melanogaster was evolving under selection in favor of those codons defined as unpreferred in this species. This finding opened the possibility that synonymous sites may...... be subject to a variety of selective pressures beyond weak selection for increased frequencies of the codons currently defined as "preferred" in D. melanogaster. To further explore patterns of synonymous site evolution in Drosophila in a lineage-specific manner, we expanded the application of the maximum...... likelihood framework to 8,452 protein coding sequences with well-defined orthology in D. melanogaster, Drosophila sechellia, and Drosophila yakuba. Our analyses reveal intragenomic and interspecific variation in mutational patterns as well as in patterns and intensity of selection on synonymous sites. In D...

  19. New insights into the Lake Chad Basin population structure revealed by high-throughput genotyping of mitochondrial DNA coding SNPs.

    Directory of Open Access Journals (Sweden)

    María Cerezo

    Full Text Available BACKGROUND: Located in the Sudan belt, the Chad Basin forms a remarkable ecosystem, where several unique agricultural and pastoral techniques have been developed. Both from an archaeological and a genetic point of view, this region has been interpreted to be the center of a bidirectional corridor connecting West and East Africa, as well as a meeting point for populations coming from North Africa through the Saharan desert. METHODOLOGY/PRINCIPAL FINDINGS: Samples from twelve ethnic groups from the Chad Basin (n = 542 have been high-throughput genotyped for 230 coding region mitochondrial DNA (mtDNA Single Nucleotide Polymorphisms (mtSNPs using Matrix-Assisted Laser Desorption/Ionization Time-Of-Flight (MALDI-TOF mass spectrometry. This set of mtSNPs allowed for much better phylogenetic resolution than previous studies of this geographic region, enabling new insights into its population history. Notable haplogroup (hg heterogeneity has been observed in the Chad Basin mirroring the different demographic histories of these ethnic groups. As estimated using a Bayesian framework, nomadic populations showed negative growth which was not always correlated to their estimated effective population sizes. Nomads also showed lower diversity values than sedentary groups. CONCLUSIONS/SIGNIFICANCE: Compared to sedentary population, nomads showed signals of stronger genetic drift occurring in their ancestral populations. These populations, however, retained more haplotype diversity in their hypervariable segments I (HVS-I, but not their mtSNPs, suggesting a more ancestral ethnogenesis. Whereas the nomadic population showed a higher Mediterranean influence signaled mainly by sub-lineages of M1, R0, U6, and U5, the other populations showed a more consistent sub-Saharan pattern. Although lifestyle may have an influence on diversity patterns and hg composition, analysis of molecular variance has not identified these differences. The present study indicates that

  20. The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate

    Directory of Open Access Journals (Sweden)

    Stone Eric A

    2011-08-01

    Full Text Available Abstract Background Protein sequences are subject to a mosaic of constraint. Changes to functional domains and buried residues, for example, are more apt to disrupt protein structure and function than are changes to residues participating in loops or exposed to solvent. Regions of constraint on the tertiary structure of a protein often result in loose segmentation of its primary structure into stretches of slowly- and rapidly-evolving amino acids. This clustering can be exploited, and existing methods have done so by relying on local sequence conservation as a signature of selection to help identify functionally important regions within proteins. We invert this paradigm by leveraging the regional nature of protein structure and function to both illuminate and make use of genome-wide patterns of local sequence conservation. Results Our hypothesis is that the regional nature of structural and functional constraints will assert a positive autocorrelation on the evolutionary rates of neighboring sites, which, in a pairwise comparison of orthologous proteins, will manifest itself as the clustering of non-synonymous changes across the amino acid sequence. We introduce a dispersion ratio statistic to test this and related hypotheses. Using genome-wide interspecific comparisons of orthologous protein pairs, we reveal a strong log-linear relationship between the degree of clustering and the intensity of constraint. We further demonstrate how this relationship varies with the evolutionary distance between the species being compared. We provide some evidence that proteins with a history of positive selection deviate from genome-wide trends. Conclusions We find a significant association between the evolutionary rate of a protein and the degree to which non-synonymous changes cluster along its primary sequence. We show that clustering is a non-redundant predictor of evolutionary rate, and we speculate that conflicting signals of clustering and constraint may

  1. EST-derived SNP discovery and selective pressure analysis in Pacific white shrimp ( Litopenaeus vannamei)

    Science.gov (United States)

    Liu, Chengzhang; Wang, Xia; Xiang, Jianhai; Li, Fuhua

    2012-09-01

    Pacific white shrimp has become a major aquaculture and fishery species worldwide. Although a large scale EST resource has been publicly available since 2008, the data have not yet been widely used for SNP discovery or transcriptome-wide assessment of selective pressure. In this study, a set of 155 411 expressed sequence tags (ESTs) from the NCBI database were computationally analyzed and 17 225 single nucleotide polymorphisms (SNPs) were predicted, including 9 546 transitions, 5 124 transversions and 2 481 indels. Among the 7 298 SNP substitutions located in functionally annotated contigs, 58.4% (4 262) are non-synonymous SNPs capable of introducing amino acid mutations. Two hundred and fifty nonsynonymous SNPs in genes associated with economic traits have been identified as candidates for markers in selective breeding. Diversity estimates among the synonymous nucleotides were on average 3.49 times greater than those in non-synonymous, suggesting negative selection. Distribution of non-synonymous to synonymous substitutions (Ka/Ks) ratio ranges from 0 to 4.01, (average 0.42, median 0.26), suggesting that the majority of the affected genes are under purifying selection. Enrichment analysis identified multiple gene ontology categories under positive or negative selection. Categories involved in innate immune response and male gamete generation are rich in positively selected genes, which is similar to reports in Drosophila and primates. This work is the first transcriptome-wide assessment of selective pressure in a Penaeid shrimp species. The functionally annotated SNPs provide a valuable resource of potential molecular markers for selective breeding.

  2. Characterization of genomic variations in SNPs of PE_PGRS genes reveals deletions and insertions in extensively drug resistant (XDR) M. tuberculosis strains from Pakistan

    KAUST Repository

    Kanji, Akbar

    2015-01-21

    Background Mycobacterium tuberculosis (MTB) PE_PGRS genes belong to the PE multigene family. Although the function of PE_PGRS genes is unknown, it is hypothesized that the PE_PGRS genes may be associated with antigenic variability in MTB. Material and methods Whole genome sequencing analysis was performed on (n = 37) extensively drug-resistant (XDR) MTB strains from Pakistan, which included Lineage 1 (East African Indian, n = 2); Other lineage 1 (n = 3); Lineage 3 (Central Asian, n = 24); Other lineage 3 (n = 4); Lineage 4 (X3, n = 1) and T group (n = 3) MTB strains. Results There were 107 SNPs identified from the analysis of 42 PE_PGRS genes; of these, 13 were non-synonymous SNPs (nsSNPs). The nsSNPs identified in PE_PGRS genes – 6, 9 and 10 – were common in all EAI, CAS, Other lineages (1 and 3), T1 and X3. Deletions (DELs) in PE_PGRS genes – 3 and 19 – were observed in 17 (80.9%) CAS1 and 6 (85.7%) in Other lineages (1 and 3) XDR MTB strains, while DELs in the PE_PGRS49 were observed in all CAS1, CAS, CAS2 and Other lineages (1 and 3) XDR MTB strains. All CAS, EAI and Other lineages (1 and 3) strains showed insertions (INS) in PE_PGRS6 gene, while INS in the PE_PGRS genes 19 and 33 were observed in 20 (95.2%) CAS1, all CAS, CAS2, EAI and Other lineages (1 and 3) XDR MTB strains. Conclusion Genetic diversity in PE_PGRS genes contributes to antigenic variability and may result in increased immunogenicity of strains. This is the first study identifying variations in nsSNPs and INDELs in the PE_PGRS genes of XDR-TB strains from Pakistan. It highlights common genetic variations which may contribute to persistence.

  3. Characterization of genomic variations in SNPs of PE_PGRS genes reveals deletions and insertions in extensively drug resistant (XDR) M. tuberculosis strains from Pakistan

    KAUST Repository

    Kanji, Akbar

    2015-03-01

    Background: Mycobacterium tuberculosis (MTB) PE_PGRS genes belong to the PE multi-gene family. Although the function of the members of the PE_PGRS multi-gene family is not yet known, it is hypothesized that the PE_PGRS genes may be associated with genetic variability. Material and methods: Whole genome sequencing analysis was performed on (n= 37) extensively drug resistant (XDR) MTB strains from Pakistan which included Central Asian (n= 23), East African Indian (n= 2), X3 (n= 1), T group (n= 3) and Orphan (n= 8) MTB strains. Results: By analyzing 42 PE_PGRS genes, 111 SNPs were identified, of which 13 were non-synonymous SNPs (nsSNPs). The nsSNPs identified in the PE_PGRS genes were as follows: 6, 9, 10 and 55 present in each of the CAS, EAI, Orphan, T1 and X3 XDR MTB strains studied. Deletions in PE_PGRS genes: 19, 21 and 23 were observed in 7 (35.0%) CAS1 and 3 (37.5%) in Orphan XDR MTB strains, while deletions in the PE_PGRS genes: 49 and 50 were observed in 36 (95.0%) CAS1 and all CAS, CAS2 and Orphan XDR MTB strains. An insertion in PE_PGRS6 gene was observed in all CAS, EAI3 and Orphan, while insertions in the PE_PGRS genes 19 and 33 were observed in 19 (95%) CAS1 and all CAS, CAS2, EAI3 and Orphan XDR MTB strains. Conclusion: Genetic diversity in PE_PGRS genes contributes to antigenic variability and may result in increased immunogenicity of strains. This is the first study identifying variations in nsSNPs, Insertions and Deletions in the PE_PGRS genes of XDR-TB strains from Pakistan. It highlights common genetic variations which may contribute to persistence.

  4. Inter-individual variation in nucleotide excision repair pathway is modulated by non-synonymous polymorphisms in ERCC4 and MBD4 genes

    Energy Technology Data Exchange (ETDEWEB)

    Allione, Alessandra, E-mail: alessandra.allione@hugef-torino.org [Human Genetics Foundation (HuGeF), Via Nizza 52, 10126 Turin (Italy); Guarrera, Simonetta; Russo, Alessia [Human Genetics Foundation (HuGeF), Via Nizza 52, 10126 Turin (Italy); Ricceri, Fulvio [Human Genetics Foundation (HuGeF), Via Nizza 52, 10126 Turin (Italy); Department of Medical Sciences, University of Turin, Via Santena 19, 10126 Turin (Italy); Purohit, Rituraj [Human Genetics Foundation (HuGeF), Via Nizza 52, 10126 Turin (Italy); Bioinformatics Division, School of Bio Sciences and Technology, Vellore Institute of Technology University, Vellore 632014, Tamil Nadu (India); Pagnani, Andrea; Rosa, Fabio; Polidoro, Silvia; Voglino, Floriana [Human Genetics Foundation (HuGeF), Via Nizza 52, 10126 Turin (Italy); Matullo, Giuseppe [Human Genetics Foundation (HuGeF), Via Nizza 52, 10126 Turin (Italy); Department of Medical Sciences, University of Turin, Via Santena 19, 10126 Turin (Italy)

    2013-11-15

    Highlights: • We reported a large inter-individual variability of NER capacity. • ERCC4 rs1800124 and MBD4 rs10342 nsSNP variants were associated with DNA repair capacity. • DNA–protein interaction analyses showed alteration of binding for ERCC4 and MBD4 variants. • A new possible cross-talk between NER and BER pathways has been reported. - Abstract: Inter-individual differences in DNA repair capacity (DRC) may lead to genome instability and, consequently, modulate individual cancer risk. Among the different DNA repair pathways, nucleotide excision repair (NER) is one of the most versatile, as it can eliminate a wide range of helix-distorting DNA lesions caused by ultraviolet light irradiation and chemical mutagens. We performed a genotype–phenotype correlation study in 122 healthy subjects in order to assess if any associations exist between phenotypic profiles of NER and DNA repair gene single nucleotide polymorphisms (SNPs). Individuals were genotyped for 768 SNPs with a custom Illumina Golden Gate Assay, and peripheral blood mononuclear cells (PBMCs) of the same subjects were tested for a NER comet assay to measure DRC after challenging cells by benzo(a)pyrene diolepoxide (BPDE). We observed a large inter-individual variability of NER capacity, with women showing a statistically significant lower DRC (mean ± SD: 6.68 ± 4.76; p = 0.004) than men (mean ± SD: 8.89 ± 5.20). Moreover, DRC was significantly lower in individuals carrying a variant allele for the ERCC4 rs1800124 non-synonymous SNP (nsSNP) (p = 0.006) and significantly higher in subjects with the variant allele of MBD4 rs2005618 SNP (p = 0.008), in linkage disequilibrium (r{sup 2} = 0.908) with rs10342 nsSNP. Traditional in silico docking approaches on protein–DNA and protein–protein interaction showed that Gly875 variant in ERCC4 (rs1800124) decreases the DNA–protein interaction and that Ser273 and Thr273 variants in MBD4 (rs10342) indicate complete loss of protein

  5. Continuous Non-malleable Codes

    DEFF Research Database (Denmark)

    Faust, Sebastian; Mukherjee, Pratyay; Nielsen, Jesper Buus

    2014-01-01

    or modify it to the encoding of a completely unrelated value. This paper introduces an extension of the standard non-malleability security notion - so-called continuous non-malleability - where we allow the adversary to tamper continuously with an encoding. This is in contrast to the standard notion of non...... is necessary to achieve continuous non-malleability in the split-state model. Moreover, we illustrate that none of the existing constructions satisfies our uniqueness property and hence is not secure in the continuous setting. We construct a split-state code satisfying continuous non-malleability. Our scheme...... is based on the inner product function, collision-resistant hashing and non-interactive zero-knowledge proofs of knowledge and requires an untamperable common reference string. We apply continuous non-malleable codes to protect arbitrary cryptographic primitives against tampering attacks. Previous...

  6. Non-binary Entanglement-assisted Stabilizer Quantum Codes

    OpenAIRE

    Riguang, Leng; Zhi, Ma

    2011-01-01

    In this paper, we show how to construct non-binary entanglement-assisted stabilizer quantum codes by using pre-shared entanglement between the sender and receiver. We also give an algorithm to determine the circuit for non-binary entanglement-assisted stabilizer quantum codes and some illustrated examples. The codes we constructed do not require the dual-containing constraint, and many non-binary classical codes, like non-binary LDPC codes, which do not satisfy the condition, can be used to c...

  7. Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analysis analyses of pooled RNA-Seq samples in rainbow trout

    Science.gov (United States)

    Coding/functional SNPs change the biological function of a gene and, therefore, could serve as “large-effect” genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, mus...

  8. In Vitro vs In Silico Detected SNPs for the Development of a Genotyping Array: What Can We Learn from a Non-Model Species?

    Science.gov (United States)

    Lepoittevin, Camille; Frigerio, Jean-Marc; Garnier-Géré, Pauline; Salin, Franck; Cervera, María-Teresa; Vornam, Barbara; Harvengt, Luc; Plomion, Christophe

    2010-01-01

    Background There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size (∼23.8 Gb/C). Methodology/Principal Findings A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates). Conclusions/Significance This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species

  9. In vitro vs in silico detected SNPs for the development of a genotyping array: what can we learn from a non-model species?

    Directory of Open Access Journals (Sweden)

    Camille Lepoittevin

    2010-06-01

    Full Text Available There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait., a conifer characterized by a huge genome size ( approximately 23.8 Gb/C.A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs, chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively. The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates.This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome.

  10. Genome-wide characterization of genetic variants and putative regions under selection in meat and egg-type chicken lines.

    Science.gov (United States)

    Boschiero, Clarissa; Moreira, Gabriel Costa Monteiro; Gheyas, Almas Ara; Godoy, Thaís Fernanda; Gasparin, Gustavo; Mariani, Pilar Drummond Sampaio Corrêa; Paduan, Marcela; Cesar, Aline Silva Mello; Ledur, Mônica Corrêa; Coutinho, Luiz Lehmann

    2018-01-25

    Meat and egg-type chickens have been selected for several generations for different traits. Artificial and natural selection for different phenotypes can change frequency of genetic variants, leaving particular genomic footprints throghtout the genome. Thus, the aims of this study were to sequence 28 chickens from two Brazilian lines (meat and white egg-type) and use this information to characterize genome-wide genetic variations, identify putative regions under selection using Fst method, and find putative pathways under selection. A total of 13.93 million SNPs and 1.36 million INDELs were identified, with more variants detected from the broiler (meat-type) line. Although most were located in non-coding regions, we identified 7255 intolerant non-synonymous SNPs, 512 stopgain/loss SNPs, 1381 frameshift and 1094 non-frameshift INDELs that may alter protein functions. Genes harboring intolerant non-synonymous SNPs affected metabolic pathways related mainly to reproduction and endocrine systems in the white-egg layer line, and lipid metabolism and metabolic diseases in the broiler line. Fst analysis in sliding windows, using SNPs and INDELs separately, identified over 300 putative regions of selection overlapping with more than 250 genes. For the first time in chicken, INDEL variants were considered for selection signature analysis, showing high level of correlation in results between SNP and INDEL data. The putative regions of selection signatures revealed interesting candidate genes and pathways related to important phenotypic traits in chicken, such as lipid metabolism, growth, reproduction, and cardiac development. In this study, Fst method was applied to identify high confidence putative regions under selection, providing novel insights into selection footprints that can help elucidate the functional mechanisms underlying different phenotypic traits relevant to meat and egg-type chicken lines. In addition, we generated a large catalog of line-specific and common

  11. Bacterial Genetic Architecture of Ecological Interactions in Co-culture by GWAS-Taking Escherichia coli and Staphylococcus aureus as an Example.

    Science.gov (United States)

    He, Xiaoqing; Jin, Yi; Ye, Meixia; Chen, Nan; Zhu, Jing; Wang, Jingqi; Jiang, Libo; Wu, Rongling

    2017-01-01

    How a species responds to such a biotic environment in the community, ultimately leading to its evolution, has been a topic of intense interest to ecological evolutionary biologists. Until recently, limited knowledge was available regarding the genotypic changes that underlie phenotypic changes. Our study implemented GWAS (Genome-Wide Association Studies) to illustrate the genetic architecture of ecological interactions that take place in microbial populations. By choosing 45 such interspecific pairs of Escherichia coli and Staphylococcus aureus strains that were all genotyped throughout the entire genome, we employed Q-ROADTRIPS to analyze the association between single SNPs and microbial abundance measured at each time point for bacterial populations reared in monoculture and co-culture, respectively. We identified a large number of SNPs and indels across the genomes (35.69 G clean data of E. coli and 50.41 G of S. aureus ). We reported 66 and 111 SNPs that were associated with interaction in E. coli and S. aureus , respectively. 23 out of 66 polymorphic changes resulted in amino acid alterations.12 significant genes, such as murE, treA, argS , and relA , which were also identified in previous evolutionary studies. In S. aureus , 111 SNPs detected in coding sequences could be divided into 35 non-synonymous and 76 synonymous SNPs. Our study illustrated the potential of genome-wide association methods for studying rapidly evolving traits in bacteria. Genetic association study methods will facilitate the identification of genetic elements likely to cause phenotypes of interest and provide targets for further laboratory investigation.

  12. Are Synonymous Substitutions in Flowering Plant Mitochondria Neutral?

    Science.gov (United States)

    Wynn, Emily L; Christensen, Alan C

    2015-10-01

    Angiosperm mitochondrial genes appear to have very low mutation rates, while non-gene regions expand, diverge, and rearrange quickly. One possible explanation for this disparity is that synonymous substitutions in plant mitochondrial genes are not truly neutral and selection keeps their occurrence low. If this were true, the explanation for the disparity in mutation rates in genes and non-genes needs to consider selection as well as mechanisms of DNA repair. Rps14 is co-transcribed with cob and rpl5 in most plant mitochondrial genomes, but in some genomes, rps14 has been duplicated to the nucleus leaving a pseudogene in the mitochondria. This provides an opportunity to compare neutral substitution rates in pseudogenes with synonymous substitution rates in the orthologs. Genes and pseudogenes of rps14 have been aligned among different species and the mutation rates have been calculated. Neutral substitution rates in pseudogenes and synonymous substitution rates in genes are significantly different, providing evidence that synonymous substitutions in plant mitochondrial genes are not completely neutral. The non-neutrality is not sufficient to completely explain the exceptionally low mutation rates in land plant mitochondrial genomes, but selective forces appear to play a small role.

  13. Large-scale analyses of synonymous substitution rates can be sensitive to assumptions about the process of mutation.

    Science.gov (United States)

    Aris-Brosou, Stéphane; Bielawski, Joseph P

    2006-08-15

    A popular approach to examine the roles of mutation and selection in the evolution of genomes has been to consider the relationship between codon bias and synonymous rates of molecular evolution. A significant relationship between these two quantities is taken to indicate the action of weak selection on substitutions among synonymous codons. The neutral theory predicts that the rate of evolution is inversely related to the level of functional constraint. Therefore, selection against the use of non-preferred codons among those coding for the same amino acid should result in lower rates of synonymous substitution as compared with sites not subject to such selection pressures. However, reliably measuring the extent of such a relationship is problematic, as estimates of synonymous rates are sensitive to our assumptions about the process of molecular evolution. Previous studies showed the importance of accounting for unequal codon frequencies, in particular when synonymous codon usage is highly biased. Yet, unequal codon frequencies can be modeled in different ways, making different assumptions about the mutation process. Here we conduct a simulation study to evaluate two different ways of modeling uneven codon frequencies and show that both model parameterizations can have a dramatic impact on rate estimates and affect biological conclusions about genome evolution. We reanalyze three large data sets to demonstrate the relevance of our results to empirical data analysis.

  14. Identification of coding and non-coding mutational hotspots in cancer genomes.

    Science.gov (United States)

    Piraino, Scott W; Furney, Simon J

    2017-01-05

    The identification of mutations that play a causal role in tumour development, so called "driver" mutations, is of critical importance for understanding how cancers form and how they might be treated. Several large cancer sequencing projects have identified genes that are recurrently mutated in cancer patients, suggesting a role in tumourigenesis. While the landscape of coding drivers has been extensively studied and many of the most prominent driver genes are well characterised, comparatively less is known about the role of mutations in the non-coding regions of the genome in cancer development. The continuing fall in genome sequencing costs has resulted in a concomitant increase in the number of cancer whole genome sequences being produced, facilitating systematic interrogation of both the coding and non-coding regions of cancer genomes. To examine the mutational landscapes of tumour genomes we have developed a novel method to identify mutational hotspots in tumour genomes using both mutational data and information on evolutionary conservation. We have applied our methodology to over 1300 whole cancer genomes and show that it identifies prominent coding and non-coding regions that are known or highly suspected to play a role in cancer. Importantly, we applied our method to the entire genome, rather than relying on predefined annotations (e.g. promoter regions) and we highlight recurrently mutated regions that may have resulted from increased exposure to mutational processes rather than selection, some of which have been identified previously as targets of selection. Finally, we implicate several pan-cancer and cancer-specific candidate non-coding regions, which could be involved in tumourigenesis. We have developed a framework to identify mutational hotspots in cancer genomes, which is applicable to the entire genome. This framework identifies known and novel coding and non-coding mutional hotspots and can be used to differentiate candidate driver regions from

  15. A novel method for in silico identification of regulatory SNPs in human genome.

    Science.gov (United States)

    Li, Rong; Zhong, Dexing; Liu, Ruiling; Lv, Hongqiang; Zhang, Xinman; Liu, Jun; Han, Jiuqiang

    2017-02-21

    Regulatory single nucleotide polymorphisms (rSNPs), kind of functional noncoding genetic variants, can affect gene expression in a regulatory way, and they are thought to be associated with increased susceptibilities to complex diseases. Here a novel computational approach to identify potential rSNPs is presented. Different from most other rSNPs finding methods which based on hypothesis that SNPs causing large allele-specific changes in transcription factor binding affinities are more likely to play regulatory functions, we use a set of documented experimentally verified rSNPs and nonfunctional background SNPs to train classifiers, so the discriminating features are found. To characterize variants, an extensive range of characteristics, such as sequence context, DNA structure and evolutionary conservation etc. are analyzed. Support vector machine is adopted to build the classifier model together with an ensemble method to deal with unbalanced data. 10-fold cross-validation result shows that our method can achieve accuracy with sensitivity of ~78% and specificity of ~82%. Furthermore, our method performances better than some other algorithms based on aforementioned hypothesis in handling false positives. The original data and the source matlab codes involved are available at https://sourceforge.net/projects/rsnppredict/. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. Locating protein-coding sequences under selection for additional, overlapping functions in 29 mammalian genomes

    DEFF Research Database (Denmark)

    Lin, Michael F; Kheradpour, Pouya; Washietl, Stefan

    2011-01-01

    conservation compared to typical protein-coding genes—especially at synonymous sites. In this study, we use genome alignments of 29 placental mammals to systematically locate short regions within human ORFs that show conspicuously low estimated rates of synonymous substitution across these species. The 29......-species alignment provides statistical power to locate more than 10,000 such regions with resolution down to nine-codon windows, which are found within more than a quarter of all human protein-coding genes and contain ~2% of their synonymous sites. We collect numerous lines of evidence that the observed...... synonymous constraint in these regions reflects selection on overlapping functional elements including splicing regulatory elements, dual-coding genes, RNA secondary structures, microRNA target sites, and developmental enhancers. Our results show that overlapping functional elements are common in mammalian...

  17. Patterns of genomic variation in the poplar rust fungus Melampsora larici-populina identify pathogenesis-related factors

    Directory of Open Access Journals (Sweden)

    Antoine ePersoons

    2014-09-01

    Full Text Available Melampsora larici-populina is a fungal pathogen responsible for foliar rust disease on poplar trees, which causes damage to forest plantations worldwide, particularly in Northern Europe. The reference genome of the isolate 98AG31 was previously sequenced using a whole genome shotgun strategy, revealing a large genome of 101 megabases containing 16,399 predicted genes, which included secreted protein genes representing poplar rust candidate effectors. In the present study, the genomes of 15 isolates collected over the past 20 years throughout the French territory, representing distinct virulence profiles, were characterized by massively parallel sequencing to assess genetic variation in the poplar rust fungus. Comparison to the reference genome revealed striking structural variations. Analysis of coverage and sequencing depth identified large missing regions between isolates related to the mating type loci. More than 611,824 single-nucleotide polymorphism (SNP positions were uncovered overall, indicating a remarkable level of polymorphism. Based on the accumulation of non-synonymous substitutions in coding sequences and the relative frequencies of synonymous and non-synonymous polymorphisms (i.e. PN/PS, we identify candidate genes that may be involved in fungal pathogenesis. Correlation between non-synonymous SNPs in genes encoding secreted proteins and pathotypes of the studied isolates revealed candidate genes potentially related to virulences 1, 6 and 8 of the poplar rust fungus.

  18. Analysis of Non-binary Hybrid LDPC Codes

    OpenAIRE

    Sassatelli, Lucile; Declercq, David

    2008-01-01

    In this paper, we analyse asymptotically a new class of LDPC codes called Non-binary Hybrid LDPC codes, which has been recently introduced. We use density evolution techniques to derive a stability condition for hybrid LDPC codes, and prove their threshold behavior. We study this stability condition to conclude on asymptotic advantages of hybrid LDPC codes compared to their non-hybrid counterparts.

  19. First Comprehensive In Silico Analysis of the Functional and Structural Consequences of SNPs in Human GalNAc-T1 Gene

    Directory of Open Access Journals (Sweden)

    Hussein Sheikh Ali Mohamoud

    2014-01-01

    Full Text Available GalNAc-T1, a key candidate of GalNac-transferases genes family that is involved in mucin-type O-linked glycosylation pathway, is expressed in most biological tissues and cell types. Despite the reported association of GalNAc-T1 gene mutations with human disease susceptibility, the comprehensive computational analysis of coding, noncoding and regulatory SNPs, and their functional impacts on protein level, still remains unknown. Therefore, sequence- and structure-based computational tools were employed to screen the entire listed coding SNPs of GalNAc-T1 gene in order to identify and characterize them. Our concordant in silico analysis by SIFT, PolyPhen-2, PANTHER-cSNP, and SNPeffect tools, identified the potential nsSNPs (S143P, G258V, and Y414D variants from 18 nsSNPs of GalNAc-T1. Additionally, 2 regulatory SNPs (rs72964406 and #x26; rs34304568 were also identified in GalNAc-T1 by using FastSNP tool. Using multiple computational approaches, we have systematically classified the functional mutations in regulatory and coding regions that can modify expression and function of GalNAc-T1 enzyme. These genetic variants can further assist in better understanding the wide range of disease susceptibility associated with the mucin-based cell signalling and pathogenic binding, and may help to develop novel therapeutic elements for associated diseases.

  20. Non-Coding RNAs in Hodgkin Lymphoma

    Directory of Open Access Journals (Sweden)

    Anna Cordeiro

    2017-05-01

    Full Text Available MicroRNAs (miRNAs, small non-coding RNAs that regulate gene expression by binding to the 3’-UTR of their target genes, can act as oncogenes or tumor suppressors. Recently, other types of non-coding RNAs—piwiRNAs and long non-coding RNAs—have also been identified. Hodgkin lymphoma (HL is a B cell origin disease characterized by the presence of only 1% of tumor cells, known as Hodgkin and Reed-Stenberg (HRS cells, which interact with the microenvironment to evade apoptosis. Several studies have reported specific miRNA signatures that can differentiate HL lymph nodes from reactive lymph nodes, identify histologic groups within classical HL, and distinguish HRS cells from germinal center B cells. Moreover, some signatures are associated with survival or response to chemotherapy. Most of the miRNAs in the signatures regulate genes related to apoptosis, cell cycle arrest, or signaling pathways. Here we review findings on miRNAs in HL, as well as on other non-coding RNAs.

  1. Adaptive synonymous mutations in an experimentally evolved Pseudomonas fluorescens population

    DEFF Research Database (Denmark)

    Bailey, Susan; Hinz, Aaron; Kassen, Rees

    2014-01-01

    Conventional wisdom holds that synonymous mutations, nucleotide changes that do not alter the encoded amino acid, have no detectable effect on phenotype or fitness. However, a growing body of evidence from both comparative and experimental studies suggests otherwise. Synonymous mutations have been...... shown to impact gene expression, protein folding and fitness, however, direct evidence that they can be positively selected, and so contribute to adaptation, is lacking. Here we report the recovery of two beneficial synonymous single base pair changes that arose spontaneously and independently...... in an experimentally evolved population of Pseudomonas fluorescens. We show experimentally that these mutations increase fitness by an amount comparable to non-synonymous mutations and that the fitness increases stem from increased gene expression. These results provide unequivocal evidence that synonymous mutations...

  2. Synonymous codon ordering: a subtle but prevalent strategy of bacteria to improve translational efficiency.

    Directory of Open Access Journals (Sweden)

    Zhu-Qing Shao

    Full Text Available BACKGROUND: In yeast coding sequences, once a particular codon has been used, subsequent occurrence of the same amino acid tends to use codons sharing the same tRNA. Such a phenomenon of co-tRNA codons pairing bias (CTCPB is also found in some other eukaryotes but it is not known whether it occurs in prokaryotes. METHODOLOGY/PRINCIPAL FINDINGS: In this study, we focused on a total of 773 bacterial genomes to investigate their synonymous codon pairing preferences. After calculating the actual frequencies of synonymous codon pairs and comparing them with their expected values, we detected an obvious pairing bias towards identical codon pairs. This seems consistent with the previously reported CTCPB phenomenon, since identical codons are certainly read by the same tRNA. However, among co-tRNA but non-identical codon pairs, only 22 were often found overrepresented, suggesting that many co-tRNA codons actually do not preferentially pair together in prokaryotes. Therefore, the previously reported co-tRNA codons pairing rule needs to be more rigorously defined. The affinity differences between a tRNA anticodon and its readable codons should be taken into account. Moreover, both within-gene-shuffling tests and phylogenetic analyses support the idea that translational selection played an important role in shaping the observed synonymous codon pairing pattern in prokaryotes. CONCLUSIONS: Overall, a high level of synonymous codon pairing bias was detected in 73% investigated bacterial species, suggesting the synonymous codon ordering strategy has been prevalently adopted by prokaryotes to improve their translational efficiencies. The findings in this study also provide important clues to better understand the complex dynamics of translational process.

  3. Disparities in allele frequencies and population differentiation for 101 disease-associated single nucleotide polymorphisms between Puerto Ricans and non-Hispanic whites

    Directory of Open Access Journals (Sweden)

    Arnett Donna

    2009-08-01

    Full Text Available Abstract Background Variations in gene allele frequencies can contribute to differences in the prevalence of some common complex diseases among populations. Natural selection modulates the balance in allele frequencies across populations. Population differentiation (FST can evidence environmental selection pressures. Such genetic information is limited in Puerto Ricans, the second largest Hispanic ethnic group in the US, and a group with high prevalence of chronic disease. We determined allele frequencies and population differentiation for 101 single nucleotide polymorphisms (SNPs in 30 genes involved in major metabolic and disease-relevant pathways in Puerto Ricans (n = 969, ages 45–75 years and compared them to similarly aged non-Hispanic whites (NHW (n = 597. Results Minor allele frequency (MAF distributions for 45.5% of the SNPs assessed in Puerto Ricans were significantly different from those of NHW. Puerto Ricans carried risk alleles in higher frequency and protective alleles in lower frequency than NHW. Patterns of population differentiation showed that Puerto Ricans had SNPs with exceptional FST values in intronic, non-synonymous and promoter regions. NHW had exceptional FST values in intronic and promoter region SNPs only. Conclusion These observations may serve to explain and broaden studies on the impact of gene polymorphisms on chronic diseases affecting Puerto Ricans.

  4. Polymorphisms in the TLR4 and TLR5 gene are significantly associated with inflammatory bowel disease in German shepherd dogs.

    Science.gov (United States)

    Kathrani, Aarti; House, Arthur; Catchpole, Brian; Murphy, Angela; German, Alex; Werling, Dirk; Allenspach, Karin

    2010-12-23

    Inflammatory bowel disease (IBD) is considered to be the most common cause of vomiting and diarrhoea in dogs, and the German shepherd dog (GSD) is particularly susceptible. The exact aetiology of IBD is unknown, however associations have been identified between specific single-nucleotide polymorphisms (SNPs) in Toll-like receptors (TLRs) and human IBD. However, to date, no genetic studies have been undertaken in canine IBD. The aim of this study was to investigate whether polymorphisms in canine TLR 2, 4 and 5 genes are associated with IBD in GSDs. Mutational analysis of TLR2, TLR4 and TLR5 was performed in 10 unrelated GSDs with IBD. Four non-synonymous SNPs (T23C, G1039A, A1571T and G1807A) were identified in the TLR4 gene, and three non-synonymous SNPs (G22A, C100T and T1844C) were identified in the TLR5 gene. The non-synonymous SNPs identified in TLR4 and TLR5 were evaluated further in a case-control study using a SNaPSHOT multiplex reaction. Sequencing information from 55 unrelated GSDs with IBD were compared to a control group consisting of 61 unrelated GSDs. The G22A SNP in TLR5 was significantly associated with IBD in GSDs, whereas the remaining two SNPs were found to be significantly protective for IBD. Furthermore, the two SNPs in TLR4 (A1571T and G1807A) were in complete linkage disequilibrium, and were also significantly associated with IBD. The TLR5 risk haplotype (ACC) without the two associated TLR4 SNP alleles was significantly associated with IBD, however the presence of the two TLR4 SNP risk alleles without the TLR5 risk haplotype was not statistically associated with IBD. Our study suggests that the three TLR5 SNPs and two TLR4 SNPs; A1571T and G1807A could play a role in the pathogenesis of IBD in GSDs. Further studies are required to confirm the functional importance of these polymorphisms in the pathogenesis of this disease.

  5. MiRNA-Related SNPs and Risk of Esophageal Adenocarcinoma and Barrett's Esophagus: Post Genome-Wide Association Analysis in the BEACON Consortium.

    Directory of Open Access Journals (Sweden)

    Matthew F Buas

    Full Text Available Incidence of esophageal adenocarcinoma (EA has increased substantially in recent decades. Multiple risk factors have been identified for EA and its precursor, Barrett's esophagus (BE, such as reflux, European ancestry, male sex, obesity, and tobacco smoking, and several germline genetic variants were recently associated with disease risk. Using data from the Barrett's and Esophageal Adenocarcinoma Consortium (BEACON genome-wide association study (GWAS of 2,515 EA cases, 3,295 BE cases, and 3,207 controls, we examined single nucleotide polymorphisms (SNPs that potentially affect the biogenesis or biological activity of microRNAs (miRNAs, small non-coding RNAs implicated in post-transcriptional gene regulation, and deregulated in many cancers, including EA. Polymorphisms in three classes of genes were examined for association with risk of EA or BE: miRNA biogenesis genes (157 SNPs, 21 genes; miRNA gene loci (234 SNPs, 210 genes; and miRNA-targeted mRNAs (177 SNPs, 158 genes. Nominal associations (P0.50, and we did not find evidence for interactions between variants analyzed and two risk factors for EA/BE (smoking and obesity. This analysis provides the most extensive assessment to date of miRNA-related SNPs in relation to risk of EA and BE. While common genetic variants within components of the miRNA biogenesis core pathway appear unlikely to modulate susceptibility to EA or BE, further studies may be warranted to examine potential associations between unassessed variants in miRNA genes and targets with disease risk.

  6. SNP-finding in pig mitochondrial ESTs

    DEFF Research Database (Denmark)

    Scheibye-Alsing, Karsten; Cirera Salicio, Susanna; Gilchrist, M.J.

    2008-01-01

    The Sino-Danish pig genome project produced 685 851 ESTs (Gorodkin et al. 2007), of which 41 499 originated from the mitochondrial genome. In this study, the mitochondrial ESTs were assembled, and 374 putative SNPs were found. Chromatograms for the ESTs containing SNPs were manually inspected, an......, and 112 total (52 non-synonymous) SNPs were found to be of high confidence (five of them are close to disease-causing SNPs in humans). Nine of the high-confidence SNPs were tested experimentally, and eight were confirmed. The SNPs can be accessed online at http://pigest.ku.dk/more.mito...

  7. Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana.

    Science.gov (United States)

    Hoffmann, Robert D; Palmgren, Michael

    2016-06-13

    Whole-genome duplications in the ancestors of many diverse species provided the genetic material for evolutionary novelty. Several models explain the retention of paralogous genes. However, how these models are reflected in the evolution of coding and non-coding sequences of paralogous genes is unknown. Here, we analyzed the coding and non-coding sequences of paralogous genes in Arabidopsis thaliana and compared these sequences with those of orthologous genes in Arabidopsis lyrata. Paralogs with lower expression than their duplicate had more nonsynonymous substitutions, were more likely to fractionate, and exhibited less similar expression patterns with their orthologs in the other species. Also, lower-expressed genes had greater tissue specificity. Orthologous conserved non-coding sequences in the promoters, introns, and 3' untranslated regions were less abundant at lower-expressed genes compared to their higher-expressed paralogs. A gene ontology (GO) term enrichment analysis showed that paralogs with similar expression levels were enriched in GO terms related to ribosomes, whereas paralogs with different expression levels were enriched in terms associated with stress responses. Loss of conserved non-coding sequences in one gene of a paralogous gene pair correlates with reduced expression levels that are more tissue specific. Together with increased mutation rates in the coding sequences, this suggests that similar forces of purifying selection act on coding and non-coding sequences. We propose that coding and non-coding sequences evolve concurrently following gene duplication.

  8. Evidence of Stage- and Age-Related Heterogeneity of Non-HLA SNPs and Risk of Islet Autoimmunity and Type 1 Diabetes: The Diabetes Autoimmunity Study in the Young

    Directory of Open Access Journals (Sweden)

    Brittni N. Frederiksen

    2013-01-01

    Full Text Available Previously, we examined 20 non-HLA SNPs for association with islet autoimmunity (IA and/or progression to type 1 diabetes (T1D. Our objective was to investigate fourteen additional non-HLA T1D candidate SNPs for stage- and age-related heterogeneity in the etiology of T1D. Of 1634 non-Hispanic white DAISY children genotyped, 132 developed IA (positive for GAD, insulin, or IA-2 autoantibodies at two or more consecutive visits; 50 IA positive children progressed to T1D. Cox regression was used to analyze risk of IA and progression to T1D in IA positive children. Restricted cubic splines were used to model SNPs when there was evidence that risk was not constant with age. C1QTNF6 (rs229541 predicted increased IA risk (HR: 1.57, CI: 1.20–2.05 but not progression to T1D (HR: 1.13, CI: 0.75–1.71. SNP (rs10517086 appears to exhibit an age-related effect on risk of IA, with increased risk before age 2 years (age 2 HR: 1.67, CI: 1.08–2.56 but not older ages (age 4 HR: 0.84, CI: 0.43–1.62. C1QTNF6 (rs229541, SNP (rs10517086, and UBASH3A (rs3788013 were associated with development of T1D. This prospective investigation of non-HLA T1D candidate loci shows that some SNPs may exhibit stage- and age-related heterogeneity in the etiology of T1D.

  9. Fiscal 1999 research achievement report on the development of SNPs related technologies; 1999 nendo SNPs kanren gijutsu kaihatsu seika hokokusho

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2001-03-01

    Efforts are made to develop specimen processing technologies for modifying and enabling various kinds of specimens to automatically undergo SNP (single nucleotide polymorphism) analysis for medicine development and clinical diagnostic activities and to develop technologies and apparatuses to enable rapid, inexpensive, and simple search and analysis of SNPs using DNA (deoxyribonucleic acid) chips and mass spectrometry. Activities are conducted in the four fields involving (1) the development of a practical clinical system for rapid detection and analysis of SNPs, (2) research and development of an SNP scoring system using bar-coded oligonucleotides and magnetic beads, (3) research and development of a high-speed SNP analysis system using a mass spectrometer, and (4) the development of a high throughput SNP analysis line. Efforts exerted in field (1) involve a protein fixation method using plasma polymerization and its application to DNA arrays, development of an SNP detection method using human genomes, construction of a rapid DNA detection device using an electric field, development of an SNP analysis system using the solid phase HPA (hybridization protection assay) method, and SNP analysis using solid phase ligation. (NEDO)

  10. Hominoid-specific de novo protein-coding genes originating from long non-coding RNAs.

    Directory of Open Access Journals (Sweden)

    Chen Xie

    2012-09-01

    Full Text Available Tinkering with pre-existing genes has long been known as a major way to create new genes. Recently, however, motherless protein-coding genes have been found to have emerged de novo from ancestral non-coding DNAs. How these genes originated is not well addressed to date. Here we identified 24 hominoid-specific de novo protein-coding genes with precise origination timing in vertebrate phylogeny. Strand-specific RNA-Seq analyses were performed in five rhesus macaque tissues (liver, prefrontal cortex, skeletal muscle, adipose, and testis, which were then integrated with public transcriptome data from human, chimpanzee, and rhesus macaque. On the basis of comparing the RNA expression profiles in the three species, we found that most of the hominoid-specific de novo protein-coding genes encoded polyadenylated non-coding RNAs in rhesus macaque or chimpanzee with a similar transcript structure and correlated tissue expression profile. According to the rule of parsimony, the majority of these hominoid-specific de novo protein-coding genes appear to have acquired a regulated transcript structure and expression profile before acquiring coding potential. Interestingly, although the expression profile was largely correlated, the coding genes in human often showed higher transcriptional abundance than their non-coding counterparts in rhesus macaque. The major findings we report in this manuscript are robust and insensitive to the parameters used in the identification and analysis of de novo genes. Our results suggest that at least a portion of long non-coding RNAs, especially those with active and regulated transcription, may serve as a birth pool for protein-coding genes, which are then further optimized at the transcriptional level.

  11. Function and Application Areas in Medicine of Non-Coding RNA

    Directory of Open Access Journals (Sweden)

    Figen Guzelgul

    2009-06-01

    Full Text Available RNA is the genetic material converting the genetic code that it gets from DNA into protein. While less than 2 % of RNA is converted into protein , more than 98 % of it can not be converted into protein and named as non-coding RNAs. 70 % of noncoding RNAs consists of introns , however, the rest part of them consists of exons. Non-coding RNAs are examined in two classes according to their size and functions. Whereas they are classified as long non-coding and small non-coding RNAs according to their size , they are grouped as housekeeping non-coding RNAs and regulating non-coding RNAs according to their function. For long years ,these non-coding RNAs have been considered as non-functional. However, today, it has been proved that these non-coding RNAs play role in regulating genes and in structural, functional and catalitic roles of RNAs converted into protein. Due to its taking a role in gene silencing mechanism, particularly in medical world , non-coding RNAs have led to significant developments. RNAi technolgy , which is used in designing drugs to be used in treatment of various diseases , is a ray of hope for medical world. [Archives Medical Review Journal 2009; 18(3.000: 141-155

  12. Two non-synonymous markers in PTPN21, identified by genome-wide association study data-mining and replication, are associated with schizophrenia.

    LENUS (Irish Health Repository)

    Chen, Jingchun

    2011-09-01

    We conducted data-mining analyses of genome wide association (GWA) studies of the CATIE and MGS-GAIN datasets, and found 13 markers in the two physically linked genes, PTPN21 and EML5, showing nominally significant association with schizophrenia. Linkage disequilibrium (LD) analysis indicated that all 7 markers from PTPN21 shared high LD (r(2)>0.8), including rs2274736 and rs2401751, the two non-synonymous markers with the most significant association signals (rs2401751, P=1.10 × 10(-3) and rs2274736, P=1.21 × 10(-3)). In a meta-analysis of all 13 replication datasets with a total of 13,940 subjects, we found that the two non-synonymous markers are significantly associated with schizophrenia (rs2274736, OR=0.92, 95% CI: 0.86-0.97, P=5.45 × 10(-3) and rs2401751, OR=0.92, 95% CI: 0.86-0.97, P=5.29 × 10(-3)). One SNP (rs7147796) in EML5 is also significantly associated with the disease (OR=1.08, 95% CI: 1.02-1.14, P=6.43 × 10(-3)). These 3 markers remain significant after Bonferroni correction. Furthermore, haplotype conditioned analyses indicated that the association signals observed between rs2274736\\/rs2401751 and rs7147796 are statistically independent. Given the results that 2 non-synonymous markers in PTPN21 are associated with schizophrenia, further investigation of this locus is warranted.

  13. Understanding Epistatic Interactions between Genes Targeted by Non-coding Regulatory Elements in Complex Diseases

    Directory of Open Access Journals (Sweden)

    Min Kyung Sung

    2014-12-01

    Full Text Available Genome-wide association studies have proven the highly polygenic architecture of complex diseases or traits; therefore, single-locus-based methods are usually unable to detect all involved loci, especially when individual loci exert small effects. Moreover, the majority of associated single-nucleotide polymorphisms resides in non-coding regions, making it difficult to understand their phenotypic contribution. In this work, we studied epistatic interactions associated with three common diseases using Korea Association Resource (KARE data: type 2 diabetes mellitus (DM, hypertension (HT, and coronary artery disease (CAD. We showed that epistatic single-nucleotide polymorphisms (SNPs were enriched in enhancers, as well as in DNase I footprints (the Encyclopedia of DNA Elements [ENCODE] Project Consortium 2012, which suggested that the disruption of the regulatory regions where transcription factors bind may be involved in the disease mechanism. Accordingly, to identify the genes affected by the SNPs, we employed whole-genome multiple-cell-type enhancer data which discovered using DNase I profiles and Cap Analysis Gene Expression (CAGE. Assigned genes were significantly enriched in known disease associated gene sets, which were explored based on the literature, suggesting that this approach is useful for detecting relevant affected genes. In our knowledge-based epistatic network, the three diseases share many associated genes and are also closely related with each other through many epistatic interactions. These findings elucidate the genetic basis of the close relationship between DM, HT, and CAD.

  14. Association of Common Polymorphisms in the Nicotinic Acetylcholine Receptor Alpha4 Subunit Gene with an Electrophysiological Endophenotype in a Large Population-Based Sample.

    Directory of Open Access Journals (Sweden)

    A Mobascher

    Full Text Available Variation in genes coding for nicotinic acetylcholine receptor (nAChR subunits affect cognitive processes and may contribute to the genetic architecture of neuropsychiatric disorders. Single nucleotide polymorphisms (SNPs in the CHRNA4 gene that codes for the alpha4 subunit of alpha4/beta2-containing receptors have previously been implicated in aspects of (mostly visual attention and smoking-related behavioral measures. Here we investigated the effects of six synonymous but functional CHRNA4 exon 5 SNPs on the N100 event-related potential (ERP, an electrophysiological endophenotype elicited by a standard auditory oddball. A total of N = 1,705 subjects randomly selected from the general population were studied with electroencephalography (EEG as part of the German Multicenter Study on nicotine addiction. Two of the six variants, rs1044396 and neighboring rs1044397, were significantly associated with N100 amplitude. This effect was pronounced in females where we also observed an effect on reaction time. Sequencing of the complete exon 5 region in the population sample excluded the existence of additional/functional variants that may be responsible for the observed effects. This is the first large-scale population-based study investigation the effects of CHRNA4 SNPs on brain activity measures related to stimulus processing and attention. Our results provide further evidence that common synonymous CHRNA4 exon 5 SNPs affect cognitive processes and suggest that they also play a role in the auditory system. As N100 amplitude reduction is considered a schizophrenia-related endophenotype the SNPs studied here may also be associated with schizophrenia outcome measures.

  15. SNPs in Multi-Species Conserved Sequences (MCS as useful markers in association studies: a practical approach

    Directory of Open Access Journals (Sweden)

    Pericak-Vance Margaret A

    2007-08-01

    Full Text Available Abstract Background Although genes play a key role in many complex diseases, the specific genes involved in most complex diseases remain largely unidentified. Their discovery will hinge on the identification of key sequence variants that are conclusively associated with disease. While much attention has been focused on variants in protein-coding DNA, variants in noncoding regions may also play many important roles in complex disease by altering gene regulation. Since the vast majority of noncoding genomic sequence is of unknown function, this increases the challenge of identifying "functional" variants that cause disease. However, evolutionary conservation can be used as a guide to indicate regions of noncoding or coding DNA that are likely to have biological function, and thus may be more likely to harbor SNP variants with functional consequences. To help bias marker selection in favor of such variants, we devised a process that prioritizes annotated SNPs for genotyping studies based on their location within Multi-species Conserved Sequences (MCSs and used this process to select SNPs in a region of linkage to a complex disease. This allowed us to evaluate the utility of the chosen SNPs for further association studies. Previously, a region of chromosome 1q43 was linked to Multiple Sclerosis (MS in a genome-wide screen. We chose annotated SNPs in the region based on location within MCSs (termed MCS-SNPs. We then obtained genotypes for 478 MCS-SNPs in 989 individuals from MS families. Results Analysis of our MCS-SNP genotypes from the 1q43 region and comparison to HapMap data confirmed that annotated SNPs in MCS regions are frequently polymorphic and show subtle signatures of selective pressure, consistent with previous reports of genome-wide variation in conserved regions. We also present an online tool that allows MCS data to be directly exported to the UCSC genome browser so that MCS-SNPs can be easily identified within genomic regions of

  16. Allelic variation of the Waxy gene in foxtail millet [Setaria italica (L.) P. Beauv.] by single nucleotide polymorphisms.

    Science.gov (United States)

    Van, K; Onoda, S; Kim, M Y; Kim, K D; Lee, S-H

    2008-03-01

    The Waxy (Wx) gene product controls the formation of a straight chain polymer of amylose in the starch pathway. Dominance/recessiveness of the Wx allele is associated with amylose content, leading to non-waxy/waxy phenotypes. For a total of 113 foxtail millet accessions, agronomic traits and the molecular differences of the Wx gene were surveyed to evaluate genetic diversities. Molecular types were associated with phenotypes determined by four specific primer sets (non-waxy, Type I; low amylose, Type VI; waxy, Type IV or V). Additionally, the insertion of transposable element in waxy was confirmed by ex1/TSI2R, TSI2F/ex2, ex2int2/TSI7R and TSI7F/ex4r. Seventeen single nucleotide polymorphims (SNPs) were observed from non-coding regions, while three SNPs from coding regions were non-synonymous. Interestingly, the phenotype of No. 88 was still non-waxy, although seven nucleotides (AATTGGT) insertion at 2,993 bp led to 78 amino acids shorter. The rapid decline of r (2) in the sequenced region (exon 1-intron 1-exon 2) suggested a low level of linkage disequilibrium and limited haplotype structure. K (s) values and estimation of evolutionary events indicate early divergence of S. italica among cereal crops. This study suggested the Wx gene was one of the targets in the selection process during domestication.

  17. SNPs altering ammonium transport activity of human Rhesus factors characterized by a yeast-based functional assay.

    Directory of Open Access Journals (Sweden)

    Aude Deschuyteneer

    Full Text Available Proteins of the conserved Mep-Amt-Rh family, including mammalian Rhesus factors, mediate transmembrane ammonium transport. Ammonium is an important nitrogen source for the biosynthesis of amino acids but is also a metabolic waste product. Its disposal in urine plays a critical role in the regulation of the acid/base homeostasis, especially with an acid diet, a trait of Western countries. Ammonium accumulation above a certain concentration is however pathologic, the cytotoxicity causing fatal cerebral paralysis in acute cases. Alteration in ammonium transport via human Rh proteins could have clinical outcomes. We used a yeast-based expression assay to characterize human Rh variants resulting from non synonymous single nucleotide polymorphisms (nsSNPs with known or unknown clinical phenotypes and assessed their ammonium transport efficiency, protein level, localization and potential trans-dominant impact. The HsRhAG variants (I61R, F65S associated to overhydrated hereditary stomatocytosis (OHSt, a disease affecting erythrocytes, proved affected in intrinsic bidirectional ammonium transport. Moreover, this study reveals that the R202C variant of HsRhCG, the orthologue of mouse MmRhcg required for optimal urinary ammonium excretion and blood pH control, shows an impaired inherent ammonium transport activity. Urinary ammonium excretion was RHcg gene-dose dependent in mouse, highlighting MmRhcg as a limiting factor. HsRhCG(R202C may confer susceptibility to disorders leading to metabolic acidosis for instance. Finally, the analogous R211C mutation in the yeast ScMep2 homologue also impaired intrinsic activity consistent with a conserved functional role of the preserved arginine residue. The yeast expression assay used here constitutes an inexpensive, fast and easy tool to screen nsSNPs reported by high throughput sequencing or individual cases for functional alterations in Rh factors revealing potential causal variants.

  18. Functional characterization of genetic enzyme variations in human lipoxygenases

    Directory of Open Access Journals (Sweden)

    Thomas Horn

    2013-01-01

    Full Text Available Mammalian lipoxygenases play a role in normal cell development and differentiation but they have also been implicated in the pathogenesis of cardiovascular, hyperproliferative and neurodegenerative diseases. As lipid peroxidizing enzymes they are involved in the regulation of cellular redox homeostasis since they produce lipid hydroperoxides, which serve as an efficient source for free radicals. There are various epidemiological correlation studies relating naturally occurring variations in the six human lipoxygenase genes (SNPs or rare mutations to the frequency for various diseases in these individuals, but for most of the described variations no functional data are available. Employing a combined bioinformatical and enzymological strategy, which included structural modeling and experimental site-directed mutagenesis, we systematically explored the structural and functional consequences of non-synonymous genetic variations in four different human lipoxygenase genes (ALOX5, ALOX12, ALOX15, and ALOX15B that have been identified in the human 1000 genome project. Due to a lack of a functional expression system we resigned to analyze the functionality of genetic variations in the hALOX12B and hALOXE3 gene. We found that most of the frequent non-synonymous coding SNPs are located at the enzyme surface and hardly alter the enzyme functionality. In contrast, genetic variations which affect functional important amino acid residues or lead to truncated enzyme variations (nonsense mutations are usually rare with a global allele frequency<0.1%. This data suggest that there appears to be an evolutionary pressure on the coding regions of the lipoxygenase genes preventing the accumulation of loss-of-function variations in the human population.

  19. Porcine lung surfactant protein B gene (SFTPB)

    DEFF Research Database (Denmark)

    Cirera Salicio, Susanna; Fredholm, Merete

    2008-01-01

    The porcine surfactant protein B (SFTPB) is a single copy gene on chromosome 3. Three different cDNAs for the SFTPB have been isolated and sequenced. Nucleotide sequence comparison revealed six nonsynonymous single nucleotide polymorphisms (SNPs), four synonymous SNPs and an in-frame deletion of 69...... bp in the region coding for the active protein. Northern analysis showed lung-specific expression of three different isoforms of the SFTPB transcript. The expression level for the SFTPB gene is low in 50 days-old fetus and it increases during lung development. Quantitative real-time polymerase chain...

  20. Triplet-Based Codon Organization Optimizes the Impact of Synonymous Mutation on Nucleic Acid Molecular Dynamics.

    Science.gov (United States)

    Babbitt, Gregory A; Coppola, Erin E; Mortensen, Jamie S; Ekeren, Patrick X; Viola, Cosmo; Goldblatt, Dallan; Hudson, André O

    2018-02-01

    Since the elucidation of the genetic code almost 50 years ago, many nonrandom aspects of its codon organization remain only partly resolved. Here, we investigate the recent hypothesis of 'dual-use' codons which proposes that in addition to allowing adjustment of codon optimization to tRNA abundance, the degeneracy in the triplet-based genetic code also multiplexes information regarding DNA's helical shape and protein-binding dynamics while avoiding interference with other protein-level characteristics determined by amino acid properties. How such structural optimization of the code within eukaryotic chromatin could have arisen from an RNA world is a mystery, but would imply some preadaptation in an RNA context. We analyzed synonymous (protein-silent) and nonsynonymous (protein-altering) mutational impacts on molecular dynamics in 13823 identically degenerate alternative codon reorganizations, defined by codon transitions in 7680 GPU-accelerated molecular dynamic simulations of implicitly and explicitly solvated double-stranded aRNA and bDNA structures. When compared to all possible alternative codon assignments, the standard genetic code minimized the impact of synonymous mutations on the random atomic fluctuations and correlations of carbon backbone vector trajectories while facilitating the specific movements that contribute to DNA polymer flexibility. This trend was notably stronger in the context of RNA supporting the idea that dual-use codon optimization and informational multiplexing in DNA resulted from the preadaptation of the RNA duplex to resist changes to thermostability. The nonrandom and divergent molecular dynamics of synonymous mutations also imply that the triplet-based code may have resulted from adaptive functional expansion enabling a primordial doublet code to multiplex gene regulatory information via the shape and charge of the minor groove.

  1. On Coding Non-Contiguous Letter Combinations

    Directory of Open Access Journals (Sweden)

    Frédéric eDandurand

    2011-06-01

    Full Text Available Starting from the hypothesis that printed word identification initially involves the parallel mapping of visual features onto location-specific letter identities, we analyze the type of information that would be involved in optimally mapping this location-specific orthographic code onto a location-invariant lexical code. We assume that some intermediate level of coding exists between individual letters and whole words, and that this involves the representation of letter combinations. We then investigate the nature of this intermediate level of coding given the constraints of optimality. This intermediate level of coding is expected to compress data while retaining as much information as possible about word identity. Information conveyed by letters is a function of how much they constrain word identity and how visible they are. Optimization of this coding is a combination of minimizing resources (using the most compact representations and maximizing information. We show that in a large proportion of cases, non-contiguous letter sequences contain more information than contiguous sequences, while at the same time requiring less precise coding. Moreover, we found that the best predictor of human performance in orthographic priming experiments was within-word ranking of conditional probabilities, rather than average conditional probabilities. We conclude that from an optimality perspective, readers learn to select certain contiguous and non-contiguous letter combinations as information that provides the best cue to word identity.

  2. Non-binary Hybrid LDPC Codes: Structure, Decoding and Optimization

    OpenAIRE

    Sassatelli, Lucile; Declercq, David

    2007-01-01

    In this paper, we propose to study and optimize a very general class of LDPC codes whose variable nodes belong to finite sets with different orders. We named this class of codes Hybrid LDPC codes. Although efficient optimization techniques exist for binary LDPC codes and more recently for non-binary LDPC codes, they both exhibit drawbacks due to different reasons. Our goal is to capitalize on the advantages of both families by building codes with binary (or small finite set order) and non-bin...

  3. Inheritance-mode specific pathogenicity prioritization (ISPP) for human protein coding genes.

    Science.gov (United States)

    Hsu, Jacob Shujui; Kwan, Johnny S H; Pan, Zhicheng; Garcia-Barcelo, Maria-Mercè; Sham, Pak Chung; Li, Miaoxin

    2016-10-15

    Exome sequencing studies have facilitated the detection of causal genetic variants in yet-unsolved Mendelian diseases. However, the identification of disease causal genes among a list of candidates in an exome sequencing study is still not fully settled, and it is often difficult to prioritize candidate genes for follow-up studies. The inheritance mode provides crucial information for understanding Mendelian diseases, but none of the existing gene prioritization tools fully utilize this information. We examined the characteristics of Mendelian disease genes under different inheritance modes. The results suggest that Mendelian disease genes with autosomal dominant (AD) inheritance mode are more haploinsufficiency and de novo mutation sensitive, whereas those autosomal recessive (AR) genes have significantly more non-synonymous variants and regulatory transcript isoforms. In addition, the X-linked (XL) Mendelian disease genes have fewer non-synonymous and synonymous variants. As a result, we derived a new scoring system for prioritizing candidate genes for Mendelian diseases according to the inheritance mode. Our scoring system assigned to each annotated protein-coding gene (N = 18 859) three pathogenic scores according to the inheritance mode (AD, AR and XL). This inheritance mode-specific framework achieved higher accuracy (area under curve  = 0.84) in XL mode. The inheritance-mode specific pathogenicity prioritization (ISPP) outperformed other well-known methods including Haploinsufficiency, Recessive, Network centrality, Genic Intolerance, Gene Damage Index and Gene Constraint scores. This systematic study suggests that genes manifesting disease inheritance modes tend to have unique characteristics. ISPP is included in KGGSeq v1.0 (http://grass.cgs.hku.hk/limx/kggseq/), and source code is available from (https://github.com/jacobhsu35/ISPP.git). mxli@hku.hkSupplementary information: Supplementary data are available at Bioinformatics online. © The Author

  4. Metformin-Induced Changes of the Coding Transcriptome and Non-Coding RNAs in the Livers of Non-Alcoholic Fatty Liver Disease Mice.

    Science.gov (United States)

    Guo, Jun; Zhou, Yuan; Cheng, Yafen; Fang, Weiwei; Hu, Gang; Wei, Jie; Lin, Yajun; Man, Yong; Guo, Lixin; Sun, Mingxiao; Cui, Qinghua; Li, Jian

    2018-01-01

    Recent studies have suggested that changes in non-coding mRNA play a key role in the progression of non-alcoholic fatty liver disease (NAFLD). Metformin is now recommended and effective for the treatment of NAFLD. We hope the current analyses of the non-coding mRNA transcriptome will provide a better presentation of the potential roles of mRNAs and long non-coding RNAs (lncRNAs) that underlie NAFLD and metformin intervention. The present study mainly analysed changes in the coding transcriptome and non-coding RNAs after the application of a five-week metformin intervention. Liver samples from three groups of mice were harvested for transcriptome profiling, which covered mRNA, lncRNA, microRNA (miRNA) and circular RNA (circRNA), using a microarray technique. A systematic alleviation of high-fat diet (HFD)-induced transcriptome alterations by metformin was observed. The metformin treatment largely reversed the correlations with diabetes-related pathways. Our analysis also suggested interaction networks between differentially expressed lncRNAs and known hepatic disease genes and interactions between circRNA and their disease-related miRNA partners. Eight HFD-responsive lncRNAs and three metformin-responsive lncRNAs were noted due to their widespread associations with disease genes. Moreover, seven miRNAs that interacted with multiple differentially expressed circRNAs were highlighted because they were likely to be associated with metabolic or liver diseases. The present study identified novel changes in the coding transcriptome and non-coding RNAs in the livers of NAFLD mice after metformin treatment that might shed light on the underlying mechanism by which metformin impedes the progression of NAFLD. © 2018 The Author(s). Published by S. Karger AG, Basel.

  5. Polymorphisms in the Tlr4 and Tlr5 Gene Are Significantly Associated with Inflammatory Bowel Disease in German Shepherd Dogs

    Science.gov (United States)

    Kathrani, Aarti; House, Arthur; Catchpole, Brian; Murphy, Angela; German, Alex; Werling, Dirk; Allenspach, Karin

    2010-01-01

    Inflammatory bowel disease (IBD) is considered to be the most common cause of vomiting and diarrhoea in dogs, and the German shepherd dog (GSD) is particularly susceptible. The exact aetiology of IBD is unknown, however associations have been identified between specific single-nucleotide polymorphisms (SNPs) in Toll-like receptors (TLRs) and human IBD. However, to date, no genetic studies have been undertaken in canine IBD. The aim of this study was to investigate whether polymorphisms in canine TLR 2, 4 and 5 genes are associated with IBD in GSDs. Mutational analysis of TLR2, TLR4 and TLR5 was performed in 10 unrelated GSDs with IBD. Four non-synonymous SNPs (T23C, G1039A, A1571T and G1807A) were identified in the TLR4 gene, and three non-synonymous SNPs (G22A, C100T and T1844C) were identified in the TLR5 gene. The non-synonymous SNPs identified in TLR4 and TLR5 were evaluated further in a case-control study using a SNaPSHOT multiplex reaction. Sequencing information from 55 unrelated GSDs with IBD were compared to a control group consisting of 61 unrelated GSDs. The G22A SNP in TLR5 was significantly associated with IBD in GSDs, whereas the remaining two SNPs were found to be significantly protective for IBD. Furthermore, the two SNPs in TLR4 (A1571T and G1807A) were in complete linkage disequilibrium, and were also significantly associated with IBD. The TLR5 risk haplotype (ACC) without the two associated TLR4 SNP alleles was significantly associated with IBD, however the presence of the two TLR4 SNP risk alleles without the TLR5 risk haplotype was not statistically associated with IBD. Our study suggests that the three TLR5 SNPs and two TLR4 SNPs; A1571T and G1807A could play a role in the pathogenesis of IBD in GSDs. Further studies are required to confirm the functional importance of these polymorphisms in the pathogenesis of this disease. PMID:21203467

  6. Non-Binary Protograph-Based LDPC Codes: Analysis,Enumerators and Designs

    OpenAIRE

    Sun, Yizeng

    2013-01-01

    Non-binary LDPC codes can outperform binary LDPC codes using sum-product algorithm with higher computation complexity. Non-binary LDPC codes based on protographs have the advantage of simple hardware architecture. In the first part of this thesis, we will use EXIT chart analysis to compute the thresholds of different protographs over GF(q). Based on threshold computation, some non-binary protograph-based LDPC codes are designed and their frame error rates are compared with binary LDPC codes. ...

  7. Age-related macular degeneration-associated silent polymorphisms in HtrA1 impair its ability to antagonize insulin-like growth factor 1.

    Science.gov (United States)

    Jacobo, Sarah Melissa P; Deangelis, Margaret M; Kim, Ivana K; Kazlauskas, Andrius

    2013-05-01

    Synonymous single nucleotide polymorphisms (SNPs) within a transcript's coding region produce no change in the amino acid sequence of the protein product and are therefore intuitively assumed to have a neutral effect on protein function. We report that two common variants of high-temperature requirement A1 (HTRA1) that increase the inherited risk of neovascular age-related macular degeneration (NvAMD) harbor synonymous SNPs within exon 1 of HTRA1 that convert common codons for Ala34 and Gly36 to less frequently used codons. The frequent-to-rare codon conversion reduced the mRNA translation rate and appeared to compromise HtrA1's conformation and function. The protein product generated from the SNP-containing cDNA displayed enhanced susceptibility to proteolysis and a reduced affinity for an anti-HtrA1 antibody. The NvAMD-associated synonymous polymorphisms lie within HtrA1's putative insulin-like growth factor 1 (IGF-1) binding domain. They reduced HtrA1's abilities to associate with IGF-1 and to ameliorate IGF-1-stimulated signaling events and cellular responses. These observations highlight the relevance of synonymous codon usage to protein function and implicate homeostatic protein quality control mechanisms that may go awry in NvAMD.

  8. Long non-coding RNAs: Mechanism of action and functional utility

    OpenAIRE

    Bhat, Shakil Ahmad; Ahmad, Syed Mudasir; Mumtaz, Peerzada Tajamul; Malik, Abrar Ahad; Dar, Mashooq Ahmad; Urwat, Uneeb; Shah, Riaz Ahmad; Ganai, Nazir Ahmad

    2016-01-01

    Recent RNA sequencing studies have revealed that most of the human genome is transcribed, but very little of the total transcriptomes has the ability to encode proteins. Long non-coding RNAs (lncRNAs) are non-coding transcripts longer than 200 nucleotides. Members of the non-coding genome include microRNA (miRNA), small regulatory RNAs and other short RNAs. Most of long non-coding RNA (lncRNAs) are poorly annotated. Recent recognition about lncRNAs highlights their effects in many biological ...

  9. V-MitoSNP: visualization of human mitochondrial SNPs

    Directory of Open Access Journals (Sweden)

    Tsui Ke-Hung

    2006-08-01

    Full Text Available Abstract Background Mitochondrial single nucleotide polymorphisms (mtSNPs constitute important data when trying to shed some light on human diseases and cancers. Unfortunately, providing relevant mtSNP genotyping information in mtDNA databases in a neatly organized and transparent visual manner still remains a challenge. Amongst the many methods reported for SNP genotyping, determining the restriction fragment length polymorphisms (RFLPs is still one of the most convenient and cost-saving methods. In this study, we prepared the visualization of the mtDNA genome in a way, which integrates the RFLP genotyping information with mitochondria related cancers and diseases in a user-friendly, intuitive and interactive manner. The inherent problem associated with mtDNA sequences in BLAST of the NCBI database was also solved. Description V-MitoSNP provides complete mtSNP information for four different kinds of inputs: (1 color-coded visual input by selecting genes of interest on the genome graph, (2 keyword search by locus, disease and mtSNP rs# ID, (3 visualized input of nucleotide range by clicking the selected region of the mtDNA sequence, and (4 sequences mtBLAST. The V-MitoSNP output provides 500 bp (base pairs flanking sequences for each SNP coupled with the RFLP enzyme and the corresponding natural or mismatched primer sets. The output format enables users to see the SNP genotype pattern of the RFLP by virtual electrophoresis of each mtSNP. The rate of successful design of enzymes and primers for RFLPs in all mtSNPs was 99.1%. The RFLP information was validated by actual agarose electrophoresis and showed successful results for all mtSNPs tested. The mtBLAST function in V-MitoSNP provides the gene information within the input sequence rather than providing the complete mitochondrial chromosome as in the NCBI BLAST database. All mtSNPs with rs number entries in NCBI are integrated in the corresponding SNP in V-MitoSNP. Conclusion V-MitoSNP is a web

  10. Investigating the impact of missense mutations in hCES1 by in silico structure-based approaches

    DEFF Research Database (Denmark)

    Nzabonimpa, Grace Shema; Rasmussen, Henrik Berg; Brunak, Søren

    2016-01-01

    Genetic variations in drug-metabolizing enzymes have been reported to influence pharmacokinetics, drug dosage and other aspects that affect therapeutic outcomes. Most particularly, non-synonymous single-nucleotide polymorphisms (nsSNPs) resulting in amino acid changes disrupt potential functional...

  11. Bistability in self-activating genes regulated by non-coding RNAs

    International Nuclear Information System (INIS)

    Miro-Bueno, Jesus

    2015-01-01

    Non-coding RNA molecules are able to regulate gene expression and play an essential role in cells. On the other hand, bistability is an important behaviour of genetic networks. Here, we propose and study an ODE model in order to show how non-coding RNA can produce bistability in a simple way. The model comprises a single gene with positive feedback that is repressed by non-coding RNA molecules. We show how the values of all the reaction rates involved in the model are able to control the transitions between the high and low states. This new model can be interesting to clarify the role of non-coding RNA molecules in genetic networks. As well, these results can be interesting in synthetic biology for developing new genetic memories and biomolecular devices based on non-coding RNAs

  12. Comprehensive search for intra- and inter-specific sequence polymorphisms among coding envelope genes of retroviral origin found in the human genome: genes and pseudogenes

    Directory of Open Access Journals (Sweden)

    Vasilescu Alexandre

    2005-09-01

    Full Text Available Abstract Background The human genome carries a high load of proviral-like sequences, called Human Endogenous Retroviruses (HERVs, which are the genomic traces of ancient infections by active retroviruses. These elements are in most cases defective, but open reading frames can still be found for the retroviral envelope gene, with sixteen such genes identified so far. Several of them are conserved during primate evolution, having possibly been co-opted by their host for a physiological role. Results To characterize further their status, we presently sequenced 12 of these genes from a panel of 91 Caucasian individuals. Genomic analyses reveal strong sequence conservation (only two non synonymous Single Nucleotide Polymorphisms [SNPs] for the two HERV-W and HERV-FRD envelope genes, i.e. for the two genes specifically expressed in the placenta and possibly involved in syncytiotrophoblast formation. We further show – using an ex vivo fusion assay for each allelic form – that none of these SNPs impairs the fusogenic function. The other envelope proteins disclose variable polymorphisms, with the occurrence of a stop codon and/or frameshift for most – but not all – of them. Moreover, the sequence conservation analysis of the orthologous genes that can be found in primates shows that three env genes have been maintained in a fully coding state throughout evolution including envW and envFRD. Conclusion Altogether, the present study strongly suggests that some but not all envelope encoding sequences are bona fide genes. It also provides new tools to elucidate the possible role of endogenous envelope proteins as susceptibility factors in a number of pathologies where HERVs have been suspected to be involved.

  13. Weight Distribution for Non-binary Cluster LDPC Code Ensemble

    Science.gov (United States)

    Nozaki, Takayuki; Maehara, Masaki; Kasai, Kenta; Sakaniwa, Kohichi

    In this paper, we derive the average weight distributions for the irregular non-binary cluster low-density parity-check (LDPC) code ensembles. Moreover, we give the exponential growth rate of the average weight distribution in the limit of large code length. We show that there exist $(2,d_c)$-regular non-binary cluster LDPC code ensembles whose normalized typical minimum distances are strictly positive.

  14. Short-lived non-coding transcripts (SLiTs): Clues to regulatory long non-coding RNA.

    Science.gov (United States)

    Tani, Hidenori

    2017-03-22

    Whole transcriptome analyses have revealed a large number of novel long non-coding RNAs (lncRNAs). Although the importance of lncRNAs has been documented in previous reports, the biological and physiological functions of lncRNAs remain largely unknown. The role of lncRNAs seems an elusive problem. Here, I propose a clue to the identification of regulatory lncRNAs. The key point is RNA half-life. RNAs with a long half-life (t 1/2 > 4 h) contain a significant proportion of ncRNAs, as well as mRNAs involved in housekeeping functions, whereas RNAs with a short half-life (t 1/2 regulatory ncRNAs and regulatory mRNAs. This novel class of ncRNAs with a short half-life can be categorized as Short-Lived non-coding Transcripts (SLiTs). I consider that SLiTs are likely to be rich in functionally uncharacterized regulatory RNAs. This review describes recent progress in research into SLiTs.

  15. Binary Linear-Time Erasure Decoding for Non-Binary LDPC codes

    OpenAIRE

    Savin, Valentin

    2009-01-01

    In this paper, we first introduce the extended binary representation of non-binary codes, which corresponds to a covering graph of the bipartite graph associated with the non-binary code. Then we show that non-binary codewords correspond to binary codewords of the extended representation that further satisfy some simplex-constraint: that is, bits lying over the same symbol-node of the non-binary graph must form a codeword of a simplex code. Applied to the binary erasure channel, this descript...

  16. Single nucleotide polymorphism analysis of Korean native chickens using next generation sequencing data.

    Science.gov (United States)

    Seo, Dong-Won; Oh, Jae-Don; Jin, Shil; Song, Ki-Duk; Park, Hee-Bok; Heo, Kang-Nyeong; Shin, Younhee; Jung, Myunghee; Park, Junhyung; Jo, Cheorun; Lee, Hak-Kyo; Lee, Jun-Heon

    2015-02-01

    There are five native chicken lines in Korea, which are mainly classified by plumage colors (black, white, red, yellow, gray). These five lines are very important genetic resources in the Korean poultry industry. Based on a next generation sequencing technology, whole genome sequence and reference assemblies were performed using Gallus_gallus_4.0 (NCBI) with whole genome sequences from these lines to identify common and novel single nucleotide polymorphisms (SNPs). We obtained 36,660,731,136 ± 1,257,159,120 bp of raw sequence and average 26.6-fold of 25-29 billion reference assembly sequences representing 97.288 % coverage. Also, 4,006,068 ± 97,534 SNPs were observed from 29 autosomes and the Z chromosome and, of these, 752,309 SNPs are the common SNPs across lines. Among the identified SNPs, the number of novel- and known-location assigned SNPs was 1,047,951 ± 14,956 and 2,948,648 ± 81,414, respectively. The number of unassigned known SNPs was 1,181 ± 150 and unassigned novel SNPs was 8,238 ± 1,019. Synonymous SNPs, non-synonymous SNPs, and SNPs having character changes were 26,266 ± 1,456, 11,467 ± 604, 8,180 ± 458, respectively. Overall, 443,048 ± 26,389 SNPs in each bird were identified by comparing with dbSNP in NCBI. The presently obtained genome sequence and SNP information in Korean native chickens have wide applications for further genome studies such as genetic diversity studies to detect causative mutations for economic and disease related traits.

  17. Linear-Time Non-Malleable Codes in the Bit-Wise Independent Tampering Model

    DEFF Research Database (Denmark)

    Cramer, Ronald; Damgård, Ivan Bjerre; Döttling, Nico

    Non-malleable codes were introduced by Dziembowski et al. (ICS 2010) as coding schemes that protect a message against tampering attacks. Roughly speaking, a code is non-malleable if decoding an adversarially tampered encoding of a message m produces the original message m or a value m' (eventuall...... non-malleable codes of Agrawal et al. (TCC 2015) and of Cher- aghchi and Guruswami (TCC 2014) and improves the previous result in the bit-wise tampering model: it builds the first non-malleable codes with linear-time complexity and optimal-rate (i.e. rate 1 - o(1)).......Non-malleable codes were introduced by Dziembowski et al. (ICS 2010) as coding schemes that protect a message against tampering attacks. Roughly speaking, a code is non-malleable if decoding an adversarially tampered encoding of a message m produces the original message m or a value m' (eventually...... abort) completely unrelated with m. It is known that non-malleability is possible only for restricted classes of tampering functions. Since their introduction, a long line of works has established feasibility results of non-malleable codes against different families of tampering functions. However...

  18. Non-Coding RNAs in Arabidopsis

    DEFF Research Database (Denmark)

    van Wonterghem, Miranda

    This work evolves around elucidating the mechanisms of micro RNAs (miRNAs) in Arabidopsis thaliana. I identified a new class of nuclear non-coding RNAs derived from protein coding genes. The genes are miRNA targets with extensive gene body methylation. The RNA species are nuclear localized and de...

  19. Analysis of the CCR5 gene coding region diversity in five South American populations reveals two new non-synonymous alleles in Amerindians and high CCR5*D32 frequency in Euro-Brazilians

    Directory of Open Access Journals (Sweden)

    Angelica B.W. Boldt

    2009-01-01

    Full Text Available The CC chemokine receptor 5 (CCR5 molecule is an important co-receptor for HIV. The effect of the CCR5*D32 allele in susceptibility to HIV infection and AIDS disease is well known. Other alleles than CCR5*D32 have not been analysed before, neither in Amerindians nor in the majority of the populations all over the world. We investigated the distribution of the CCR5 coding region alleles in South Brazil and noticed a high CCR5*D32 frequency in the Euro-Brazilian population of the Paraná State (9.3%, which is the highest thus far reported for Latin America. The D32 frequency is even higher among the Euro-Brazilian Mennonites (14.2%. This allele is uncommon in Afro-Brazilians (2.0%, rare in the Guarani Amerindians (0.4% and absent in the Kaingang Amerindians and the Oriental-Brazilians. R223Q is common in the Oriental-Brazilians (7.7% and R60S in the Afro-Brazilians (5.0%. A29S and L55Q present an impaired response to b-chemokines and occurred in Afro- and Euro-Brazilians with cumulative frequencies of 4.4% and 2.7%, respectively. Two new non-synonymous alleles were found in Amerindians: C323F (g.3729G > T in Guarani (1.4% and Y68C (g.2964A > G in Kaingang (10.3%. The functional characteristics of these alleles should be defined and considered in epidemiological investigations about HIV-1 infection and AIDS incidence in Amerindian populations.

  20. Five new synonyms in Epimedium (Berberidaceae) from China.

    Science.gov (United States)

    Zhang, Yanjun; Dang, Haishan; Li, Shengyu; Li, Jianqiang; Wang, Ying

    2015-01-01

    Five new synonyms in Chinese Epimedium are designated in the present paper. Epimediumchlorandrum is treated as a synonym of Epimediumacuminatum; Epimediumrhizomatosum as a synonym of Epimediummembranaceum; Epimediumbrachyrrhizum as a synonym of Epimediumleptorrhizum; Epimediumdewuense as a synonym of Epimediumdolichostemon; and Epimediumsagittatumvar.oblongifoliolatum as a synonym of Epimediumborealiguizhouense.

  1. DEFLATE Compression Algorithm Corrects for Overestimation of Phylogenetic Diversity by Grantham Approach to Single-Nucleotide Polymorphism Classification

    Directory of Open Access Journals (Sweden)

    Arran Schlosberg

    2014-05-01

    Full Text Available Improvements in speed and cost of genome sequencing are resulting in increasing numbers of novel non-synonymous single nucleotide polymorphisms (nsSNPs in genes known to be associated with disease. The large number of nsSNPs makes laboratory-based classification infeasible and familial co-segregation with disease is not always possible. In-silico methods for classification or triage are thus utilised. A popular tool based on multiple-species sequence alignments (MSAs and work by Grantham, Align-GVGD, has been shown to underestimate deleterious effects, particularly as sequence numbers increase. We utilised the DEFLATE compression algorithm to account for expected variation across a number of species. With the adjusted Grantham measure we derived a means of quantitatively clustering known neutral and deleterious nsSNPs from the same gene; this was then used to assign novel variants to the most appropriate cluster as a means of binary classification. Scaling of clusters allows for inter-gene comparison of variants through a single pathogenicity score. The approach improves upon the classification accuracy of Align-GVGD while correcting for sensitivity to large MSAs. Open-source code and a web server are made available at https://github.com/aschlosberg/CompressGV.

  2. Five new synonyms in Epimedium ( Berberidaceae ) from China

    OpenAIRE

    Zhang, Yanjun; Dang, Haishan; Li, Shengyu; Li, Jianqiang; Wang, Ying

    2015-01-01

    Abstract Five new synonyms in Chinese Epimedium are designated in the present paper. Epimedium chlorandrum is treated as a synonym of Epimedium acuminatum ; Epimedium rhizomatosum as a synonym of Epimedium membranaceum ; Epimedium brachyrrhizum as a synonym of Epimedium leptorrhizum ; Epimedium dewuense as a synonym of Epimedium dolichostemon ; and Epimedium sagittatum var. oblongifoliolatum as a synonym of Epimedium borealiguizhouense .

  3. Five new synonyms in Epimedium (Berberidaceae from China

    Directory of Open Access Journals (Sweden)

    Yanjun Zhang

    2015-04-01

    Full Text Available Five new synonyms in Chinese Epimedium are designated in the present paper. Epimedium chlorandrum is treated as a synonym of E. acuminatum; Epimedium rhizomatosum as a synonym of E. membranaceum; Epimedium brachyrrhizum as a synonym of E. leptorrhizum; Epimedium dewuense as a synonym of E. dolichostemon; and Epimedium sagittatum var. oblongifoliolatum as a synonym of E. borealiguizhouense.

  4. Code of practice for safety in laboratory - non ionising radiation

    International Nuclear Information System (INIS)

    Ramli Jaya; Mohd Yusof Mohd Ali; Khoo Boo Huat; Khatijah Hashim

    1995-01-01

    The code identifies the non-ionizing radiation encountered in laboratories and the associated hazards. The code is intended as a laboratory standard reference document for general information on safety requirements relating to the usage of non-ionizing radiations in laboratories. The nonionizing radiations cover in this code, namely, are ultraviolet radiation, visible light, radio-frequency radiation, lasers, sound waves and ultrasonic radiation. (author)

  5. Single-nucleotide variations in the genes encoding the mitochondrial Hsp60/Hsp10 chaperone system and their disease-causing potential

    DEFF Research Database (Denmark)

    Bross, Peter; Li, Zhijie; Hansen, Jakob

    2007-01-01

    for variations in the HSPD1 and HSPE1 genes encoding the mitochondrial Hsp60/Hsp10 chaperone complex: two patients with multiple mitochondrial enzyme deficiency, 61 sudden infant death syndrome cases (MIM: #272120), and 60 patients presenting with ethylmalonic aciduria carrying non-synonymous susceptibility...... variations in the ACADS gene (MIM: *606885 and #201470). Besides previously reported variations we detected six novel variations: two in the bidirectional promoter region, and one synonymous and three non-synonymous variations in the HSPD1 coding region. One of the non-synonymous variations was polymorphic...... in patient and control samples, and the rare variations were each only found in single patients and absent in 100 control chromosomes. Functional investigation of the effects of the variations in the promoter region and the non-synonymous variations in the coding region indicated that none of them had...

  6. Exome sequencing in a breast canner family without BRCA mutation

    Energy Technology Data Exchange (ETDEWEB)

    Noh, Jae Myoung; Choi, Doo Ho; Park, Won; Huh, Seung Jae [Dept. of Radiation Oncology, amsung Medical Center, Sungkyunkwan University School of Medicine, Seoul (Korea, Republic of); Kim, Ji Hun; Cho, Dae Yeon [LabGenomics Clinical Research Institute, LabGenomics, Seongnam (Korea, Republic of)

    2015-06-15

    We performed exome sequencing in a breast cancer family without BRCA mutations. A family that three sisters have a history of breast cancer was selected for analysis. There were no family members with breast cancer in the previous generation. Genetic testing for BRCA mutation was negative, even by the multiplex ligation-dependent probe amplification method. Two sisters with breast cancer were selected as affected members, while the mother of the sisters was a non-affected member. Whole exome sequencing was performed on the HiSeq 2000 platform with paired-end reads of 101 bp in the three members. We identified 19,436, 19,468, and 19,345 single-nucleotide polymorphisms (SNPs) in the coding regions. Among them, 8,759, 8,789, and 8,772 were non-synonymous SNPs, respectively. After filtering out 12,843 synonymous variations and 12,105 known variations with indels found in the dbSNP135 or 1000 Genomes Project database, we selected 73 variations in the samples from the affected sisters that did not occur in the sample from the unaffected mother. Using the Sorting Intolerant From Tolerant (SIFT), PolyPhen-2, and MutationTaster algorithms to predict amino acid substitutions, the XCR1, DLL1, TH, ACCS, SPPL3, CCNF, and SRL genes were risky among all three algorithms, while definite candidate genes could not be conclusively determined. Using exome sequencing, we found 7 variants for a breast cancer family without BRCA mutations. Genetic evidence of disease association should be confirmed by future studies.

  7. SNP discovery in the transcriptome of white Pacific shrimp Litopenaeus vannamei by next generation sequencing.

    Directory of Open Access Journals (Sweden)

    Yang Yu

    Full Text Available The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies.

  8. Long non-coding RNAs: Mechanism of action and functional utility

    Directory of Open Access Journals (Sweden)

    Shakil Ahmad Bhat

    2016-10-01

    Full Text Available Recent RNA sequencing studies have revealed that most of the human genome is transcribed, but very little of the total transcriptomes has the ability to encode proteins. Long non-coding RNAs (lncRNAs are non-coding transcripts longer than 200 nucleotides. Members of the non-coding genome include microRNA (miRNA, small regulatory RNAs and other short RNAs. Most of long non-coding RNA (lncRNAs are poorly annotated. Recent recognition about lncRNAs highlights their effects in many biological and pathological processes. LncRNAs are dysfunctional in a variety of human diseases varying from cancerous to non-cancerous diseases. Characterization of these lncRNA genes and their modes of action may allow their use for diagnosis, monitoring of progression and targeted therapies in various diseases. In this review, we summarize the functional perspectives as well as the mechanism of action of lncRNAs. Keywords: LncRNA, X-chromosome inactivation, Genome imprinting, Transcription regulation, Cancer, Immunity

  9. Synonymy and synonyms in Lexicography

    DEFF Research Database (Denmark)

    Bergenholtz, Henning; Gouws, Rufus

    2012-01-01

    In this paper we work with the assumption that items giving synonyms in dictionaries are primarily of assistance in the case of text production problems. We assume furthermore that synonymy does not prevail between lexemes but rather between textual items in concrete texts. Accordingly a rich...... selection of synonyms in text production dictionaries will offer the possibility to select the appropriate item – but only for mother-tongue speakers. We are not discussing items giving synonyms in learners' dictionaries and school dictionaries. From a selection of existing dictionaries it shows, as could...... be expected, that there is no uniform lexicographic practice but also numerous ways of dealing with synonyms that offers very little assistance to the intended target users of a specific dictionary. This could be due to the fact that too often the inclusion and presentation of synonyms are done without taking...

  10. Development of non-linear vibration analysis code for CANDU fuelling machine

    International Nuclear Information System (INIS)

    Murakami, Hajime; Hirai, Takeshi; Horikoshi, Kiyomi; Mizukoshi, Kaoru; Takenaka, Yasuo; Suzuki, Norio.

    1988-01-01

    This paper describes the development of a non-linear, dynamic analysis code for the CANDU 600 fuelling machine (F-M), which includes a number of non-linearities such as gap with or without Coulomb friction, special multi-linear spring connections, etc. The capabilities and features of the code and the mathematical treatment for the non-linearities are explained. The modeling and numerical methodology for the non-linearities employed in the code are verified experimentally. Finally, the simulation analyses for the full-scale F-M vibration testing are carried out, and the applicability of the code to such multi-degree of freedom systems as F-M is demonstrated. (author)

  11. Non-Coding Transcript Heterogeneity in Mesothelioma: Insights from Asbestos-Exposed Mice.

    Science.gov (United States)

    Felley-Bosco, Emanuela; Rehrauer, Hubert

    2018-04-11

    Mesothelioma is an aggressive, rapidly fatal cancer and a better understanding of its molecular heterogeneity may help with making more efficient therapeutic strategies. Non-coding RNAs represent a larger part of the transcriptome but their contribution to diseases is not fully understood yet. We used recently obtained RNA-seq data from asbestos-exposed mice and performed data mining of publicly available datasets in order to evaluate how non-coding RNA contribute to mesothelioma heterogeneity. Nine non-coding RNAs are specifically elevated in mesothelioma tumors and contribute to human mesothelioma heterogeneity. Because some of them have known oncogenic properties, this study supports the concept of non-coding RNAs as cancer progenitor genes.

  12. Annotating non-coding regions of the genome.

    Science.gov (United States)

    Alexander, Roger P; Fang, Gang; Rozowsky, Joel; Snyder, Michael; Gerstein, Mark B

    2010-08-01

    Most of the human genome consists of non-protein-coding DNA. Recently, progress has been made in annotating these non-coding regions through the interpretation of functional genomics experiments and comparative sequence analysis. One can conceptualize functional genomics analysis as involving a sequence of steps: turning the output of an experiment into a 'signal' at each base pair of the genome; smoothing this signal and segmenting it into small blocks of initial annotation; and then clustering these small blocks into larger derived annotations and networks. Finally, one can relate functional genomics annotations to conserved units and measures of conservation derived from comparative sequence analysis.

  13. Min-Max decoding for non binary LDPC codes

    OpenAIRE

    Savin, Valentin

    2008-01-01

    Iterative decoding of non-binary LDPC codes is currently performed using either the Sum-Product or the Min-Sum algorithms or slightly different versions of them. In this paper, several low-complexity quasi-optimal iterative algorithms are proposed for decoding non-binary codes. The Min-Max algorithm is one of them and it has the benefit of two possible LLR domain implementations: a standard implementation, whose complexity scales as the square of the Galois field's cardinality and a reduced c...

  14. Candidate chromosome 1 disease susceptibility genes for Sjogren’s syndrome xerostomia are narrowed by novel NOD.B10 congenic mice

    Science.gov (United States)

    Mongini, Patricia K. A.; Kramer, Jill M.; Ishikawa, Tomo-o; Herschman, Harvey; Esposito, Donna

    2014-01-01

    Sjogren’s syndrome (SS) is characterized by salivary gland leukocytic infiltrates and impaired salivation (xerostomia). Cox-2 (Ptgs2) is located on chromosome 1 within the span of the Aec2 region. In an attempt to demonstrate that COX-2 drives antibody-dependent hyposalivation, NOD.B10 congenic mice bearing a Cox-2flox gene were generated. A congenic line with non-NOD alleles in Cox-2-flanking genes failed manifest xerostomia. Further backcrossing yielded disease-susceptible NOD.B10 Cox-2flox lines; fine genetic mapping determined that critical Aec2 genes lie within a 1.56 to 2.17 Mb span of DNA downstream of Cox-2. Bioinformatics analysis revealed that susceptible and non-susceptible lines exhibit non-synonymous coding SNPs in 8 protein-encoding genes of this region, thereby better delineating candidate Aec2 alleles needed for SS xerostomia. PMID:24685748

  15. Practice of calculation of neutron-physical characteristics of reactors and radiating shielding in structure SNPS with program complex MCNP

    International Nuclear Information System (INIS)

    Krotov, A.D.; Son'ko, A.V.

    2009-01-01

    Calculation of neutron-physical properties and radiation protection of space power reactor was made by means of the MCNP code allowing simulation of neutron, γ- and electron transport by the Monte Carlo method in the systems with combined geometry. Universality of the MCNP code has been demonstrated both for the calculation of reactor-converter so for the optimization of radiation protection that allows to reserve a new level of complex simulation of SNPS [ru

  16. The Selective Advantage of Synonymous Codon Usage Bias in Salmonella.

    Directory of Open Access Journals (Sweden)

    Gerrit Brandis

    2016-03-01

    Full Text Available The genetic code in mRNA is redundant, with 61 sense codons translated into 20 different amino acids. Individual amino acids are encoded by up to six different codons but within codon families some are used more frequently than others. This phenomenon is referred to as synonymous codon usage bias. The genomes of free-living unicellular organisms such as bacteria have an extreme codon usage bias and the degree of bias differs between genes within the same genome. The strong positive correlation between codon usage bias and gene expression levels in many microorganisms is attributed to selection for translational efficiency. However, this putative selective advantage has never been measured in bacteria and theoretical estimates vary widely. By systematically exchanging optimal codons for synonymous codons in the tuf genes we quantified the selective advantage of biased codon usage in highly expressed genes to be in the range 0.2-4.2 x 10-4 per codon per generation. These data quantify for the first time the potential for selection on synonymous codon choice to drive genome-wide sequence evolution in bacteria, and in particular to optimize the sequences of highly expressed genes. This quantification may have predictive applications in the design of synthetic genes and for heterologous gene expression in biotechnology.

  17. Functional interrogation of non-coding DNA through CRISPR genome editing.

    Science.gov (United States)

    Canver, Matthew C; Bauer, Daniel E; Orkin, Stuart H

    2017-05-15

    Methodologies to interrogate non-coding regions have lagged behind coding regions despite comprising the vast majority of the genome. However, the rapid evolution of clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing has provided a multitude of novel techniques for laboratory investigation including significant contributions to the toolbox for studying non-coding DNA. CRISPR-mediated loss-of-function strategies rely on direct disruption of the underlying sequence or repression of transcription without modifying the targeted DNA sequence. CRISPR-mediated gain-of-function approaches similarly benefit from methods to alter the targeted sequence through integration of customized sequence into the genome as well as methods to activate transcription. Here we review CRISPR-based loss- and gain-of-function techniques for the interrogation of non-coding DNA. Copyright © 2017 Elsevier Inc. All rights reserved.

  18. Retrotransposons and non-protein coding RNAs

    DEFF Research Database (Denmark)

    Mourier, Tobias; Willerslev, Eske

    2009-01-01

    does not merely represent spurious transcription. We review examples of functional RNAs transcribed from retrotransposons, and address the collection of non-protein coding RNAs derived from transposable element sequences, including numerous human microRNAs and the neuronal BC RNAs. Finally, we review...

  19. Consortium analysis of 7 candidate SNPs for ovarian cancer

    DEFF Research Database (Denmark)

    Ramus, S.J.; Vierkant, R.A.; Johnatty, S.E.

    2008-01-01

    The Ovarian Cancer Association Consortium selected 7 candidate single nucleotide polymorphisms (SNPs), for which there is evidence from previous studies of an association with variation in ovarian cancer or breast cancer risks. The SNPs selected for analysis were F31I (rs2273535) in AURKA, N372H...... (rs144848) in BRCA2, rs2854344 in intron 17 of RB1, rs2811712 5' flanking CDKN2A, rs523349 in the 3' UTR of SRD5A2, D302H (rs1045485) in CASP8 and L10P (rs1982073) in TGFB1. Fourteen studies genotyped 4,624 invasive epithelial ovarian cancer cases and 8,113 controls of white non-Hispanic origin...... was suggestive although no longer statistically significant (ordinal OR 0.92, 95% CI 0.79-1.06). This SNP has also been shown to have an association with decreased risk in breast cancer. There was a suggestion of an association for AURKA, when one study that caused significant study heterogeneity was excluded...

  20. Analysis of the genome-wide variations among multiple strains of the plant pathogenic bacterium Xylella fastidiosa

    Directory of Open Access Journals (Sweden)

    Walker M Andrew

    2006-09-01

    Full Text Available Abstract Background The Gram-negative, xylem-limited phytopathogenic bacterium Xylella fastidiosa is responsible for causing economically important diseases in grapevine, citrus and many other plant species. Despite its economic impact, relatively little is known about the genomic variations among strains isolated from different hosts and their influence on the population genetics of this pathogen. With the availability of genome sequence information for four strains, it is now possible to perform genome-wide analyses to identify and categorize such DNA variations and to understand their influence on strain functional divergence. Results There are 1,579 genes and 194 non-coding homologous sequences present in the genomes of all four strains, representing a 76. 2% conservation of the sequenced genome. About 60% of the X. fastidiosa unique sequences exist as tandem gene clusters of 6 or more genes. Multiple alignments identified 12,754 SNPs and 14,449 INDELs in the 1528 common genes and 20,779 SNPs and 10,075 INDELs in the 194 non-coding sequences. The average SNP frequency was 1.08 × 10-2 per base pair of DNA and the average INDEL frequency was 2.06 × 10-2 per base pair of DNA. On an average, 60.33% of the SNPs were synonymous type while 39.67% were non-synonymous type. The mutation frequency, primarily in the form of external INDELs was the main type of sequence variation. The relative similarity between the strains was discussed according to the INDEL and SNP differences. The number of genes unique to each strain were 60 (9a5c, 54 (Dixon, 83 (Ann1 and 9 (Temecula-1. A sub-set of the strain specific genes showed significant differences in terms of their codon usage and GC composition from the native genes suggesting their xenologous origin. Tandem repeat analysis of the genomic sequences of the four strains identified associations of repeat sequences with hypothetical and phage related functions. Conclusion INDELs and strain specific genes

  1. Non-coding RNA in Deinococcus radiodurans

    International Nuclear Information System (INIS)

    Chen Zhongzhong; Wang Liangyan; Lin Jun; Tian Bing; Hua Yuejin

    2006-01-01

    Researches on DNA damage and repair pathways of Deinococcus radiodurans show its extreme resistance to ionizing radiation, ultraviolet radiation and reactive oxygen species. Non-coding (ncRNA) RNAs are involved in a variety of processes such as transcriptional regulations, RNA processing and modification, mRNA translation, protein transportation and stability. The conserved secondary structures of intergenic regions of Deinococcus radiodurans R1 were predicted using Stochastic Context Free Grammar (SCFG) scan strategy. Results showed that 28 ncRNA families were present in the non-coding regions of the genome of Deinococcus radiodurans R1. Among these families, IRE is the largest family, followed by Histone3, tRNA, SECIS. DicF, ctRNA-pGA1 and tmRNA are one discovered in bacteria. Results from the comparison with other organisms showed that these ncRNA can be applied to the study of biological function of Deinococcus radiodurans and supply reference for the further study of DNA damage and repair mechanisms of this bacterium. (authors)

  2. The systematic position of the Cladrastis Rafin. genus: history of research, synonyms, place in modern phylogenetic systems

    Directory of Open Access Journals (Sweden)

    Porokhniava Olga L.

    2015-12-01

    C. kentukea has many synonyms, which were relevant in different historical periods. The correct and current, according to the rules of the International Code of the botanical nomenclature, is the name Cladrastis kentukea (Dum. - Cours. Rudd, established by V. E. Rudd in 1972.

  3. Toward understanding non-coding RNA roles in intracranial aneurysms and subarachnoid hemorrhage

    Directory of Open Access Journals (Sweden)

    Huang Fengzhen

    2017-05-01

    Full Text Available Subarachnoid hemorrhage (SAH is a common and frequently life-threatening cerebrovascular disease, which is mostly related with a ruptured intracranial aneurysm. Its complications include rebleeding, early brain injury, cerebral vasospasm, delayed cerebral ischemia, chronic hydrocephalus, and also non neurological problems. Non-coding RNAs (ncRNAs, comprising of microRNAs (miRNAs, small interfering RNAs (siRNAs and long non-coding RNAs (lncRNAs, play an important role in intracranial aneurysms and SAH. Here, we review the non-coding RNAs expression profile and their related mechanisms in intracranial aneurysms and SAH. Moreover, we suggest that these non-coding RNAs function as novel molecular biomarkers to predict intracranial aneurysms and SAH, and may yield new therapies after SAH in the future.

  4. Investigation of SNPs in the porcine desmoglein 1 gene

    DEFF Research Database (Denmark)

    Daugaard, L.; Andresen, Lars Ole; Fredholm, M.

    2007-01-01

    epidermitis were diagnosed clinically as affected or unaffected. Two regions of the desmoglein I gene were sequenced and genotypes of the SNPs were established. Seven SNPs (823T>C, 828A>G, 829A>G, 830A>T, 831A>T, 838A>C and 1139C>T) were found in the analysed sequences and the allele frequencies were...... the location of single nucleotide polymorphisms (SNPs) in the porcine desmoglein I gene (PIG)DSGI in correlation to the cleavage site as well as if the genotype of the SNPs is correlated to susceptibility or resistance to the disease. Results: DNA from 32 affected and 32 unaffected piglets with exudative...... the genotypes of two out of seven SNPs found in the porcine desmoglein I gene and the susceptibility to exudative epidermitis....

  5. In Silico Analysis of FMR1 Gene Missense SNPs.

    Science.gov (United States)

    Tekcan, Akin

    2016-06-01

    The FMR1 gene, a member of the fragile X-related gene family, is responsible for fragile X syndrome (FXS). Missense single-nucleotide polymorphisms (SNPs) are responsible for many complex diseases. The effect of FMR1 gene missense SNPs is unknown. The aim of this study, using in silico techniques, was to analyze all known missense mutations that can affect the functionality of the FMR1 gene, leading to mental retardation (MR) and FXS. Data on the human FMR1 gene were collected from the Ensembl database (release 81), National Centre for Biological Information dbSNP Short Genetic Variations database, 1000 Genomes Browser, and NHLBI Exome Sequencing Project Exome Variant Server. In silico analysis was then performed. One hundred-twenty different missense SNPs of the FMR1 gene were determined. Of these, 11.66 % of the FMR1 gene missense SNPs were in highly conserved domains, and 83.33 % were in domains with high variety. The results of the in silico prediction analysis showed that 31.66 % of the FMR1 gene SNPs were disease related and that 50 % of SNPs had a pathogenic effect. The results of the structural and functional analysis revealed that although the R138Q mutation did not seem to have a damaging effect on the protein, the G266E and I304N SNPs appeared to disturb the interaction between the domains and affect the function of the protein. This is the first study to analyze all missense SNPs of the FMR1 gene. The results indicate the applicability of a bioinformatics approach to FXS and other FMR1-related diseases. I think that the analysis of FMR1 gene missense SNPs using bioinformatics methods would help diagnosis of FXS and other FMR1-related diseases.

  6. Identification of novel single nucleotide polymorphisms (SNPs in deer (Odocoileus spp. using the BovineSNP50 BeadChip.

    Directory of Open Access Journals (Sweden)

    Gwilym D Haynes

    Full Text Available Single nucleotide polymorphisms (SNPs are growing in popularity as a genetic marker for investigating evolutionary processes. A panel of SNPs is often developed by comparing large quantities of DNA sequence data across multiple individuals to identify polymorphic sites. For non-model species, this is particularly difficult, as performing the necessary large-scale genomic sequencing often exceeds the resources available for the project. In this study, we trial the Bovine SNP50 BeadChip developed in cattle (Bos taurus for identifying polymorphic SNPs in cervids Odocoileus hemionus (mule deer and black-tailed deer and O. virginianus (white-tailed deer in the Pacific Northwest. We found that 38.7% of loci could be genotyped, of which 5% (n = 1068 were polymorphic. Of these 1068 polymorphic SNPs, a mixture of putatively neutral loci (n = 878 and loci under selection (n = 190 were identified with the F(ST-outlier method. A range of population genetic analyses were implemented using these SNPs and a panel of 10 microsatellite loci. The three types of deer could readily be distinguished with both the SNP and microsatellite datasets. This study demonstrates that commercially developed SNP chips are a viable means of SNP discovery for non-model organisms, even when used between very distantly related species (the Bovidae and Cervidae families diverged some 25.1-30.1 million years before present.

  7. XAB2 tagSNPs contribute to non-small cell lung cancer susceptibility in Chinese population

    International Nuclear Information System (INIS)

    Pei, Na; Cao, Lei; Liu, Yingwen; Wu, Jing; Song, Qinqin; Zhang, Zhi; Yuan, Juxiang; Zhang, Xuemei

    2015-01-01

    XPA-binding protein 2 (XAB2) interacts with Cockayne syndrome complementation group A (CSA), group B (CSB) and RNA polymerase II to initiate nucleotide excision repair. This study aims to evaluate the association of XAB2 genetic variants with the risk of non-small cell lung cancer (NSCLC) using a tagging approach. A hospital-based case-control study was conducted in 470 patients with NSCLC and 470 controls in Chinese population. Totally, 5 tag single nucleotide polymorphisms (SNPs) in XAB2 gene were selected by Haploview software using Hapmap database. Genotyping was performed using iPlex Gold Genotyping Asssy and Sequenom MassArray. Unconditional logistic regression was conducted to estimate odd ratios (ORs) and 95 % confidence intervals (95 % CI). Unconditional logistic regression analysis showed that the XAB2 genotype with rs794078 AA or at least one rs4134816 C allele were associated with the decreased risk of NSCLC with OR (95 % CI) of 0.12 (0.03–0.54) and 0.46 (0.26–0.84). When stratified by gender, we found that the subjects carrying rs4134816 CC or CT genotype had a decreased risk for developing NSCLC among males with OR (95 % CI) of 0.39 (0.18–0.82), but not among females. In age stratification analysis, we found that younger subjects (age ≤ 60) with at least one C allele had a decreased risk of NSCLC with OR (95 % CI) of 0.35 (0.17–0.74), but older subjects didn’t. We didn’t find that XAB2 4134816 C > T variant effect on the risk of NSCLC when stratified by smoking status. The environmental factors, such as age, sex and smoking had no effect on the risk of NSCLC related to XAB2 genotypes at other polymorphic sites. The XAB2 tagSNPs (rs794078 and rs4134816) were significantly associated with the risk of NSCLC in Chinese population, which supports the XAB2 plays a significant role in the development of NSCLC

  8. Genetic variants in long non-coding RNA MIAT contribute to risk of paranoid schizophrenia in a Chinese Han population.

    Science.gov (United States)

    Rao, Shu-Quan; Hu, Hui-Ling; Ye, Ning; Shen, Yan; Xu, Qi

    2015-08-01

    The heritability of schizophrenia has been reported to be as high as ~80%, but the contribution of genetic variants identified to this heritability remains to be estimated. Long non-coding RNAs (LncRNAs) are involved in multiple processes critical to normal cellular function and dysfunction of lncRNA MIAT may contribute to the pathophysiology of schizophrenia. However, the genetic evidence of lncRNAs involved in schizophrenia has not been documented. Here, we conducted a two-stage association analysis on 8 tag SNPs that cover the whole MIAT locus in two independent Han Chinese schizophrenia case-control cohorts (discovery sample from Shanxi Province: 1093 patients with paranoid schizophrenia and 1180 control subjects; replication cohort from Jilin Province: 1255 cases and 1209 healthy controls). In discovery stage, significant genetic association with paranoid schizophrenia was observed for rs1894720 (χ(2)=74.20, P=7.1E-18), of which minor allele (T) had an OR of 1.70 (95% CI=1.50-1.91). This association was confirmed in the replication cohort (χ(2)=22.66, P=1.9E-06, OR=1.32, 95%CI 1.18-1.49). Besides, a weak genotypic association was detected for rs4274 (χ(2)=4.96, df=2, P=0.03); the AA carriers showed increased disease risk (OR=1.30, 95%CI=1.03-1.64). No significant association was found between any haplotype and paranoid schizophrenia. The present studies showed that lncRNA MIAT was a novel susceptibility gene for paranoid schizophrenia in the Chinese Han population. Considering that most lncRNAs locate in non-coding regions, our result may explain why most susceptibility loci for schizophrenia identified by genome wide association studies were out of coding regions. Copyright © 2015 Elsevier B.V. All rights reserved.

  9. Non-coding RNAs in the Ovarian Follicle

    Directory of Open Access Journals (Sweden)

    Rosalia Battaglia

    2017-05-01

    Full Text Available The mammalian ovarian follicle is the complex reproductive unit comprising germ cell, somatic cells (Cumulus and Granulosa cells, and follicular fluid (FF: paracrine communication among the different cell types through FF ensures the development of a mature oocyte ready for fertilization. This paper is focused on non-coding RNAs in ovarian follicles and their predicted role in the pathways involved in oocyte growth and maturation. We determined the expression profiles of microRNAs in human oocytes and FF by high-throughput analysis and identified 267 microRNAs in FF and 176 in oocytes. Most of these were FF microRNAs, while 9 were oocyte specific. By bioinformatic analysis, independently performed on FF and oocyte microRNAs, we identified the most significant Biological Processes and the pathways regulated by their validated targets. We found many pathways shared between the two compartments and some specific for oocyte microRNAs. Moreover, we found 41 long non-coding RNAs able to interact with oocyte microRNAs and potentially involved in the regulation of folliculogenesis. These data are important in basic reproductive research and could also be useful for clinical applications. In fact, the characterization of non-coding RNAs in ovarian follicles could improve reproductive disease diagnosis, provide biomarkers of oocyte quality in Assisted Reproductive Treatment, and allow the development of therapies for infertility disorders.

  10. Decoding the function of nuclear long non-coding RNAs.

    Science.gov (United States)

    Chen, Ling-Ling; Carmichael, Gordon G

    2010-06-01

    Long non-coding RNAs (lncRNAs) are mRNA-like, non-protein-coding RNAs that are pervasively transcribed throughout eukaryotic genomes. Rather than silently accumulating in the nucleus, many of these are now known or suspected to play important roles in nuclear architecture or in the regulation of gene expression. In this review, we highlight some recent progress in how lncRNAs regulate these important nuclear processes at the molecular level. Copyright 2010 Elsevier Ltd. All rights reserved.

  11. Terminological synonyms in Czech and English sports terminologies

    Directory of Open Access Journals (Sweden)

    Michaela Cocca

    2016-11-01

    Full Text Available The following paper deals with the concept and typology of terminological synonyms in English and Czech, focusing on the official sport terms codified in English and/or Czech dictionaries. The analysis focuses on Anglicisms as terminological doublets, hyposynonyms, stylistic synonyms, and false friends. Results show that a high number of synonyms were generated by the process of transshaping or translating English terms into Czech. Our analysis suggests that there may be found three types of sports synonyms in English (real, quasi-, and pseudo- synonyms and four main types in Czech (terminological doublets, Anglicisms as hyposynonyms, false friends, and stylistic synonyms. The use of synonyms is even more evident in modern or newly created sports; mass media and the accessibility of data through the Internet playing an essential role as they mediate an immense input of information to the target population.

  12. Linear-time non-malleable codes in the bit-wise independent tampering model

    NARCIS (Netherlands)

    R.J.F. Cramer (Ronald); I.B. Damgård (Ivan); N.M. Döttling (Nico); I. Giacomelli (Irene); C. Xing (Chaoping)

    2017-01-01

    textabstractNon-malleable codes were introduced by Dziembowski et al. (ICS 2010) as coding schemes that protect a message against tampering attacks. Roughly speaking, a code is non-malleable if decoding an adversarially tampered encoding of a message m produces the original message m or a value m′

  13. Cyclic and constacyclic codes over a non-chain ring

    Directory of Open Access Journals (Sweden)

    Ayşegül Bayram

    2014-09-01

    Full Text Available In this study, we consider linear and especially cyclic codes over the non-chain ring $Z_{p}[v]/\\langle v^{p}-v\\rangle$ where $p$ is a prime. This is a generalization of the case $p=3.$ Further, in this work the structure of constacyclic codes are studied as well. This study takes advantage mainly from a Gray map which preserves the distance between codes over this ring and $p$-ary codes and moreover this map enlightens the structure of these codes. Furthermore, a MacWilliams type identity is presented together with some illustrative examples.

  14. Non-coding, mRNA-like RNAs database Y2K.

    Science.gov (United States)

    Erdmann, V A; Szymanski, M; Hochberg, A; Groot, N; Barciszewski, J

    2000-01-01

    In last few years much data has accumulated on various non-translatable RNA transcripts that are synthesised in different cells. They are lacking in protein coding capacity and it seems that they work mainly or exclusively at the RNA level. All known non-coding RNA transcripts are collected in the database: http://www. man.poznan.pl/5SData/ncRNA/index.html

  15. Patterns of population differentiation of candidate genes for cardiovascular disease

    Directory of Open Access Journals (Sweden)

    Ding Keyue

    2007-07-01

    Full Text Available Abstract Background The basis for ethnic differences in cardiovascular disease (CVD susceptibility is not fully understood. We investigated patterns of population differentiation (FST of a set of genes in etiologic pathways of CVD among 3 ethnic groups: Yoruba in Nigeria (YRI, Utah residents with European ancestry (CEU, and Han Chinese (CHB + Japanese (JPT. We identified 37 pathways implicated in CVD based on the PANTHER classification and 416 genes in these pathways were further studied; these genes belonged to 6 biological processes (apoptosis, blood circulation and gas exchange, blood clotting, homeostasis, immune response, and lipoprotein metabolism. Genotype data were obtained from the HapMap database. Results We calculated FST for 15,559 common SNPs (minor allele frequency ≥ 0.10 in at least one population in genes that co-segregated among the populations, as well as an average-weighted FST for each gene. SNPs were classified as putatively functional (non-synonymous and untranslated regions or non-functional (intronic and synonymous sites. Mean FST values for common putatively functional variants were significantly higher than FST values for nonfunctional variants. A significant variation in FST was also seen based on biological processes; the processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes with high FST. Thus, putative functional SNPs in genes in etiologic pathways for CVD show greater population differentiation than non-functional SNPs and a significant variance of FST values was noted among pairwise population comparisons for different biological processes. Conclusion These results suggest a possible basis for varying susceptibility to CVD among ethnic groups.

  16. A periodic pattern of SNPs in the human genome

    DEFF Research Database (Denmark)

    Madsen, Bo Eskerod; Villesen, Palle; Wiuf, Carsten

    2007-01-01

    By surveying a filtered, high-quality set of SNPs in the human genome, we have found that SNPs positioned 1, 2, 4, 6, or 8 bp apart are more frequent than SNPs positioned 3, 5, 7, or 9 bp apart. The observed pattern is not restricted to genomic regions that are known to cause sequencing...... periodic DNA. Our results suggest that not all SNPs in the human genome are created by independent single nucleotide mutations, and that care should be taken in analysis of SNPs from periodic DNA. The latter may have important consequences for SNP and association studies....... or alignment errors, for example, transposable elements (SINE, LINE, and LTR), tandem repeats, and large duplicated regions. However, we found that the pattern is almost entirely confined to what we define as "periodic DNA." Periodic DNA is a genomic region with a high degree of periodicity in nucleotide usage...

  17. A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

    Science.gov (United States)

    Zhang, Ai-bing; Feng, Jie; Ward, Robert D; Wan, Ping; Gao, Qiang; Wu, Jun; Zhao, Wei-zhong

    2012-01-01

    Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI) region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS) genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF) to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish) and two representing non-coding ITS barcodes (rust fungi and brown algae). Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ) and Maximum likelihood (ML) methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI) of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40%) for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37%) for 1094 brown algae queries, both using ITS barcodes.

  18. A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

    Directory of Open Access Journals (Sweden)

    Ai-bing Zhang

    Full Text Available Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish and two representing non-coding ITS barcodes (rust fungi and brown algae. Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ and Maximum likelihood (ML methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40% for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37% for 1094 brown algae queries, both using ITS barcodes.

  19. SNPs of melanocortin 4 receptor (MC4R) associated with body weight in Beagle dogs.

    Science.gov (United States)

    Zeng, Ruixia; Zhang, Yibo; Du, Peng

    2014-01-01

    Melanocortin 4 receptor (MC4R), which is associated with inherited human obesity, is involoved in food intake and body weight of mammals. To study the relationships between MC4R gene polymorphism and body weight in Beagle dogs, we detected and compared the nucleotide sequence of the whole coding region and 3'- and 5'- flanking regions of the dog MC4R gene (1214 bp). In 120 Beagle dogs, two SNPs (A420C, C895T) were identified and their relation with body weight was analyzed with RFLP-PCR method. The results showed that the SNP at A420C was significantly associated with canine body weight trait when it changed amino acid 101 of the MC4R protein from asparagine to threonine, while canine body weight variations were significant in female dogs when MC4R nonsense mutation at C895T. It suggested that the two SNPs might affect the MC4R gene's function which was relative to body weight in Beagle dogs. Therefore, MC4R was a candidate gene for selecting different size dogs with the MC4R SNPs (A420C, C895T) being potentially valuable as a genetic marker.

  20. A SNP panel and online tool for checking genotype concordance through comparing QR codes.

    Directory of Open Access Journals (Sweden)

    Yonghong Du

    Full Text Available In the current precision medicine era, more and more samples get genotyped and sequenced. Both researchers and commercial companies expend significant time and resources to reduce the error rate. However, it has been reported that there is a sample mix-up rate of between 0.1% and 1%, not to mention the possibly higher mix-up rate during the down-stream genetic reporting processes. Even on the low end of this estimate, this translates to a significant number of mislabeled samples, especially over the projected one billion people that will be sequenced within the next decade. Here, we first describe a method to identify a small set of Single nucleotide polymorphisms (SNPs that can uniquely identify a personal genome, which utilizes allele frequencies of five major continental populations reported in the 1000 genomes project and the ExAC Consortium. To make this panel more informative, we added four SNPs that are commonly used to predict ABO blood type, and another two SNPs that are capable of predicting sex. We then implement a web interface (http://qrcme.tech, nicknamed QRC (for QR code based Concordance check, which is capable of extracting the relevant ID SNPs from a raw genetic data, coding its genotype as a quick response (QR code, and comparing QR codes to report the concordance of underlying genetic datasets. The resulting 80 fingerprinting SNPs represent a significant decrease in complexity and the number of markers used for genetic data labelling and tracking. Our method and web tool is easily accessible to both researchers and the general public who consider the accuracy of complex genetic data as a prerequisite towards precision medicine.

  1. A SNP panel and online tool for checking genotype concordance through comparing QR codes.

    Science.gov (United States)

    Du, Yonghong; Martin, Joshua S; McGee, John; Yang, Yuchen; Liu, Eric Yi; Sun, Yingrui; Geihs, Matthias; Kong, Xuejun; Zhou, Eric Lingfeng; Li, Yun; Huang, Jie

    2017-01-01

    In the current precision medicine era, more and more samples get genotyped and sequenced. Both researchers and commercial companies expend significant time and resources to reduce the error rate. However, it has been reported that there is a sample mix-up rate of between 0.1% and 1%, not to mention the possibly higher mix-up rate during the down-stream genetic reporting processes. Even on the low end of this estimate, this translates to a significant number of mislabeled samples, especially over the projected one billion people that will be sequenced within the next decade. Here, we first describe a method to identify a small set of Single nucleotide polymorphisms (SNPs) that can uniquely identify a personal genome, which utilizes allele frequencies of five major continental populations reported in the 1000 genomes project and the ExAC Consortium. To make this panel more informative, we added four SNPs that are commonly used to predict ABO blood type, and another two SNPs that are capable of predicting sex. We then implement a web interface (http://qrcme.tech), nicknamed QRC (for QR code based Concordance check), which is capable of extracting the relevant ID SNPs from a raw genetic data, coding its genotype as a quick response (QR) code, and comparing QR codes to report the concordance of underlying genetic datasets. The resulting 80 fingerprinting SNPs represent a significant decrease in complexity and the number of markers used for genetic data labelling and tracking. Our method and web tool is easily accessible to both researchers and the general public who consider the accuracy of complex genetic data as a prerequisite towards precision medicine.

  2. Synonyms for some species of Mexican anoles (Squamata: Dactyloidae).

    Science.gov (United States)

    De Oca, Adrián Nieto Montes; Poe, Steven; Scarpetta, Simon; Gray, Levi; Lieb, Carl S

    2013-01-01

    We studied type material and freshly collected topotypical specimens to assess the taxonomic status of five names associated with species of Mexican Anolis. We find A. schmidti to be a junior synonym of A. nebulosus, A. breedlovei to be a junior synonym of A. cuprinus, A. polyrhachis to be a junior synonym of A. rubiginosus, A. simmonsi to be a junior synonym of A. nebuloides, and A. adleri to be a junior synonym of A. liogaster.

  3. The Study of Synonymous Word "Mistake"

    OpenAIRE

    Suwardi, Albertus

    2016-01-01

    This article discusses the synonymous word "mistake*.The discussion will also cover the meaning of 'word' itself. Words can be considered as form whether spoken or written, or alternatively as composite expression, which combine and meaning. Synonymous are different phonological words which have the same or very similar meanings. The synonyms of mistake are error, fault, blunder, slip, slipup, gaffe and inaccuracy. The data is taken from a computer program. The procedure of data collection is...

  4. Extracellular vesicle associated long non-coding RNAs functionally enhance cell viability

    Directory of Open Access Journals (Sweden)

    Chris Hewson

    2016-10-01

    Full Text Available Cells communicate with one another to create microenvironments and share resources. One avenue by which cells communicate is through the action of exosomes. Exosomes are extracellular vesicles that are released by one cell and taken up by neighbouring cells. But how exosomes instigate communication between cells has remained largely unknown. We present evidence here that particular long non-coding RNA molecules are preferentially packaged into exosomes. We also find that a specific class of these exosome associated non-coding RNAs functionally modulate cell viability by direct interactions with l-lactate dehydrogenase B (LDHB, high-mobility group protein 17 (HMG-17, and CSF2RB, proteins involved in metabolism, nucleosomal architecture and cell signalling respectively. Knowledge of this endogenous cell to cell pathway, those proteins interacting with exosome associated non-coding transcripts and their interacting domains, could lead to a better understanding of not only cell to cell interactions but also the development of exosome targeted approaches in patient specific cell-based therapies. Keywords: Non-coding RNA, Extracellular RNA, Exosomes, Retroelement, Pseudogene

  5. Progressive changes in non-coding RNA profile in leucocytes with age

    Science.gov (United States)

    Muñoz-Culla, Maider; Irizar, Haritz; Gorostidi, Ana; Alberro, Ainhoa; Osorio-Querejeta, Iñaki; Ruiz-Martínez, Javier; Olascoaga, Javier; de Munain, Adolfo López; Otaegui, David

    2017-01-01

    It has been observed that immune cell deterioration occurs in the elderly, as well as a chronic low-grade inflammation called inflammaging. These cellular changes must be driven by numerous changes in gene expression and in fact, both protein-coding and non-coding RNA expression alterations have been observed in peripheral blood mononuclear cells from elder people. In the present work we have studied the expression of small non-coding RNA (microRNA and small nucleolar RNA -snoRNA-) from healthy individuals from 24 to 79 years old. We have observed that the expression of 69 non-coding RNAs (56 microRNAs and 13 snoRNAs) changes progressively with chronological age. According to our results, the age range from 47 to 54 is critical given that it is the period when the expression trend (increasing or decreasing) of age-related small non-coding RNAs is more pronounced. Furthermore, age-related miRNAs regulate genes that are involved in immune, cell cycle and cancer-related processes, which had already been associated to human aging. Therefore, human aging could be studied as a result of progressive molecular changes, and different age ranges should be analysed to cover the whole aging process. PMID:28448962

  6. BIRTH: a beam deposition code for non-circular tokamak plasmas

    International Nuclear Information System (INIS)

    Otsuka, Michio; Nagami, Masayuki; Matsuda, Toshiaki

    1982-09-01

    A new beam deposition code has been developed which is capable of calculating fast ion deposition profiles including the orbit correction. The code incorporates any injection geometry and a non-circular cross section plasma with a variable elongation and an outward shift of the magnetic flux surface. Typical cpu time on a DEC-10 computer is 10 - 20 seconds and 5 - 10 seconds with and without the orbit correction, respectively. This is shorter by an order of magnitude than that of other codes, e.g., Monte Carlo codes. The power deposition profile calculated by this code is in good agreement with that calculated by a Monte Carlo code. (author)

  7. Whole-gene positive selection, elevated synonymous substitution rates, duplication, and indel evolution of the chloroplast clpP1 gene.

    Directory of Open Access Journals (Sweden)

    Per Erixon

    Full Text Available BACKGROUND: Synonymous DNA substitution rates in the plant chloroplast genome are generally relatively slow and lineage dependent. Non-synonymous rates are usually even slower due to purifying selection acting on the genes. Positive selection is expected to speed up non-synonymous substitution rates, whereas synonymous rates are expected to be unaffected. Until recently, positive selection has seldom been observed in chloroplast genes, and large-scale structural rearrangements leading to gene duplications are hitherto supposed to be rare. METHODOLOGY/PRINCIPLE FINDINGS: We found high substitution rates in the exons of the plastid clpP1 gene in Oenothera (the Evening Primrose family and three separate lineages in the tribe Sileneae (Caryophyllaceae, the Carnation family. Introns have been lost in some of the lineages, but where present, the intron sequences have substitution rates similar to those found in other introns of their genomes. The elevated substitution rates of clpP1 are associated with statistically significant whole-gene positive selection in three branches of the phylogeny. In two of the lineages we found multiple copies of the gene. Neighboring genes present in the duplicated fragments do not show signs of elevated substitution rates or positive selection. Although non-synonymous substitutions account for most of the increase in substitution rates, synonymous rates are also markedly elevated in some lineages. Whereas plant clpP1 genes experiencing negative (purifying selection are characterized by having very conserved lengths, genes under positive selection often have large insertions of more or less repetitive amino acid sequence motifs. CONCLUSIONS/SIGNIFICANCE: We found positive selection of the clpP1 gene in various plant lineages to correlated with repeated duplication of the clpP1 gene and surrounding regions, repetitive amino acid sequences, and increase in synonymous substitution rates. The present study sheds light on the

  8. Development of a SNP resource and a genetic linkage map for Atlantic cod (Gadus morhua

    Directory of Open Access Journals (Sweden)

    Higgins Brent

    2010-03-01

    Full Text Available Abstract Background Atlantic cod (Gadus morhua is a species with increasing economic significance for the aquaculture industry. The genetic improvement of cod will play a critical role in achieving successful large-scale aquaculture. While many microsatellite markers have been developed in cod, the number of single nucleotide polymorphisms (SNPs is currently limited. Here we report the identification of SNPs from sequence data generated by a large-scale expressed sequence tag (EST program, focusing on fish originating from Canadian waters. Results A total of 97976 ESTs were assembled to generate 13448 contigs. We detected 4753 SNPs that met our selection criteria (depth of coverage ≥ 4 reads; minor allele frequency > 25%. 3072 SNPs were selected for testing. The percentage of successful assays was 75%, with 2291 SNPs amplifying correctly. Of these, 607 (26% SNPs were monomorphic for all populations tested. In total, 64 (4% of SNPs are likely to represent duplicated genes or highly similar members of gene families, rather than alternative alleles of the same gene, since they showed a high frequency of heterozygosity. The remaining polymorphic SNPs (1620 were categorised as validated SNPs. The mean minor allele frequency of the validated loci was 0.258 (± 0.141. Of the 1514 contigs from which validated SNPs were selected, 31% have a significant blast hit. For the SNPs predicted to occur in coding regions (141, we determined that 36% (51 are non-synonymous. Many loci (1033 SNPs; 64% are polymorphic in all populations tested. However a small number of SNPs (184 that are polymorphic in the Western Atlantic were monomorphic in fish tested from three European populations. A preliminary linkage map has been constructed with 23 major linkage groups and 924 mapped SNPs. Conclusions These SNPs represent powerful tools to accelerate the genetic improvement of cod aquaculture. They have been used to build a genetic linkage map that can be applied to

  9. Development of a spreadsheet for SNPs typing using Microsoft EXCEL.

    Science.gov (United States)

    Hashiyada, Masaki; Itakura, Yukio; Takahashi, Shirushi; Sakai, Jun; Funayama, Masato

    2009-04-01

    Single-nucleotide polymorphisms (SNPs) have some characteristics that make them very appropriate for forensic studies and applications. In our institute, SNPs typings were performed by the TaqMan SNP Genotyping Assays using the ABI PRISM 7500 FAST Real-Time PCR System (AppliedBiosystems) and Sequence Detection Software ver.1.4 (AppliedBiosystem). The TaqMan method was desired two positive control (Allele1 and 2) and one negative control to analyze each SNP locus. Therefore, it can be analyzed up to 24 loci of a person on a 96-well-plate at the same time. If SNPs analysis is expected to apply to biometrics authentication, 48 and over loci are required to identify a person. In this study, we designed a spreadsheet package using Microsoft EXCEL, and population data were used from our 120 SNPs population studies. On the spreadsheet, we defined SNP types using 'template files' instead of positive and negative controls. "Template files" consisted of the results of 94 unknown samples and two negative controls of each of 120 SNPs loci we had previously studied. By the use of the files, the spreadsheet could analyze 96 SNPs on a 96-wells-plate simultaneously.

  10. Population differentiation in allele frequencies of obesity-associated SNPs.

    Science.gov (United States)

    Mao, Linyong; Fang, Yayin; Campbell, Michael; Southerland, William M

    2017-11-10

    Obesity is emerging as a global health problem, with more than one-third of the world's adult population being overweight or obese. In this study, we investigated worldwide population differentiation in allele frequencies of obesity-associated SNPs (single nucleotide polymorphisms). We collected a total of 225 obesity-associated SNPs from a public database. Their population-level allele frequencies were derived based on the genotype data from 1000 Genomes Project (phase 3). We used hypergeometric model to assess whether the effect allele at a given SNP is significantly enriched or depleted in each of the 26 populations surveyed in the 1000 Genomes Project with respect to the overall pooled population. Our results indicate that 195 out of 225 SNPs (86.7%) possess effect alleles significantly enriched or depleted in at least one of the 26 populations. Populations within the same continental group exhibit similar allele enrichment/depletion patterns whereas inter-continental populations show distinct patterns. Among the 225 SNPs, 15 SNPs cluster in the first intron region of the FTO gene, which is a major gene associated with body-mass index (BMI) and fat mass. African populations exhibit much smaller blocks of LD (linkage disequilibrium) among these15 SNPs while European and Asian populations have larger blocks. To estimate the cumulative effect of all variants associated with obesity, we developed the personal composite genetic risk score for obesity. Our results indicate that the East Asian populations have the lowest averages of the composite risk scores, whereas three European populations have the highest averages. In addition, the population-level average of composite genetic risk scores is significantly correlated (R 2 = 0.35, P = 0.0060) with obesity prevalence. We have detected substantial population differentiation in allele frequencies of obesity-associated SNPs. The results will help elucidate the genetic basis which may contribute to population

  11. Quantitative Profiling of Peptides from RNAs classified as non-coding

    Science.gov (United States)

    Prabakaran, Sudhakaran; Hemberg, Martin; Chauhan, Ruchi; Winter, Dominic; Tweedie-Cullen, Ry Y.; Dittrich, Christian; Hong, Elizabeth; Gunawardena, Jeremy; Steen, Hanno; Kreiman, Gabriel; Steen, Judith A.

    2014-01-01

    Only a small fraction of the mammalian genome codes for messenger RNAs destined to be translated into proteins, and it is generally assumed that a large portion of transcribed sequences - including introns and several classes of non-coding RNAs (ncRNAs) do not give rise to peptide products. A systematic examination of translation and physiological regulation of ncRNAs has not been conducted. Here, we use computational methods to identify the products of non-canonical translation in mouse neurons by analyzing unannotated transcripts in combination with proteomic data. This study supports the existence of non-canonical translation products from both intragenic and extragenic genomic regions, including peptides derived from anti-sense transcripts and introns. Moreover, the studied novel translation products exhibit temporal regulation similar to that of proteins known to be involved in neuronal activity processes. These observations highlight a potentially large and complex set of biologically regulated translational events from transcripts formerly thought to lack coding potential. PMID:25403355

  12. Effects of GWAS-associated genetic variants on lncRNAs within IBD and T1D candidate loci

    DEFF Research Database (Denmark)

    Mirza, Aashiq H; Kaur, Simranjeet; Brorsson, Caroline A

    2014-01-01

    -nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) lie outside of the protein coding regions, and map to the non-coding intervals. However, the relationship between phenotype-associated loci and the non-coding regions including the long non-coding RNAs (lncRNAs) is poorly understood. Here......, we systemically identified all annotated IBD and T1D loci-associated lncRNAs, and mapped nominally significant GWAS/ImmunoChip SNPs for IBD and T1D within these lncRNAs. Additionally, we identified tissue-specific cis-eQTLs, and strong linkage disequilibrium (LD) signals associated with these SNPs...... within and in close proximity (+/-5 kb flanking regions) of IBD and T1D loci-associated candidate genes, suggesting that these RNA conformation-altering polymorphisms might be associated with diseased-phenotype. Disruption of lncRNA secondary structure due to presence of GWAS SNPs provides valuable...

  13. At the intersection of non-coding transcription, DNA repair, chromatin structure, and cellular senescence

    Directory of Open Access Journals (Sweden)

    Ryosuke eOhsawa

    2013-07-01

    Full Text Available It is well accepted that non-coding RNAs play a critical role in regulating gene expression. Recent paradigm-setting studies are now revealing that non-coding RNAs, other than microRNAs, also play intriguing roles in the maintenance of chromatin structure, in the DNA damage response, and in adult human stem cell aging. In this review, we will discuss the complex inter-dependent relationships among non-coding RNA transcription, maintenance of genomic stability, chromatin structure and adult stem cell senescence. DNA damage-induced non-coding RNAs transcribed in the vicinity of the DNA break regulate recruitment of the DNA damage machinery and DNA repair efficiency. We will discuss the correlation between non-coding RNAs and DNA damage repair efficiency and the potential role of changing chromatin structures around double-strand break sites. On the other hand, induction of non-coding RNA transcription from the repetitive Alu elements occurs during human stem cell aging and hinders efficient DNA repair causing entry into senescence. We will discuss how this fine balance between transcription and genomic instability may be regulated by the dramatic changes to chromatin structure that accompany cellular senescence.

  14. On the classification of long non-coding RNAs

    KAUST Repository

    Ma, Lina; Bajic, Vladimir B.; Zhang, Zhang

    2013-01-01

    Long non-coding RNAs (lncRNAs) have been found to perform various functions in a wide variety of important biological processes. To make easier interpretation of lncRNA functionality and conduct deep mining on these transcribed sequences

  15. Single-nucleotide polymorphisms (SNPs) of the IRF6 and TFAP2A in non-syndromic cleft lip with or without cleft palate (NSCLP) in a northern Chinese population

    International Nuclear Information System (INIS)

    Shi, Jinna; Song, Tao; Jiao, Xiaohui; Qin, Chunlin; Zhou, Jin

    2011-01-01

    Highlights: → IRF6 rs642961 polymorphism is intensively associated with NSCLP. → IRF6 rs2235371 polymorphism is not associated with NSCLP in the northern Chinese population. → This investigation failed to yield any evidence for the involvement of TFAP2A polymorphisms in NSCLP in the northern Chinese population. -- Abstract: Non-syndromic cleft lip with or without cleft palate (NSCLP) is a common birth defect that is presumably caused by genetic factors alone or gene alterations in combination with environmental changes. A number of studies have shown an association between NSCLP and single-nucleotide polymorphisms (SNPs) in the interferon regulatory factor 6 (IRF6) gene in several populations. The transcription factor AP-2a (TFAP2A), which is involved in regulating mid-face development and upper lip fusion, has also be considered a candidate gene contributing to the etiology of NSCLP. The potential importance of IRF6 and TFAP2A in the NSCLP is further highlighted by a study showing that the two molecules are in the same developmental pathway. To further assess the roles of the IRF6 and TFAP2A in NSCLP, we investigated two identified IRF6 SNPs (rs2235371, rs642961) and three TFAP2A tag SNPs (rs3798691, rs1675414, rs303050) selected from HapMap data in a northern Chinese population, a group with a high prevalence of NSCLP. These SNPs were examined for association with NSCLP in 175 patients and 160 healthy controls. We observed a significant correlation between IRF6 rs642961 and NSCLP, and a lack of association between IRF6 rs2235371 polymorphisms and NSCLP in this population. This investigation indicated that there is no association between the three SNPs in the TFAP2A and NSCLP, suggesting that TFAP2A may not be involved in the development of NSCLP in the northern Chinese population. Our study provides further evidence regarding the role of IRF6 variations in NSCLP development and finds no significant association between TFAP2A and NSCLP in this northern

  16. Single-nucleotide polymorphisms (SNPs) of the IRF6 and TFAP2A in non-syndromic cleft lip with or without cleft palate (NSCLP) in a northern Chinese population

    Energy Technology Data Exchange (ETDEWEB)

    Shi, Jinna, E-mail: kqkjk@yahoo.com.cn [Department of Periodontology, The First Affiliated Hospital, Harbin Medical University, Harbin (China); Song, Tao; Jiao, Xiaohui [Department of Oral Maxillofacial Surgery, The First Affiliated Hospital, Harbin Medical University, Harbin (China); Qin, Chunlin [Department of Biomedical Sciences, Texas A and M Health Science Center, Baylor College of Dentistry, Dallas, TX (United States); Zhou, Jin [Department of Hematology, The First Affiliated Hospital, Harbin Medical University, Harbin (China)

    2011-07-15

    Highlights: {yields} IRF6 rs642961 polymorphism is intensively associated with NSCLP. {yields} IRF6 rs2235371 polymorphism is not associated with NSCLP in the northern Chinese population. {yields} This investigation failed to yield any evidence for the involvement of TFAP2A polymorphisms in NSCLP in the northern Chinese population. -- Abstract: Non-syndromic cleft lip with or without cleft palate (NSCLP) is a common birth defect that is presumably caused by genetic factors alone or gene alterations in combination with environmental changes. A number of studies have shown an association between NSCLP and single-nucleotide polymorphisms (SNPs) in the interferon regulatory factor 6 (IRF6) gene in several populations. The transcription factor AP-2a (TFAP2A), which is involved in regulating mid-face development and upper lip fusion, has also be considered a candidate gene contributing to the etiology of NSCLP. The potential importance of IRF6 and TFAP2A in the NSCLP is further highlighted by a study showing that the two molecules are in the same developmental pathway. To further assess the roles of the IRF6 and TFAP2A in NSCLP, we investigated two identified IRF6 SNPs (rs2235371, rs642961) and three TFAP2A tag SNPs (rs3798691, rs1675414, rs303050) selected from HapMap data in a northern Chinese population, a group with a high prevalence of NSCLP. These SNPs were examined for association with NSCLP in 175 patients and 160 healthy controls. We observed a significant correlation between IRF6 rs642961 and NSCLP, and a lack of association between IRF6 rs2235371 polymorphisms and NSCLP in this population. This investigation indicated that there is no association between the three SNPs in the TFAP2A and NSCLP, suggesting that TFAP2A may not be involved in the development of NSCLP in the northern Chinese population. Our study provides further evidence regarding the role of IRF6 variations in NSCLP development and finds no significant association between TFAP2A and NSCLP in this

  17. Non-Coding RNAs and Endometrial Cancer

    Directory of Open Access Journals (Sweden)

    Cristina Vallone

    2018-03-01

    Full Text Available Non-coding RNAs (ncRNAs are involved in the regulation of cell metabolism and neoplastic transformation. Recent studies have tried to clarify the significance of these information carriers in the genesis and progression of various cancers and their use as biomarkers for the disease; possible targets for the inhibition of growth and invasion by the neoplastic cells have been suggested. The significance of ncRNAs in lung cancer, bladder cancer, kidney cancer, and melanoma has been amply investigated with important results. Recently, the role of long non-coding RNAs (lncRNAs has also been included in cancer studies. Studies on the relation between endometrial cancer (EC and ncRNAs, such as small ncRNAs or micro RNAs (miRNAs, transfer RNAs (tRNAs, ribosomal RNAs (rRNAs, antisense RNAs (asRNAs, small nuclear RNAs (snRNAs, Piwi-interacting RNAs (piRNAs, small nucleolar RNAs (snoRNAs, competing endogenous RNAs (ceRNAs, lncRNAs, and long intergenic ncRNAs (lincRNAs have been published. The recent literature produced in the last three years was extracted from PubMed by two independent readers, which was then selected for the possible relation between ncRNAs, oncogenesis in general, and EC in particular.

  18. Junk DNA and the long non-coding RNA twist in cancer genetics

    NARCIS (Netherlands)

    H. Ling (Hui); K. Vincent; M. Pichler; R. Fodde (Riccardo); I. Berindan-Neagoe (Ioana); F.J. Slack (Frank); G.A. Calin (George)

    2015-01-01

    textabstractThe central dogma of molecular biology states that the flow of genetic information moves from DNA to RNA to protein. However, in the last decade this dogma has been challenged by new findings on non-coding RNAs (ncRNAs) such as microRNAs (miRNAs). More recently, long non-coding RNAs

  19. Identification of functional SNPs in the 5-prime flanking sequences of human genes

    Directory of Open Access Journals (Sweden)

    Lenhard Boris

    2005-02-01

    Full Text Available Abstract Background Over 4 million single nucleotide polymorphisms (SNPs are currently reported to exist within the human genome. Only a small fraction of these SNPs alter gene function or expression, and therefore might be associated with a cell phenotype. These functional SNPs are consequently important in understanding human health. Information related to functional SNPs in candidate disease genes is critical for cost effective genetic association studies, which attempt to understand the genetics of complex diseases like diabetes, Alzheimer's, etc. Robust methods for the identification of functional SNPs are therefore crucial. We report one such experimental approach. Results Sequence conserved between mouse and human genomes, within 5 kilobases of the 5-prime end of 176 GPCR genes, were screened for SNPs. Sequences flanking these SNPs were scored for transcription factor binding sites. Allelic pairs resulting in a significant score difference were predicted to influence the binding of transcription factors (TFs. Ten such SNPs were selected for mobility shift assays (EMSA, resulting in 7 of them exhibiting a reproducible shift. The full-length promoter regions with 4 of the 7 SNPs were cloned in a Luciferase based plasmid reporter system. Two out of the 4 SNPs exhibited differential promoter activity in several human cell lines. Conclusions We propose a method for effective selection of functional, regulatory SNPs that are located in evolutionary conserved 5-prime flanking regions (5'-FR regions of human genes and influence the activity of the transcriptional regulatory region. Some SNPs behave differently in different cell types.

  20. Streptomyces ciscaucasicus Sveshnikova et al. 1983 is a later subjective synonym of Streptomyces canus Heinemann et al. 1953.

    Science.gov (United States)

    Kämpfer, Peter; Rückert, Christian; Blom, Jochen; Goesmann, Alexander; Wink, Joachim; Kalinowski, Jörn; Glaeser, Stefanie P

    2018-01-01

    Streptomyces canuswas described in 1953 and the name was listed in the Approved List of Bacterial Names in 1980. Three years later, Streptomyces ciscaucasicus was published and the name was subsequently validated in Validation List no. 22 in 1986. On the basis of genome comparison and multilocus sequence analysis of the type strains of Streptomyces canus and Streptomyces ciscaucasicus it can now be shown that these two species despite some phenotypic differences are subjective synonyms. In such a case Rule 24 of the Bacteriological Code applies, in which priority of names is determined by the date of the original publication. Hence, we propose that S. ciscaucasicus is a later subjective synonym of S. canus.

  1. Highly conserved non-coding sequences are associated with vertebrate development.

    Directory of Open Access Journals (Sweden)

    Adam Woolfe

    2005-01-01

    Full Text Available In addition to protein coding sequence, the human genome contains a significant amount of regulatory DNA, the identification of which is proving somewhat recalcitrant to both in silico and functional methods. An approach that has been used with some success is comparative sequence analysis, whereby equivalent genomic regions from different organisms are compared in order to identify both similarities and differences. In general, similarities in sequence between highly divergent organisms imply functional constraint. We have used a whole-genome comparison between humans and the pufferfish, Fugu rubripes, to identify nearly 1,400 highly conserved non-coding sequences. Given the evolutionary divergence between these species, it is likely that these sequences are found in, and furthermore are essential to, all vertebrates. Most, and possibly all, of these sequences are located in and around genes that act as developmental regulators. Some of these sequences are over 90% identical across more than 500 bases, being more highly conserved than coding sequence between these two species. Despite this, we cannot find any similar sequences in invertebrate genomes. In order to begin to functionally test this set of sequences, we have used a rapid in vivo assay system using zebrafish embryos that allows tissue-specific enhancer activity to be identified. Functional data is presented for highly conserved non-coding sequences associated with four unrelated developmental regulators (SOX21, PAX6, HLXB9, and SHH, in order to demonstrate the suitability of this screen to a wide range of genes and expression patterns. Of 25 sequence elements tested around these four genes, 23 show significant enhancer activity in one or more tissues. We have identified a set of non-coding sequences that are highly conserved throughout vertebrates. They are found in clusters across the human genome, principally around genes that are implicated in the regulation of development

  2. An expanding universe of the non-coding genome in cancer biology.

    Science.gov (United States)

    Xue, Bin; He, Lin

    2014-06-01

    Neoplastic transformation is caused by accumulation of genetic and epigenetic alterations that ultimately convert normal cells into tumor cells with uncontrolled proliferation and survival, unlimited replicative potential and invasive growth [Hanahan,D. et al. (2011) Hallmarks of cancer: the next generation. Cell, 144, 646-674]. Although the majority of the cancer studies have focused on the functions of protein-coding genes, emerging evidence has started to reveal the importance of the vast non-coding genome, which constitutes more than 98% of the human genome. A number of non-coding RNAs (ncRNAs) derived from the 'dark matter' of the human genome exhibit cancer-specific differential expression and/or genomic alterations, and it is increasingly clear that ncRNAs, including small ncRNAs and long ncRNAs (lncRNAs), play an important role in cancer development by regulating protein-coding gene expression through diverse mechanisms. In addition to ncRNAs, nearly half of the mammalian genomes consist of transposable elements, particularly retrotransposons. Once depicted as selfish genomic parasites that propagate at the expense of host fitness, retrotransposon elements could also confer regulatory complexity to the host genomes during development and disease. Reactivation of retrotransposons in cancer, while capable of causing insertional mutagenesis and genome rearrangements to promote oncogenesis, could also alter host gene expression networks to favor tumor development. Taken together, the functional significance of non-coding genome in tumorigenesis has been previously underestimated, and diverse transcripts derived from the non-coding genome could act as integral functional components of the oncogene and tumor suppressor network. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  3. Design of ACM system based on non-greedy punctured LDPC codes

    Science.gov (United States)

    Lu, Zijun; Jiang, Zihong; Zhou, Lin; He, Yucheng

    2017-08-01

    In this paper, an adaptive coded modulation (ACM) scheme based on rate-compatible LDPC (RC-LDPC) codes was designed. The RC-LDPC codes were constructed by a non-greedy puncturing method which showed good performance in high code rate region. Moreover, the incremental redundancy scheme of LDPC-based ACM system over AWGN channel was proposed. By this scheme, code rates vary from 2/3 to 5/6 and the complication of the ACM system is lowered. Simulations show that more and more obvious coding gain can be obtained by the proposed ACM system with higher throughput.

  4. Probable relationship between partitions of the set of codons and the origin of the genetic code.

    Science.gov (United States)

    Salinas, Dino G; Gallardo, Mauricio O; Osorio, Manuel I

    2014-03-01

    Here we study the distribution of randomly generated partitions of the set of amino acid-coding codons. Some results are an application from a previous work, about the Stirling numbers of the second kind and triplet codes, both to the cases of triplet codes having four stop codons, as in mammalian mitochondrial genetic code, and hypothetical doublet codes. Extending previous results, in this work it is found that the most probable number of blocks of synonymous codons, in a genetic code, is similar to the number of amino acids when there are four stop codons, as well as it could be for a primigenious doublet code. Also it is studied the integer partitions associated to patterns of synonymous codons and it is shown, for the canonical code, that the standard deviation inside an integer partition is one of the most probable. We think that, in some early epoch, the genetic code might have had a maximum of the disorder or entropy, independent of the assignment between codons and amino acids, reaching a state similar to "code freeze" proposed by Francis Crick. In later stages, maybe deterministic rules have reassigned codons to amino acids, forming the natural codes, such as the canonical code, but keeping the numerical features describing the set partitions and the integer partitions, like a "fossil numbers"; both kinds of partitions about the set of amino acid-coding codons. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  5. Geographic differences in allele frequencies of susceptibility SNPs for cardiovascular disease

    Directory of Open Access Journals (Sweden)

    Kullo Iftikhar J

    2011-04-01

    Full Text Available Abstract Background We hypothesized that the frequencies of risk alleles of SNPs mediating susceptibility to cardiovascular diseases differ among populations of varying geographic origin and that population-specific selection has operated on some of these variants. Methods From the database of genome-wide association studies (GWAS, we selected 36 cardiovascular phenotypes including coronary heart disease, hypertension, and stroke, as well as related quantitative traits (eg, body mass index and plasma lipid levels. We identified 292 SNPs in 270 genes associated with a disease or trait at P -8. As part of the Human Genome-Diversity Project (HGDP, 158 (54.1% of these SNPs have been genotyped in 938 individuals belonging to 52 populations from seven geographic areas. A measure of population differentiation, FST, was calculated to quantify differences in risk allele frequencies (RAFs among populations and geographic areas. Results Large differences in RAFs were noted in populations of Africa, East Asia, America and Oceania, when compared with other geographic regions. The mean global FST (0.1042 for 158 SNPs among the populations was not significantly higher than the mean global FST of 158 autosomal SNPs randomly sampled from the HGDP database. Significantly higher global FST (P FST of 2036 putatively neutral SNPs. For four of these SNPs, additional evidence of selection was noted based on the integrated Haplotype Score. Conclusion Large differences in RAFs for a set of common SNPs that influence risk of cardiovascular disease were noted between the major world populations. Pairwise comparisons revealed RAF differences for at least eight SNPs that might be due to population-specific selection or demographic factors. These findings are relevant to a better understanding of geographic variation in the prevalence of cardiovascular disease.

  6. Variability of the caprine whey protein genes and their association with milk yield, composition and renneting properties in the Sarda breed. 1. The LALBA gene.

    Science.gov (United States)

    Dettori, Maria Luisa; Pazzola, Michele; Paschino, Pietro; Pira, Maria Giovanna; Vacca, Giuseppe Massimo

    2015-11-01

    The 5' flanking region and 3' UTR of the caprine LALBA gene were analysed by SSCP and sequencing. A total of nine SNPs were detected: three in the promoter region, two were synonymous coding SNPs at exon-1, and four SNPs were in exon-4, within the 3'UTR. The nucleotide changes located in the promoter region (c.-358T>C, c.-163G>A, c.-121T>G) were genotyped by SSCP in 263 Sarda goats to evaluate their possible effect on milk yield, composition and renneting properties. We observed an effect of the three SNPs on milk yield and lactose content. Genotypes TT and CT at c.-358T>C (P A (P C and c.-121T>G were part of transcription factors binding sites, potentially involved in modulating the LALBA gene expression. The LALBA genotype affected renneting properties (P < 0.001), as heterozygotes c.-358CT and c.-163GA were characterised by delayed rennet coagulation time and curd firming time and the lowest value of curd firmness. The present investigation increases the panel of SNPs and adds new information about the effects of the caprine LALBA gene polymorphism.

  7. The importance of species name synonyms in literature searches

    Science.gov (United States)

    Guala, Gerald

    2016-01-01

    The synonyms of biological species names are shown to be an important component in comprehensive searches of electronic scientific literature databases but they are not well leveraged within the major literature databases examined. For accepted or valid species names in the Integrated Taxonomic Information System (ITIS) which have synonyms in the system, and which are found in citations within PLoS, PMC, PubMed or Scopus, both the percentage of species for which citations will not be found if synonyms are not used, and the percentage increase in number of citations found by including synonyms are very often substantial. However, there is no correlation between the number of synonyms per species and the magnitude of the effect. Further, the number of citations found does not generally increase proportionally to the number of synonyms available. Users looking for literature on specific species across all of the resources investigated here are often missing large numbers of citations if they are not manually augmenting their searches with synonyms. Of course, missing citations can have serious consequences by effectively hiding critical information. Literature searches should include synonym relationships and a new web service in ITIS, with examples of how to apply it to this issue, was developed as a result of this study, and is here announced, to aide in this.

  8. The Importance of Species Name Synonyms in Literature Searches.

    Science.gov (United States)

    Guala, Gerald F

    2016-01-01

    The synonyms of biological species names are shown to be an important component in comprehensive searches of electronic scientific literature databases but they are not well leveraged within the major literature databases examined. For accepted or valid species names in the Integrated Taxonomic Information System (ITIS) which have synonyms in the system, and which are found in citations within PLoS, PMC, PubMed or Scopus, both the percentage of species for which citations will not be found if synonyms are not used, and the percentage increase in number of citations found by including synonyms are very often substantial. However, there is no correlation between the number of synonyms per species and the magnitude of the effect. Further, the number of citations found does not generally increase proportionally to the number of synonyms available. Users looking for literature on specific species across all of the resources investigated here are often missing large numbers of citations if they are not manually augmenting their searches with synonyms. Of course, missing citations can have serious consequences by effectively hiding critical information. Literature searches should include synonym relationships and a new web service in ITIS, with examples of how to apply it to this issue, was developed as a result of this study, and is here announced, to aide in this.

  9. The Importance of Species Name Synonyms in Literature Searches.

    Directory of Open Access Journals (Sweden)

    Gerald F Guala

    Full Text Available The synonyms of biological species names are shown to be an important component in comprehensive searches of electronic scientific literature databases but they are not well leveraged within the major literature databases examined. For accepted or valid species names in the Integrated Taxonomic Information System (ITIS which have synonyms in the system, and which are found in citations within PLoS, PMC, PubMed or Scopus, both the percentage of species for which citations will not be found if synonyms are not used, and the percentage increase in number of citations found by including synonyms are very often substantial. However, there is no correlation between the number of synonyms per species and the magnitude of the effect. Further, the number of citations found does not generally increase proportionally to the number of synonyms available. Users looking for literature on specific species across all of the resources investigated here are often missing large numbers of citations if they are not manually augmenting their searches with synonyms. Of course, missing citations can have serious consequences by effectively hiding critical information. Literature searches should include synonym relationships and a new web service in ITIS, with examples of how to apply it to this issue, was developed as a result of this study, and is here announced, to aide in this.

  10. u-Constacyclic codes over F_p+u{F}_p and their applications of constructing new non-binary quantum codes

    Science.gov (United States)

    Gao, Jian; Wang, Yongkang

    2018-01-01

    Structural properties of u-constacyclic codes over the ring F_p+u{F}_p are given, where p is an odd prime and u^2=1. Under a special Gray map from F_p+u{F}_p to F_p^2, some new non-binary quantum codes are obtained by this class of constacyclic codes.

  11. Long non-coding RNA expression profile in cervical cancer tissues

    Science.gov (United States)

    Zhu, Hua; Chen, Xiangjian; Hu, Yan; Shi, Zhengzheng; Zhou, Qing; Zheng, Jingjie; Wang, Yifeng

    2017-01-01

    Cervical cancer (CC), one of the most common types of cancer of the female population, presents an enormous challenge in diagnosis and treatment. Long non-coding (lnc)RNAs, non-coding (nc)RNAs with length >200 nucleotides, have been identified to be associated with multiple types of cancer, including CC. This class of nc transcripts serves an important role in tumor suppression and oncogenic signaling pathways. In the present study, the microarray method was used to obtain the expression profile of lncRNAs and protein-coding mRNAs and to compare the expression of lncRNAs between CC tissues and corresponding adjacent non-cancerous tissues in order to screen potential lncRNAs for associations with CC. Overall, 3356 lncRNAs with significantly different expression pattern in CC tissues compared with adjacent non-cancerous tissues were identified, while 1,857 of them were upregulated. These differentially expressed lncRNAs were additionally classified into 5 subgroups. Reverse transcription quantitative polymerase chain reactions were performed to validate the expression pattern of 5 random selected lncRNAs, and 2lncRNAs were identified to have significantly different expression in CC samples compared with adjacent non-cancerous tissues. This finding suggests that those lncRNAs with different expression may serve important roles in the development of CC, and the expression data may provide information for additional study on the involvement of lncRNAs in CC. PMID:28789353

  12. RNA editing differently affects protein-coding genes in D. melanogaster and H. sapiens.

    Science.gov (United States)

    Grassi, Luigi; Leoni, Guido; Tramontano, Anna

    2015-07-14

    When an RNA editing event occurs within a coding sequence it can lead to a different encoded amino acid. The biological significance of these events remains an open question: they can modulate protein functionality, increase the complexity of transcriptomes or arise from a loose specificity of the involved enzymes. We analysed the editing events in coding regions that produce or not a change in the encoded amino acid (nonsynonymous and synonymous events, respectively) in D. melanogaster and in H. sapiens and compared them with the appropriate random models. Interestingly, our results show that the phenomenon has rather different characteristics in the two organisms. For example, we confirm the observation that editing events occur more frequently in non-coding than in coding regions, and report that this effect is much more evident in H. sapiens. Additionally, in this latter organism, editing events tend to affect less conserved residues. The less frequently occurring editing events in Drosophila tend to avoid drastic amino acid changes. Interestingly, we find that, in Drosophila, changes from less frequently used codons to more frequently used ones are favoured, while this is not the case in H. sapiens.

  13. Association of six CpG-SNPs in the inflammation-related genes with coronary heart disease.

    Science.gov (United States)

    Chen, Xiaomin; Chen, Xiaoying; Xu, Yan; Yang, William; Wu, Nan; Ye, Huadan; Yang, Jack Y; Hong, Qingxiao; Xin, Yanfei; Yang, Mary Qu; Deng, Youping; Duan, Shiwei

    2016-07-25

    Chronic inflammation has been widely considered to be the major risk factor of coronary heart disease (CHD). The goal of our study was to explore the possible association with CHD for inflammation-related single nucleotide polymorphisms (SNPs) involved in cytosine-phosphate-guanine (CpG) dinucleotides. A total of 784 CHD patients and 739 non-CHD controls were recruited from Zhejiang Province, China. Using the Sequenom MassARRAY platform, we measured the genotypes of six inflammation-related CpG-SNPs, including IL1B rs16944, IL1R2 rs2071008, PLA2G7 rs9395208, FAM5C rs12732361, CD40 rs1800686, and CD36 rs2065666). Allele and genotype frequencies were compared between CHD and non-CHD individuals using the CLUMP22 software with 10,000 Monte Carlo simulations. Allelic tests showed that PLA2G7 rs9395208 and CD40 rs1800686 were significantly associated with CHD. Moreover, IL1B rs16944, PLA2G7 rs9395208, and CD40 rs1800686 were shown to be associated with CHD under the dominant model. Further gender-based subgroup tests showed that one SNP (CD40 rs1800686) and two SNPs (FAM5C rs12732361 and CD36 rs2065666) were associated with CHD in females and males, respectively. And the age-based subgroup tests indicated that PLA2G7 rs9395208, IL1B rs16944, and CD40 rs1800686 were associated with CHD among individuals younger than 55, younger than 65, and over 65, respectively. In conclusion, all the six inflammation-related CpG-SNPs (rs16944, rs2071008, rs12732361, rs2065666, rs9395208, and rs1800686) were associated with CHD in the combined or subgroup tests, suggesting an important role of inflammation in the risk of CHD.

  14. Evolutionary forces shaping genomic islands of population differentiation in humans

    Directory of Open Access Journals (Sweden)

    Hofer Tamara

    2012-03-01

    Full Text Available Abstract Background Levels of differentiation among populations depend both on demographic and selective factors: genetic drift and local adaptation increase population differentiation, which is eroded by gene flow and balancing selection. We describe here the genomic distribution and the properties of genomic regions with unusually high and low levels of population differentiation in humans to assess the influence of selective and neutral processes on human genetic structure. Methods Individual SNPs of the Human Genome Diversity Panel (HGDP showing significantly high or low levels of population differentiation were detected under a hierarchical-island model (HIM. A Hidden Markov Model allowed us to detect genomic regions or islands of high or low population differentiation. Results Under the HIM, only 1.5% of all SNPs are significant at the 1% level, but their genomic spatial distribution is significantly non-random. We find evidence that local adaptation shaped high-differentiation islands, as they are enriched for non-synonymous SNPs and overlap with previously identified candidate regions for positive selection. Moreover there is a negative relationship between the size of islands and recombination rate, which is stronger for islands overlapping with genes. Gene ontology analysis supports the role of diet as a major selective pressure in those highly differentiated islands. Low-differentiation islands are also enriched for non-synonymous SNPs, and contain an overly high proportion of genes belonging to the 'Oncogenesis' biological process. Conclusions Even though selection seems to be acting in shaping islands of high population differentiation, neutral demographic processes might have promoted the appearance of some genomic islands since i as much as 20% of islands are in non-genic regions ii these non-genic islands are on average two times shorter than genic islands, suggesting a more rapid erosion by recombination, and iii most loci are

  15. Four new single nucleotide polymorphisms (SNPs) of toll-like ...

    African Journals Online (AJOL)

    In order to reveal the single nucleotide polymorphisms (SNPs), genotypes and allelic frequencies of each mutation site of TLR7 gene in Chinese native duck breeds, SNPs of duck TLR7 gene were detected by DNA sequencing. The genotypes of 465 native ducks from eight key protected duck breeds were determined by ...

  16. Genetic association of SNPs in the FTO gene and predisposition to obesity in Malaysian Malays

    International Nuclear Information System (INIS)

    Apalasamy, Y.D.; Ming, M.F.; Rampal, S.; Bulgiba, A.; Mohamed, Z.

    2012-01-01

    The common variants in the fat mass- and obesity-associated (FTO) gene have been previously found to be associated with obesity in various adult populations. The objective of the present study was to investigate whether the single nucleotide polymorphisms (SNPs) and linkage disequilibrium (LD) blocks in various regions of the FTO gene are associated with predisposition to obesity in Malaysian Malays. Thirty-one FTO SNPs were genotyped in 587 (158 obese and 429 non-obese) Malaysian Malay subjects. Obesity traits and lipid profiles were measured and single-marker association testing, LD testing, and haplotype association analysis were performed. LD analysis of the FTO SNPs revealed the presence of 57 regions with complete LD (D' = 1.0). In addition, we detected the association of rs17817288 with low-density lipoprotein cholesterol. The FTO gene may therefore be involved in lipid metabolism in Malaysian Malays. Two haplotype blocks were present in this region of the FTO gene, but no particular haplotype was found to be significantly associated with an increased risk of obesity in Malaysian Malays

  17. Genetic association of SNPs in the FTO gene and predisposition to obesity in Malaysian Malays

    Energy Technology Data Exchange (ETDEWEB)

    Apalasamy, Y.D. [Pharmacogenomics Laboratory, Department of Pharmacology, Faculty of Medicine, University of Malaya, Kuala Lumpur (Malaysia); Ming, M.F.; Rampal, S.; Bulgiba, A. [Julius Centre University of Malaya, Department of Social and Preventive Medicine, Faculty of Medicine, University of Malaya, Kuala Lumpur (Malaysia); Mohamed, Z. [Pharmacogenomics Laboratory, Department of Pharmacology, Faculty of Medicine, University of Malaya, Kuala Lumpur (Malaysia)

    2012-08-24

    The common variants in the fat mass- and obesity-associated (FTO) gene have been previously found to be associated with obesity in various adult populations. The objective of the present study was to investigate whether the single nucleotide polymorphisms (SNPs) and linkage disequilibrium (LD) blocks in various regions of the FTO gene are associated with predisposition to obesity in Malaysian Malays. Thirty-one FTO SNPs were genotyped in 587 (158 obese and 429 non-obese) Malaysian Malay subjects. Obesity traits and lipid profiles were measured and single-marker association testing, LD testing, and haplotype association analysis were performed. LD analysis of the FTO SNPs revealed the presence of 57 regions with complete LD (D' = 1.0). In addition, we detected the association of rs17817288 with low-density lipoprotein cholesterol. The FTO gene may therefore be involved in lipid metabolism in Malaysian Malays. Two haplotype blocks were present in this region of the FTO gene, but no particular haplotype was found to be significantly associated with an increased risk of obesity in Malaysian Malays.

  18. Genetic association of SNPs in the FTO gene and predisposition to obesity in Malaysian Malays

    Directory of Open Access Journals (Sweden)

    Y.D. Apalasamy

    2012-12-01

    Full Text Available The common variants in the fat mass- and obesity-associated (FTO gene have been previously found to be associated with obesity in various adult populations. The objective of the present study was to investigate whether the single nucleotide polymorphisms (SNPs and linkage disequilibrium (LD blocks in various regions of the FTO gene are associated with predisposition to obesity in Malaysian Malays. Thirty-one FTO SNPs were genotyped in 587 (158 obese and 429 non-obese Malaysian Malay subjects. Obesity traits and lipid profiles were measured and single-marker association testing, LD testing, and haplotype association analysis were performed. LD analysis of the FTO SNPs revealed the presence of 57 regions with complete LD (D’ = 1.0. In addition, we detected the association of rs17817288 with low-density lipoprotein cholesterol. The FTO gene may therefore be involved in lipid metabolism in Malaysian Malays. Two haplotype blocks were present in this region of the FTO gene, but no particular haplotype was found to be significantly associated with an increased risk of obesity in Malaysian Malays.

  19. Genotyping of 75 SNPs using arrays for individual identification in five population groups.

    Science.gov (United States)

    Hwa, Hsiao-Lin; Wu, Lawrence Shih Hsin; Lin, Chun-Yen; Huang, Tsun-Ying; Yin, Hsiang-I; Tseng, Li-Hui; Lee, James Chun-I

    2016-01-01

    Single nucleotide polymorphism (SNP) typing offers promise to forensic genetics. Various strategies and panels for analyzing SNP markers for individual identification have been published. However, the best panels with fewer identity SNPs for all major population groups are still under discussion. This study aimed to find more autosomal SNPs with high heterozygosity for individual identification among Asian populations. Ninety-six autosomal SNPs of 502 DNA samples from unrelated individuals of five population groups (208 Taiwanese Han, 83 Filipinos, 62 Thais, 69 Indonesians, and 80 individuals with European, Near Eastern, or South Asian ancestry) were analyzed using arrays in an initial screening, and 75 SNPs (group A, 46 newly selected SNPs; groups B, 29 SNPs based on a previous SNP panel) were selected for further statistical analyses. Some SNPs with high heterozygosity from Asian populations were identified. The combined random match probability of the best 40 and 45 SNPs was between 3.16 × 10(-17) and 7.75 × 10(-17) and between 2.33 × 10(-19) and 7.00 × 10(-19), respectively, in all five populations. These loci offer comparable power to short tandem repeats (STRs) for routine forensic profiling. In this study, we demonstrated the population genetic characteristics and forensic parameters of 75 SNPs with high heterozygosity from five population groups. This SNPs panel can provide valuable genotypic information and can be helpful in forensic casework for individual identification among these populations.

  20. An RNA-Seq strategy to detect the complete coding and non-coding transcriptome including full-length imprinted macro ncRNAs.

    Directory of Open Access Journals (Sweden)

    Ru Huang

    Full Text Available Imprinted macro non-protein-coding (nc RNAs are cis-repressor transcripts that silence multiple genes in at least three imprinted gene clusters in the mouse genome. Similar macro or long ncRNAs are abundant in the mammalian genome. Here we present the full coding and non-coding transcriptome of two mouse tissues: differentiated ES cells and fetal head using an optimized RNA-Seq strategy. The data produced is highly reproducible in different sequencing locations and is able to detect the full length of imprinted macro ncRNAs such as Airn and Kcnq1ot1, whose length ranges between 80-118 kb. Transcripts show a more uniform read coverage when RNA is fragmented with RNA hydrolysis compared with cDNA fragmentation by shearing. Irrespective of the fragmentation method, all coding and non-coding transcripts longer than 8 kb show a gradual loss of sequencing tags towards the 3' end. Comparisons to published RNA-Seq datasets show that the strategy presented here is more efficient in detecting known functional imprinted macro ncRNAs and also indicate that standardization of RNA preparation protocols would increase the comparability of the transcriptome between different RNA-Seq datasets.

  1. Identification of SNPs in chemerin gene and association with ...

    African Journals Online (AJOL)

    Chemerin is a novel adipokine that regulates adipogenesis and adipocyte metabolism via its own receptor. In this study, two novel SNPs (868A>G in exon 2 and 2692C>T in exon 5) of chemerin gene were identified by PCR-SSCP and DNA sequencing technology. The allele frequencies of the novel SNPs were determined ...

  2. Genome-wide identification of coding and non-coding conserved sequence tags in human and mouse genomes

    Directory of Open Access Journals (Sweden)

    Maggi Giorgio P

    2008-06-01

    Full Text Available Abstract Background The accurate detection of genes and the identification of functional regions is still an open issue in the annotation of genomic sequences. This problem affects new genomes but also those of very well studied organisms such as human and mouse where, despite the great efforts, the inventory of genes and regulatory regions is far from complete. Comparative genomics is an effective approach to address this problem. Unfortunately it is limited by the computational requirements needed to perform genome-wide comparisons and by the problem of discriminating between conserved coding and non-coding sequences. This discrimination is often based (thus dependent on the availability of annotated proteins. Results In this paper we present the results of a comprehensive comparison of human and mouse genomes performed with a new high throughput grid-based system which allows the rapid detection of conserved sequences and accurate assessment of their coding potential. By detecting clusters of coding conserved sequences the system is also suitable to accurately identify potential gene loci. Following this analysis we created a collection of human-mouse conserved sequence tags and carefully compared our results to reliable annotations in order to benchmark the reliability of our classifications. Strikingly we were able to detect several potential gene loci supported by EST sequences but not corresponding to as yet annotated genes. Conclusion Here we present a new system which allows comprehensive comparison of genomes to detect conserved coding and non-coding sequences and the identification of potential gene loci. Our system does not require the availability of any annotated sequence thus is suitable for the analysis of new or poorly annotated genomes.

  3. TFIIS-Dependent Non-coding Transcription Regulates Developmental Genome Rearrangements.

    Directory of Open Access Journals (Sweden)

    Kamila Maliszewska-Olejniczak

    2015-07-01

    Full Text Available Because of their nuclear dimorphism, ciliates provide a unique opportunity to study the role of non-coding RNAs (ncRNAs in the communication between germline and somatic lineages. In these unicellular eukaryotes, a new somatic nucleus develops at each sexual cycle from a copy of the zygotic (germline nucleus, while the old somatic nucleus degenerates. In the ciliate Paramecium tetraurelia, the genome is massively rearranged during this process through the reproducible elimination of repeated sequences and the precise excision of over 45,000 short, single-copy Internal Eliminated Sequences (IESs. Different types of ncRNAs resulting from genome-wide transcription were shown to be involved in the epigenetic regulation of genome rearrangements. To understand how ncRNAs are produced from the entire genome, we have focused on a homolog of the TFIIS elongation factor, which regulates RNA polymerase II transcriptional pausing. Six TFIIS-paralogs, representing four distinct families, can be found in P. tetraurelia genome. Using RNA interference, we showed that TFIIS4, which encodes a development-specific TFIIS protein, is essential for the formation of a functional somatic genome. Molecular analyses and high-throughput DNA sequencing upon TFIIS4 RNAi demonstrated that TFIIS4 is involved in all kinds of genome rearrangements, including excision of ~48% of IESs. Localization of a GFP-TFIIS4 fusion revealed that TFIIS4 appears specifically in the new somatic nucleus at an early developmental stage, before IES excision. RT-PCR experiments showed that TFIIS4 is necessary for the synthesis of IES-containing non-coding transcripts. We propose that these IES+ transcripts originate from the developing somatic nucleus and serve as pairing substrates for germline-specific short RNAs that target elimination of their homologous sequences. Our study, therefore, connects the onset of zygotic non coding transcription to the control of genome plasticity in Paramecium

  4. SNPsnap: a Web-based tool for identification and annotation of matched SNPs

    DEFF Research Database (Denmark)

    Pers, Tune Hannes; Timshel, Pascal; Hirschhorn, Joel N.

    2015-01-01

    -localization of GWAS signals to gene-dense and high linkage disequilibrium (LD) regions, and correlations of gene size, location and function. The SNPsnap Web server enables SNP-based enrichment analysis by providing matched sets of SNPs that can be used to calibrate background expectations. Specifically, SNPsnap...... efficiently identifies sets of randomly drawn SNPs that are matched to a set of query SNPs based on allele frequency, number of SNPs in LD, distance to nearest gene and gene density. Availability and implementation : SNPsnap server is available at http://www.broadinstitute.org/mpg/snpsnap/. Contact: joelh...

  5. nRC: non-coding RNA Classifier based on structural features.

    Science.gov (United States)

    Fiannaca, Antonino; La Rosa, Massimo; La Paglia, Laura; Rizzo, Riccardo; Urso, Alfonso

    2017-01-01

    Non-coding RNA (ncRNA) are small non-coding sequences involved in gene expression regulation of many biological processes and diseases. The recent discovery of a large set of different ncRNAs with biologically relevant roles has opened the way to develop methods able to discriminate between the different ncRNA classes. Moreover, the lack of knowledge about the complete mechanisms in regulative processes, together with the development of high-throughput technologies, has required the help of bioinformatics tools in addressing biologists and clinicians with a deeper comprehension of the functional roles of ncRNAs. In this work, we introduce a new ncRNA classification tool, nRC (non-coding RNA Classifier). Our approach is based on features extraction from the ncRNA secondary structure together with a supervised classification algorithm implementing a deep learning architecture based on convolutional neural networks. We tested our approach for the classification of 13 different ncRNA classes. We obtained classification scores, using the most common statistical measures. In particular, we reach an accuracy and sensitivity score of about 74%. The proposed method outperforms other similar classification methods based on secondary structure features and machine learning algorithms, including the RNAcon tool that, to date, is the reference classifier. nRC tool is freely available as a docker image at https://hub.docker.com/r/tblab/nrc/. The source code of nRC tool is also available at https://github.com/IcarPA-TBlab/nrc.

  6. Long Non-Coding RNA in Cancer

    Directory of Open Access Journals (Sweden)

    Damjan Glavač

    2013-02-01

    Full Text Available Long non-coding RNAs (lncRNAs are pervasively transcribed in the genome and are emerging as new players in tumorigenesis due to their various functions in transcriptional, posttranscriptional and epigenetic mechanisms of gene regulation. LncRNAs are deregulated in a number of cancers, demonstrating both oncogenic and tumor suppressive roles, thus suggesting their aberrant expression may be a substantial contributor in cancer development. In this review, we will summarize their emerging role in human cancer and discuss their perspectives in diagnostics as potential biomarkers.

  7. A Case for Dynamic Reverse-code Generation to Debug Non-deterministic Programs

    Directory of Open Access Journals (Sweden)

    Jooyong Yi

    2013-09-01

    Full Text Available Backtracking (i.e., reverse execution helps the user of a debugger to naturally think backwards along the execution path of a program, and thinking backwards makes it easy to locate the origin of a bug. So far backtracking has been implemented mostly by state saving or by checkpointing. These implementations, however, inherently do not scale. Meanwhile, a more recent backtracking method based on reverse-code generation seems promising because executing reverse code can restore the previous states of a program without state saving. In the literature, there can be found two methods that generate reverse code: (a static reverse-code generation that pre-generates reverse code through static analysis before starting a debugging session, and (b dynamic reverse-code generation that generates reverse code by applying dynamic analysis on the fly during a debugging session. In particular, we espoused the latter one in our previous work to accommodate non-determinism of a program caused by e.g., multi-threading. To demonstrate the usefulness of our dynamic reverse-code generation, this article presents a case study of various backtracking methods including ours. We compare the memory usage of various backtracking methods in a simple but nontrivial example, a bounded-buffer program. In the case of non-deterministic programs such as this bounded-buffer program, our dynamic reverse-code generation outperforms the existing backtracking methods in terms of memory efficiency.

  8. Identification of Novel Long Non-coding and Circular RNAs in Human Papillomavirus-Mediated Cervical Cancer

    Directory of Open Access Journals (Sweden)

    Hongbo Wang

    2017-09-01

    Full Text Available Cervical cancer is the third most common cancer worldwide and the fourth leading cause of cancer-associated mortality in women. Accumulating evidence indicates that long non-coding RNAs (lncRNAs and circular RNAs (circRNAs may play key roles in the carcinogenesis of different cancers; however, little is known about the mechanisms of lncRNAs and circRNAs in the progression and metastasis of cervical cancer. In this study, we explored the expression profiles of lncRNAs, circRNAs, miRNAs, and mRNAs in HPV16 (human papillomavirus genotype 16 mediated cervical squamous cell carcinoma and matched adjacent non-tumor (ATN tissues from three patients with high-throughput RNA sequencing (RNA-seq. In total, we identified 19 lncRNAs, 99 circRNAs, 28 miRNAs, and 304 mRNAs that were commonly differentially expressed (DE in different patients. Among the non-coding RNAs, 3 lncRNAs and 44 circRNAs are novel to our knowledge. Functional enrichment analysis showed that DE lncRNAs, miRNAs, and mRNAs were enriched in pathways crucial to cancer as well as other gene ontology (GO terms. Furthermore, the co-expression network and function prediction suggested that all 19 DE lncRNAs could play different roles in the carcinogenesis and development of cervical cancer. The competing endogenous RNA (ceRNA network based on DE coding and non-coding RNAs showed that each miRNA targeted a number of lncRNAs and circRNAs. The link between part of the miRNAs in the network and cervical cancer has been validated in previous studies, and these miRNAs targeted the majority of the novel non-coding RNAs, thus suggesting that these novel non-coding RNAs may be involved in cervical cancer. Taken together, our study shows that DE non-coding RNAs could be further developed as diagnostic and therapeutic biomarkers of cervical cancer. The complex ceRNA network also lays the foundation for future research of the roles of coding and non-coding RNAs in cervical cancer.

  9. Identification of Novel Long Non-coding and Circular RNAs in Human Papillomavirus-Mediated Cervical Cancer

    Science.gov (United States)

    Wang, Hongbo; Zhao, Yingchao; Chen, Mingyue; Cui, Jie

    2017-01-01

    Cervical cancer is the third most common cancer worldwide and the fourth leading cause of cancer-associated mortality in women. Accumulating evidence indicates that long non-coding RNAs (lncRNAs) and circular RNAs (circRNAs) may play key roles in the carcinogenesis of different cancers; however, little is known about the mechanisms of lncRNAs and circRNAs in the progression and metastasis of cervical cancer. In this study, we explored the expression profiles of lncRNAs, circRNAs, miRNAs, and mRNAs in HPV16 (human papillomavirus genotype 16) mediated cervical squamous cell carcinoma and matched adjacent non-tumor (ATN) tissues from three patients with high-throughput RNA sequencing (RNA-seq). In total, we identified 19 lncRNAs, 99 circRNAs, 28 miRNAs, and 304 mRNAs that were commonly differentially expressed (DE) in different patients. Among the non-coding RNAs, 3 lncRNAs and 44 circRNAs are novel to our knowledge. Functional enrichment analysis showed that DE lncRNAs, miRNAs, and mRNAs were enriched in pathways crucial to cancer as well as other gene ontology (GO) terms. Furthermore, the co-expression network and function prediction suggested that all 19 DE lncRNAs could play different roles in the carcinogenesis and development of cervical cancer. The competing endogenous RNA (ceRNA) network based on DE coding and non-coding RNAs showed that each miRNA targeted a number of lncRNAs and circRNAs. The link between part of the miRNAs in the network and cervical cancer has been validated in previous studies, and these miRNAs targeted the majority of the novel non-coding RNAs, thus suggesting that these novel non-coding RNAs may be involved in cervical cancer. Taken together, our study shows that DE non-coding RNAs could be further developed as diagnostic and therapeutic biomarkers of cervical cancer. The complex ceRNA network also lays the foundation for future research of the roles of coding and non-coding RNAs in cervical cancer. PMID:28970820

  10. Screening and Evaluation of Deleterious SNPs in APOE Gene of Alzheimer’s Disease

    Directory of Open Access Journals (Sweden)

    Tariq Ahmad Masoodi

    2012-01-01

    Full Text Available Introduction. Apolipoprotein E (APOE is an important risk factor for Alzheimer’s disease (AD and is present in 30–50% of patients who develop late-onset AD. Several single-nucleotide polymorphisms (SNPs are present in APOE gene which act as the biomarkers for exploring the genetic basis of this disease. The objective of this study is to identify deleterious nsSNPs associated with APOE gene. Methods. The SNPs were retrieved from dbSNP. Using I-Mutant, protein stability change was calculated. The potentially functional nonsynonymous (ns SNPs and their effect on protein was predicted by PolyPhen and SIFT, respectively. FASTSNP was used for functional analysis and estimation of risk score. The functional impact on the APOE protein was evaluated by using Swiss PDB viewer and NOMAD-Ref server. Results. Six nsSNPs were found to be least stable by I-Mutant 2.0 with DDG value of >−1.0. Four nsSNPs showed a highly deleterious tolerance index score of 0.00. Nine nsSNPs were found to be probably damaging with position-specific independent counts (PSICs score of ≥2.0. Seven nsSNPs were found to be highly polymorphic with a risk score of 3-4. The total energies and root-mean-square deviation (RMSD values were higher for three mutant-type structures compared to the native modeled structure. Conclusion. We concluded that three nsSNPs, namely, rs11542041, rs11542040, and rs11542034, to be potentially functional polymorphic.

  11. Estradiol-Induced Transcriptional Regulation of Long Non-Coding RNA, HOTAIR.

    Science.gov (United States)

    Bhan, Arunoday; Mandal, Subhrangsu S

    2016-01-01

    HOTAIR (HOX antisense intergenic RNA) is a 2.2 kb long non-coding RNA (lncRNA), transcribed from the antisense strand of homeobox C (HOXC) gene locus in chromosome 12. HOTAIR acts as a scaffolding lncRNA. It interacts and guides various chromatin-modifying complexes such as PRC2 (polycomb-repressive complex 2) and LSD1 (lysine-specific demethylase 1) to the target gene promoters leading to their gene silencing. Various studies have demonstrated that HOTAIR overexpression is associated with breast cancer. Recent studies from our laboratory demonstrate that HOTAIR is required for viability of breast cancer cells and is transcriptionally regulated by estradiol (E2) in vitro and in vivo. This chapter describes protocols for analysis of the HOTAIR promoter, cloning, transfection and dual luciferase assays, knockdown of protein synthesis by antisense oligonucleotides, and chromatin immunoprecipitation (ChIP) assay. These protocols are useful for studying the estrogen-mediated transcriptional regulation of lncRNA HOTAIR, as well as other protein coding genes and non-coding RNAs.

  12. Complete mitochondrial genome of Concholepas concholepas inferred by 454 pyrosequencing and mtDNA expression in two mollusc populations.

    Science.gov (United States)

    Núñez-Acuña, Gustavo; Aguilar-Espinoza, Andrea; Gallardo-Escárate, Cristian

    2013-03-01

    Despite the great relevance of mitochondrial genome analysis in evolutionary studies, there is scarce information on how the transcripts associated with the mitogenome are expressed and their role in the genetic structuring of populations. This work reports the complete mitochondrial genome of the marine gastropod Concholepas concholepas, obtained by 454 pryosequencing, and an analysis of mitochondrial transcripts of two populations 1000 km apart along the Chilean coast. The mitochondrion of C. concholepas is 15,495 base pairs (bp) in size and contains the 37 subunits characteristic of metazoans, as well as a non-coding region of 330 bp. In silico analysis of mitochondrial gene variability showed significant differences among populations. In terms of levels of relative abundance of transcripts associated with mitochondrion in the two populations (assessed by qPCR), the genes associated with complexes III and IV of the mitochondrial genome had the highest levels of expression in the northern population while transcripts associated with the ATP synthase complex had the highest levels of expression in the southern population. Moreover, fifteen polymorphic SNPs were identified in silico between the mitogenomes of the two populations. Four of these markers implied different amino acid substitutions (non-synonymous SNPs). This work contributes novel information regarding the mitochondrial genome structure and mRNA expression levels of C. concholepas. Copyright © 2012 Elsevier Inc. All rights reserved.

  13. Advanced GF(32) nonbinary LDPC coded modulation with non-uniform 9-QAM outperforming star 8-QAM.

    Science.gov (United States)

    Liu, Tao; Lin, Changyu; Djordjevic, Ivan B

    2016-06-27

    In this paper, we first describe a 9-symbol non-uniform signaling scheme based on Huffman code, in which different symbols are transmitted with different probabilities. By using the Huffman procedure, prefix code is designed to approach the optimal performance. Then, we introduce an algorithm to determine the optimal signal constellation sets for our proposed non-uniform scheme with the criterion of maximizing constellation figure of merit (CFM). The proposed nonuniform polarization multiplexed signaling 9-QAM scheme has the same spectral efficiency as the conventional 8-QAM. Additionally, we propose a specially designed GF(32) nonbinary quasi-cyclic LDPC code for the coded modulation system based on the 9-QAM non-uniform scheme. Further, we study the efficiency of our proposed non-uniform 9-QAM, combined with nonbinary LDPC coding, and demonstrate by Monte Carlo simulation that the proposed GF(23) nonbinary LDPC coded 9-QAM scheme outperforms nonbinary LDPC coded uniform 8-QAM by at least 0.8dB.

  14. Semantic Modeling for SNPs Associated with Ethnic Disparities in HapMap Samples

    Directory of Open Access Journals (Sweden)

    HyoYoung Kim

    2014-03-01

    Full Text Available Single-nucleotide polymorphisms (SNPs have been emerging out of the efforts to research human diseases and ethnic disparities. A semantic network is needed for in-depth understanding of the impacts of SNPs, because phenotypes are modulated by complex networks, including biochemical and physiological pathways. We identified ethnicity-specific SNPs by eliminating overlapped SNPs from HapMap samples, and the ethnicity-specific SNPs were mapped to the UCSC RefGene lists. Ethnicity-specific genes were identified as follows: 22 genes in the USA (CEU individuals, 25 genes in the Japanese (JPT individuals, and 332 genes in the African (YRI individuals. To analyze the biologically functional implications for ethnicity-specific SNPs, we focused on constructing a semantic network model. Entities for the network represented by "Gene," "Pathway," "Disease," "Chemical," "Drug," "ClinicalTrials," "SNP," and relationships between entity-entity were obtained through curation. Our semantic modeling for ethnicity-specific SNPs showed interesting results in the three categories, including three diseases ("AIDS-associated nephropathy," "Hypertension," and "Pelvic infection", one drug ("Methylphenidate", and five pathways ("Hemostasis," "Systemic lupus erythematosus," "Prostate cancer," "Hepatitis C virus," and "Rheumatoid arthritis". We found ethnicity-specific genes using the semantic modeling, and the majority of our findings was consistent with the previous studies - that an understanding of genetic variability explained ethnicity-specific disparities.

  15. Effective selection of informative SNPs and classification on the HapMap genotype data

    Directory of Open Access Journals (Sweden)

    Wang Lipo

    2007-12-01

    Full Text Available Abstract Background Since the single nucleotide polymorphisms (SNPs are genetic variations which determine the difference between any two unrelated individuals, the SNPs can be used to identify the correct source population of an individual. For efficient population identification with the HapMap genotype data, as few informative SNPs as possible are required from the original 4 million SNPs. Recently, Park et al. (2006 adopted the nearest shrunken centroid method to classify the three populations, i.e., Utah residents with ancestry from Northern and Western Europe (CEU, Yoruba in Ibadan, Nigeria in West Africa (YRI, and Han Chinese in Beijing together with Japanese in Tokyo (CHB+JPT, from which 100,736 SNPs were obtained and the top 82 SNPs could completely classify the three populations. Results In this paper, we propose to first rank each feature (SNP using a ranking measure, i.e., a modified t-test or F-statistics. Then from the ranking list, we form different feature subsets by sequentially choosing different numbers of features (e.g., 1, 2, 3, ..., 100. with top ranking values, train and test them by a classifier, e.g., the support vector machine (SVM, thereby finding one subset which has the highest classification accuracy. Compared to the classification method of Park et al., we obtain a better result, i.e., good classification of the 3 populations using on average 64 SNPs. Conclusion Experimental results show that the both of the modified t-test and F-statistics method are very effective in ranking SNPs about their classification capabilities. Combined with the SVM classifier, a desirable feature subset (with the minimum size and most informativeness can be quickly found in the greedy manner after ranking all SNPs. Our method is able to identify a very small number of important SNPs that can determine the populations of individuals.

  16. THE LEXICOGRAFIC PRINCIPLES OF WORD MEANINGS IN THE MULTILINGUAL SYNONYMIC DICTIONARIES

    Directory of Open Access Journals (Sweden)

    Siddikova, I.A.

    2018-03-01

    Full Text Available This article is dedicated to study of principles of description of the meaning of words in the multilingual synonymic dictionaries. The dictionary of synonyms must have full enough and absolutely explicit description of their semantic similarity and distinctions. The description can be full if it includes all existing features of the words, adequately denote every meaning and help the language learners and speakers in choosing possible meanings of synonyms owing to situation. The synonymic dictionary must include all synonyms, their meanings, lexico-semantic combination, distribution, grammatical constructions and stylistic features showing their usage in certain contexts and situations. In some cases according to their contextual meanings synonyms may be substituted depending on situation. The article is based on examples of English, Uzbek and Russian languages.

  17. Domain altering SNPs in the human proteome and their impact on signaling pathways.

    Directory of Open Access Journals (Sweden)

    Yichuan Liu

    Full Text Available Single nucleotide polymorphisms (SNPs constitute an important mode of genetic variations observed in the human genome. A small fraction of SNPs, about four thousand out of the ten million, has been associated with genetic disorders and complex diseases. The present study focuses on SNPs that fall on protein domains, 3D structures that facilitate connectivity of proteins in cell signaling and metabolic pathways. We scanned the human proteome using the PROSITE web tool and identified proteins with SNP containing domains. We showed that SNPs that fall on protein domains are highly statistically enriched among SNPs linked to hereditary disorders and complex diseases. Proteins whose domains are dramatically altered by the presence of an SNP are even more likely to be present among proteins linked to hereditary disorders. Proteins with domain-altering SNPs comprise highly connected nodes in cellular pathways such as the focal adhesion, the axon guidance pathway and the autoimmune disease pathways. Statistical enrichment of domain/motif signatures in interacting protein pairs indicates extensive loss of connectivity of cell signaling pathways due to domain-altering SNPs, potentially leading to hereditary disorders.

  18. Python Radiative Transfer Emission code (PyRaTE): non-LTE spectral lines simulations

    Science.gov (United States)

    Tritsis, A.; Yorke, H.; Tassis, K.

    2018-05-01

    We describe PyRaTE, a new, non-local thermodynamic equilibrium (non-LTE) line radiative transfer code developed specifically for post-processing astrochemical simulations. Population densities are estimated using the escape probability method. When computing the escape probability, the optical depth is calculated towards all directions with density, molecular abundance, temperature and velocity variations all taken into account. A very easy-to-use interface, capable of importing data from simulations outputs performed with all major astrophysical codes, is also developed. The code is written in PYTHON using an "embarrassingly parallel" strategy and can handle all geometries and projection angles. We benchmark the code by comparing our results with those from RADEX (van der Tak et al. 2007) and against analytical solutions and present case studies using hydrochemical simulations. The code will be released for public use.

  19. Genome-wide re-sequencing of multidrug-resistant Mycobacterium leprae Airaku-3.

    Science.gov (United States)

    Singh, P; Benjak, A; Carat, S; Kai, M; Busso, P; Avanzi, C; Paniz-Mondolfi, A; Peter, C; Harshman, K; Rougemont, J; Matsuoka, M; Cole, S T

    2014-10-01

    Genotyping and molecular characterization of drug resistance mechanisms in Mycobacterium leprae enables disease transmission and drug resistance trends to be monitored. In the present study, we performed genome-wide analysis of Airaku-3, a multidrug-resistant strain with an unknown mechanism of resistance to rifampicin. We identified 12 unique non-synonymous single-nucleotide polymorphisms (SNPs) including two in the transporter-encoding ctpC and ctpI genes. In addition, two SNPs were found that improve the resolution of SNP-based genotyping, particularly for Venezuelan and South East Asian strains of M. leprae. © 2014 The Authors Clinical Microbiology and Infection © 2014 European Society of Clinical Microbiology and Infectious Diseases.

  20. Multiple origins of resistance-conferring mutations in Plasmodium vivax dihydrofolate reductase

    Directory of Open Access Journals (Sweden)

    O'Neil Michael T

    2008-04-01

    Full Text Available Abstract Background In order to maximize the useful therapeutic life of antimalarial drugs, it is crucial to understand the mechanisms by which parasites resistant to antimalarial drugs are selected and spread in natural populations. Recent work has demonstrated that pyrimethamine-resistance conferring mutations in Plasmodium falciparum dihydrofolate reductase (dhfr have arisen rarely de novo, but spread widely in Asia and Africa. The origin and spread of mutations in Plasmodium vivax dhfr were assessed by constructing haplotypes based on sequencing dhfr and its flanking regions. Methods The P. vivax dhfr coding region, 792 bp upstream and 683 bp downstream were amplified and sequenced from 137 contemporary patient isolates from Colombia, India, Indonesia, Papua New Guinea, Sri Lanka, Thailand, and Vanuatu. A repeat motif located 2.6 kb upstream of dhfr was also sequenced from 75 of 137 patient isolates, and mutational relationships among the haplotypes were visualized using the programme Network. Results Synonymous and non-synonymous single nucleotide polymorphisms (SNPs within the dhfr coding region were identified, as was the well-documented in-frame insertion/deletion (indel. SNPs were also identified upstream and downstream of dhfr, with an indel and a highly polymorphic repeat region identified upstream of dhfr. The regions flanking dhfr were highly variable. The double mutant (58R/117N dhfr allele has evolved from several origins, because the 58R is encoded by at least 3 different codons. The triple (58R/61M/117T and quadruple (57L/61M/117T/173F, 57I/58R/61M/117T and 57L/58R/61M/117T mutant alleles had at least three independent origins in Thailand, Indonesia, and Papua New Guinea/Vanuatu. Conclusion It was found that the P. vivax dhfr coding region and its flanking intergenic regions are highly polymorphic and that mutations in P. vivax dhfr that confer antifolate resistance have arisen several times in the Asian region. This contrasts

  1. SNPs in PPARG associate with type 2 diabetes and interact with physical activity

    DEFF Research Database (Denmark)

    Oskari Kilpeläinen, Tuomas; Lakka, Timo A; Laaksonen, David E

    2008-01-01

    To study the associations of seven single-nucleotide polymorphisms (SNPs) in the peroxisome proliferator-activated receptor gamma (PPARG) gene with the conversion from impaired glucose tolerance (IGT) to type 2 diabetes (T2D), and the interactions of the SNPs with physical activity (PA).......To study the associations of seven single-nucleotide polymorphisms (SNPs) in the peroxisome proliferator-activated receptor gamma (PPARG) gene with the conversion from impaired glucose tolerance (IGT) to type 2 diabetes (T2D), and the interactions of the SNPs with physical activity (PA)....

  2. Genomic characterization of the porcine CRTC3 and the effects of a non-synonymous mutation p.V515F on lean meat production and belly fat.

    Science.gov (United States)

    Lee, S H; Hur, M H; Lee, E A; Hong, K C; Kim, J M

    2018-03-01

    cAMP-responsive element-binding protein (CREB)-regulated transcriptional coactivator 3 (CRTC3) is well known to be related to obesity in humans and mice. However, the effects of CRTC3 have not been studied in pigs. Here, we characterized the structure of the porcine CRTC3 gene and identified single nucleotide polymorphisms (SNPs) in its coding region. Moreover, mRNA expression profiles of CRTC3 in muscle and fat tissues were examined. Of the 40 identified SNPs, the p.V515F mutation, located on exon 16, was genotyped in 368 Yorkshire pigs. The p.V515F mutation was significantly associated with lean meat production ability, including reduced back fat thickness (P=0.0317) and loin eye area (P=0.0174). Moreover, the SNP was significantly associated with differences in intermuscular fat (P=0.0092), total muscle area in the belly (P=0.0108), and total fat percentage in the belly (P=0.0298). Taken together, our results suggest that the p.V515F mutation affects to lean meat production ability and amount of belly fat. Copyright © 2017. Published by Elsevier Ltd.

  3. Rapid multiplex high resolution melting method to analyze inflammatory related SNPs in preterm birth

    Directory of Open Access Journals (Sweden)

    Pereyra Silvana

    2012-01-01

    Full Text Available Abstract Background Complex traits like cancer, diabetes, obesity or schizophrenia arise from an intricate interaction between genetic and environmental factors. Complex disorders often cluster in families without a clear-cut pattern of inheritance. Genomic wide association studies focus on the detection of tens or hundreds individual markers contributing to complex diseases. In order to test if a subset of single nucleotide polymorphisms (SNPs from candidate genes are associated to a condition of interest in a particular individual or group of people, new techniques are needed. High-resolution melting (HRM analysis is a new method in which polymerase chain reaction (PCR and mutations scanning are carried out simultaneously in a closed tube, making the procedure fast, inexpensive and easy. Preterm birth (PTB is considered a complex disease, where genetic and environmental factors interact to carry out the delivery of a newborn before 37 weeks of gestation. It is accepted that inflammation plays an important role in pregnancy and PTB. Methods Here, we used real time-PCR followed by HRM analysis to simultaneously identify several gene variations involved in inflammatory pathways on preterm labor. SNPs from TLR4, IL6, IL1 beta and IL12RB genes were analyzed in a case-control study. The results were confirmed either by sequencing or by PCR followed by restriction fragment length polymorphism. Results We were able to simultaneously recognize the variations of four genes with similar accuracy than other methods. In order to obtain non-overlapping melting temperatures, the key step in this strategy was primer design. Genotypic frequencies found for each SNP are in concordance with those previously described in similar populations. None of the studied SNPs were associated with PTB. Conclusions Several gene variations related to the same inflammatory pathway were screened through a new flexible, fast and non expensive method with the purpose of analyzing

  4. The rhesus macaque is three times as diverse but more closely equivalent in damaging coding variation as compared to the human

    Directory of Open Access Journals (Sweden)

    Yuan Qiaoping

    2012-06-01

    Full Text Available Abstract Background As a model organism in biomedicine, the rhesus macaque (Macaca mulatta is the most widely used nonhuman primate. Although a draft genome sequence was completed in 2007, there has been no systematic genome-wide comparison of genetic variation of this species to humans. Comparative analysis of functional and nonfunctional diversity in this highly abundant and adaptable non-human primate could inform its use as a model for human biology, and could reveal how variation in population history and size alters patterns and levels of sequence variation in primates. Results We sequenced the mRNA transcriptome and H3K4me3-marked DNA regions in hippocampus from 14 humans and 14 rhesus macaques. Using equivalent methodology and sampling spaces, we identified 462,802 macaque SNPs, most of which were novel and disproportionately located in the functionally important genomic regions we had targeted in the sequencing. At least one SNP was identified in each of 16,797 annotated macaque genes. Accuracy of macaque SNP identification was conservatively estimated to be >90%. Comparative analyses using SNPs equivalently identified in the two species revealed that rhesus macaque has approximately three times higher SNP density and average nucleotide diversity as compared to the human. Based on this level of diversity, the effective population size of the rhesus macaque is approximately 80,000 which contrasts with an effective population size of less than 10,000 for humans. Across five categories of genomic regions, intergenic regions had the highest SNP density and average nucleotide diversity and CDS (coding sequences the lowest, in both humans and macaques. Although there are more coding SNPs (cSNPs per individual in macaques than in humans, the ratio of dN/dS is significantly lower in the macaque. Furthermore, the number of damaging nonsynonymous cSNPs (have damaging effects on protein functions from PolyPhen-2 prediction in the macaque is more

  5. The Non-Coding RNA Ontology (NCRO): a comprehensive resource for the unification of non-coding RNA biology.

    Science.gov (United States)

    Huang, Jingshan; Eilbeck, Karen; Smith, Barry; Blake, Judith A; Dou, Dejing; Huang, Weili; Natale, Darren A; Ruttenberg, Alan; Huan, Jun; Zimmermann, Michael T; Jiang, Guoqian; Lin, Yu; Wu, Bin; Strachan, Harrison J; He, Yongqun; Zhang, Shaojie; Wang, Xiaowei; Liu, Zixing; Borchert, Glen M; Tan, Ming

    2016-01-01

    In recent years, sequencing technologies have enabled the identification of a wide range of non-coding RNAs (ncRNAs). Unfortunately, annotation and integration of ncRNA data has lagged behind their identification. Given the large quantity of information being obtained in this area, there emerges an urgent need to integrate what is being discovered by a broad range of relevant communities. To this end, the Non-Coding RNA Ontology (NCRO) is being developed to provide a systematically structured and precisely defined controlled vocabulary for the domain of ncRNAs, thereby facilitating the discovery, curation, analysis, exchange, and reasoning of data about structures of ncRNAs, their molecular and cellular functions, and their impacts upon phenotypes. The goal of NCRO is to serve as a common resource for annotations of diverse research in a way that will significantly enhance integrative and comparative analysis of the myriad resources currently housed in disparate sources. It is our belief that the NCRO ontology can perform an important role in the comprehensive unification of ncRNA biology and, indeed, fill a critical gap in both the Open Biological and Biomedical Ontologies (OBO) Library and the National Center for Biomedical Ontology (NCBO) BioPortal. Our initial focus is on the ontological representation of small regulatory ncRNAs, which we see as the first step in providing a resource for the annotation of data about all forms of ncRNAs. The NCRO ontology is free and open to all users, accessible at: http://purl.obolibrary.org/obo/ncro.owl.

  6. Comprehensive reconstruction andvisualization of non-coding regulatorynetworks in human

    Directory of Open Access Journals (Sweden)

    Vincenzo eBonnici

    2014-12-01

    Full Text Available Research attention has been powered to understand the functional roles of non-coding RNAs (ncRNAs. Many studies have demonstrated their deregulation in cancer and other human disorders. ncRNAs are also present in extracellular human body fluids such as serum and plasma, giving them a great potential as non-invasive biomarkers. However, non-coding RNAs have been relatively recently discovered and a comprehensive database including all of them is still missing. Reconstructing and visualizing the network of ncRNAs interactions are important steps to understand their regulatory mechanism in complex systems. This work presents ncRNA-DB, a NoSQL database that integrates ncRNAs data interactions from a large number of well established online repositories. The interactions involve RNA, DNA, proteins and diseases. ncRNA-DB is available at http://ncrnadb.scienze.univr.it/ncrnadb/. It is equipped with three interfaces: web based, command line and a Cytoscape app called ncINetView. By accessing only one resource, users can search for ncRNAs and their interactions, build a network annotated with all known ncRNAs and associated diseases, and use all visual and mining features available in Cytoscape.

  7. RNA-Seq of human neurons derived from iPS cells reveals candidate long non-coding RNAs involved in neurogenesis and neuropsychiatric disorders.

    Directory of Open Access Journals (Sweden)

    Mingyan Lin

    Full Text Available Genome-wide expression analysis using next generation sequencing (RNA-Seq provides an opportunity for in-depth molecular profiling of fundamental biological processes, such as cellular differentiation and malignant transformation. Differentiating human neurons derived from induced pluripotent stem cells (iPSCs provide an ideal system for RNA-Seq since defective neurogenesis caused by abnormalities in transcription factors, DNA methylation, and chromatin modifiers lie at the heart of some neuropsychiatric disorders. As a preliminary step towards applying next generation sequencing using neurons derived from patient-specific iPSCs, we have carried out an RNA-Seq analysis on control human neurons. Dramatic changes in the expression of coding genes, long non-coding RNAs (lncRNAs, pseudogenes, and splice isoforms were seen during the transition from pluripotent stem cells to early differentiating neurons. A number of genes that undergo radical changes in expression during this transition include candidates for schizophrenia (SZ, bipolar disorder (BD and autism spectrum disorders (ASD that function as transcription factors and chromatin modifiers, such as POU3F2 and ZNF804A, and genes coding for cell adhesion proteins implicated in these conditions including NRXN1 and NLGN1. In addition, a number of novel lncRNAs were found to undergo dramatic changes in expression, one of which is HOTAIRM1, a regulator of several HOXA genes during myelopoiesis. The increase we observed in differentiating neurons suggests a role in neurogenesis as well. Finally, several lncRNAs that map near SNPs associated with SZ in genome wide association studies also increase during neuronal differentiation, suggesting that these novel transcripts may be abnormally regulated in a subgroup of patients.

  8. Therapeutic targeting of non-coding RNAs in cancer

    Czech Academy of Sciences Publication Activity Database

    Slabý, O.; Laga, Richard; Sedláček, Ondřej

    2017-01-01

    Roč. 474, č. 24 (2017), s. 4219-4251 ISSN 0264-6021 R&D Projects: GA MŠk(CZ) LQ1604; GA MŠk(CZ) ED1.1.00/02.0109 Institutional support: RVO:61389013 Keywords : non-coding RNA * RNA delivery * polymer carriers Subject RIV: EB - Genetics ; Molecular Biology OBOR OECD: Biochemical research methods Impact factor: 3.797, year: 2016

  9. Constacyclic codes over the ring F_q+v{F}_q+v2F_q and their applications of constructing new non-binary quantum codes

    Science.gov (United States)

    Ma, Fanghui; Gao, Jian; Fu, Fang-Wei

    2018-06-01

    Let R={F}_q+v{F}_q+v2{F}_q be a finite non-chain ring, where q is an odd prime power and v^3=v. In this paper, we propose two methods of constructing quantum codes from (α +β v+γ v2)-constacyclic codes over R. The first one is obtained via the Gray map and the Calderbank-Shor-Steane construction from Euclidean dual-containing (α +β v+γ v2)-constacyclic codes over R. The second one is obtained via the Gray map and the Hermitian construction from Hermitian dual-containing (α +β v+γ v2)-constacyclic codes over R. As an application, some new non-binary quantum codes are obtained.

  10. Comprehensive exploration of the effects of miRNA SNPs on monocyte gene expression.

    Directory of Open Access Journals (Sweden)

    Nicolas Greliche

    Full Text Available We aimed to assess whether pri-miRNA SNPs (miSNPs could influence monocyte gene expression, either through marginal association or by interacting with polymorphisms located in 3'UTR regions (3utrSNPs. We then conducted a genome-wide search for marginal miSNPs effects and pairwise miSNPs × 3utrSNPs interactions in a sample of 1,467 individuals for which genome-wide monocyte expression and genotype data were available. Statistical associations that survived multiple testing correction were tested for replication in an independent sample of 758 individuals with both monocyte gene expression and genotype data. In both studies, the hsa-mir-1279 rs1463335 was found to modulate in cis the expression of LYZ and in trans the expression of CNTN6, CTRC, COPZ2, KRT9, LRRFIP1, NOD1, PCDHA6, ST5 and TRAF3IP2 genes, supporting the role of hsa-mir-1279 as a regulator of several genes in monocytes. In addition, we identified two robust miSNPs × 3utrSNPs interactions, one involving HLA-DPB1 rs1042448 and hsa-mir-219-1 rs107822, the second the H1F0 rs1894644 and hsa-mir-659 rs5750504, modulating the expression of the associated genes.As some of the aforementioned genes have previously been reported to reside at disease-associated loci, our findings provide novel arguments supporting the hypothesis that the genetic variability of miRNAs could also contribute to the susceptibility to human diseases.

  11. Genome-wide sequence variations among Mycobacterium avium subspecies paratuberculosis.

    Directory of Open Access Journals (Sweden)

    Chung-Yi eHsu

    2011-12-01

    Full Text Available Mycobacterium avium subspecies paratuberculosis (M. ap, the causative agent of Johne’s disease (JD, infects many farmed ruminants, wildlife animals and humans. To better understand the molecular pathogenesis of these infections, we analyzed the whole genome sequences of several M. ap and M. avium subspecies avium (M. avium strains isolated from various hosts and environments. Using Next-generation sequencing technology, all 6 M. ap isolates showed a high percentage of homology (98% to the reference genome sequence of M. ap K-10 isolated from cattle. However, 2 M. avium isolates (DT 78 and Env 77 showed significant sequence diversity from the reference strain M. avium 104. The genomes of M. avium isolates DT 78 and Env 77 exhibited only 87% and 40% homology, respectively, to the M. avium 104 reference genome. Within the M. ap isolates, genomic rearrangements (insertions/deletions, Indels were not detected, and only unique single nucleotide polymorphisms (SNPs were observed among the 6 M. ap strains. While most of the SNPs (~100 in M. ap genomes were non-synonymous, a total of ~ 6000 SNPs were detected among M. avium genomes, most of them were synonymous suggesting a differential selective pressure between M. ap and M. avium isolates. In addition, SNPs-based phylo-genomic analysis showed that isolates from goat and Oryx are closely related to the cattle (K-10 strain while the human isolate (M. ap 4B is closely related to the environmental strains, indicating environmental source to human infections. Overall, SNPs were the most common variations among M. ap isolates while SNPs in addition to Indels were prevalent among M. avium isolates. Genomic variations will be useful in designing host-specific markers for the analysis of mycobacterial evolution and for developing novel diagnostics directed against Johne’s disease in animals.

  12. An Analysis of Countries which have Integrated Coding into their Curricula and the Content Analysis of Academic Studies on Coding Training in Turkey

    Directory of Open Access Journals (Sweden)

    Hüseyin Uzunboylu

    2017-11-01

    Full Text Available The first aim is to conduct a general analysis of countries which have integrated coding training into their curricula, and the second aim is to conduct a content analysis of studies on coding training in Turkey. It was identified that there are only a few academic studies on coding training in Turkey, and that the majority of them were published in 2016, the intended population was mainly “undergraduate students” and that the majority of these students were Computer Education and Instructional Technology undergraduates. It was determined that the studies mainly focused on the subjects of “programming” and “Scratch”, the terms programming and coding were used as synonyms, most of the studies were carried out using quantitative methods and data was obtained mostly by literature review and scale/survey interval techniques.

  13. Construction of Fixed Rate Non-Binary WOM Codes Based on Integer Programming

    Science.gov (United States)

    Fujino, Yoju; Wadayama, Tadashi

    In this paper, we propose a construction of non-binary WOM (Write-Once-Memory) codes for WOM storages such as flash memories. The WOM codes discussed in this paper are fixed rate WOM codes where messages in a fixed alphabet of size $M$ can be sequentially written in the WOM storage at least $t^*$-times. In this paper, a WOM storage is modeled by a state transition graph. The proposed construction has the following two features. First, it includes a systematic method to determine the encoding regions in the state transition graph. Second, the proposed construction includes a labeling method for states by using integer programming. Several novel WOM codes for $q$ level flash memories with 2 cells are constructed by the proposed construction. They achieve the worst numbers of writes $t^*$ that meet the known upper bound in many cases. In addition, we constructed fixed rate non-binary WOM codes with the capability to reduce ICI (inter cell interference) of flash cells. One of the advantages of the proposed construction is its flexibility. It can be applied to various storage devices, to various dimensions (i.e, number of cells), and various kind of additional constraints.

  14. Evidence for the effect of serotonin receptor 1A gene (HTR1A) polymorphism on tractability in Thoroughbred horses.

    Science.gov (United States)

    Hori, Y; Tozaki, T; Nambo, Y; Sato, F; Ishimaru, M; Inoue-Murayama, M; Fujita, K

    2016-02-01

    Tractability, or how easily animals can be trained and controlled, is an important behavioural trait for the management and training of domestic animals, but its genetic basis remains unclear. Polymorphisms in the serotonin receptor 1A gene (HTR1A) have been associated with individual variability in anxiety-related traits in several species. In this study, we examined the association between HTR1A polymorphisms and tractability in Thoroughbred horses. We assessed the tractability of 167 one-year-old horses reared at a training centre for racehorses using a questionnaire consisting of 17 items. A principal components analysis of answers contracted the data to five principal component (PC) scores. We genotyped two non-synonymous single nucleotide polymorphisms (SNPs) in the horse HTR1A coding region. We found that one of the two SNPs, c.709G>A, which causes an amino acid change at the intracellular region of the receptor, was significantly associated with scores of four of five PCs in fillies (all Ps Horses carrying an A allele at c.709G>A showed lower tractability. This result provides the first evidence that a polymorphism in a serotonin-related gene may affect tractability in horses with the effect partially different depending on sex. © 2015 Stichting International Foundation for Animal Genetics.

  15. Clinical code set engineering for reusing EHR data for research: A review.

    Science.gov (United States)

    Williams, Richard; Kontopantelis, Evangelos; Buchan, Iain; Peek, Niels

    2017-06-01

    The construction of reliable, reusable clinical code sets is essential when re-using Electronic Health Record (EHR) data for research. Yet code set definitions are rarely transparent and their sharing is almost non-existent. There is a lack of methodological standards for the management (construction, sharing, revision and reuse) of clinical code sets which needs to be addressed to ensure the reliability and credibility of studies which use code sets. To review methodological literature on the management of sets of clinical codes used in research on clinical databases and to provide a list of best practice recommendations for future studies and software tools. We performed an exhaustive search for methodological papers about clinical code set engineering for re-using EHR data in research. This was supplemented with papers identified by snowball sampling. In addition, a list of e-phenotyping systems was constructed by merging references from several systematic reviews on this topic, and the processes adopted by those systems for code set management was reviewed. Thirty methodological papers were reviewed. Common approaches included: creating an initial list of synonyms for the condition of interest (n=20); making use of the hierarchical nature of coding terminologies during searching (n=23); reviewing sets with clinician input (n=20); and reusing and updating an existing code set (n=20). Several open source software tools (n=3) were discovered. There is a need for software tools that enable users to easily and quickly create, revise, extend, review and share code sets and we provide a list of recommendations for their design and implementation. Research re-using EHR data could be improved through the further development, more widespread use and routine reporting of the methods by which clinical codes were selected. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  16. The Non-Coding Regulatory RNA Revolution in Archaea

    Directory of Open Access Journals (Sweden)

    Diego Rivera Gelsinger

    2018-03-01

    Full Text Available Small non-coding RNAs (sRNAs are ubiquitously found in the three domains of life playing large-scale roles in gene regulation, transposable element silencing and defense against foreign elements. While a substantial body of experimental work has been done to uncover function of sRNAs in Bacteria and Eukarya, the functional roles of sRNAs in Archaea are still poorly understood. Recently, high throughput studies using RNA-sequencing revealed that sRNAs are broadly expressed in the Archaea, comprising thousands of transcripts within the transcriptome during non-challenged and stressed conditions. Antisense sRNAs, which overlap a portion of a gene on the opposite strand (cis-acting, are the most abundantly expressed non-coding RNAs and they can be classified based on their binding patterns to mRNAs (3′ untranslated region (UTR, 5′ UTR, CDS-binding. These antisense sRNAs target many genes and pathways, suggesting extensive roles in gene regulation. Intergenic sRNAs are less abundantly expressed and their targets are difficult to find because of a lack of complete overlap between sRNAs and target mRNAs (trans-acting. While many sRNAs have been validated experimentally, a regulatory role has only been reported for very few of them. Further work is needed to elucidate sRNA-RNA binding mechanisms, the molecular determinants of sRNA-mediated regulation, whether protein components are involved and how sRNAs integrate with complex regulatory networks.

  17. Fine Mapping and Functional Analysis of the Multiple Sclerosis Risk Gene CD6

    Science.gov (United States)

    Swaminathan, Bhairavi; Cuapio, Angélica; Alloza, Iraide; Matesanz, Fuencisla; Alcina, Antonio; García-Barcina, Maria; Fedetz, Maria; Fernández, Óscar; Lucas, Miguel; Órpez, Teresa; Pinto-Medel, Mª Jesus; Otaegui, David; Olascoaga, Javier; Urcelay, Elena; Ortiz, Miguel A.; Arroyo, Rafael; Oksenberg, Jorge R.; Antigüedad, Alfredo; Tolosa, Eva; Vandenbroeck, Koen

    2013-01-01

    CD6 has recently been identified and validated as risk gene for multiple sclerosis (MS), based on the association of a single nucleotide polymorphism (SNP), rs17824933, located in intron 1. CD6 is a cell surface scavenger receptor involved in T-cell activation and proliferation, as well as in thymocyte differentiation. In this study, we performed a haptag SNP screen of the CD6 gene locus using a total of thirteen tagging SNPs, of which three were non-synonymous SNPs, and replicated the recently reported GWAS SNP rs650258 in a Spanish-Basque collection of 814 controls and 823 cases. Validation of the six most strongly associated SNPs was performed in an independent collection of 2265 MS patients and 2600 healthy controls. We identified association of haplotypes composed of two non-synonymous SNPs [rs11230563 (R225W) and rs2074225 (A257V)] in the 2nd SRCR domain with susceptibility to MS (P max(T) permutation = 1×10−4). The effect of these haplotypes on CD6 surface expression and cytokine secretion was also tested. The analysis showed significantly different CD6 expression patterns in the distinct cell subsets, i.e. – CD4+ naïve cells, P = 0.0001; CD8+ naïve cells, P<0.0001; CD4+ and CD8+ central memory cells, P = 0.01 and 0.05, respectively; and natural killer T (NKT) cells, P = 0.02; with the protective haplotype (RA) showing higher expression of CD6. However, no significant changes were observed in natural killer (NK) cells, effector memory and terminally differentiated effector memory T cells. Our findings reveal that this new MS-associated CD6 risk haplotype significantly modifies expression of CD6 on CD4+ and CD8+ T cells. PMID:23638056

  18. Female-biased expression of long non-coding RNAs in domains that escape X-inactivation in mouse

    Directory of Open Access Journals (Sweden)

    Lu Lu

    2010-11-01

    Full Text Available Abstract Background Sexual dimorphism in brain gene expression has been recognized in several animal species. However, the relevant regulatory mechanisms remain poorly understood. To investigate whether sex-biased gene expression in mammalian brain is globally regulated or locally regulated in diverse brain structures, and to study the genomic organisation of brain-expressed sex-biased genes, we performed a large scale gene expression analysis of distinct brain regions in adult male and female mice. Results This study revealed spatial specificity in sex-biased transcription in the mouse brain, and identified 173 sex-biased genes in the striatum; 19 in the neocortex; 12 in the hippocampus and 31 in the eye. Genes located on sex chromosomes were consistently over-represented in all brain regions. Analysis on a subset of genes with sex-bias in more than one tissue revealed Y-encoded male-biased transcripts and X-encoded female-biased transcripts known to escape X-inactivation. In addition, we identified novel coding and non-coding X-linked genes with female-biased expression in multiple tissues. Interestingly, the chromosomal positions of all of the female-biased non-coding genes are in close proximity to protein-coding genes that escape X-inactivation. This defines X-chromosome domains each of which contains a coding and a non-coding female-biased gene. Lack of repressive chromatin marks in non-coding transcribed loci supports the possibility that they escape X-inactivation. Moreover, RNA-DNA combined FISH experiments confirmed the biallelic expression of one such novel domain. Conclusion This study demonstrated that the amount of genes with sex-biased expression varies between individual brain regions in mouse. The sex-biased genes identified are localized on many chromosomes. At the same time, sexually dimorphic gene expression that is common to several parts of the brain is mostly restricted to the sex chromosomes. Moreover, the study uncovered

  19. SNP-VISTA: An Interactive SNPs Visualization Tool

    Energy Technology Data Exchange (ETDEWEB)

    Shah, Nameeta; Teplitsky, Michael V.; Pennacchio, Len A.; Hugenholtz, Philip; Hamann, Bernd; Dubchak, Inna L.

    2005-07-05

    Recent advances in sequencing technologies promise better diagnostics for many diseases as well as better understanding of evolution of microbial populations. Single Nucleotide Polymorphisms(SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it is possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease and then screen for causative mutations.In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmental samples makes possible more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at http://genome.lbl.gov/vista/snpvista.

  20. Elixir, Alchemy and the Metamorphoses of Two Synonyms

    Directory of Open Access Journals (Sweden)

    Gotthard Strohmaier

    2017-03-01

    Full Text Available The history of the terms ‘elixir’ and ‘alchemy’ seems paradoxical; derived from Greek, the Arabic al-iksīr signified a dry powder capable of transforming base metals into gold or silver. Evolving through the European languages, elixir has come to mean a magic liquid that can be ingested to cure illness. The second term, al-kīmiyāʼ, which was in its Arabic beginnings almost synonymous with elixir, took a different turn and changed its meaning from a miraculous substance into an abstract noun connoting the art of alchemy. This article intends to show that these changes of meaning are linked to inevitable interrelations between the two synonyms and, consequently, the generally assumed etymology of the Arabic alkīmiyāʼ from the seemingly corresponding Greek expression χυμεία must be questioned. Of particular interest is the hitherto overlooked fact that al-kīmiyāʼ ends in a glottal stop, indicated by the hamza and being a consonant in its own right, which ultimately points to a non-Greek origin.

  1. Long Non-coding RNAs in Response to Genotoxic Stress

    Institute of Scientific and Technical Information of China (English)

    Xiaoman Li; Dong Pan; Baoquan Zhao; Burong Hu

    2016-01-01

    Long non-coding RNAs(lncRNAs) are increasingly involved in diverse biological processes.Upon DNA damage,the DNA damage response(DDR) elicits a complex signaling cascade,which includes the induction of lncRNAs.LncRNA-mediated DDR is involved in non-canonical and canonical manners.DNA-damage induced lncRNAs contribute to the regulation of cell cycle,apoptosis,and DNA repair,thereby playing a key role in maintaining genome stability.This review summarizes the emerging role of lncRNAs in DNA damage and repair.

  2. Assessment of single nucleotide polymorphisms in screening 52 DNA repair and cell cycle control genes in Fanconi anemia patients

    Directory of Open Access Journals (Sweden)

    Petrović Sandra

    2015-01-01

    Full Text Available Fanconi anemia (FA is a rare genetically heterogeneous disorder associated with bone marrow failure, birth defects and cancer susceptibility. Apart from the disease- causing mutations in FANC genes, the identification of specific DNA variations, such as single nucleotide polymorphisms (SNPs, in other candidate genes may lead to a better clinical description of this condition enabling individualized treatment with improvement of the prognosis. In this study, we have assessed 95 SNPs located in 52 key genes involved in base excision repair (BER, nucleotide excision repair (NER, mismatch repair (MMR, double strand break (DSB repair and cell cycle control using a DNA repair chip (Asper Biotech, Estonia which includes most of the common variants for the candidate genes. The SNP genotyping was performed in five FA-D2 patients and in one FA-A patient. The polymorphisms studied were synonymous (n=10, nonsynonymous (missense (n=52 and in non-coding regions of the genome (introns and 5 ‘and 3’ untranslated regions (UTR (n=33. Polymorphisms found at the homozygous state are selected for further analysis. Our results have shown a significant inter-individual variability among patients in the type and the frequency of SNPs and also elucidate the need for further studies of polymorphisms located in ATM, APEX APE 1, XRCC1, ERCC2, MSH3, PARP4, NBS1, BARD1, CDKN1B, TP53 and TP53BP1 which may be of great importance for better clinical description of FA. In addition, the present report recommends the use of SNPs as predictive and prognostic genetic markers to individualize therapy of FA patients. [Projekat Ministarstva nauke Republike Srbije, br. 173046

  3. Development of a 3D non-linear implicit MHD code

    International Nuclear Information System (INIS)

    Nicolas, T.; Ichiguchi, K.

    2016-06-01

    This paper details the on-going development of a 3D non-linear implicit MHD code, which aims at making possible large scale simulations of the non-linear phase of the interchange mode. The goal of the paper is to explain the rationale behind the choices made along the development, and the technical difficulties encountered. At the present stage, the development of the code has not been completed yet. Most of the discussion is concerned with the first approach, which utilizes cartesian coordinates in the poloidal plane. This approach shows serious difficulties in writing the preconditioner, closely related to the choice of coordinates. A second approach, based on curvilinear coordinates, also faced significant difficulties, which are detailed. The third and last approach explored involves unstructured tetrahedral grids, and indicates the possibility to solve the problem. The issue to domain meshing is addressed. (author)

  4. Complex analysis of urate transporters SLC2A9, SLC22A12 and functional characterization of non-synonymous allelic variants of GLUT9 in the Czech population: no evidence of effect on hyperuricemia and gout.

    Science.gov (United States)

    Hurba, Olha; Mancikova, Andrea; Krylov, Vladimir; Pavlikova, Marketa; Pavelka, Karel; Stibůrková, Blanka

    2014-01-01

    Using European descent Czech populations, we performed a study of SLC2A9 and SLC22A12 genes previously identified as being associated with serum uric acid concentrations and gout. This is the first study of the impact of non-synonymous allelic variants on the function of GLUT9 except for patients suffering from renal hypouricemia type 2. The cohort consisted of 250 individuals (150 controls, 54 nonspecific hyperuricemics and 46 primary gout and/or hyperuricemia subjects). We analyzed 13 exons of SLC2A9 (GLUT9 variant 1 and GLUT9 variant 2) and 10 exons of SLC22A12 by PCR amplification and sequenced directly. Allelic variants were prepared and their urate uptake and subcellular localization were studied by Xenopus oocytes expression system. The functional studies were analyzed using the non-parametric Wilcoxon and Kruskall-Wallis tests; the association study used the Fisher exact test and linear regression approach. We identified a total of 52 sequence variants (12 unpublished). Eight non-synonymous allelic variants were found only in SLC2A9: rs6820230, rs2276961, rs144196049, rs112404957, rs73225891, rs16890979, rs3733591 and rs2280205. None of these variants showed any significant difference in the expression of GLUT9 and in urate transport. In the association study, eight variants showed a possible association with hyperuricemia. However, seven of these were in introns and the one exon located variant, rs7932775, did not show a statistically significant association with serum uric acid concentration. Our results did not confirm any effect of SLC22A12 and SLC2A9 variants on serum uric acid concentration. Our complex approach using association analysis together with functional and immunohistochemical characterization of non-synonymous allelic variants did not show any influence on expression, subcellular localization and urate uptake of GLUT9.

  5. Quick, “Imputation-free” meta-analysis with proxy-SNPs

    Directory of Open Access Journals (Sweden)

    Meesters Christian

    2012-09-01

    Full Text Available Abstract Background Meta-analysis (MA is widely used to pool genome-wide association studies (GWASes in order to a increase the power to detect strong or weak genotype effects or b as a result verification method. As a consequence of differing SNP panels among genotyping chips, imputation is the method of choice within GWAS consortia to avoid losing too many SNPs in a MA. YAMAS (Yet Another Meta Analysis Software, however, enables cross-GWAS conclusions prior to finished and polished imputation runs, which eventually are time-consuming. Results Here we present a fast method to avoid forfeiting SNPs present in only a subset of studies, without relying on imputation. This is accomplished by using reference linkage disequilibrium data from 1,000 Genomes/HapMap projects to find proxy-SNPs together with in-phase alleles for SNPs missing in at least one study. MA is conducted by combining association effect estimates of a SNP and those of its proxy-SNPs. Our algorithm is implemented in the MA software YAMAS. Association results from GWAS analysis applications can be used as input files for MA, tremendously speeding up MA compared to the conventional imputation approach. We show that our proxy algorithm is well-powered and yields valuable ad hoc results, possibly providing an incentive for follow-up studies. We propose our method as a quick screening step prior to imputation-based MA, as well as an additional main approach for studies without available reference data matching the ethnicities of study participants. As a proof of principle, we analyzed six dbGaP Type II Diabetes GWAS and found that the proxy algorithm clearly outperforms naïve MA on the p-value level: for 17 out of 23 we observe an improvement on the p-value level by a factor of more than two, and a maximum improvement by a factor of 2127. Conclusions YAMAS is an efficient and fast meta-analysis program which offers various methods, including conventional MA as well as inserting proxy-SNPs

  6. Sub-grouping of Plasmodium falciparum 3D7 var genes based on sequence analysis of coding and non-coding regions

    DEFF Research Database (Denmark)

    Lavstsen, Thomas; Salanti, Ali; Jensen, Anja T R

    2003-01-01

    and organization of the 3D7 PfEMP1 repertoire was investigated on the basis of the complete genome sequence. METHODS: Using two tree-building methods we analysed the coding and non-coding sequences of 3D7 var and rif genes as well as var genes of other parasite strains. RESULTS: var genes can be sub...

  7. ncRNA-class Web Tool: Non-coding RNA feature extraction and pre-miRNA classification web tool

    KAUST Repository

    Kleftogiannis, Dimitrios A.; Theofilatos, Konstantinos A.; Papadimitriou, Stergios; Tsakalidis, Athanasios K.; Likothanassis, Spiridon D.; Mavroudi, Seferina P.

    2012-01-01

    Until recently, it was commonly accepted that most genetic information is transacted by proteins. Recent evidence suggests that the majority of the genomes of mammals and other complex organisms are in fact transcribed into non-coding RNAs (ncRNAs), many of which are alternatively spliced and/or processed into smaller products. Non coding RNA genes analysis requires the calculation of several sequential, thermodynamical and structural features. Many independent tools have already been developed for the efficient calculation of such features but to the best of our knowledge there does not exist any integrative approach for this task. The most significant amount of existing work is related to the miRNA class of non-coding RNAs. MicroRNAs (miRNAs) are small non-coding RNAs that play a significant role in gene regulation and their prediction is a challenging bioinformatics problem. Non-coding RNA feature extraction and pre-miRNA classification Web Tool (ncRNA-class Web Tool) is a publicly available web tool ( http://150.140.142.24:82/Default.aspx ) which provides a user friendly and efficient environment for the effective calculation of a set of 58 sequential, thermodynamical and structural features of non-coding RNAs, plus a tool for the accurate prediction of miRNAs. © 2012 IFIP International Federation for Information Processing.

  8. The origins and evolutionary history of human non-coding RNA regulatory networks.

    Science.gov (United States)

    Sherafatian, Masih; Mowla, Seyed Javad

    2017-04-01

    The evolutionary history and origin of the regulatory function of animal non-coding RNAs are not well understood. Lack of conservation of long non-coding RNAs and small sizes of microRNAs has been major obstacles in their phylogenetic analysis. In this study, we tried to shed more light on the evolution of ncRNA regulatory networks by changing our phylogenetic strategy to focus on the evolutionary pattern of their protein coding targets. We used available target databases of miRNAs and lncRNAs to find their protein coding targets in human. We were able to recognize evolutionary hallmarks of ncRNA targets by phylostratigraphic analysis. We found the conventional 3'-UTR and lesser known 5'-UTR targets of miRNAs to be enriched at three consecutive phylostrata. Firstly, in eukaryata phylostratum corresponding to the emergence of miRNAs, our study revealed that miRNA targets function primarily in cell cycle processes. Moreover, the same overrepresentation of the targets observed in the next two consecutive phylostrata, opisthokonta and eumetazoa, corresponded to the expansion periods of miRNAs in animals evolution. Coding sequence targets of miRNAs showed a delayed rise at opisthokonta phylostratum, compared to the 3' and 5' UTR targets of miRNAs. LncRNA regulatory network was the latest to evolve at eumetazoa.

  9. Confinement of Reinforced-Concrete Columns with Non-Code Compliant Confining Reinforcement plus Supplemental Pen-Binder

    Directory of Open Access Journals (Sweden)

    Anang Kristianto

    2012-11-01

    Full Text Available One of the important requirements for earthquake resistant building related to confinement is the use of seismic hooks in the hoop or confining reinforcement of reinforced-concrete column elements. However, installation of a confining reinforcement with a 135-degree hook is not easy. Therefore, in practice, many construction workers apply a confining reinforcement with a 90-degreehook (non-code compliant. Based on research and records of recent earthquakes in Indonesia, the use of a non-code compliant confining reinforcement for concrete columns produces structures with poor seismic performance. This paper presents a study that introduces an additional element that is expected to improve the effectiveness of concrete columns confined with a non-code compliant confining reinforcement. The additional element, named a pen-binder, is used to keep the non-code compliant confining reinforcement in place. The effectiveness of this element under pure axial concentric loading was investigatedcomprehensively.The specimens tested in this study were 18 concrete columns,with a cross-section of 170 mm x 170 mm and a height of 480 mm. The main test variables were the material type of the pen-binder, the angle of the hook, and the confining reinforcement configuration.The test results indicate that adding pen-binders can effectively improve the strength and ductility of the column specimens confined with a non-code compliant confining reinforcement

  10. Transcriptome Analysis of an Insecticide Resistant Housefly Strain: Insights about SNPs and Regulatory Elements in Cytochrome P450 Genes.

    Science.gov (United States)

    Mahmood, Khalid; Højland, Dorte H; Asp, Torben; Kristensen, Michael

    2016-01-01

    Insecticide resistance in the housefly, Musca domestica, has been investigated for more than 60 years. It will enter a new era after the recent publication of the housefly genome and the development of multiple next generation sequencing technologies. The genetic background of the xenobiotic response can now be investigated in greater detail. Here, we investigate the 454-pyrosequencing transcriptome of the spinosad-resistant 791spin strain in relation to the housefly genome with focus on P450 genes. The de novo assembly of clean reads gave 35,834 contigs consisting of 21,780 sequences of the spinosad resistant strain. The 3,648 sequences were annotated with an enzyme code EC number and were mapped to 124 KEGG pathways with metabolic processes as most highly represented pathway. One hundred and twenty contigs were annotated as P450s covering 44 different P450 genes of housefly. Eight differentially expressed P450s genes were identified and investigated for SNPs, CpG islands and common regulatory motifs in promoter and coding regions. Functional annotation clustering of metabolic related genes and motif analysis of P450s revealed their association with epigenetic, transcription and gene expression related functions. The sequence variation analysis resulted in 12 SNPs and eight of them found in cyp6d1. There is variation in location, size and frequency of CpG islands and specific motifs were also identified in these P450s. Moreover, identified motifs were associated to GO terms and transcription factors using bioinformatic tools. Transcriptome data of a spinosad resistant strain provide together with genome data fundamental support for future research to understand evolution of resistance in houseflies. Here, we report for the first time the SNPs, CpG islands and common regulatory motifs in differentially expressed P450s. Taken together our findings will serve as a stepping stone to advance understanding of the mechanism and role of P450s in xenobiotic detoxification.

  11. The human pregnane X receptor: genomic structure and identification and functional characterization of natural allelic variants.

    Science.gov (United States)

    Zhang, J; Kuehl, P; Green, E D; Touchman, J W; Watkins, P B; Daly, A; Hall, S D; Maurel, P; Relling, M; Brimer, C; Yasuda, K; Wrighton, S A; Hancock, M; Kim, R B; Strom, S; Thummel, K; Russell, C G; Hudson, J R; Schuetz, E G; Boguski, M S

    2001-10-01

    The pregnane X receptor (PXR)/steroid and xenobiotic receptor (SXR) transcriptionally activates cytochrome P4503A4 (CYP3A4) when ligand activated by endobiotics and xenobiotics. We cloned the human PXR gene and analysed the sequence in DNAs of individuals whose CYP3A phenotype was known. The PXR gene spans 35 kb, contains nine exons, and mapped to chromosome 13q11-13. Thirty-eight single nucleotide polymorphisms (SNPs) were identified including six SNPs in the coding region. Three of the coding SNPs are non-synonymous creating new PXR alleles [PXR*2, P27S (79C to T); PXR*3, G36R (106G to A); and PXR*4, R122Q (4321G to A)]. The frequency of PXR*2 was 0.20 in African Americans and was never found in Caucasians. Hepatic expression of CYP3A4 protein was not significantly different between African Americans homozygous for PXR*1 compared to those with one PXR*2 allele. PXR*4 was a rare variant found in only one Caucasian person. Homology modelling suggested that R122Q, (PXR*4) is a direct DNA contact site variation in the third alpha-helix in the DNA binding domain. Compared with PXR*1, and variants PXR*2 and PXR*3, only the variant PXR*4 protein had significantly decreased affinity for the PXR binding sequence in electromobility shift assays and attenuated ligand activation of the CYP3A4 reporter plasmids in transient transfection assays. However, the person heterozygous for PXR*4 is normal for CYP3A4 metabolism phenotype. The relevance of each of the 38 PXR SNPs identified in DNA of individuals whose CYP3A basal and rifampin-inducible CYP3A4 expression was determined in vivo and/or in vitro was demonstrated by univariate statistical analysis. Because ligand activation of PXR and upregulation of a system of drug detoxification genes are major determinants of drug interactions, it will now be useful to extend this work to determine the association of these common PXR SNPs to human variation in induction of other drug detoxification gene targets.

  12. Exp2 polymorphisms associated with variation for fiber quality properties in cotton (Gossypium spp.

    Directory of Open Access Journals (Sweden)

    Daohua He

    2014-10-01

    Full Text Available Plant expansins are a group of extracellular proteins thought to affect the quality of cotton fibers. Previous expression profile analysis revealed that six Expansin A genes are present in cotton, of which two (GhExp1 and GhExp2 produce transcripts that are specific to the developing cotton fiber. To identify the phenotypic function of Exp2, and to determine whether nucleotide variation among alleles of Exp2 affects fiber quality, candidate gene association mapping was conducted. Gene-specific primers were designed to amplify the Exp2 gene. By amplicon sequencing, the nucleotide diversity of Exp2 was investigated across 92 accessions (including 7 Gossypium arboreum, 74 Gossypium hirsutum, and 11 Gossypium barbadense accessions with different fiber qualities. Twenty-six SNPs and seven InDels including 14 from the coding region of Exp2 were detected, forming twelve distinct haplotypes in the cotton collection. Among the 14 SNPs in the coding region, five were missense mutations and nine were synonymous nucleotide changes. The average SNP/InDel per nucleotide ratio was 2.61% (one SNP per 39 bp, with 1.81 and 3.87% occurring in coding and non-coding regions, respectively. Nucleotide and haplotype diversity across the entire Exp2 region was 0.00603 (π and 0.844, respectively, and diversity in non-coding regions was higher than that in coding regions. For linkage disequilibrium (LD, the mean r2 value for all polymorphism loci pairs was 0.48, and LD did not decay over 748 bp. Based on 132 simple sequence repeat (SSR loci evenly covering 26 chromosomes, the population structure was estimated, and the accessions were divided into seven groups that agreed well with their genomic origin and evolutionary history. A general linear model was used to calculate the Exp2-wide diversity–trait associations of 5 fiber quality traits, considering population structure (Q. Four SNPs in Exp2 were associated with at least one of the fiber quality traits, but not with

  13. Characterization of PRLR and PPARGC1A genes in buffalo (Bubalus bubalis

    Directory of Open Access Journals (Sweden)

    Ruheena Javed

    2011-01-01

    Full Text Available More than 40 million households in India depend at least partially on livestock production. Buffaloes are one of the major milk producers in India. The prolactin receptor (PRLR gene and peroxisome proliferators activated receptor-γ coactivator 1-alpha (PPARGC1A gene are reportedly associated with milk protein and milk fat yields in Bos taurus. In this study, we sequenced the PRLR and PPARGC1A genes in the water buffalo Bubalus bubalis. The PRLR and PPARGC1A genes coded for 581 and 819 amino acids, respectively. The B. bubalis PRLR gene differed from the corresponding Bos taurus at 21 positions and four differences with an additional arginine at position 620 in the PPARGC1A gene were found in the amino acid sequence. All of the changes were confirmed by cDNA sequencing. Twelve buffalo-specific single nucleotide polymorphisms (SNPs were identified in both genes, with five of them being non-synonymous.

  14. Exome sequencing and genetic testing for MODY.

    Directory of Open Access Journals (Sweden)

    Stefan Johansson

    Full Text Available Genetic testing for monogenic diabetes is important for patient care. Given the extensive genetic and clinical heterogeneity of diabetes, exome sequencing might provide additional diagnostic potential when standard Sanger sequencing-based diagnostics is inconclusive.The aim of the study was to examine the performance of exome sequencing for a molecular diagnosis of MODY in patients who have undergone conventional diagnostic sequencing of candidate genes with negative results.We performed exome enrichment followed by high-throughput sequencing in nine patients with suspected MODY. They were Sanger sequencing-negative for mutations in the HNF1A, HNF4A, GCK, HNF1B and INS genes. We excluded common, non-coding and synonymous gene variants, and performed in-depth analysis on filtered sequence variants in a pre-defined set of 111 genes implicated in glucose metabolism.On average, we obtained 45 X median coverage of the entire targeted exome and found 199 rare coding variants per individual. We identified 0-4 rare non-synonymous and nonsense variants per individual in our a priori list of 111 candidate genes. Three of the variants were considered pathogenic (in ABCC8, HNF4A and PPARG, respectively, thus exome sequencing led to a genetic diagnosis in at least three of the nine patients. Approximately 91% of known heterozygous SNPs in the target exomes were detected, but we also found low coverage in some key diabetes genes using our current exome sequencing approach. Novel variants in the genes ARAP1, GLIS3, MADD, NOTCH2 and WFS1 need further investigation to reveal their possible role in diabetes.Our results demonstrate that exome sequencing can improve molecular diagnostics of MODY when used as a complement to Sanger sequencing. However, improvements will be needed, especially concerning coverage, before the full potential of exome sequencing can be realized.

  15. Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.

    Science.gov (United States)

    Nguyen, Thanh-Tung; Huang, Joshua; Wu, Qingyao; Nguyen, Thuy; Li, Mark

    2015-01-01

    Single-nucleotide polymorphisms (SNPs) selection and identification are the most important tasks in Genome-wide association data analysis. The problem is difficult because genome-wide association data is very high dimensional and a large portion of SNPs in the data is irrelevant to the disease. Advanced machine learning methods have been successfully used in Genome-wide association studies (GWAS) for identification of genetic variants that have relatively big effects in some common, complex diseases. Among them, the most successful one is Random Forests (RF). Despite of performing well in terms of prediction accuracy in some data sets with moderate size, RF still suffers from working in GWAS for selecting informative SNPs and building accurate prediction models. In this paper, we propose to use a new two-stage quality-based sampling method in random forests, named ts-RF, for SNP subspace selection for GWAS. The method first applies p-value assessment to find a cut-off point that separates informative and irrelevant SNPs in two groups. The informative SNPs group is further divided into two sub-groups: highly informative and weak informative SNPs. When sampling the SNP subspace for building trees for the forest, only those SNPs from the two sub-groups are taken into account. The feature subspaces always contain highly informative SNPs when used to split a node at a tree. This approach enables one to generate more accurate trees with a lower prediction error, meanwhile possibly avoiding overfitting. It allows one to detect interactions of multiple SNPs with the diseases, and to reduce the dimensionality and the amount of Genome-wide association data needed for learning the RF model. Extensive experiments on two genome-wide SNP data sets (Parkinson case-control data comprised of 408,803 SNPs and Alzheimer case-control data comprised of 380,157 SNPs) and 10 gene data sets have demonstrated that the proposed model significantly reduced prediction errors and outperformed

  16. Mutation rate switch inside Eurasian mitochondrial haplogroups: impact of selection and consequences for dating settlement in Europe.

    Directory of Open Access Journals (Sweden)

    Denis Pierron

    Full Text Available R-lineage mitochondrial DNA represents over 90% of the European population and is significantly present all around the planet (North Africa, Asia, Oceania, and America. This lineage played a major role in migration "out of Africa" and colonization in Europe. In order to determine an accurate dating of the R lineage and its sublineages, we analyzed 1173 individuals and complete mtDNA sequences from Mitomap. This analysis revealed a new coalescence age for R at 54.500 years, as well as several limitations of standard dating methods, likely to lead to false interpretations. These findings highlight the association of a striking under-accumulation of synonymous mutations, an over-accumulation of non-synonymous mutations, and the phenotypic effect on haplogroup J. Consequently, haplogroup J is apparently not a Neolithic group but an older haplogroup (Paleolithic that was subjected to an underestimated selective force. These findings also indicated an under-accumulation of synonymous and non-synonymous mutations localized on coding and non-coding (HVS1 sequences for haplogroup R0, which contains the major haplogroups H and V. These new dates are likely to impact the present colonization model for Europe and confirm the late glacial resettlement scenario.

  17. Do prion protein gene polymorphisms induce apoptosis in non ...

    Indian Academy of Sciences (India)

    2016-08-26

    Aug 26, 2016 ... Genetic variations such as single nucleotide polymorphisms (SNPs) in prion protein coding gene, Prnp, greatly affect susceptibility to prion diseases in mammals. Here, the coding region of Prnp was screened for polymorphisms in redeared turtle, Trachemys scripta. Four polymorphisms, L203V, N205I, ...

  18. Genome-wide screen for universal individual identification SNPs based on the HapMap and 1000 Genomes databases.

    Science.gov (United States)

    Huang, Erwen; Liu, Changhui; Zheng, Jingjing; Han, Xiaolong; Du, Weian; Huang, Yuanjian; Li, Chengshi; Wang, Xiaoguang; Tong, Dayue; Ou, Xueling; Sun, Hongyu; Zeng, Zhaoshu; Liu, Chao

    2018-04-03

    Differences among SNP panels for individual identification in SNP-selecting and populations led to few common SNPs, compromising their universal applicability. To screen all universal SNPs, we performed a genome-wide SNP mining in multiple populations based on HapMap and 1000Genomes databases. SNPs with high minor allele frequencies (MAF) in 37 populations were selected. With MAF from ≥0.35 to ≥0.43, the number of selected SNPs decreased from 2769 to 0. A total of 117 SNPs with MAF ≥0.39 have no linkage disequilibrium with each other in every population. For 116 of the 117 SNPs, cumulative match probability (CMP) ranged from 2.01 × 10-48 to 1.93 × 10-50 and cumulative exclusion probability (CEP) ranged from 0.9999999996653 to 0.9999999999945. In 134 tested Han samples, 110 of the 117 SNPs remained within high MAF and conformed to Hardy-Weinberg equilibrium, with CMP = 4.70 × 10-47 and CEP = 0.999999999862. By analyzing the same number of autosomal SNPs as in the HID-Ion AmpliSeq Identity Panel, i.e. 90 randomized out of the 110 SNPs, our panel yielded preferable CMP and CEP. Taken together, the 110-SNPs panel is advantageous for forensic test, and this study provided plenty of highly informative SNPs for compiling final universal panels.

  19. Long Non-Coding RNAs in Metabolic Organs and Energy Homeostasis

    Directory of Open Access Journals (Sweden)

    Maude Giroud

    2017-11-01

    Full Text Available Single cell organisms can surprisingly exceed the number of human protein-coding genes, which are thus not at the origin of the complexity of an organism. In contrast, the relative amount of non-protein-coding sequences increases consistently with organismal complexity. Moreover, the mammalian transcriptome predominantly comprises non-(protein-coding RNAs (ncRNA, of which the long ncRNAs (lncRNAs constitute the most abundant part. lncRNAs are highly species- and tissue-specific with very versatile modes of action in accordance with their binding to a large spectrum of molecules and their diverse localization. lncRNAs are transcriptional regulators adding an additional regulatory layer in biological processes and pathophysiological conditions. Here, we review lncRNAs affecting metabolic organs with a focus on the liver, pancreas, skeletal muscle, cardiac muscle, brain, and adipose organ. In addition, we will discuss the impact of lncRNAs on metabolic diseases such as obesity and diabetes. In contrast to the substantial number of lncRNA loci in the human genome, the functionally characterized lncRNAs are just the tip of the iceberg. So far, our knowledge concerning lncRNAs in energy homeostasis is still in its infancy, meaning that the rest of the iceberg is a treasure chest yet to be discovered.

  20. Coding chaotic billiards: I-Non-Compact billiards on a negative curvature manifold

    International Nuclear Information System (INIS)

    Giannoni, M.J.; Ullmo, D.

    1989-03-01

    This paper presents a method for coding billiards. The main device is to use a proper surface of section, the bounce mapping, and foliate the reduced phase space into regions associated with a given code. The alphabet is merely the ensemble of the labels of the sides of the billiard. The procedure is applied here to non-compact polygonal billiard defined on a manifold of constant negative curvature, with all vertices at infinity. A simple grammar rule is necessary and sufficient to insure existence and uniqueness of the coding

  1. Simultaneous SNP identification and assessment of allele-specific bias from ChIP-seq data

    Directory of Open Access Journals (Sweden)

    Ni Yunyun

    2012-09-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs have been associated with many aspects of human development and disease, and many non-coding SNPs associated with disease risk are presumed to affect gene regulation. We have previously shown that SNPs within transcription factor binding sites can affect transcription factor binding in an allele-specific and heritable manner. However, such analysis has relied on prior whole-genome genotypes provided by large external projects such as HapMap and the 1000 Genomes Project. This requirement limits the study of allele-specific effects of SNPs in primary patient samples from diseases of interest, where complete genotypes are not readily available. Results In this study, we show that we are able to identify SNPs de novo and accurately from ChIP-seq data generated in the ENCODE Project. Our de novo identified SNPs from ChIP-seq data are highly concordant with published genotypes. Independent experimental verification of more than 100 sites estimates our false discovery rate at less than 5%. Analysis of transcription factor binding at de novo identified SNPs revealed widespread heritable allele-specific binding, confirming previous observations. SNPs identified from ChIP-seq datasets were significantly enriched for disease-associated variants, and we identified dozens of allele-specific binding events in non-coding regions that could distinguish between disease and normal haplotypes. Conclusions Our approach combines SNP discovery, genotyping and allele-specific analysis, but is selectively focused on functional regulatory elements occupied by transcription factors or epigenetic marks, and will therefore be valuable for identifying the functional regulatory consequences of non-coding SNPs in primary disease samples.

  2. Non-coding RNAs and epigenome: de novo DNA methylation, allelic exclusion and X-inactivation

    Directory of Open Access Journals (Sweden)

    V. A. Halytskiy

    2013-12-01

    Full Text Available Non-coding RNAs are widespread class of cell RNAs. They participate in many important processes in cells – signaling, posttranscriptional silencing, protein biosynthesis, splicing, maintenance of genome stability, telomere lengthening, X-inactivation. Nevertheless, activity of these RNAs is not restricted to posttranscriptional sphere, but cover also processes that change or maintain the epigenetic information. Non-coding RNAs can directly bind to the DNA targets and cause their repression through recruitment of DNA methyltransferases as well as chromatin modifying enzymes. Such events constitute molecular mechanism of the RNA-dependent DNA methylation. It is possible, that the RNA-DNA interaction is universal mechanism triggering DNA methylation de novo. Allelic exclusion can be also based on described mechanism. This phenomenon takes place, when non-coding RNA, which precursor is transcribed from one allele, triggers DNA methylation in all other alleles present in the cell. Note, that miRNA-mediated transcriptional silencing resembles allelic exclusion, because both miRNA gene and genes, which can be targeted by this miRNA, contain elements with the same sequences. It can be assumed that RNA-dependent DNA methylation and allelic exclusion originated with the purpose of counteracting the activity of mobile genetic elements. Probably, thinning and deregulation of the cellular non-coding RNA pattern allows reactivation of silent mobile genetic elements resulting in genome instability that leads to ageing and carcinogenesis. In the course of X-inactivation, DNA methylation and subsequent hete­rochromatinization of X chromosome can be triggered by direct hybridization of 5′-end of large non-coding RNA Xist with DNA targets in remote regions of the X chromosome.

  3. Color differences among feral pigeons (Columba livia) are not attributable to sequence variation in the coding region of the melanocortin-1 receptor gene (MC1R)

    Science.gov (United States)

    2013-01-01

    Background Genetic variation at the melanocortin-1 receptor (MC1R) gene is correlated with melanin color variation in many birds. Feral pigeons (Columba livia) show two major melanin-based colorations: a red coloration due to pheomelanic pigment and a black coloration due to eumelanic pigment. Furthermore, within each color type, feral pigeons display continuous variation in the amount of melanin pigment present in the feathers, with individuals varying from pure white to a full dark melanic color. Coloration is highly heritable and it has been suggested that it is under natural or sexual selection, or both. Our objective was to investigate whether MC1R allelic variants are associated with plumage color in feral pigeons. Findings We sequenced 888 bp of the coding sequence of MC1R among pigeons varying both in the type, eumelanin or pheomelanin, and the amount of melanin in their feathers. We detected 10 non-synonymous substitutions and 2 synonymous substitution but none of them were associated with a plumage type. It remains possible that non-synonymous substitutions that influence coloration are present in the short MC1R fragment that we did not sequence but this seems unlikely because we analyzed the entire functionally important region of the gene. Conclusions Our results show that color differences among feral pigeons are probably not attributable to amino acid variation at the MC1R locus. Therefore, variation in regulatory regions of MC1R or variation in other genes may be responsible for the color polymorphism of feral pigeons. PMID:23915680

  4. Portability of tag SNPs across isolated population groups: an example from India.

    Science.gov (United States)

    Sarkar Roy, N; Farheen, S; Roy, N; Sengupta, S; Majumder, P P

    2008-01-01

    Isolated population groups are useful in conducting association studies of complex diseases to avoid various pitfalls, including those arising from population stratification. Since DNA resequencing is expensive, it is recommended that genotyping be carried out at tagSNP (tSNP) loci. For this, tSNPs identified in one isolated population need to be used in another. Unless tSNPs are highly portable across populations this strategy may result in loss of information in association studies. We examined the issue of tSNP portability by sampling individuals from 10 isolated ethnic groups from India. We generated DNA resequencing data pertaining to 3 genomic regions and identified tSNPs in each population. We defined an index of tSNP portability and showed that portability is low across isolated Indian ethnic groups. The extent of portability did not significantly correlate with genetic similarity among the populations studied here. We also analyzed our data with sequence data from individuals of African and European descent. Our results indicated that it may be necessary to carry out resequencing in a small number of individuals to discover SNPs and identify tSNPs in the specific isolated population in which a disease association study is to be conducted.

  5. Identification of evolutionarily conserved non-AUG-initiated N-terminal extensions in human coding sequences.

    LENUS (Irish Health Repository)

    Ivanov, Ivaylo P

    2011-05-01

    In eukaryotes, it is generally assumed that translation initiation occurs at the AUG codon closest to the messenger RNA 5\\' cap. However, in certain cases, initiation can occur at codons differing from AUG by a single nucleotide, especially the codons CUG, UUG, GUG, ACG, AUA and AUU. While non-AUG initiation has been experimentally verified for a handful of human genes, the full extent to which this phenomenon is utilized--both for increased coding capacity and potentially also for novel regulatory mechanisms--remains unclear. To address this issue, and hence to improve the quality of existing coding sequence annotations, we developed a methodology based on phylogenetic analysis of predicted 5\\' untranslated regions from orthologous genes. We use evolutionary signatures of protein-coding sequences as an indicator of translation initiation upstream of annotated coding sequences. Our search identified novel conserved potential non-AUG-initiated N-terminal extensions in 42 human genes including VANGL2, FGFR1, KCNN4, TRPV6, HDGF, CITED2, EIF4G3 and NTF3, and also affirmed the conservation of known non-AUG-initiated extensions in 17 other genes. In several instances, we have been able to obtain independent experimental evidence of the expression of non-AUG-initiated products from the previously published literature and ribosome profiling data.

  6. CodonShuffle: a tool for generating and analyzing synonymously mutated sequences

    OpenAIRE

    Jorge, Daniel Macedo de Melo; Mills, Ryan E.; Lauring, Adam S.

    2015-01-01

    Because synonymous mutations do not change the amino acid sequence of a protein, they are generally considered to be selectively neutral. Empiric data suggest, however, that a significant fraction of viral mutational fitness effects may be attributable to synonymous mutation. Bias in synonymous codon usage in viruses may result from selection for translational efficiency, mutational bias, base pairing requirements in RNA structures, or even selection against specific dinucleotides by innate i...

  7. Genome-Wide Analysis of the Synonymous Codon Usage Patterns in Riemerella anatipestifer

    Directory of Open Access Journals (Sweden)

    Jibin Liu

    2016-08-01

    Full Text Available Riemerella anatipestifer (RA belongs to the Flavobacteriaceae family and can cause a septicemia disease in poultry. The synonymous codon usage patterns of bacteria reflect a series of evolutionary changes that enable bacteria to improve tolerance of the various environments. We detailed the codon usage patterns of RA isolates from the available 12 sequenced genomes by multiple codon and statistical analysis. Nucleotide compositions and relative synonymous codon usage (RSCU analysis revealed that A or U ending codons are predominant in RA. Neutrality analysis found no significant correlation between GC12 and GC3 (p > 0.05. Correspondence analysis and ENc-plot results showed that natural selection dominated over mutation in the codon usage bias. The tree of cluster analysis based on RSCU was concordant with dendrogram based on genomic BLAST by neighbor-joining method. By comparative analysis, about 50 highly expressed genes that were orthologs across all 12 strains were found in the top 5% of high CAI value. Based on these CAI values, we infer that RA contains a number of predicted highly expressed coding sequences, involved in transcriptional regulation and metabolism, reflecting their requirement for dealing with diverse environmental conditions. These results provide some useful information on the mechanisms that contribute to codon usage bias and evolution of RA.

  8. Facial Expression Recognition via Non-Negative Least-Squares Sparse Coding

    Directory of Open Access Journals (Sweden)

    Ying Chen

    2014-05-01

    Full Text Available Sparse coding is an active research subject in signal processing, computer vision, and pattern recognition. A novel method of facial expression recognition via non-negative least squares (NNLS sparse coding is presented in this paper. The NNLS sparse coding is used to form a facial expression classifier. To testify the performance of the presented method, local binary patterns (LBP and the raw pixels are extracted for facial feature representation. Facial expression recognition experiments are conducted on the Japanese Female Facial Expression (JAFFE database. Compared with other widely used methods such as linear support vector machines (SVM, sparse representation-based classifier (SRC, nearest subspace classifier (NSC, K-nearest neighbor (KNN and radial basis function neural networks (RBFNN, the experiment results indicate that the presented NNLS method performs better than other used methods on facial expression recognition tasks.

  9. All SNPs are not created equal: genome-wide association studies reveal a consistent pattern of enrichment among functionally annotated SNPs

    DEFF Research Database (Denmark)

    Schork, Andrew J; Thompson, Wesley K; Pham, Phillip

    2013-01-01

    Recent results indicate that genome-wide association studies (GWAS) have the potential to explain much of the heritability of common complex phenotypes, but methods are lacking to reliably identify the remaining associated single nucleotide polymorphisms (SNPs). We applied stratified False...... Discovery Rate (sFDR) methods to leverage genic enrichment in GWAS summary statistics data to uncover new loci likely to replicate in independent samples. Specifically, we use linkage disequilibrium-weighted annotations for each SNP in combination with nominal p-values to estimate the True Discovery Rate...... in introns, and negative enrichment for intergenic SNPs. Stratified enrichment directly leads to increased TDR for a given p-value, mirrored by increased replication rates in independent samples. We show this in independent Crohn's disease GWAS, where we find a hundredfold variation in replication rate...

  10. nocoRNAc: Characterization of non-coding RNAs in prokaryotes

    Directory of Open Access Journals (Sweden)

    Nieselt Kay

    2011-01-01

    Full Text Available Abstract Background The interest in non-coding RNAs (ncRNAs constantly rose during the past few years because of the wide spectrum of biological processes in which they are involved. This led to the discovery of numerous ncRNA genes across many species. However, for most organisms the non-coding transcriptome still remains unexplored to a great extent. Various experimental techniques for the identification of ncRNA transcripts are available, but as these methods are costly and time-consuming, there is a need for computational methods that allow the detection of functional RNAs in complete genomes in order to suggest elements for further experiments. Several programs for the genome-wide prediction of functional RNAs have been developed but most of them predict a genomic locus with no indication whether the element is transcribed or not. Results We present NOCORNAc, a program for the genome-wide prediction of ncRNA transcripts in bacteria. NOCORNAc incorporates various procedures for the detection of transcriptional features which are then integrated with functional ncRNA loci to determine the transcript coordinates. We applied RNAz and NOCORNAc to the genome of Streptomyces coelicolor and detected more than 800 putative ncRNA transcripts most of them located antisense to protein-coding regions. Using a custom design microarray we profiled the expression of about 400 of these elements and found more than 300 to be transcribed, 38 of them are predicted novel ncRNA genes in intergenic regions. The expression patterns of many ncRNAs are similarly complex as those of the protein-coding genes, in particular many antisense ncRNAs show a high expression correlation with their protein-coding partner. Conclusions We have developed NOCORNAc, a framework that facilitates the automated characterization of functional ncRNAs. NOCORNAc increases the confidence of predicted ncRNA loci, especially if they contain transcribed ncRNAs. NOCORNAc is not restricted to

  11. Developmental programming of long non-coding RNAs during postnatal liver maturation in mice.

    Directory of Open Access Journals (Sweden)

    Lai Peng

    Full Text Available The liver is a vital organ with critical functions in metabolism, protein synthesis, and immune defense. Most of the liver functions are not mature at birth and many changes happen during postnatal liver development. However, it is unclear what changes occur in liver after birth, at what developmental stages they occur, and how the developmental processes are regulated. Long non-coding RNAs (lncRNAs are involved in organ development and cell differentiation. Here, we analyzed the transcriptome of lncRNAs in mouse liver from perinatal (day -2 to adult (day 60 by RNA-Sequencing, with an attempt to understand the role of lncRNAs in liver maturation. We found around 15,000 genes expressed, including about 2,000 lncRNAs. Most lncRNAs were expressed at a lower level than coding RNAs. Both coding RNAs and lncRNAs displayed three major ontogenic patterns: enriched at neonatal, adolescent, or adult stages. Neighboring coding and non-coding RNAs showed the trend to exhibit highly correlated ontogenic expression patterns. Gene ontology (GO analysis revealed that some lncRNAs enriched at neonatal ages have their neighbor protein coding genes also enriched at neonatal ages and associated with cell proliferation, immune activation related processes, tissue organization pathways, and hematopoiesis; other lncRNAs enriched at adolescent ages have their neighbor protein coding genes associated with different metabolic processes. These data reveal significant functional transition during postnatal liver development and imply the potential importance of lncRNAs in liver maturation.

  12. Novel polymorphisms in UTR and coding region of inducible heat shock protein 70.1 gene in tropically adapted Indian zebu cattle (Bos indicus) and riverine buffalo (Bubalus bubalis).

    Science.gov (United States)

    Sodhi, M; Mukesh, M; Kishore, A; Mishra, B P; Kataria, R S; Joshi, B K

    2013-09-25

    Due to evolutionary divergence, cattle (taurine, and indicine) and buffalo are speculated to have different responses to heat stress condition. Variation in candidate genes associated with a heat-shock response may provide an insight into the dissimilarity and suggest targets for intervention. The present work was undertaken to characterize one of the inducible heat shock protein genes promoter and coding regions in diverse breeds of Indian zebu cattle and buffaloes. The genomic DNA from a panel of 117 unrelated animals representing 14 diversified native cattle breeds and 6 buffalo breeds were utilized to determine the complete sequence and gene diversity of HSP70.1 gene. The coding region of HSP70.1 gene in Indian zebu cattle, Bos taurus and buffalo was similar in length (1,926 bp) encoding a HSP70 protein of 641 amino acids with a calculated molecular weight (Mw) of 70.26 kDa. However buffalo had a longer 5' and 3' untranslated region (UTR) of 204 and 293 nucleotides respectively, in comparison to Indian zebu cattle and Bos taurus wherein length of 5' and 3'-UTR was 172 and 286 nucleotides, respectively. The increased length of buffalo HSP70.1 gene compared to indicine and taurine gene was due to two insertions each in 5' and 3'-UTR. Comparative sequence analysis of cattle (taurine and indicine) and buffalo HSP70.1 gene revealed a total of 54 gene variations (50 SNPs and 4 INDELs) among the three species in the HSP70.1 gene. The minor allele frequencies of these nucleotide variations varied from 0.03 to 0.5 with an average of 0.26. Among the 14 B. indicus cattle breeds studied, a total of 19 polymorphic sites were identified: 4 in the 5'-UTR and 15 in the coding region (of these 2 were non-synonymous). Analysis among buffalo breeds revealed 15 SNPs throughout the gene: 6 at the 5' flanking region and 9 in the coding region. In bubaline 5'-UTR, 2 additional putative transcription factor binding sites (Elk-1 and C-Re1) were identified, other than three common sites

  13. Analysis of EFL Students' Ability in Reading Vocabulary of Synonyms and Antonyms

    Directory of Open Access Journals (Sweden)

    Vina Fathira

    2017-02-01

    Full Text Available Reading is an important thing for academic level. Every student must have many vocabularies to encourage her/his reading skill. The aim of this research is to analyze the students' understanding of reading vocabularies of synonyms and antonyms in the higher education level. Synonyms and antonyms are two important things should be mastered to get better reading comprehension. The method used in this research was quantitative with survey design. The population same as the sample of this research was from fifth semester students of STIBA Persada Bunda Pekanbaru. The procedures of the research were divided into 3 parts. First, students were asked to choose the best choice in the multiple choice for synonyms and anton, number and the wrong number, and grouped the wrong number into difficulties level. Last, the researcher analyzed the students' ability in reading vocabulary of synonyms and antonyms and concluded the result of students' ability in reading vocabulary of synonyms and antonyms in elementary, intermediate, and advanced level. The result of this research showed that the students' ability in reading vocabulary of synonyms and antonyms was categorized into "excellent" level with mean score 85. From the three difficulties level of question, the findings of this research were explained every level of question. In synonyms, the mean score of students' ability were 89, 85, and 84 for elementary, intermediate, and advanced level of question. Whereas, in antonyms, the mean score of students' ability were 97, 85, and 69 for elementary, intermediate, and advanced level of question.Keywords: students' ability, reading vocabulary, synonyms and antonyms

  14. Non-coding RNA networks in cancer.

    Science.gov (United States)

    Anastasiadou, Eleni; Jacob, Leni S; Slack, Frank J

    2018-01-01

    Thousands of unique non-coding RNA (ncRNA) sequences exist within cells. Work from the past decade has altered our perception of ncRNAs from 'junk' transcriptional products to functional regulatory molecules that mediate cellular processes including chromatin remodelling, transcription, post-transcriptional modifications and signal transduction. The networks in which ncRNAs engage can influence numerous molecular targets to drive specific cell biological responses and fates. Consequently, ncRNAs act as key regulators of physiological programmes in developmental and disease contexts. Particularly relevant in cancer, ncRNAs have been identified as oncogenic drivers and tumour suppressors in every major cancer type. Thus, a deeper understanding of the complex networks of interactions that ncRNAs coordinate would provide a unique opportunity to design better therapeutic interventions.

  15. Evidence of translation efficiency adaptation of the coding regions of the bacteriophage lambda.

    Science.gov (United States)

    Goz, Eli; Mioduser, Oriah; Diament, Alon; Tuller, Tamir

    2017-08-01

    Deciphering the way gene expression regulatory aspects are encoded in viral genomes is a challenging mission with ramifications related to all biomedical disciplines. Here, we aimed to understand how the evolution shapes the bacteriophage lambda genes by performing a high resolution analysis of ribosomal profiling data and gene expression related synonymous/silent information encoded in bacteriophage coding regions.We demonstrated evidence of selection for distinct compositions of synonymous codons in early and late viral genes related to the adaptation of translation efficiency to different bacteriophage developmental stages. Specifically, we showed that evolution of viral coding regions is driven, among others, by selection for codons with higher decoding rates; during the initial/progressive stages of infection the decoding rates in early/late genes were found to be superior to those in late/early genes, respectively. Moreover, we argued that selection for translation efficiency could be partially explained by adaptation to Escherichia coli tRNA pool and the fact that it can change during the bacteriophage life cycle.An analysis of additional aspects related to the expression of viral genes, such as mRNA folding and more complex/longer regulatory signals in the coding regions, is also reported. The reported conclusions are likely to be relevant also to additional viruses. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  16. Development of 3D CFD code based on structured non-orthogonal grids

    International Nuclear Information System (INIS)

    Vaidya, Abhijeet Mohan; Maheshwari, Naresh Kumar; Rama Rao, A.

    2016-01-01

    Most of the nuclear industry problems involve complex geometries. Solution of flow and heat transfer over complex geometries is a very important requirement for designing new reactor systems. Hence development of a general purpose three dimensional (3D) CFD code is undertaken. For handling complex shape of computational domain, implementation on structured non-orthogonal coordinates is being done. The code is validated by comparing its results for 3D inclined lid driven cavity at different inclination angles and Reynolds numbers with OpenFOAM results. This paper contains formulation and validation of the new code developed. (author)

  17. Characteristics of forming of synonymic rows within lexical phraseological field

    Directory of Open Access Journals (Sweden)

    Мария Валерьевна Волнакова

    2011-03-01

    Full Text Available The article deals with the characteristics of forming of phraseological synonymic rows with a lexical identifier as a dominant of a row. Revealed synonymic rows mirror the deepness of systematic language relationships between lexis and phraseology.

  18. Computational Approaches Reveal New Insights into Regulation and Function of Non; coding RNAs and their Targets

    KAUST Repository

    Alam, Tanvir

    2016-01-01

    Regulation and function of protein-coding genes are increasingly well-understood, but no comparable evidence exists for non-coding RNA (ncRNA) genes, which appear to be more numerous than protein-coding genes. We developed a novel machine

  19. Computational Profiling of Deleterious Non-Synonymous SNP’s in HFE

    Directory of Open Access Journals (Sweden)

    S. Sarath Raj

    2017-12-01

    Full Text Available Liver cirrhosis describes a condition where scar tissue gradually replaces healthy cells in liver. The main causes are sustained, excessive alcohol consumption, viral hepatitis B and C, and fatty liver disease - however, there are other possible causes. Hemochromatosis is most common form of iron overload disease. Three types of hemochromatosis are primary hemochromatosis, also called hereditary hemochromatosis; secondary hemochromatosis; and neonatal hemochromatosis. The HFE gene helps regulate the amount of iron absorbed from food and inherited genetic defects or mutation in HFE [C282Y] cause primary hemochromatosis. Computational approach is sought to determine other similar mutations in this gene. In-silico tools such as SIFT, Polyphen 2.0, and PROVEAN were employed to determine the various deleterious ns-SNPs of HFE that may influence cystic fibrosis.

  20. Development of non-orthogonal and 2-dimensional numerical code TFC2D-BFC for fluid flow

    International Nuclear Information System (INIS)

    Park, Ju Yeop; In, Wang Kee; Chun, Tae Hyun; Oh, Dong Seok

    2000-09-01

    The development of algorithm for three dimensional non-orthogonal coordinate system has been made. The algorithm adopts a non-staggered grid system, Cartesian velocity components for independent variables of momentum equations and a SIMPLER algorithm for a pressure correction equation. Except the pressure correction method, the selected grid system and the selected independent variables for momentum equations have been widely used in a commercial code. It is well known that the SIMPLER is superior to the SIMPLE algorithm in the view of convergence rate. Using this algorithm, a two dimensional non-orthogonal numerical code has been completed. The code adopts a structured single square block in a computational domain with a uniform mesh interval. Consequently, any solid body existing in a flow field can be implemented in the numerical code through a blocked-off method which was devised by Patankar

  1. Evaluation of temporary non-code repairs in safety class 3 piping systems

    International Nuclear Information System (INIS)

    Godha, P.C.; Kupinski, M.; Azevedo, N.F.

    1996-01-01

    Temporary non-ASME Code repairs in safety class 3 pipe and piping components are permissible during plant operation in accordance with Nuclear Regulatory Commission Generic Letter 90-05. However, regulatory acceptance of such repairs requires the licensee to undertake several timely actions. Consistent with the requirements of GL 90-05, this paper presents an overview of the detailed evaluation and relief request process. The technical criteria encompasses both ductile and brittle piping materials. It also lists appropriate evaluation methods that a utility engineer can select to perform a structural integrity assessment for design basis loading conditions to support the use of temporary non-Code repair for degraded piping components. Most use of temporary non-code repairs at a nuclear generating station is in the service water system which is an essential safety related system providing the ultimate heat sink for various plant systems. Depending on the plant siting, the service water system may use fresh water or salt water as the cooling medium. Various degradation mechanisms including general corrosion, erosion/corrosion, pitting, microbiological corrosion, galvanic corrosion, under-deposit corrosion or a combination thereof continually challenge the pressure boundary structural integrity. A good source for description of corrosion degradation in cooling water systems is provided in a cited reference

  2. Genomic variation in myeloma: design, content, and initial application of the Bank On A Cure SNP Panel to detect associations with progression-free survival

    Directory of Open Access Journals (Sweden)

    Fang Gang

    2008-09-01

    Full Text Available Abstract Background We have engaged in an international program designated the Bank On A Cure, which has established DNA banks from multiple cooperative and institutional clinical trials, and a platform for examining the association of genetic variations with disease risk and outcomes in multiple myeloma. We describe the development and content of a novel custom SNP panel that contains 3404 SNPs in 983 genes, representing cellular functions and pathways that may influence disease severity at diagnosis, toxicity, progression or other treatment outcomes. A systematic search of national databases was used to identify non-synonymous coding SNPs and SNPs within transcriptional regulatory regions. To explore SNP associations with PFS we compared SNP profiles of short term (less than 1 year, n = 70 versus long term progression-free survivors (greater than 3 years, n = 73 in two phase III clinical trials. Results Quality controls were established, demonstrating an accurate and robust screening panel for genetic variations, and some initial racial comparisons of allelic variation were done. A variety of analytical approaches, including machine learning tools for data mining and recursive partitioning analyses, demonstrated predictive value of the SNP panel in survival. While the entire SNP panel showed genotype predictive association with PFS, some SNP subsets were identified within drug response, cellular signaling and cell cycle genes. Conclusion A targeted gene approach was undertaken to develop an SNP panel that can test for associations with clinical outcomes in myeloma. The initial analysis provided some predictive power, demonstrating that genetic variations in the myeloma patient population may influence PFS.

  3. Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies.

    Directory of Open Access Journals (Sweden)

    Clive J Hoggart

    2008-07-01

    Full Text Available Testing one SNP at a time does not fully realise the potential of genome-wide association studies to identify multiple causal variants, which is a plausible scenario for many complex diseases. We show that simultaneous analysis of the entire set of SNPs from a genome-wide study to identify the subset that best predicts disease outcome is now feasible, thanks to developments in stochastic search methods. We used a Bayesian-inspired penalised maximum likelihood approach in which every SNP can be considered for additive, dominant, and recessive contributions to disease risk. Posterior mode estimates were obtained for regression coefficients that were each assigned a prior with a sharp mode at zero. A non-zero coefficient estimate was interpreted as corresponding to a significant SNP. We investigated two prior distributions and show that the normal-exponential-gamma prior leads to improved SNP selection in comparison with single-SNP tests. We also derived an explicit approximation for type-I error that avoids the need to use permutation procedures. As well as genome-wide analyses, our method is well-suited to fine mapping with very dense SNP sets obtained from re-sequencing and/or imputation. It can accommodate quantitative as well as case-control phenotypes, covariate adjustment, and can be extended to search for interactions. Here, we demonstrate the power and empirical type-I error of our approach using simulated case-control data sets of up to 500 K SNPs, a real genome-wide data set of 300 K SNPs, and a sequence-based dataset, each of which can be analysed in a few hours on a desktop workstation.

  4. Decoding the non-coding RNAs in Alzheimer's disease.

    Science.gov (United States)

    Schonrock, Nicole; Götz, Jürgen

    2012-11-01

    Non-coding RNAs (ncRNAs) are integral components of biological networks with fundamental roles in regulating gene expression. They can integrate sequence information from the DNA code, epigenetic regulation and functions of multimeric protein complexes to potentially determine the epigenetic status and transcriptional network in any given cell. Humans potentially contain more ncRNAs than any other species, especially in the brain, where they may well play a significant role in human development and cognitive ability. This review discusses their emerging role in Alzheimer's disease (AD), a human pathological condition characterized by the progressive impairment of cognitive functions. We discuss the complexity of the ncRNA world and how this is reflected in the regulation of the amyloid precursor protein and Tau, two proteins with central functions in AD. By understanding this intricate regulatory network, there is hope for a better understanding of disease mechanisms and ultimately developing diagnostic and therapeutic tools.

  5. Promoter Analysis Reveals Globally Differential Regulation of Human Long Non-Coding RNA and Protein-Coding Genes

    KAUST Repository

    Alam, Tanvir

    2014-10-02

    Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptional regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.

  6. Genomic Selection for Drought Tolerance Using Genome-Wide SNPs in Maize

    Directory of Open Access Journals (Sweden)

    Thirunavukkarasu Nepolean

    2017-04-01

    Full Text Available Traditional breeding strategies for selecting superior genotypes depending on phenotypic traits have proven to be of limited success, as this direct selection is hindered by low heritability, genetic interactions such as epistasis, environmental-genotype interactions, and polygenic effects. With the advent of new genomic tools, breeders have paved a way for selecting superior breeds. Genomic selection (GS has emerged as one of the most important approaches for predicting genotype performance. Here, we tested the breeding values of 240 maize subtropical lines phenotyped for drought at different environments using 29,619 cured SNPs. Prediction accuracies of seven genomic selection models (ridge regression, LASSO, elastic net, random forest, reproducing kernel Hilbert space, Bayes A and Bayes B were tested for their agronomic traits. Though prediction accuracies of Bayes B, Bayes A and RKHS were comparable, Bayes B outperformed the other models by predicting highest Pearson correlation coefficient in all three environments. From Bayes B, a set of the top 1053 significant SNPs with higher marker effects was selected across all datasets to validate the genes and QTLs. Out of these 1053 SNPs, 77 SNPs associated with 10 drought-responsive transcription factors. These transcription factors were associated with different physiological and molecular functions (stomatal closure, root development, hormonal signaling and photosynthesis. Of several models, Bayes B has been shown to have the highest level of prediction accuracy for our data sets. Our experiments also highlighted several SNPs based on their performance and relative importance to drought tolerance. The result of our experiments is important for the selection of superior genotypes and candidate genes for breeding drought-tolerant maize hybrids.

  7. Application of SNPs for population genetics of nonmodel organisms: new opportunities and challenges

    DEFF Research Database (Denmark)

    Helyar, S.J.; Hansen, Jakob Hemmer; Bekkevold, Dorte

    2011-01-01

    Recent improvements in the speed, cost and accuracy of next generation sequencing are revolutionizing the discovery of single nucleotide polymorphisms (SNPs). SNPs are increasingly being used as an addition to the molecular ecology toolkit in nonmodel organisms, but their efficient use remains...

  8. Biomarker Detection in Association Studies: Modeling SNPs Simultaneously via Logistic ANOVA

    KAUST Repository

    Jung, Yoonsuh

    2014-10-02

    In genome-wide association studies, the primary task is to detect biomarkers in the form of Single Nucleotide Polymorphisms (SNPs) that have nontrivial associations with a disease phenotype and some other important clinical/environmental factors. However, the extremely large number of SNPs comparing to the sample size inhibits application of classical methods such as the multiple logistic regression. Currently the most commonly used approach is still to analyze one SNP at a time. In this paper, we propose to consider the genotypes of the SNPs simultaneously via a logistic analysis of variance (ANOVA) model, which expresses the logit transformed mean of SNP genotypes as the summation of the SNP effects, effects of the disease phenotype and/or other clinical variables, and the interaction effects. We use a reduced-rank representation of the interaction-effect matrix for dimensionality reduction, and employ the L 1-penalty in a penalized likelihood framework to filter out the SNPs that have no associations. We develop a Majorization-Minimization algorithm for computational implementation. In addition, we propose a modified BIC criterion to select the penalty parameters and determine the rank number. The proposed method is applied to a Multiple Sclerosis data set and simulated data sets and shows promise in biomarker detection.

  9. Biomarker Detection in Association Studies: Modeling SNPs Simultaneously via Logistic ANOVA

    KAUST Repository

    Jung, Yoonsuh; Huang, Jianhua Z.; Hu, Jianhua

    2014-01-01

    In genome-wide association studies, the primary task is to detect biomarkers in the form of Single Nucleotide Polymorphisms (SNPs) that have nontrivial associations with a disease phenotype and some other important clinical/environmental factors. However, the extremely large number of SNPs comparing to the sample size inhibits application of classical methods such as the multiple logistic regression. Currently the most commonly used approach is still to analyze one SNP at a time. In this paper, we propose to consider the genotypes of the SNPs simultaneously via a logistic analysis of variance (ANOVA) model, which expresses the logit transformed mean of SNP genotypes as the summation of the SNP effects, effects of the disease phenotype and/or other clinical variables, and the interaction effects. We use a reduced-rank representation of the interaction-effect matrix for dimensionality reduction, and employ the L 1-penalty in a penalized likelihood framework to filter out the SNPs that have no associations. We develop a Majorization-Minimization algorithm for computational implementation. In addition, we propose a modified BIC criterion to select the penalty parameters and determine the rank number. The proposed method is applied to a Multiple Sclerosis data set and simulated data sets and shows promise in biomarker detection.

  10. All SNPs Are Not Created Equal: Genome-Wide Association Studies Reveal a Consistent Pattern of Enrichment among Functionally Annotated SNPs

    NARCIS (Netherlands)

    Schork, Andrew J.; Thompson, Wesley K.; Pham, Phillip; Torkamani, Ali; Roddey, J. Cooper; Sullivan, Patrick F.; Kelsoe, John R.; O'Donovan, Michael C.; Furberg, Helena; Schork, Nicholas J.; Andreassen, Ole A.; Dale, Anders M.; Absher, Devin; Agudo, Antonio; Almgren, Peter; Ardissino, Diego; Assimes, Themistocles L.; Bandinelli, Stephania; Barzan, Luigi; Bencko, Vladimir; Benhamou, Simone; Benjamin, Emelia J.; Bernardinelli, Luisa; Bis, Joshua; Boehnke, Michael; Boerwinkle, Eric; Boomsma, Dorret I.; Brennan, Paul; Canova, Cristina; Castellsagué, Xavier; Chanock, Stephen; Chasman, Daniel; Conway, David I.; Dackor, Jennifer; de Geus, Eco J. C.; Duan, Jubao; Elosua, Roberto; Everett, Brendan; Fabianova, Eleonora; Ferrucci, Luigi; Foretova, Lenka; Fortmann, Stephen P.; Franceschini, Nora; Frayling, Timothy; Furberg, Curt; Gejman, Pablo V.; Groop, Leif; Gu, Fangyi; de Haan, Lieuwe; Linszen, Don H.

    2013-01-01

    Recent results indicate that genome-wide association studies (GWAS) have the potential to explain much of the heritability of common complex phenotypes, but methods are lacking to reliably identify the remaining associated single nucleotide polymorphisms (SNPs). We applied stratified False Discovery

  11. All SNPs are not created equal: Genome-wide association studies reveal a consistent pattern of enrichment among functionally annotated SNPs

    NARCIS (Netherlands)

    Schork, A.J.; Thompson, W.K.; Pham, P.; Torkamani, A.; Roddey, J.C.; Sullivan, P.F.; Kelsoe, J.; O'Donovan, M.C.; Furberg, H.; Absher, D.; Agudo, A.; Almgren, P.; Ardissino, D.; Assimes, T.L.; Bandinelli, S.; Barzan, L.; Bencko, V.; Benhamou, S.; Benjamin, E.J.; Bernardinelli, L.; Bis, J.; Boehnke, M.; Boerwinkle, E.; Boomsma, D.I.; Brennan, P.; Canova, C.; Castellsagué, X.; Chanock, S.; Chasman, D.I.; Conway, D.I.; Dackor, J.; de Geus, E.J.C.; Duan, J.; Elosua, R.; Everett, B.; Fabianova, E.; Ferrucci, L.; Foretova, L.; Fortmann, S.P.; Franceschini, N.; Frayling, T.M.; Furberg, C.; Gejman, P.V.; Groop, L.; Gu, F.; Guralnik, J.; Hankinson, S.E.; Haritunians, T.; Healy, C.; Hofman, A.; Holcátová, I.; Hunter, D.J.; Hwang, S.J.; Ioannidis, J.P.A.; Iribarren, C.; Jackson, A.U.; Janout, V.; Kaprio, J.; Kim, Y.; Kjaerheim, K.; Knowles, J.W.; Kraft, P.; Ladenvall, C.; Lagiou, P.; Lanthrop, M.; Lerman, C.; Levinson, D.F.; Levy, D.; Li, M.D.; Lin, D.Y.; Lips, E.H.; Lissowska, J.; Lowry, R.B.; Lucas, G.; Macfarlane, T.V.; Maes, H.H.M.; Mannucci, P.M.; Mates, D.; Mauri, F.; McGovern, J.A.; McKay, J.D.; McKnight, B.; Melander, O.; Merlini, P.A.; Milaneschi, Y.; Mohlke, K.L.; O'Donnell, C.J.; Pare, G.; Penninx, B.W.J.H.; Perry, J.R.B.; Posthuma, D.; Preis, S.R.; Psaty, B.; Quertermous, T.; Ramachandran, V.S.; Richiardi, L.; Ridker, P.M.; Rose, J.; Rudnai, P.; Salomaa, V.; Sanders, A.R.; Schwartz, S.M.; Shi, J.; Smit, J.H.; Stringham, H.M.; Szeszenia-Dabrowska, N.; Tanaka, T.; Taylor, K.; Thacker, E.E.; Thornton, L.; Tiemeier, H.; Tuomilehto, J.; Uitterlinden, A.G.; van Duijn, C.M.; Vink, J.M.; Vogelzangs, N.; Voight, B.F.; Walter, S.; Willemsen, G.; Zaridze, D.; Znaor, A.; Akil, H.; Anjorin, A.; Backlund, L.; Badner, J.A.; Barchas, J.D.; Barrett, T.; Bass, N.; Bauer, M.; Bellivier, F.; Bergen, S.E.; Berrettini, W.; Blackwood, D.; Bloss, C.S.; Breen, G.; Breuer, R.; Bunner, W.E.; Burmeister, M.; Byerley, W. F.; Caesar, S.; Chambert, K.; Cichon, S.; St Clair, D.; Collier, D.A.; Corvin, A.; Coryell, W.H.; Craddock, N.; Craig, D.W.; Daly, M.; Day, R.; Degenhardt, F.; Djurovic, S.; Dudbridge, F.; Edenberg, H.J.; Elkin, A.; Etain, B.; Farmer, A.E.; Ferreira, M.A.; Ferrier, I.; Flickinger, M.; Foroud, T.; Frank, J.; Fraser, C.; Frisén, L.; Gershon, E.S.; Gill, M.; Gordon-Smith, K.; Green, E.K.; Greenwood, T.A.; Grozeva, D.; Guan, W.; Gurling, H.; Gustafsson, O.; Hamshere, M.L.; Hautzinger, M.; Herms, S.; Hipolito, M.; Holmans, P.A.; Hultman, C. M.; Jamain, S.; Jones, E.G.; Jones, I.; Jones, L.; Kandaswamy, R.; Kennedy, J.L.; Kirov, G. K.; Koller, D.L.; Kwan, P.; Landén, M.; Langstrom, N.; Lathrop, M.; Lawrence, J.; Lawson, W.B.; Leboyer, M.; Lee, P.H.; Li, J.; Lichtenstein, P.; Lin, D.; Liu, C.; Lohoff, F.W.; Lucae, S.; Mahon, P.B.; Maier, W.; Martin, N.G.; Mattheisen, M.; Matthews, K.; Mattingsdal, M.; McGhee, K.A.; McGuffin, P.; McInnis, M.G.; McIntosh, A.; McKinney, R.; McLean, A.W.; McMahon, F.J.; McQuillin, A.; Meier, S.; Melle, I.; Meng, F.; Mitchell, P.B.; Montgomery, G.W.; Moran, J.; Morken, G.; Morris, D.W.; Moskvina, V.; Muglia, P.; Mühleisen, T.W.; Muir, W.J.; Müller-Myhsok, B.; Myers, R.M.; Nievergelt, C.M.; Nikolov, I.; Nimgaonkar, V.L.; Nöthen, M.M.; Nurnberger, J.I.; Nwulia, E.A.; O'Dushlaine, C.; Osby, U.; Óskarsson, H.; Owen, M.J.; Petursson, H.; Pickard, B.S.; Porgeirsson, P.; Potash, J.B.; Propping, P.; Purcell, S.M.; Quinn, E.; Raychaudhuri, S.; Rice, J.; Rietschel, M.; Ruderfer, D.; Schalling, M.; Schatzberg, A.F.; Scheftner, W.A.; Schofield, P.R.; Schulze, T.G.; Schumacher, J.; Schwarz, M.M.; Scolnick, E.; Scott, L.J.; Shilling, P.D.; Sigurdsson, E.; Sklar, P.; Smith, E.N.; Stefansson, H.; Stefansson, K.; Steffens, M; Steinberg, S.; Strauss, J.; Strohmaier, J.; Szelinger, S.; Thompson, R.C.; Tozzi, F.; Treutlein, J.; Vincent, J.B.; Watson, S.J.; Wienker, T.F.; Williamson, R.; Witt, S.H.; Wright, A.; Xu, W.; Young, A.H.; Zandi, P.P.; Zhang, P.; Zöllner, S.; Agartz, I.; Albus, M.; Alexander, M.; Amdur, R. L.; Amin, F.; Bitter, I.; Black, D.W.; Børglum, A.D.; Brown, M.A.; Bruggeman, R.; Buccola, N.G.; Cahn, W.; Cantor, R.M.; Carr, V.J.; Catts, S. V.; Choudhury, K.; Cloninger, C. R.; Cormican, P.; Danoy, P. A.; Datta, S.; DeHert, M.; Demontis, D.; Dikeos, D.; Donnelly, P.; Donohoe, G.; Duong, L.; Dwyer, S.; Fanous, A.; Fink-Jensen, A.; Freedman, R.; Freimer, N.B.; Friedl, M.; Georgieva, L.; Giegling, I.; Glenthoj, B.; Godard, S.; Golimbet, V.; de Haan, L.; Hansen, M.; Hansen, T.; Hartmann, A.M.; Henskens, F. A.; Hougaard, D. M.; Ingason, A.; Jablensky, A. V.; Jakobsen, K.D.; Jay, M.; Jönsson, E.G.; Jürgens, G.; Kahn, R.S.; Keller, M.C.; Kendler, K.S.; Kenis, G.; Kenny, E.; Konnerth, H.; Konte, B.; Krabbendam, L.; Krasucki, R.; Lasseter, V. K.; Laurent, C.; Lencz, T.; Lerer, F. B.; Liang, K. Y.; Lieberman, J. A.; Linszen, D.H.; Lönnqvist, J.; Loughland, C. M.; Maclean, A. W.; Maher, B.S.; Malhotra, A.K.; Mallet, J.; Malloy, P.; McGrath, J. J.; McLean, D. E.; Michie, P. T.; Milanova, V.; Mors, O.; Mortensen, P.B.; Mowry, B. J.; Myin-Germeys, I.; Neale, B.; Nertney, D. A.; Nestadt, G.; Nielsen, J.; Nordentoft, M.; Norton, N.; O'Neill, F.; Olincy, A.; Olsen, L.; Ophoff, R.A.; Orntoft, T. F.; van Os, J.; Pantelis, C.; Papadimitriou, G.; Pato, C.N.; Peltonen, L.; Pickard, B.; Pietilainen, O.P.; Pimm, J.; Pulver, A. E.; Puri, V.; Quested, D.; Rasmussen, H.B.; Rethelyi, J.M.; Ribble, R.; Riley, B.P.; Rossin, L.; Ruggeri, M.; Rujescu, D.; Schall, U.; Schwab, S. G.; Scott, R.J.; Silverman, J.M.; Spencer, C. C.; Strange, A.; Strengman, E.; Stroup, T.S.; Suvisaari, J.; Terenius, L.; Thirumalai, S.; Timm, S.; Toncheva, D.; Tosato, S.; van den Oord, E.J.; Veldink, J.; Visscher, P.M.; Walsh, D.; Wang, A. G.; Werge, T.; Wiersma, D.; Wildenauer, D. B.; Williams, H.J.; Williams, N.M.; van Winkel, R.; Wormley, B.; Zammit, S.; Schork, N.J.; Andreassen, O.A.; Dale, A.M.

    2013-01-01

    Recent results indicate that genome-wide association studies (GWAS) have the potential to explain much of the heritability of common complex phenotypes, but methods are lacking to reliably identify the remaining associated single nucleotide polymorphisms (SNPs). We applied stratified False Discovery

  12. Characterization of novel and uncharacterized p53 SNPs in the Chinese population--intron 2 SNP co-segregates with the common codon 72 polymorphism.

    Directory of Open Access Journals (Sweden)

    Beng Hooi Phang

    Full Text Available Multiple single nucleotide polymorphisms (SNPs have been identified in the tumor suppressor gene p53, though the relevance of many of them is unclear. Some of them are also differentially distributed in various ethnic populations, suggesting selective functionality. We have therefore sequenced all exons and flanking regions of p53 from the Singaporean Chinese population and report here the characterization of some novel and uncharacterized SNPs - four in intron 1 (nucleotide positions 8759/10361/10506/11130, three in intron 3 (11968/11969/11974 and two in the 3'UTR (19168/19514. Allelic frequencies were determined for all these and some known SNPs, and were compared in a limited scale to leukemia and lung cancer patient samples. Intron 2 (11827 and 7 (14181/14201 SNPs were found to have a high minor allele frequency of between 26-47%, in contrast to the lower frequencies found in the US population, but similar in trend to the codon 72 polymorphism (SNP12139 that shows a distribution pattern correlative with latitude. Several of the SNPs were linked, such as those in introns 1, 3 and 7. Most interestingly, we noticed the co-segregation of the intron 2 and the codon 72 SNPs, the latter which has been shown to be expressed in an allele-specific manner, suggesting possible regulatory cross-talk. Association analysis indicated that the T/G alleles in both the co-segregating intron 7 SNPs and a 4tagSNP haplotype was strongly associated increased susceptibility to lung cancer in non-smoker females [OR: 1.97 (1.32, 3.394]. These data together demonstrate high SNP diversity in p53 gene between different populations, highlighting ethnicity-based differences, and their association with cancer risk.

  13. Novel SNPs polymorphism of bovine CACNA2D1 gene and their ...

    African Journals Online (AJOL)

    In this study, the bovine CACNA2D1 gene was taken as a candidate gene for mastitis resistance. The objective of this study was to identify single nucleotide polymorphisms (SNPs) in the bovine CACNA2D1 gene and evaluate the association of these SNPs with mastitis in cattle. Through DNA sequencing and PCR-RFLP ...

  14. Association of six CpG-SNPs in the inflammation-related genes with coronary heart disease

    OpenAIRE

    Chen, Xiaomin; Chen, Xiaoying; Xu, Yan; Yang, William; Wu, Nan; Ye, Huadan; Yang, Jack Y.; Hong, Qingxiao; Xin, Yanfei; Yang, Mary Qu; Deng, Youping; Duan, Shiwei

    2016-01-01

    Background Chronic inflammation has been widely considered to be the major risk factor of coronary heart disease (CHD). The goal of our study was to explore the possible association with CHD for inflammation-related single nucleotide polymorphisms (SNPs) involved in cytosine-phosphate-guanine (CpG) dinucleotides. A total of 784 CHD patients and 739 non-CHD controls were recruited from Zhejiang Province, China. Using the Sequenom MassARRAY platform, we measured the genotypes of six inflammatio...

  15. Forensic genetic informativeness of an SNP panel consisting of 19 multi-allelic SNPs.

    Science.gov (United States)

    Gao, Zehua; Chen, Xiaogang; Zhao, Yuancun; Zhao, Xiaohong; Zhang, Shu; Yang, Yiwen; Wang, Yufang; Zhang, Ji

    2018-05-01

    Current research focusing on forensic personal identification, phenotype inference and ancestry information on single-nucleotide polymorphisms (SNPs) has been widely reported. In the present study, we focused on tetra-allelic SNPs in the Chinese Han population. A total of 48 tetra-allelic SNPs were screened out from the Chinese Han population of the 1000 Genomes Database, including Chinese Han in Beijing (CHB) and Chinese Han South (CHS). Considering the forensic genetic requirement for the polymorphisms, only 11 tetra-allelic SNPs with a heterozygosity >0.06 were selected for further multiplex panel construction. In order to meet the demands of personal identification and parentage identification, an additional 8 tri-allelic SNPs were combined into the final multiplex panel. To ensure application in the degraded DNA analysis, all the PCR products were designed to be 87-188 bp. Employing multiple PCR reactions and SNaPshot minisequencing, 511 unrelated Chinese Han individuals from Sichuan were genotyped. The combined match probability (CMP), combined discrimination power (CDP), and cumulative probability of exclusion (CPE) of the panel were 6.07 × 10 -11 , 0.9999999999393 and 0.996764, respectively. Based on the population data retrieved from the 1000 Genomes Project, Fst values between Chinese Han in Sichuan (SCH) and all the populations included in the 1000 Genomes Project were calculated. The results indicated that two SNPs in this panel may contain ancestry information and may be used as markers of forensic biogeographical ancestry inference. Copyright © 2018 Elsevier B.V. All rights reserved.

  16. Complex analysis of urate transporters SLC2A9, SLC22A12 and functional characterization of non-synonymous allelic variants of GLUT9 in the Czech population: no evidence of effect on hyperuricemia and gout.

    Directory of Open Access Journals (Sweden)

    Olha Hurba

    Full Text Available OBJECTIVE: Using European descent Czech populations, we performed a study of SLC2A9 and SLC22A12 genes previously identified as being associated with serum uric acid concentrations and gout. This is the first study of the impact of non-synonymous allelic variants on the function of GLUT9 except for patients suffering from renal hypouricemia type 2. METHODS: The cohort consisted of 250 individuals (150 controls, 54 nonspecific hyperuricemics and 46 primary gout and/or hyperuricemia subjects. We analyzed 13 exons of SLC2A9 (GLUT9 variant 1 and GLUT9 variant 2 and 10 exons of SLC22A12 by PCR amplification and sequenced directly. Allelic variants were prepared and their urate uptake and subcellular localization were studied by Xenopus oocytes expression system. The functional studies were analyzed using the non-parametric Wilcoxon and Kruskall-Wallis tests; the association study used the Fisher exact test and linear regression approach. RESULTS: We identified a total of 52 sequence variants (12 unpublished. Eight non-synonymous allelic variants were found only in SLC2A9: rs6820230, rs2276961, rs144196049, rs112404957, rs73225891, rs16890979, rs3733591 and rs2280205. None of these variants showed any significant difference in the expression of GLUT9 and in urate transport. In the association study, eight variants showed a possible association with hyperuricemia. However, seven of these were in introns and the one exon located variant, rs7932775, did not show a statistically significant association with serum uric acid concentration. CONCLUSION: Our results did not confirm any effect of SLC22A12 and SLC2A9 variants on serum uric acid concentration. Our complex approach using association analysis together with functional and immunohistochemical characterization of non-synonymous allelic variants did not show any influence on expression, subcellular localization and urate uptake of GLUT9.

  17. Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes.

    Science.gov (United States)

    Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

    2017-10-03

    Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes.

  18. A reduced number of mtSNPs saturates mitochondrial DNA haplotype diversity of worldwide population groups.

    Science.gov (United States)

    Salas, Antonio; Amigo, Jorge

    2010-05-03

    The high levels of variation characterising the mitochondrial DNA (mtDNA) molecule are due ultimately to its high average mutation rate; moreover, mtDNA variation is deeply structured in different populations and ethnic groups. There is growing interest in selecting a reduced number of mtDNA single nucleotide polymorphisms (mtSNPs) that account for the maximum level of discrimination power in a given population. Applications of the selected mtSNP panel range from anthropologic and medical studies to forensic genetic casework. This study proposes a new simulation-based method that explores the ability of different mtSNP panels to yield the maximum levels of discrimination power. The method explores subsets of mtSNPs of different sizes randomly chosen from a preselected panel of mtSNPs based on frequency. More than 2,000 complete genomes representing three main continental human population groups (Africa, Europe, and Asia) and two admixed populations ("African-Americans" and "Hispanics") were collected from GenBank and the literature, and were used as training sets. Haplotype diversity was measured for each combination of mtSNP and compared with existing mtSNP panels available in the literature. The data indicates that only a reduced number of mtSNPs ranging from six to 22 are needed to account for 95% of the maximum haplotype diversity of a given population sample. However, only a small proportion of the best mtSNPs are shared between populations, indicating that there is not a perfect set of "universal" mtSNPs suitable for all population contexts. The discrimination power provided by these mtSNPs is much higher than the power of the mtSNP panels proposed in the literature to date. Some mtSNP combinations also yield high diversity values in admixed populations. The proposed computational approach for exploring combinations of mtSNPs that optimise the discrimination power of a given set of mtSNPs is more efficient than previous empirical approaches. In contrast to

  19. Studies on interaction of colloidal silver nanoparticles (SNPs) with five different bacterial species.

    Science.gov (United States)

    Khan, S Sudheer; Mukherjee, Amitava; Chandrasekaran, N

    2011-10-01

    Silver nanoparticles (SNPs) are being increasingly used in many consumer products like textile fabrics, cosmetics, washing machines, food and drug products owing to its excellent antimicrobial properties. Here we have studied the adsorption and toxicity of SNPs on bacterial species such as Pseudomonas aeruginosa, Micrococcus luteus, Bacillus subtilis, Bacillus barbaricus and Klebsiella pneumoniae. The influence of zeta potential on the adsorption of SNPs on bacterial cell surface was investigated at acidic, neutral and alkaline pH and with varying salt (NaCl) concentrations (0.05, 0.1, 0.5, 1 and 1.5 M). The survival rate of bacterial species decreased with increase in adsorption of SNPs. Maximum adsorption and toxicity was observed at pH 5, and NaCl concentration of 0.5 M, there by resulting in less toxicity. The zeta potential study suggests that, the adsorption of SNPs on the cell surface was related to electrostatic force of attraction. The equilibrium and kinetics of the adsorption process were also studied. The adsorption equilibrium isotherms fitted well to the Langmuir model. The kinetics of adsorption fitted best to pseudo-first-order. These findings form a basis for interpreting the interaction of nanoparticles with environmental bacterial species. Copyright © 2011 Elsevier B.V. All rights reserved.

  20. Who's behind that mask and cape? The Asian leopard cat's Agouti (ASIP) allele likely affects coat colour phenotype in the Bengal cat breed.

    Science.gov (United States)

    Gershony, L C; Penedo, M C T; Davis, B W; Murphy, W J; Helps, C R; Lyons, L A

    2014-12-01

    Coat colours and patterns are highly variable in cats and are determined mainly by several genes with Mendelian inheritance. A 2-bp deletion in agouti signalling protein (ASIP) is associated with melanism in domestic cats. Bengal cats are hybrids between domestic cats and Asian leopard cats (Prionailurus bengalensis), and the charcoal coat colouration/pattern in Bengals presents as a possible incomplete melanism. The complete coding region of ASIP was directly sequenced in Asian leopard, domestic and Bengal cats. Twenty-seven variants were identified between domestic and leopard cats and were investigated in Bengals and Savannahs, a hybrid with servals (Leptailurus serval). The leopard cat ASIP haplotype was distinguished from domestic cat by four synonymous and four non-synonymous exonic SNPs, as well as 19 intronic variants, including a 42-bp deletion in intron 4. Fifty-six of 64 reported charcoal cats were compound heterozygotes at ASIP, with leopard cat agouti (A(P) (be) ) and domestic cat non-agouti (a) haplotypes. Twenty-four Bengals had an additional unique haplotype (A2) for exon 2 that was not identified in leopard cats, servals or jungle cats (Felis chaus). The compound heterozygote state suggests the leopard cat allele, in combination with the recessive non-agouti allele, influences Bengal markings, producing a darker, yet not completely melanistic coat. This is the first validation of a leopard cat allele segregating in the Bengal breed and likely affecting their overall pelage phenotype. Genetic testing services need to be aware of the possible segregation of wild felid alleles in all assays performed on hybrid cats. © 2014 The Authors. Animal Genetics published by John Wiley & Sons Ltd on behalf of Stichting International Foundation for Animal Genetics.

  1. The emerging role of non-coding RNAs in drug addiction

    Directory of Open Access Journals (Sweden)

    Gregory Charles Sartor

    2012-06-01

    Full Text Available Prolonged drug use causes long-lasting neuroadaptations in reward-related brain areas that contribute to addiction. Despite significant amount of research dedicated to understanding the underlying mechanisms of addiction, the molecular underpinnings remain unclear. At the same time, much of the pervasive transcription that encompasses the human genome occurs in the nervous system and contributes to its heterogeneity and complexity. Recent evidence suggests that non-coding RNAs (ncRNAs play an important and dynamic role in transcriptional regulation, epigenetic signaling, stress response, and plasticity in the nervous system. Dysregulation of ncRNAs are thought to contribute to many, and perhaps all, neurological disorders, including addiction. Here, we review recent insights in the functional relevance of ncRNAs, including both microRNAs (miRNAs and long non-coding RNAs (lncRNAs, and then illustrate specific examples of ncRNA regulation in the context of drug addiction. We conclude that ncRNAs are importantly involved in the persistent neuroadaptations associated with addiction-related behaviors, and that therapies that target specific ncRNAs may represent new avenues for the treatment of drug addiction.

  2. The roles of non-coding RNAs in cardiac regenerative medicine

    Directory of Open Access Journals (Sweden)

    Oi Kuan Choong

    2017-06-01

    Full Text Available The emergence of non-coding RNAs (ncRNAs has challenged the central dogma of molecular biology that dictates that the decryption of genetic information starts from transcription of DNA to RNA, with subsequent translation into a protein. Large numbers of ncRNAs with biological significance have now been identified, suggesting that ncRNAs are important in their own right and their roles extend far beyond what was originally envisaged. ncRNAs do not only regulate gene expression, but are also involved in chromatin architecture and structural conformation. Several studies have pointed out that ncRNAs participate in heart disease; however, the functions of ncRNAs still remain unclear. ncRNAs are involved in cellular fate, differentiation, proliferation and tissue regeneration, hinting at their potential therapeutic applications. Here, we review the current understanding of both the biological functions and molecular mechanisms of ncRNAs in heart disease and describe some of the ncRNAs that have potential heart regeneration effects. Keywords: Non-coding RNAs, Cardiac regeneration, Cardiac fate, Proliferation, Differentiation, Reprograming

  3. Improvements to Busquet's Non LTE algorithm in NRL's Hydro code

    Science.gov (United States)

    Klapisch, M.; Colombant, D.

    1996-11-01

    Implementation of the Non LTE model RADIOM (M. Busquet, Phys. Fluids B, 5, 4191 (1993)) in NRL's RAD2D Hydro code in conservative form was reported previously(M. Klapisch et al., Bull. Am. Phys. Soc., 40, 1806 (1995)).While the results were satisfactory, the algorithm was slow and not always converging. We describe here modifications that address the latter two shortcomings. This method is quicker and more stable than the original. It also gives information about the validity of the fitting. It turns out that the number and distribution of groups in the multigroup diffusion opacity tables - a basis for the computation of radiation effects in the ionization balance in RADIOM- has a large influence on the robustness of the algorithm. These modifications give insight about the algorithm, and allow to check that the obtained average charge state is the true average. In addition, code optimization resulted in greatly reduced computing time: The ratio of Non LTE to LTE computing times being now between 1.5 and 2.

  4. A survey of single nucleotide polymorphisms identified from whole-genome sequencing and their functional effect in the porcine genome.

    Science.gov (United States)

    Keel, B N; Nonneman, D J; Rohrer, G A

    2017-08-01

    Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a more significant effect on phenotypic variation than do other types of genetic variants. Hence, a comprehensive list of these functional variants would be of considerable interest in swine genomic studies, particularly those targeting fertility and production traits. Whole-genome sequence was obtained from 72 of the founders of an intensely phenotyped experimental swine herd at the U.S. Meat Animal Research Center (USMARC). These animals included all 24 of the founding boars (12 Duroc and 12 Landrace) and 48 Yorkshire-Landrace composite sows. Sequence reads were mapped to the Sscrofa10.2 genome build, resulting in a mean of 6.1 fold (×) coverage per genome. A total of 22 342 915 high confidence SNPs were identified from the sequenced genomes. These included 21 million previously reported SNPs and 79% of the 62 163 SNPs on the PorcineSNP60 BeadChip assay. Variation was detected in the coding sequence or untranslated regions (UTRs) of 87.8% of the genes in the porcine genome: loss-of-function variants were predicted in 504 genes, 10 202 genes contained nonsynonymous variants, 10 773 had variation in UTRs and 13 010 genes contained synonymous variants. Approximately 139 000 SNPs were classified as loss-of-function, nonsynonymous or regulatory, which suggests that over 99% of the variation detected in our pigs could potentially be ignored, allowing us to focus on a much smaller number of functional SNPs during future analyses. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.

  5. Cannabis-dependence risk relates to synergism between neuroticism and proenkephalin SNPs associated with amygdala gene expression: case-control study.

    Directory of Open Access Journals (Sweden)

    Didier Jutras-Aswad

    Full Text Available Many young people experiment with cannabis, yet only a subgroup progress to dependence suggesting individual differences that could relate to factors such as genetics and behavioral traits. Dopamine receptor D2 (DRD2 and proenkephalin (PENK genes have been implicated in animal studies with cannabis exposure. Whether polymorphisms of these genes are associated with cannabis dependence and related behavioral traits is unknown.Healthy young adults (18-27 years with cannabis dependence and without a dependence diagnosis were studied (N = 50/group in relation to a priori-determined single nucleotide polymorphisms (SNPs of the DRD2 and PENK genes. Negative affect, Impulsive Risk Taking and Neuroticism-Anxiety temperamental traits, positive and negative reward-learning performance and stop-signal reaction times were examined. The findings replicated the known association between the rs6277 DRD2 SNP and decisions associated with negative reinforcement outcomes. Moreover, PENK variants (rs2576573 and rs2609997 significantly related to Neuroticism and cannabis dependence. Cigarette smoking is common in cannabis users, but it was not associated to PENK SNPs as also validated in another cohort (N = 247 smokers, N = 312 non-smokers. Neuroticism mediated (15.3%-19.5% the genetic risk to cannabis dependence and interacted with risk SNPs, resulting in a 9-fold increase risk for cannabis dependence. Molecular characterization of the postmortem human brain in a different population revealed an association between PENK SNPs and PENK mRNA expression in the central amygdala nucleus emphasizing the functional relevance of the SNPs in a brain region strongly linked to negative affect.Overall, the findings suggest an important role for Neuroticism as an endophenotype linking PENK polymorphisms to cannabis-dependence vulnerability synergistically amplifying the apparent genetic risk.

  6. RTEL1 tagging SNPs and haplotypes were associated with glioma development.

    Science.gov (United States)

    Li, Gang; Jin, Tianbo; Liang, Hongjuan; Zhang, Zhiguo; He, Shiming; Tu, Yanyang; Yang, Haixia; Geng, Tingting; Cui, Guangbin; Chen, Chao; Gao, Guodong

    2013-05-17

    As glioma ranks as the first most prevalent solid tumors in primary central nervous system, certain single-nucleotide polymorphisms (SNPs) may be related to increased glioma risk, and have implications in carcinogenesis. The present case-control study was carried out to elucidate how common variants contribute to glioma susceptibility. Ten candidate tagging SNPs (tSNPs) were selected from seven genes whose polymorphisms have been proven by classical literatures and reliable databases to be tended to relate with gliomas, and with the minor allele frequency (MAF)>5% in the HapMap Asian population. The selected tSNPs were genotyped in 629 glioma patients and 645 controls from a Han Chinese population using the multiplexed SNP MassEXTEND assay calibrated. Two significant tSNPs in RTEL1 gene were observed to be associated with glioma risk (rs6010620, P=0.0016, OR: 1.32, 95% CI: 1.11-1.56; rs2297440, P=0.001, OR: 1.33, 95% CI: 1.12-1.58) by χ2 test. It was identified the genotype "GG" of rs6010620 acted as the protective genotype for glioma (OR, 0.46; 95% CI, 0.31-0.7; P=0.0002), while the genotype "CC" of rs2297440 as the protective genotype in glioma (OR, 0.47; 95% CI, 0.31-0.71; P=0.0003). Furthermore, haplotype "GCT" in RTEL1 gene was found to be associated with risk of glioma (OR, 0.7; 95% CI, 0.57-0.86; Fisher's P=0.0005; Pearson's P=0.0005), and haplotype "ATT" was detected to be associated with risk of glioma (OR, 1.32; 95% CI, 1.12-1.57; Fisher's P=0.0013; Pearson's P=0.0013). Two single variants, the genotypes of "GG" of rs6010620 and "CC" of rs2297440 (rs6010620 and rs2297440) in the RTEL1 gene, together with two haplotypes of GCT and ATT, were identified to be associated with glioma development. And it might be used to evaluate the glioma development risks to screen the above RTEL1 tagging SNPs and haplotypes. The virtual slides for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/1993021136961998.

  7. Wind Noise Reduction using Non-negative Sparse Coding

    DEFF Research Database (Denmark)

    Schmidt, Mikkel N.; Larsen, Jan; Hsiao, Fu-Tien

    2007-01-01

    We introduce a new speaker independent method for reducing wind noise in single-channel recordings of noisy speech. The method is based on non-negative sparse coding and relies on a wind noise dictionary which is estimated from an isolated noise recording. We estimate the parameters of the model ...... and discuss their sensitivity. We then compare the algorithm with the classical spectral subtraction method and the Qualcomm-ICSI-OGI noise reduction method. We optimize the sound quality in terms of signal-to-noise ratio and provide results on a noisy speech recognition task....

  8. A two-locus global DNA barcode for land plants: the coding rbcL gene complements the non-coding trnH-psbA spacer region.

    Science.gov (United States)

    Kress, W John; Erickson, David L

    2007-06-06

    A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination.

  9. Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing.

    Science.gov (United States)

    Aflitos, Saulo; Schijlen, Elio; de Jong, Hans; de Ridder, Dick; Smit, Sandra; Finkers, Richard; Wang, Jun; Zhang, Gengyun; Li, Ning; Mao, Likai; Bakker, Freek; Dirks, Rob; Breit, Timo; Gravendeel, Barbara; Huits, Henk; Struss, Darush; Swanson-Wagner, Ruth; van Leeuwen, Hans; van Ham, Roeland C H J; Fito, Laia; Guignier, Laëtitia; Sevilla, Myrna; Ellul, Philippe; Ganko, Eric; Kapur, Arvind; Reclus, Emannuel; de Geus, Bernard; van de Geest, Henri; Te Lintel Hekkert, Bas; van Haarst, Jan; Smits, Lars; Koops, Andries; Sanchez-Perez, Gabino; van Heusden, Adriaan W; Visser, Richard; Quan, Zhiwu; Min, Jiumeng; Liao, Li; Wang, Xiaoli; Wang, Guangbiao; Yue, Zhen; Yang, Xinhua; Xu, Na; Schranz, Eric; Smets, Erik; Vos, Rutger; Rauwerda, Johan; Ursem, Remco; Schuit, Cees; Kerns, Mike; van den Berg, Jan; Vriezen, Wim; Janssen, Antoine; Datema, Erwin; Jahrman, Torben; Moquet, Frederic; Bonnet, Julien; Peters, Sander

    2014-10-01

    We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative of the Lycopersicon, Arcanum, Eriopersicon and Neolycopersicon groups, which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new reference genomes were reconstructed to support our comparative genome analyses. Comparative sequence alignment revealed group-, species- and accession-specific polymorphisms, explaining characteristic fruit traits and growth habits in the various cultivars. Using gene models from the annotated Heinz 1706 reference genome, we observed differences in the ratio between non-synonymous and synonymous SNPs (dN/dS) in fruit diversification and plant growth genes compared to a random set of genes, indicating positive selection and differences in selection pressure between crop accessions and wild species. In wild species, the number of single-nucleotide polymorphisms (SNPs) exceeds 10 million, i.e. 20-fold higher than found in most of the crop accessions, indicating dramatic genetic erosion of crop and heirloom tomatoes. In addition, the highest levels of heterozygosity were found for allogamous self-incompatible wild species, while facultative and autogamous self-compatible species display a lower heterozygosity level. Using whole-genome SNP information for maximum-likelihood analysis, we achieved complete tree resolution, whereas maximum-likelihood trees based on SNPs from ten fruit and growth genes show incomplete resolution for the crop accessions, partly due to the effect of heterozygous SNPs. Finally, results suggest that phylogenetic relationships are correlated with habitat, indicating the occurrence of geographical races within these groups, which is of practical importance for Solanum genome evolution studies. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.

  10. Identification and characterization of single nucleotide polymorphisms in 6 growth-correlated genes in porcine by denaturing high performance liquid chromatography.

    Science.gov (United States)

    Liu, Dewu; Zhang, Yushan; Du, Yinjun; Yang, Guanfu; Zhang, Xiquan

    2007-06-01

    The growth-correlated genes that are part of the neuroendocrine growth axis play crucial roles in the regulation of growth and development of pig. The identification of genetic polymorphisms in these genes will enable the scientist to evaluate the biological relevance of such polymorphisms and to gain a better understanding of quantitative traits like growth. In the present study, seven pairs of primers were designed to obtain unknown sequences of growth-correlated genes, and other 25 pairs of primers were designed to identify single nucleotide polymorphisms (SNP) using the denaturing high-performance liquid chromatography (DHPLC) technology in four pig breeds (Duroc, Landrace, Lantang and Wuzhishan), significantly differing in growth and development characteristics. A total of 101 polymorphisms were discovered in 10,707 base pairs (bp) from six genes of the ghrelin (GHRL), leptin (LEP), insulin-like growth factor II (IGF-II), insulin-like growth factor binding protein 2 (IGFBP-2), insulin-like growth factor binding protein 3 (IGFBP-3), and somatostatin (SS). The observed average distances between the SNP in the 5'UTR, coding regions, introns and 3'UTR were 134, 521, 81 and 92 bp, respectively. Four SNPs were found in the coding regions of IGF-II, IGFBP-2 and LEP, respectively. Two synonymous mutations were obtained in IGF-II and LEP genes respectively, and two non-synonymous were found in IGFBP-2 and LEP genes, respectively. Seven other mutations were also observed. Thirty-two PCR-RFLP markers were found among 101 polymorphisms of the six genes. The SNP discovered in this study would provide suitable markers for association studies of candidate genes with growth related traits in pig.

  11. A method of non-contact reading code based on computer vision

    Science.gov (United States)

    Zhang, Chunsen; Zong, Xiaoyu; Guo, Bingxuan

    2018-03-01

    With the purpose of guarantee the computer information exchange security between internal and external network (trusted network and un-trusted network), A non-contact Reading code method based on machine vision has been proposed. Which is different from the existing network physical isolation method. By using the computer monitors, camera and other equipment. Deal with the information which will be on exchanged, Include image coding ,Generate the standard image , Display and get the actual image , Calculate homography matrix, Image distort correction and decoding in calibration, To achieve the computer information security, Non-contact, One-way transmission between the internal and external network , The effectiveness of the proposed method is verified by experiments on real computer text data, The speed of data transfer can be achieved 24kb/s. The experiment shows that this algorithm has the characteristics of high security, fast velocity and less loss of information. Which can meet the daily needs of the confidentiality department to update the data effectively and reliably, Solved the difficulty of computer information exchange between Secret network and non-secret network, With distinctive originality, practicability, and practical research value.

  12. Comprehensive Reconstruction and Visualization of Non-Coding Regulatory Networks in Human

    Science.gov (United States)

    Bonnici, Vincenzo; Russo, Francesco; Bombieri, Nicola; Pulvirenti, Alfredo; Giugno, Rosalba

    2014-01-01

    Research attention has been powered to understand the functional roles of non-coding RNAs (ncRNAs). Many studies have demonstrated their deregulation in cancer and other human disorders. ncRNAs are also present in extracellular human body fluids such as serum and plasma, giving them a great potential as non-invasive biomarkers. However, non-coding RNAs have been relatively recently discovered and a comprehensive database including all of them is still missing. Reconstructing and visualizing the network of ncRNAs interactions are important steps to understand their regulatory mechanism in complex systems. This work presents ncRNA-DB, a NoSQL database that integrates ncRNAs data interactions from a large number of well established on-line repositories. The interactions involve RNA, DNA, proteins, and diseases. ncRNA-DB is available at http://ncrnadb.scienze.univr.it/ncrnadb/. It is equipped with three interfaces: web based, command-line, and a Cytoscape app called ncINetView. By accessing only one resource, users can search for ncRNAs and their interactions, build a network annotated with all known ncRNAs and associated diseases, and use all visual and mining features available in Cytoscape. PMID:25540777

  13. Comprehensive reconstruction and visualization of non-coding regulatory networks in human.

    Science.gov (United States)

    Bonnici, Vincenzo; Russo, Francesco; Bombieri, Nicola; Pulvirenti, Alfredo; Giugno, Rosalba

    2014-01-01

    Research attention has been powered to understand the functional roles of non-coding RNAs (ncRNAs). Many studies have demonstrated their deregulation in cancer and other human disorders. ncRNAs are also present in extracellular human body fluids such as serum and plasma, giving them a great potential as non-invasive biomarkers. However, non-coding RNAs have been relatively recently discovered and a comprehensive database including all of them is still missing. Reconstructing and visualizing the network of ncRNAs interactions are important steps to understand their regulatory mechanism in complex systems. This work presents ncRNA-DB, a NoSQL database that integrates ncRNAs data interactions from a large number of well established on-line repositories. The interactions involve RNA, DNA, proteins, and diseases. ncRNA-DB is available at http://ncrnadb.scienze.univr.it/ncrnadb/. It is equipped with three interfaces: web based, command-line, and a Cytoscape app called ncINetView. By accessing only one resource, users can search for ncRNAs and their interactions, build a network annotated with all known ncRNAs and associated diseases, and use all visual and mining features available in Cytoscape.

  14. Antonyms and Synonyms: Cognitive Aspects of Negation in Positive Sentences

    OpenAIRE

    Arimitsu, Nami

    2017-01-01

    This paper investigates the cognitive orientation of the negative meaning in antonyms and synonyms. While the negative meaning in antonyms is a reflection of the cognitive mapping of our mental contiguity, the negative images in synonymous words are more closely associated with aspects of subjective semantics and factors related to politeness

  15. Non-Coding RNAs in Castration-Resistant Prostate Cancer: Regulation of Androgen Receptor Signaling and Cancer Metabolism.

    Science.gov (United States)

    Shih, Jing-Wen; Wang, Ling-Yu; Hung, Chiu-Lien; Kung, Hsing-Jien; Hsieh, Chia-Ling

    2015-12-04

    Hormone-refractory prostate cancer frequently relapses from therapy and inevitably progresses to a bone-metastatic status with no cure. Understanding of the molecular mechanisms conferring resistance to androgen deprivation therapy has the potential to lead to the discovery of novel therapeutic targets for type of prostate cancer with poor prognosis. Progression to castration-resistant prostate cancer (CRPC) is characterized by aberrant androgen receptor (AR) expression and persistent AR signaling activity. Alterations in metabolic activity regulated by oncogenic pathways, such as c-Myc, were found to promote prostate cancer growth during the development of CRPC. Non-coding RNAs represent a diverse family of regulatory transcripts that drive tumorigenesis of prostate cancer and various other cancers by their hyperactivity or diminished function. A number of studies have examined differentially expressed non-coding RNAs in each stage of prostate cancer. Herein, we highlight the emerging impacts of microRNAs and long non-coding RNAs linked to reactivation of the AR signaling axis and reprogramming of the cellular metabolism in prostate cancer. The translational implications of non-coding RNA research for developing new biomarkers and therapeutic strategies for CRPC are also discussed.

  16. Orion: Detecting regions of the human non-coding genome that are intolerant to variation using population genetics.

    Science.gov (United States)

    Gussow, Ayal B; Copeland, Brett R; Dhindsa, Ryan S; Wang, Quanli; Petrovski, Slavé; Majoros, William H; Allen, Andrew S; Goldstein, David B

    2017-01-01

    There is broad agreement that genetic mutations occurring outside of the protein-coding regions play a key role in human disease. Despite this consensus, we are not yet capable of discerning which portions of non-coding sequence are important in the context of human disease. Here, we present Orion, an approach that detects regions of the non-coding genome that are depleted of variation, suggesting that the regions are intolerant of mutations and subject to purifying selection in the human lineage. We show that Orion is highly correlated with known intolerant regions as well as regions that harbor putatively pathogenic variation. This approach provides a mechanism to identify pathogenic variation in the human non-coding genome and will have immediate utility in the diagnostic interpretation of patient genomes and in large case control studies using whole-genome sequences.

  17. Association between SNPs within candidate genes and compounds related to boar taint and reproduction

    DEFF Research Database (Denmark)

    Moe, Maren; Lien, Sigbjørn; Aasmundstad, Torunn

    2009-01-01

    BACKGROUND: Boar taint is an unpleasant odour and flavour of the meat from some uncastrated male pigs primarily caused by elevated levels of androstenone and skatole in adipose tissue. Androstenone is produced in the same biochemical pathway as testosterone and estrogens, which represents...... of this study was to detect SNPs in boar taint candidate genes and to perform association studies for both single SNPs and haplotypes with levels of boar taint compounds and phenotypes related to reproduction. RESULTS: An association study involving 275 SNPs in 121 genes and compounds related to boar taint...... and reproduction were carried out in Duroc and Norwegian Landrace boars. Phenotypes investigated were levels of androstenone, skatole and indole in adipose tissue, levels of androstenone, testosterone, estrone sulphate and 17beta-estradiol in plasma, and length of bulbo urethralis gland. The SNPs were genotyped...

  18. Non-coding RNAs and heme oxygenase-1 in vaccinia virus infection

    International Nuclear Information System (INIS)

    Meseda, Clement A.; Srinivasan, Kumar; Wise, Jasen; Catalano, Jennifer; Yamada, Kenneth M.; Dhawan, Subhash

    2014-01-01

    Highlights: • Heme oxygenase-1 (HO-1) induction inhibited vaccinia virus infection of macrophages. • Reduced infectivity inversely correlated with increased expression of non-coding RNAs. • The regulation of HO-1 and ncRNAs suggests a novel host defense response against vaccinia virus infection. - Abstract: Small nuclear RNAs (snRNAs) are <200 nucleotide non-coding uridylate-rich RNAs. Although the functions of many snRNAs remain undetermined, a population of snRNAs is produced during the early phase of infection of cells by vaccinia virus. In the present study, we demonstrate a direct correlation between expression of the cytoprotective enzyme heme oxygenase-1 (HO-1), suppression of selective snRNA expression, and inhibition of vaccinia virus infection of macrophages. Hemin induced HO-1 expression, completely reversed virus-induced host snRNA expression, and suppressed vaccinia virus infection. This involvement of specific virus-induced snRNAs and associated gene clusters suggests a novel HO-1-dependent host-defense pathway in poxvirus infection

  19. Low-Frequency Synonymous Coding Variation in CYP2R1 Has Large Effects on Vitamin D Levels and Risk of Multiple Sclerosis.

    Science.gov (United States)

    Manousaki, Despoina; Dudding, Tom; Haworth, Simon; Hsu, Yi-Hsiang; Liu, Ching-Ti; Medina-Gómez, Carolina; Voortman, Trudy; van der Velde, Nathalie; Melhus, Håkan; Robinson-Cohen, Cassianne; Cousminer, Diana L; Nethander, Maria; Vandenput, Liesbeth; Noordam, Raymond; Forgetta, Vincenzo; Greenwood, Celia M T; Biggs, Mary L; Psaty, Bruce M; Rotter, Jerome I; Zemel, Babette S; Mitchell, Jonathan A; Taylor, Bruce; Lorentzon, Mattias; Karlsson, Magnus; Jaddoe, Vincent V W; Tiemeier, Henning; Campos-Obando, Natalia; Franco, Oscar H; Utterlinden, Andre G; Broer, Linda; van Schoor, Natasja M; Ham, Annelies C; Ikram, M Arfan; Karasik, David; de Mutsert, Renée; Rosendaal, Frits R; den Heijer, Martin; Wang, Thomas J; Lind, Lars; Orwoll, Eric S; Mook-Kanamori, Dennis O; Michaëlsson, Karl; Kestenbaum, Bryan; Ohlsson, Claes; Mellström, Dan; de Groot, Lisette C P G M; Grant, Struan F A; Kiel, Douglas P; Zillikens, M Carola; Rivadeneira, Fernando; Sawcer, Stephen; Timpson, Nicholas J; Richards, J Brent

    2017-08-03

    Vitamin D insufficiency is common, correctable, and influenced by genetic factors, and it has been associated with risk of several diseases. We sought to identify low-frequency genetic variants that strongly increase the risk of vitamin D insufficiency and tested their effect on risk of multiple sclerosis, a disease influenced by low vitamin D concentrations. We used whole-genome sequencing data from 2,619 individuals through the UK10K program and deep-imputation data from 39,655 individuals genotyped genome-wide. Meta-analysis of the summary statistics from 19 cohorts identified in CYP2R1 the low-frequency (minor allele frequency = 2.5%) synonymous coding variant g.14900931G>A (p.Asp120Asp) (rs117913124[A]), which conferred a large effect on 25-hydroxyvitamin D (25OHD) levels (-0.43 SD of standardized natural log-transformed 25OHD per A allele; p value = 1.5 × 10 -88 ). The effect on 25OHD was four times larger and independent of the effect of a previously described common variant near CYP2R1. By analyzing 8,711 individuals, we showed that heterozygote carriers of this low-frequency variant have an increased risk of vitamin D insufficiency (odds ratio [OR] = 2.2, 95% confidence interval [CI] = 1.78-2.78, p = 1.26 × 10 -12 ). Individuals carrying one copy of this variant also had increased odds of multiple sclerosis (OR = 1.4, 95% CI = 1.19-1.64, p = 2.63 × 10 -5 ) in a sample of 5,927 case and 5,599 control subjects. In conclusion, we describe a low-frequency CYP2R1 coding variant that exerts the largest effect upon 25OHD levels identified to date in the general European population and implicates vitamin D in the etiology of multiple sclerosis. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  20. TGFβ1 SNPs and radio-induced toxicity in prostate cancer patients

    International Nuclear Information System (INIS)

    Fachal, Laura; Gómez-Caamaño, Antonio; Sánchez-García, Manuel; Carballo, Ana; Peleteiro, Paula; Lobato-Busto, Ramón; Carracedo, Ángel; Vega, Ana

    2012-01-01

    Background and purpose: We have performed a case-control study in 413 prostate cancer patients to test for association between TGFβ1 and the development of late normal-tissue toxicity among prostate cancer patients treated with three-dimensional conformational radiotherapy (3D-CRT) Materials and methods: Late gastrointestinal and genitourinary toxicities were assessed for at least two years after radiotherapy in 413 patients according to CTCAEvs3 scores. Codominant genotypic tests and haplotypic analyses were undertaken to evaluate the correlation between TGFβ1 SNPs rs1800469, rs1800470 and rs1800472 and radio-induced toxicity. Results: Neither the SNPs nor the haplotypes were found to be associated with the risk of late toxicity. Conclusions: We were able to exclude up to a 2-fold increase in the risk of developing late gastrointestinal and genitourinary radio-induced toxicity due to the TGFβ1 SNPs rs1800469 and rs1800470, as well as the two most frequent TGFβ1 haplotypes.

  1. Securing maximum diversity of Non Pollen Palynomorphs in palynological samples

    DEFF Research Database (Denmark)

    Enevold, Renée; Odgaard, Bent Vad

    2015-01-01

    Palynology is no longer synonymous with analysis of pollen with the addition of a few fern spores. A wide range of Non Pollen Palynomorphs are now described and are potential palaeoenvironmental proxies in the palynological surveys. The contribution of NPP’s has proven important to the interpreta......Palynology is no longer synonymous with analysis of pollen with the addition of a few fern spores. A wide range of Non Pollen Palynomorphs are now described and are potential palaeoenvironmental proxies in the palynological surveys. The contribution of NPP’s has proven important...

  2. The more from East-Asian, the better: risk prediction of colorectal cancer risk by GWAS-identified SNPs among Japanese.

    Science.gov (United States)

    Abe, Makiko; Ito, Hidemi; Oze, Isao; Nomura, Masatoshi; Ogawa, Yoshihiro; Matsuo, Keitaro

    2017-12-01

    Little is known about the difference of genetic predisposition for CRC between ethnicities; however, many genetic traits common to colorectal cancer have been identified. This study investigated whether more SNPs identified in GWAS in East Asian population could improve the risk prediction of Japanese and explored possible application of genetic risk groups as an instrument of the risk communication. 558 Patients histologically verified colorectal cancer and 1116 first-visit outpatients were included for derivation study, and 547 cases and 547 controls were for replication study. Among each population, we evaluated prediction models for the risk of CRC that combined the genetic risk group based on SNPs from GWASs in European-population and a similarly developed model adding SNPs from GWASs in East Asian-population. We examined whether adding East Asian-specific SNPs would improve the discrimination. Six SNPs (rs6983267, rs4779584, rs4444235, rs9929218, rs10936599, rs16969681) from 23 SNPs by European-based GWAS and five SNPs (rs704017, rs11196172, rs10774214, rs647161, rs2423279) among ten SNPs by Asian-based GWAS were selected in CRC risk prediction model. Compared with a 6-SNP-based model, an 11-SNP model including Asian GWAS-SNPs showed improved discrimination capacity in Receiver operator characteristic analysis. A model with 11 SNPs resulted in statistically significant improvement in both derivation (P = 0.0039) and replication studies (P = 0.0018) compared with six SNP model. We estimated cumulative risk of CRC by using genetic risk group based on 11 SNPs and found that the cumulative risk at age 80 is approximately 13% in the high-risk group while 6% in the low-risk group. We constructed a more efficient CRC risk prediction model with 11 SNPs including newly identified East Asian-based GWAS SNPs (rs704017, rs11196172, rs10774214, rs647161, rs2423279). Risk grouping based on 11 SNPs depicted lifetime difference of CRC risk. This might be useful for

  3. Biocomputational prediction of small non-coding RNAs in Streptomyces

    Czech Academy of Sciences Publication Activity Database

    Pánek, Josef; Bobek, Jan; Mikulík, Karel; Basler, Marek; Vohradský, Jiří

    2008-01-01

    Roč. 9, č. 217 (2008), s. 1-14 ISSN 1471-2164 R&D Projects: GA ČR GP204/07/P361; GA ČR GA203/05/0106; GA ČR GA310/07/1009 Grant - others:XE(XE) EC Integrated Project ActinoGEN, LSHM-CT-2004-005224. Institutional research plan: CEZ:AV0Z50200510 Keywords : non-coding RNA * streptomyces * biocomputational prediction Subject RIV: IN - Informatics, Computer Science Impact factor: 3.926, year: 2008

  4. Changes in the Coding and Non-coding Transcriptome and DNA Methylome that Define the Schwann Cell Repair Phenotype after Nerve Injury.

    Science.gov (United States)

    Arthur-Farraj, Peter J; Morgan, Claire C; Adamowicz, Martyna; Gomez-Sanchez, Jose A; Fazal, Shaline V; Beucher, Anthony; Razzaghi, Bonnie; Mirsky, Rhona; Jessen, Kristjan R; Aitman, Timothy J

    2017-09-12

    Repair Schwann cells play a critical role in orchestrating nerve repair after injury, but the cellular and molecular processes that generate them are poorly understood. Here, we perform a combined whole-genome, coding and non-coding RNA and CpG methylation study following nerve injury. We show that genes involved in the epithelial-mesenchymal transition are enriched in repair cells, and we identify several long non-coding RNAs in Schwann cells. We demonstrate that the AP-1 transcription factor C-JUN regulates the expression of certain micro RNAs in repair Schwann cells, in particular miR-21 and miR-34. Surprisingly, unlike during development, changes in CpG methylation are limited in injury, restricted to specific locations, such as enhancer regions of Schwann cell-specific genes (e.g., Nedd4l), and close to local enrichment of AP-1 motifs. These genetic and epigenomic changes broaden our mechanistic understanding of the formation of repair Schwann cell during peripheral nervous system tissue repair. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  5. The role of polarity in antonym and synonym conceptual knowledge: evidence from stroke aphasia and multidimensional ratings of abstract words.

    Science.gov (United States)

    Crutch, Sebastian J; Williams, Paul; Ridgway, Gerard R; Borgenicht, Laura

    2012-09-01

    This study describes an investigation of different types of semantic relationship among abstract words: antonyms (e.g. good-bad), synonyms (e.g. good-great), non-antonymous, non-synonymous associates (NANSAs; e.g. good-fun) and unrelated words (e.g. good-late). The comprehension and semantic properties of these words were examined using two distinct methodologies. Experiment 1 tested the comprehension of pairs of abstract words in three patients with global aphasia using a spoken word to written word matching paradigm. Contrary to expectations, all three patients showed superior antonym comprehension compared with synonyms or NANSAs, discriminating antonyms with a similar level of accuracy as unrelated words. Experiment 2 aimed to explore the content or semantic attributes of the abstract words used in Experiment 1 through the generation of control ratings across nine cognitive dimensions (sensation, action, thought, emotion, social interaction, space, time, quantity and polarity). Discrepancy analyses revealed that antonyms were as or more similar to one another than synonyms on all but one measure: polarity. The results of Experiment 2 provide a possible explanation for the novel pattern of neuropsychological data observed in Experiment 1, namely that polarity information is more important than other semantic attributes when discriminating the meaning of abstract words. It is argued that polarity is a critical semantic attribute of abstract words, and that simple 'dissimilarity' metrics mask fundamental consistencies in the semantic representation of antonyms. It is also suggested that mapping abstract semantic space requires the identification and quantification of the contribution made to abstract concepts by not only sensorimotor and emotional information but also a host of other cognitive dimensions. Copyright © 2012 Elsevier Ltd. All rights reserved.

  6. Exploring the deleterious SNPs in XRCC4 gene using computational approach and studying their association with breast cancer in the population of West India.

    Science.gov (United States)

    Singh, Preety K; Mistry, Kinnari N; Chiramana, Haritha; Rank, Dharamshi N; Joshi, Chaitanya G

    2018-05-20

    Non-homologous end joining (NHEJ) pathway has pivotal role in repair of double-strand DNA breaks that may lead to carcinogenesis. XRCC4 is one of the essential proteins of this pathway and single-nucleotide polymorphisms (SNPs) of this gene are reported to be associated with cancer risks. In our study, we first used computational approaches to predict the damaging variants of XRCC4 gene. Tools predicted rs79561451 (S110P) nsSNP as the most deleterious SNP. Along with this SNP, we analysed other two SNPs (rs3734091 and rs6869366) to study their association with breast cancer in population of West India. Variant rs3734091 was found to be significantly associated with breast cancer while rs6869366 variant did not show any association. These SNPs may influence the susceptibility of individuals to breast cancer in this population. Copyright © 2018 Elsevier B.V. All rights reserved.

  7. Low multiple electrode aggregometry platelet responses are not associated with non-synonymous variants in G-protein coupled receptor genes.

    Science.gov (United States)

    Norman, Jane E; Lee, Kurtis R; Walker, Mary E; Murden, Sherina L; Harris, Jessica; Mundell, Stuart; J Murphy, Gavin; Mumford, Andrew D

    2015-10-01

    Multiple electrode aggregometry (MEA) improves prediction of thrombosis and bleeding in cardiac patients. However, the causes of inter-individual variation in MEA results are incompletely understood. We explore whether low MEA results are associated with platelet G-protein coupled receptor (GPCR) gene variants. The effects of P2Y12 receptor (P2Y12), thromboxane A2 receptor (TPα) and protease-activated receptor 1 (PAR1) dysfunction on the MEA ADP-test, ASPI-test and TRAP-test were determined using receptor antagonists. Cardiac surgery patients with pre-operative MEA results suggesting GPCR dysfunction were selected for P2Y12 (P2RY12), TPα (TBXA2R) and PAR1 (F2R) sequencing. In control blood samples, P2Y12, TPα or PAR1 antagonists markedly reduced ADP-test, ASPI-test and TRAP-test results respectively. In the 636 patients from a cohort of 2388 cardiac surgery patients who were not receiving aspirin or a P2Y12 blocker, the median ADP-test result was 75.1 U (range 4.8-153.2), ASPI-test 83.7 U (1.4-157.3) and TRAP-test 117.7 U (2.4-194.1), indicating a broad range of results unexplained by anti-platelet drugs. In 238 consenting patients with unexplained low MEA results, three P2RY12 variants occurred in 70/107 (65%) with suspected P2Y12 dysfunction and four TBXA2R variants occurred in 19/22 (86%) with suspected TPα dysfunction although the later group was too small to draw meaningful conclusions about variant frequency. All the variants were synonymous and unlikely to cause GPCR dysfunction. There were no F2R variants in the 109 cases with suspected PAR1 dysfunction. MEA results suggesting isolated platelet GPCR dysfunction were common in cardiac surgery patients, but were not associated with non-synonymous variants in P2RY12 or F2R. Copyright © 2015 Elsevier Ltd. All rights reserved.

  8. In silico SNP analysis of the breast cancer antigen NY-BR-1.

    Science.gov (United States)

    Kosaloglu, Zeynep; Bitzer, Julia; Halama, Niels; Huang, Zhiqin; Zapatka, Marc; Schneeweiss, Andreas; Jäger, Dirk; Zörnig, Inka

    2016-11-18

    Breast cancer is one of the most common malignancies with increasing incidences every year and a leading cause of death among women. Although early stage breast cancer can be effectively treated, there are limited numbers of treatment options available for patients with advanced and metastatic disease. The novel breast cancer associated antigen NY-BR-1 was identified by SEREX analysis and is expressed in the majority (>70%) of breast tumors as well as metastases, in normal breast tissue, in testis and occasionally in prostate tissue. The biological function and regulation of NY-BR-1 is up to date unknown. We performed an in silico analysis on the genetic variations of the NY-BR-1 gene using data available in public SNP databases and the tools SIFT, Polyphen and Provean to find possible functional SNPs. Additionally, we considered the allele frequency of the found damaging SNPs and also analyzed data from an in-house sequencing project of 55 breast cancer samples for recurring SNPs, recorded in dbSNP. Over 2800 SNPs are recorded in the dbSNP and NHLBI ESP databases for the NY-BR-1 gene. Of these, 65 (2.07%) are synonymous SNPs, 191 (6.09%) are non-synoymous SNPs, and 2430 (77.48%) are noncoding intronic SNPs. As a result, 69 non-synoymous SNPs were predicted to be damaging by at least two, and 16 SNPs were predicted as damaging by all three of the used tools. The SNPs rs200639888, rs367841401 and rs377750885 were categorized as highly damaging by all three tools. Eight damaging SNPs are located in the ankyrin repeat domain (ANK), a domain known for its frequent involvement in protein-protein interactions. No distinctive features could be observed in the allele frequency of the analyzed SNPs. Considering these results we expect to gain more insights into the variations of the NY-BR-1 gene and their possible impact on giving rise to splice variants and therefore influence the function of NY-BR-1 in healthy tissue as well as in breast cancer.

  9. p53-dependent non-coding RNA networks in chronic lymphocytic leukemia

    NARCIS (Netherlands)

    Blume, C. J.; Hotz-Wagenblatt, A.; Hüllein, J.; Sellner, L.; Jethwa, A.; Stolz, T.; Slabicki, M.; Lee, K.; Sharathchandra, A.; Benner, A.; Dietrich, S.; Oakes, C. C.; Dreger, P.; te Raa, D.; Kater, A. P.; Jauch, A.; Merkel, O.; Oren, M.; Hielscher, T.; Zenz, T.

    2015-01-01

    Mutations of the tumor suppressor p53 lead to chemotherapy resistance and a dismal prognosis in chronic lymphocytic leukemia (CLL). Whereas p53 targets are used to identify patient subgroups with impaired p53 function, a comprehensive assessment of non-coding RNA targets of p53 in CLL is missing. We

  10. Sequencing of the needle transcriptome from Norway spruce (Picea abies Karst L. reveals lower substitution rates, but similar selective constraints in gymnosperms and angiosperms

    Directory of Open Access Journals (Sweden)

    Chen Jun

    2012-11-01

    Full Text Available Abstract Background A detailed knowledge about spatial and temporal gene expression is important for understanding both the function of genes and their evolution. For the vast majority of species, transcriptomes are still largely uncharacterized and even in those where substantial information is available it is often in the form of partially sequenced transcriptomes. With the development of next generation sequencing, a single experiment can now simultaneously identify the transcribed part of a species genome and estimate levels of gene expression. Results mRNA from actively growing needles of Norway spruce (Picea abies was sequenced using next generation sequencing technology. In total, close to 70 million fragments with a length of 76 bp were sequenced resulting in 5 Gbp of raw data. A de novo assembly of these reads, together with publicly available expressed sequence tag (EST data from Norway spruce, was used to create a reference transcriptome. Of the 38,419 PUTs (putative unique transcripts longer than 150 bp in this reference assembly, 83.5% show similarity to ESTs from other spruce species and of the remaining PUTs, 3,704 show similarity to protein sequences from other plant species, leaving 4,167 PUTs with limited similarity to currently available plant proteins. By predicting coding frames and comparing not only the Norway spruce PUTs, but also PUTs from the close relatives Picea glauca and Picea sitchensis to both Pinus taeda and Taxus mairei, we obtained estimates of synonymous and non-synonymous divergence among conifer species. In addition, we detected close to 15,000 SNPs of high quality and estimated gene expression differences between samples collected under dark and light conditions. Conclusions Our study yielded a large number of single nucleotide polymorphisms as well as estimates of gene expression on transcriptome scale. In agreement with a recent study we find that the synonymous substitution rate per year (0.6 × 10

  11. A comparison between genotyping-by-sequencing and array-based scoring of SNPs for genomic prediction accuracy in winter wheat.

    Science.gov (United States)

    Elbasyoni, Ibrahim S; Lorenz, A J; Guttieri, M; Frels, K; Baenziger, P S; Poland, J; Akhunov, E

    2018-05-01

    The utilization of DNA molecular markers in plant breeding to maximize selection response via marker-assisted selection (MAS) and genomic selection (GS) has revolutionized plant breeding. A key factor affecting GS applicability is the choice of molecular marker platform. Genotyping-by-sequencing scored SNPs (GBS-scored SNPs) provides a large number of markers, albeit with high rates of missing data. Array scored SNPs are of high quality, but the cost per sample is substantially higher. The objectives of this study were 1) compare GBS-scored SNPs, and array scored SNPs for genomic selection applications, and 2) compare estimates of genomic kinship and population structure calculated using the two marker platforms. SNPs were compared in a diversity panel consisting of 299 hard winter wheat (Triticum aestivum L.) accessions that were part of a multi-year, multi-environments association mapping study. The panel was phenotyped in Ithaca, Nebraska for heading date, plant height, days to physiological maturity and grain yield in 2012 and 2013. The panel was genotyped using GBS-scored SNPs, and array scored SNPs. Results indicate that GBS-scored SNPs is comparable to or better than Array-scored SNPs for genomic prediction application. Both platforms identified the same genetic patterns in the panel where 90% of the lines were classified to common genetic groups. Overall, we concluded that GBS-scored SNPs have the potential to be the marker platform of choice for genetic diversity and genomic selection in winter wheat. Copyright © 2018 Elsevier B.V. All rights reserved.

  12. Mutations that Cause Human Disease: A Computational/Experimental Approach

    Energy Technology Data Exchange (ETDEWEB)

    Beernink, P; Barsky, D; Pesavento, B

    2006-01-11

    International genome sequencing projects have produced billions of nucleotides (letters) of DNA sequence data, including the complete genome sequences of 74 organisms. These genome sequences have created many new scientific opportunities, including the ability to identify sequence variations among individuals within a species. These genetic differences, which are known as single nucleotide polymorphisms (SNPs), are particularly important in understanding the genetic basis for disease susceptibility. Since the report of the complete human genome sequence, over two million human SNPs have been identified, including a large-scale comparison of an entire chromosome from twenty individuals. Of the protein coding SNPs (cSNPs), approximately half leads to a single amino acid change in the encoded protein (non-synonymous coding SNPs). Most of these changes are functionally silent, while the remainder negatively impact the protein and sometimes cause human disease. To date, over 550 SNPs have been found to cause single locus (monogenic) diseases and many others have been associated with polygenic diseases. SNPs have been linked to specific human diseases, including late-onset Parkinson disease, autism, rheumatoid arthritis and cancer. The ability to predict accurately the effects of these SNPs on protein function would represent a major advance toward understanding these diseases. To date several attempts have been made toward predicting the effects of such mutations. The most successful of these is a computational approach called ''Sorting Intolerant From Tolerant'' (SIFT). This method uses sequence conservation among many similar proteins to predict which residues in a protein are functionally important. However, this method suffers from several limitations. First, a query sequence must have a sufficient number of relatives to infer sequence conservation. Second, this method does not make use of or provide any information on protein structure, which

  13. Inference of purifying and positive selection in three subspecies of chimpanzees (Pan troglodytes) from exome sequencing

    DEFF Research Database (Denmark)

    Bataillon, Thomas; Duan, Jinjie; Hvilsom, Christina

    2015-01-01

    of recent gene flow from Western into Eastern chimpanzees. The striking contrast in X-linked vs. autosomal polymorphism and divergence previously reported in Central chimpanzees is also found in Eastern and Western chimpanzees. We show that the direction of selection (DoS) statistic exhibits a strong non......-monotonic relationship with the strength of purifying selection S, making it inappropriate for estimating S. We instead use counts in synonymous vs. non-synonymous frequency classes to infer the distribution of S coefficients acting on non-synonymous mutations in each subspecies. The strength of purifying selection we...... infer is congruent with the differences in effective sizes of each subspecies: Central chimpanzees are undergoing the strongest purifying selection followed by Eastern and Western chimpanzees. Coding indels show stronger selection against indels changing the reading frame than observed in human...

  14. An evaluation of the performance of tag SNPs derived from HapMap in a Caucasian population.

    Directory of Open Access Journals (Sweden)

    Alexandre Montpetit

    2006-03-01

    Full Text Available The Haplotype Map (HapMap project recently generated genotype data for more than 1 million single-nucleotide polymorphisms (SNPs in four population samples. The main application of the data is in the selection of tag single-nucleotide polymorphisms (tSNPs to use in association studies. The usefulness of this selection process needs to be verified in populations outside those used for the HapMap project. In addition, it is not known how well the data represent the general population, as only 90-120 chromosomes were used for each population and since the genotyped SNPs were selected so as to have high frequencies. In this study, we analyzed more than 1,000 individuals from Estonia. The population of this northern European country has been influenced by many different waves of migrations from Europe and Russia. We genotyped 1,536 randomly selected SNPs from two 500-kbp ENCODE regions on Chromosome 2. We observed that the tSNPs selected from the CEPH (Centre d'Etude du Polymorphisme Humain from Utah (CEU HapMap samples (derived from US residents with northern and western European ancestry captured most of the variation in the Estonia sample. (Between 90% and 95% of the SNPs with a minor allele frequency of more than 5% have an r2 of at least 0.8 with one of the CEU tSNPs. Using the reverse approach, tags selected from the Estonia sample could almost equally well describe the CEU sample. Finally, we observed that the sample size, the allelic frequency, and the SNP density in the dataset used to select the tags each have important effects on the tagging performance. Overall, our study supports the use of HapMap data in other Caucasian populations, but the SNP density and the bias towards high-frequency SNPs have to be taken into account when designing association studies.

  15. DNA watermarks in non-coding regulatory sequences

    Directory of Open Access Journals (Sweden)

    Pyka Martin

    2009-07-01

    Full Text Available Abstract Background DNA watermarks can be applied to identify the unauthorized use of genetically modified organisms. It has been shown that coding regions can be used to encrypt information into living organisms by using the DNA-Crypt algorithm. Yet, if the sequence of interest presents a non-coding DNA sequence, either the function of a resulting functional RNA molecule or a regulatory sequence, such as a promoter, could be affected. For our studies we used the small cytoplasmic RNA 1 in yeast and the lac promoter region of Escherichia coli. Findings The lac promoter was deactivated by the integrated watermark. In addition, the RNA molecules displayed altered configurations after introducing a watermark, but surprisingly were functionally intact, which has been verified by analyzing the growth characteristics of both wild type and watermarked scR1 transformed yeast cells. In a third approach we introduced a second overlapping watermark into the lac promoter, which did not affect the promoter activity. Conclusion Even though the watermarked RNA and one of the watermarked promoters did not show any significant differences compared to the wild type RNA and wild type promoter region, respectively, it cannot be generalized that other RNA molecules or regulatory sequences behave accordingly. Therefore, we do not recommend integrating watermark sequences into regulatory regions.

  16. Long non-coding RNA expression profiling of mouse testis during postnatal development.

    Directory of Open Access Journals (Sweden)

    Jin Sun

    Full Text Available Mammalian testis development and spermatogenesis play critical roles in male fertility and continuation of a species. Previous research into the molecular mechanisms of testis development and spermatogenesis has largely focused on the role of protein-coding genes and small non-coding RNAs, such as microRNAs and piRNAs. Recently, it has become apparent that large numbers of long (>200 nt non-coding RNAs (lncRNAs are transcribed from mammalian genomes and that lncRNAs perform important regulatory functions in various developmental processes. However, the expression of lncRNAs and their biological functions in post-natal testis development remain unknown. In this study, we employed microarray technology to examine lncRNA expression profiles of neonatal (6-day-old and adult (8-week-old mouse testes. We found that 8,265 lncRNAs were expressed above background levels during post-natal testis development, of which 3,025 were differentially expressed. Candidate lncRNAs were identified for further characterization by an integrated examination of genomic context, gene ontology (GO enrichment of their associated protein-coding genes, promoter analysis for epigenetic modification, and evolutionary conservation of elements. Many lncRNAs overlapped or were adjacent to key transcription factors and other genes involved in spermatogenesis, such as Ovol1, Ovol2, Lhx1, Sox3, Sox9, Plzf, c-Kit, Wt1, Sycp2, Prm1 and Prm2. Most differentially expressed lncRNAs exhibited epigenetic modification marks similar to protein-coding genes and tend to be expressed in a tissue-specific manner. In addition, the majority of differentially expressed lncRNAs harbored evolutionary conserved elements. Taken together, our findings represent the first systematic investigation of lncRNA expression in the mammalian testis and provide a solid foundation for further research into the molecular mechanisms of lncRNAs function in mammalian testis development and spermatogenesis.

  17. Genotyping of Brucella species using clade specific SNPs

    Directory of Open Access Journals (Sweden)

    Foster Jeffrey T

    2012-06-01

    Full Text Available Abstract Background Brucellosis is a worldwide disease of mammals caused by Alphaproteobacteria in the genus Brucella. The genus is genetically monomorphic, requiring extensive genotyping to differentiate isolates. We utilized two different genotyping strategies to characterize isolates. First, we developed a microarray-based assay based on 1000 single nucleotide polymorphisms (SNPs that were identified from whole genome comparisons of two B. abortus isolates , one B. melitensis, and one B. suis. We then genotyped a diverse collection of 85 Brucella strains at these SNP loci and generated a phylogenetic tree of relationships. Second, we developed a selective primer-extension assay system using capillary electrophoresis that targeted 17 high value SNPs across 8 major branches of the phylogeny and determined their genotypes in a large collection ( n = 340 of diverse isolates. Results Our 1000 SNP microarray readily distinguished B. abortus, B. melitensis, and B. suis, differentiating B. melitensis and B. suis into two clades each. Brucella abortus was divided into four major clades. Our capillary-based SNP genotyping confirmed all major branches from the microarray assay and assigned all samples to defined lineages. Isolates from these lineages and closely related isolates, among the most commonly encountered lineages worldwide, can now be quickly and easily identified and genetically characterized. Conclusions We have identified clade-specific SNPs in Brucella that can be used for rapid assignment into major groups below the species level in the three main Brucella species. Our assays represent SNP genotyping approaches that can reliably determine the evolutionary relationships of bacterial isolates without the need for whole genome sequencing of all isolates.

  18. Novel classes of non-coding RNAs and cancer

    Directory of Open Access Journals (Sweden)

    Sana Jiri

    2012-05-01

    Full Text Available Abstract For the many years, the central dogma of molecular biology has been that RNA functions mainly as an informational intermediate between a DNA sequence and its encoded protein. But one of the great surprises of modern biology was the discovery that protein-coding genes represent less than 2% of the total genome sequence, and subsequently the fact that at least 90% of the human genome is actively transcribed. Thus, the human transcriptome was found to be more complex than a collection of protein-coding genes and their splice variants. Although initially argued to be spurious transcriptional noise or accumulated evolutionary debris arising from the early assembly of genes and/or the insertion of mobile genetic elements, recent evidence suggests that the non-coding RNAs (ncRNAs may play major biological roles in cellular development, physiology and pathologies. NcRNAs could be grouped into two major classes based on the transcript size; small ncRNAs and long ncRNAs. Each of these classes can be further divided, whereas novel subclasses are still being discovered and characterized. Although, in the last years, small ncRNAs called microRNAs were studied most frequently with more than ten thousand hits at PubMed database, recently, evidence has begun to accumulate describing the molecular mechanisms by which a wide range of novel RNA species function, providing insight into their functional roles in cellular biology and in human disease. In this review, we summarize newly discovered classes of ncRNAs, and highlight their functioning in cancer biology and potential usage as biomarkers or therapeutic targets.

  19. Synonymous Codon Usage Analysis of Thirty Two Mycobacteriophage Genomes

    Directory of Open Access Journals (Sweden)

    Sameer Hassan

    2009-01-01

    Full Text Available Synonymous codon usage of protein coding genes of thirty two completely sequenced mycobacteriophage genomes was studied using multivariate statistical analysis. One of the major factors influencing codon usage is identified to be compositional bias. Codons ending with either C or G are preferred in highly expressed genes among which C ending codons are highly preferred over G ending codons. A strong negative correlation between effective number of codons (Nc and GC3s content was also observed, showing that the codon usage was effected by gene nucleotide composition. Translational selection is also identified to play a role in shaping the codon usage operative at the level of translational accuracy. High level of heterogeneity is seen among and between the genomes. Length of genes is also identified to influence the codon usage in 11 out of 32 phage genomes. Mycobacteriophage Cooper is identified to be the highly biased genome with better translation efficiency comparing well with the host specific tRNA genes.

  20. Optimisation and validation of methods to assess single nucleotide polymorphisms (SNPs) in archival histological material

    DEFF Research Database (Denmark)

    Andreassen, C N; Sørensen, Flemming Brandt; Overgaard

    2004-01-01

    only archival specimens are available. This study was conducted to validate protocols optimised for assessment of SNPs based on paraffin embedded, formalin fixed tissue samples.PATIENTS AND METHODS: In 137 breast cancer patients, three TGFB1 SNPs were assessed based on archival histological specimens...... precipitation).RESULTS: Assessment of SNPs based on archival histological material is encumbered by a number of obstacles and pitfalls. However, these can be widely overcome by careful optimisation of the methods used for sample selection, DNA extraction and PCR. Within 130 samples that fulfil the criteria...

  1. Mining the 30UTR of Autism-implicated Genes for SNPs Perturbing MicroRNA Regulation

    Institute of Scientific and Technical Information of China (English)

    Varadharajan Vaishnavi; Mayakannan Manikandan; Arasambattu Kannan Munirajan

    2014-01-01

    Autism spectrum disorder (ASD) refers to a group of childhood neurodevelopmental dis-orders with polygenic etiology. The expression of many genes implicated in ASD is tightly regulated by various factors including microRNAs (miRNAs), a class of noncoding RNAs 22 nucleotides in length that function to suppress translation by pairing with‘miRNA recognition elements’ (MREs) present in the 30untranslated region (30UTR) of target mRNAs. This emphasizes the role played by miRNAs in regulating neurogenesis, brain development and differentiation and hence any perturba-tions in this regulatory mechanism might affect these processes as well. Recently, single nucleotide polymorphisms (SNPs) present within 30UTRs of mRNAs have been shown to modulate existing MREs or even create new MREs. Therefore, we hypothesized that SNPs perturbing miRNA-medi-ated gene regulation might lead to aberrant expression of autism-implicated genes, thus resulting in disease predisposition or pathogenesis in at least a subpopulation of ASD individuals. We developed a systematic computational pipeline that integrates data from well-established databases. By following a stringent selection criterion, we identified 9 MRE-modulating SNPs and another 12 MRE-creating SNPs in the 30UTR of autism-implicated genes. These high-confidence candidate SNPs may play roles in ASD and hence would be valuable for further functional validation.

  2. TASS code topical report. V.1 TASS code technical manual

    International Nuclear Information System (INIS)

    Sim, Suk K.; Chang, W. P.; Kim, K. D.; Kim, H. C.; Yoon, H. Y.

    1997-02-01

    TASS 1.0 code has been developed at KAERI for the initial and reload non-LOCA safety analysis for the operating PWRs as well as the PWRs under construction in Korea. TASS code will replace various vendor's non-LOCA safety analysis codes currently used for the Westinghouse and ABB-CE type PWRs in Korea. This can be achieved through TASS code input modifications specific to each reactor type. The TASS code can be run interactively through the keyboard operation. A simimodular configuration used in developing the TASS code enables the user easily implement new models. TASS code has been programmed using FORTRAN77 which makes it easy to install and port for different computer environments. The TASS code can be utilized for the steady state simulation as well as the non-LOCA transient simulations such as power excursions, reactor coolant pump trips, load rejections, loss of feedwater, steam line breaks, steam generator tube ruptures, rod withdrawal and drop, and anticipated transients without scram (ATWS). The malfunctions of the control systems, components, operator actions and the transients caused by the malfunctions can be easily simulated using the TASS code. This technical report describes the TASS 1.0 code models including reactor thermal hydraulic, reactor core and control models. This TASS code models including reactor thermal hydraulic, reactor core and control models. This TASS code technical manual has been prepared as a part of the TASS code manual which includes TASS code user's manual and TASS code validation report, and will be submitted to the regulatory body as a TASS code topical report for a licensing non-LOCA safety analysis for the Westinghouse and ABB-CE type PWRs operating and under construction in Korea. (author). 42 refs., 29 tabs., 32 figs

  3. Methods for Using Small Non-Coding RNAs to Improve Recombinant Protein Expression in Mammalian Cells

    Directory of Open Access Journals (Sweden)

    Sarah Inwood

    2018-01-01

    Full Text Available The ability to produce recombinant proteins by utilizing different “cell factories” revolutionized the biotherapeutic and pharmaceutical industry. Chinese hamster ovary (CHO cells are the dominant industrial producer, especially for antibodies. Human embryonic kidney cells (HEK, while not being as widely used as CHO cells, are used where CHO cells are unable to meet the needs for expression, such as growth factors. Therefore, improving recombinant protein expression from mammalian cells is a priority, and continuing effort is being devoted to this topic. Non-coding RNAs are RNA segments that are not translated into a protein and often have a regulatory role. Since their discovery, major progress has been made towards understanding their functions. Non-coding RNA has been investigated extensively in relation to disease, especially cancer, and recently they have also been used as a method for engineering cells to improve their protein expression capability. In this review, we provide information about methods used to identify non-coding RNAs with the potential of improving recombinant protein expression in mammalian cell lines.

  4. Genome-wide DNA polymorphisms in Kavuni, a traditional rice cultivar with nutritional and therapeutic properties.

    Science.gov (United States)

    Rathinasabapathi, Pasupathi; Purushothaman, Natarajan; Parani, Madasamy

    2016-05-01

    Although rice genome was sequenced in the year 2002, efforts in resequencing the large number of available accessions, landraces, traditional cultivars, and improved varieties of this important food crop are limited. We have initiated resequencing of the traditional cultivars from India. Kavuni is an important traditional rice cultivar from South India that attracts premium price for its nutritional and therapeutic properties. Whole-genome sequencing of Kavuni using Illumina platform and SNPs analysis using Nipponbare reference genome identified 1 150 711 SNPs of which 377 381 SNPs were located in the genic regions. Non-synonymous SNPs (62 708) were distributed in 19 251 genes, and their number varied between 1 and 115 per gene. Large-effect DNA polymorphisms (7769) were present in 3475 genes. Pathway mapping of these polymorphisms revealed the involvement of genes related to carbohydrate metabolism, translation, protein-folding, and cell death. Analysis of the starch biosynthesis related genes revealed that the granule-bound starch synthase I gene had T/G SNPs at the first intron/exon junction and a two-nucleotide combination, which were reported to favour high amylose content and low glycemic index. The present study provided a valuable genomics resource to study the rice varieties with nutritional and medicinal properties.

  5. Mutations in MC1R Gene Determine Black Coat Color Phenotype in Chinese Sheep

    Directory of Open Access Journals (Sweden)

    Guang-Li Yang

    2013-01-01

    Full Text Available The melanocortin receptor 1 (MC1R plays a central role in regulation of animal coat color formation. In this study, we sequenced the complete coding region and parts of the 5′- and 3′-untranslated regions of the MC1R gene in Chinese sheep with completely white (Large-tailed Han sheep, black (Minxian Black-fur sheep, and brown coat colors (Kazakh Fat-Rumped sheep. The results showed five single nucleotide polymorphisms (SNPs: two non-synonymous mutations previously associated with coat color (c.218 T>A, p.73 Met>Lys. c.361 G>A, p.121 Asp>Asn and three synonymous mutations (c.429 C>T, p.143 Tyr>Tyr; c.600 T>G, p.200 Leu>Leu. c.735 C>T, p.245 Ile>Ile. Meanwhile, all mutations were detected in Minxian Black-fur sheep. However, the two nonsynonymous mutation sites were not in all studied breeds (Large-tailed Han, Small-tailed Han, Gansu Alpine Merino, and China Merino breeds, all of which are in white coat. A single haplotype AATGT (haplotype3 was uniquely associated with black coat color in Minxian Black-fur breed (P=9.72E-72, chi-square test. The first and second A alleles in this haplotype 3 represent location at 218 and 361 positions, respectively. Our results suggest that the mutations of MC1R gene are associated with black coat color phenotype in Chinese sheep.

  6. Sequencing illustrates the transcriptional response of Legionella pneumophila during infection and identifies seventy novel small non-coding RNAs.

    LENUS (Irish Health Repository)

    Weissenmayer, Barbara A

    2011-01-01

    Second generation sequencing has prompted a number of groups to re-interrogate the transcriptomes of several bacterial and archaeal species. One of the central findings has been the identification of complex networks of small non-coding RNAs that play central roles in transcriptional regulation in all growth conditions and for the pathogen\\'s interaction with and survival within host cells. Legionella pneumophila is a gram-negative facultative intracellular human pathogen with a distinct biphasic lifestyle. One of its primary environmental hosts in the free-living amoeba Acanthamoeba castellanii and its infection by L. pneumophila mimics that seen in human macrophages. Here we present analysis of strand specific sequencing of the transcriptional response of L. pneumophila during exponential and post-exponential broth growth and during the replicative and transmissive phase of infection inside A. castellanii. We extend previous microarray based studies as well as uncovering evidence of a complex regulatory architecture underpinned by numerous non-coding RNAs. Over seventy new non-coding RNAs could be identified; many of them appear to be strain specific and in configurations not previously reported. We discover a family of non-coding RNAs preferentially expressed during infection conditions and identify a second copy of 6S RNA in L. pneumophila. We show that the newly discovered putative 6S RNA as well as a number of other non-coding RNAs show evidence for antisense transcription. The nature and extent of the non-coding RNAs and their expression patterns suggests that these may well play central roles in the regulation of Legionella spp. specific traits and offer clues as to how L. pneumophila adapts to its intracellular niche. The expression profiles outlined in the study have been deposited into Genbank\\'s Gene Expression Omnibus (GEO) database under the series accession GSE27232.

  7. Microevolution and History of the Plague Bacillus, Yersinia pestis

    National Research Council Canada - National Science Library

    Achtman, Mark

    2004-01-01

    .... The microevolution of Y. pestis was therefore investigated by three different multilocus molecular methods, targeting genomewide synonymous SNPs, variation in number of tandem repeats, and insertion of IS100 insertion elements...

  8. Flavivirus RNAi suppression: decoding non-coding RNA.

    Science.gov (United States)

    Pijlman, Gorben P

    2014-08-01

    Flaviviruses are important human pathogens that are transmitted by invertebrate vectors, mostly mosquitoes and ticks. During replication in their vector, flaviviruses are subject to a potent innate immune response known as antiviral RNA interference (RNAi). This defense mechanism is associated with the production of small interfering (si)RNA that lead to degradation of viral RNA. To what extent flaviviruses would benefit from counteracting antiviral RNAi is subject of debate. Here, the experimental evidence to suggest the existence of flavivirus RNAi suppressors is discussed. I will highlight the putative role of non-coding, subgenomic flavivirus RNA in suppression of RNAi in insect and mammalian cells. Novel insights from ongoing research will reveal how arthropod-borne viruses modulate innate immunity including antiviral RNAi. Copyright © 2014 Elsevier B.V. All rights reserved.

  9. Roles of Non-Coding RNA in Sugarcane-Microbe Interaction.

    Science.gov (United States)

    Thiebaut, Flávia; Rojas, Cristian A; Grativol, Clícia; Calixto, Edmundo P da R; Motta, Mariana R; Ballesteros, Helkin G F; Peixoto, Barbara; de Lima, Berenice N S; Vieira, Lucas M; Walter, Maria Emilia; de Armas, Elvismary M; Entenza, Júlio O P; Lifschitz, Sergio; Farinelli, Laurent; Hemerly, Adriana S; Ferreira, Paulo C G

    2017-12-20

    Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs) in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae . Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs) were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae , while the siRNAs were repressed in the presence of A. avenae . Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR). Among these miRNAs, miR408-a copper-microRNA-was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5'RACE (rapid amplification of cDNA ends) assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly.

  10. Roles of Non-Coding RNA in Sugarcane-Microbe Interaction

    Science.gov (United States)

    Grativol, Clícia; Motta, Mariana R.; Ballesteros, Helkin G. F.; Peixoto, Barbara; Vieira, Lucas M.; Walter, Maria Emilia; de Armas, Elvismary M.; Entenza, Júlio O. P.; Lifschitz, Sergio; Farinelli, Laurent; Hemerly, Adriana S.

    2017-01-01

    Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs) in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae. Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs) were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae, while the siRNAs were repressed in the presence of A. avenae. Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR). Among these miRNAs, miR408—a copper-microRNA—was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5′RACE (rapid amplification of cDNA ends) assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly. PMID:29657296

  11. Roles of Non-Coding RNA in Sugarcane-Microbe Interaction

    Directory of Open Access Journals (Sweden)

    Flávia Thiebaut

    2017-12-01

    Full Text Available Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae. Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae, while the siRNAs were repressed in the presence of A. avenae. Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR. Among these miRNAs, miR408—a copper-microRNA—was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5′RACE (rapid amplification of cDNA ends assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly.

  12. Definition of novel GP6 polymorphisms and major difference in haplotype frequencies between populations by a combination of in-depth exon resequencing and genotyping with tag single nucleotide polymorphisms.

    Science.gov (United States)

    Watkins, N A; O'Connor, M N; Rankin, A; Jennings, N; Wilson, E; Harmer, I J; Davies, L; Smethurst, P A; Dudbridge, F; Farndale, R W; Ouwehand, W H

    2006-06-01

    Common genetic variants of cell surface receptors contribute to differences in functional responses and disease susceptibility. We have previously shown that single nucleotide polymorphisms (SNPs) in platelet glycoprotein VI (GP6) determine the extent of response to agonist. In addition, SNPs in the GP6 gene have been proposed as risk factors for coronary artery disease. To completely characterize genetic variation in the GP6 gene we generated a high-resolution SNP map by sequencing the promoter, exons and consensus splice sequences in 94 non-related Caucasoids. In addition, we sequenced DNA encoding the ligand-binding domains of GP6 from non-human primates to determine the level of evolutionary conservation. Eighteen SNPs were identified, six of which encoded amino acid substitutions in the mature form of the protein. The single non-synonymous SNP identified in the exons encoding the ligand-binding domains, encoding for a 103Leu > Val substitution, resulted in reduced ligand binding. Two common protein isoforms were confirmed in Caucasoid with frequencies of 0.82 and 0.15. Variation at the GP6 locus was characterized further by determining SNP frequency in over 2000 individuals from different ethnic backgrounds. The SNPs were polymorphic in all populations studied although significant differences in allele frequencies were observed. Twelve additional GP6 protein isoforms were identified from the genotyping results and, despite extensive variation in GP6, the sequence of the ligand-binding domains is conserved. Sequences from non-human primates confirmed this observation. These data provide valuable information for the optimal selection of genetic variants for use in future association studies.

  13. Replication and Characterization of Association between ABO SNPs and Red Blood Cell Traits by Meta-Analysis in Europeans.

    Directory of Open Access Journals (Sweden)

    Stela McLachlan

    Full Text Available Red blood cell (RBC traits are routinely measured in clinical practice as important markers of health. Deviations from the physiological ranges are usually a sign of disease, although variation between healthy individuals also occurs, at least partly due to genetic factors. Recent large scale genetic studies identified loci associated with one or more of these traits; further characterization of known loci and identification of new loci is necessary to better understand their role in health and disease and to identify potential molecular mechanisms. We performed meta-analysis of Metabochip association results for six RBC traits-hemoglobin concentration (Hb, hematocrit (Hct, mean corpuscular hemoglobin (MCH, mean corpuscular hemoglobin concentration (MCHC, mean corpuscular volume (MCV and red blood cell count (RCC-in 11 093 Europeans from seven studies of the UCL-LSHTM-Edinburgh-Bristol (UCLEB Consortium. We identified 394 non-overlapping SNPs in five loci at genome-wide significance: 6p22.1-6p21.33 (with HFE among others, 6q23.2 (with HBS1L among others, 6q23.3 (contains no genes, 9q34.3 (only ABO gene and 22q13.1 (with TMPRSS6 among others, replicating previous findings of association with RBC traits at these loci and extending them by imputation to 1000 Genomes. We further characterized associations between ABO SNPs and three traits: hemoglobin, hematocrit and red blood cell count, replicating them in an independent cohort. Conditional analyses indicated the independent association of each of these traits with ABO SNPs and a role for blood group O in mediating the association. The 15 most significant RBC-associated ABO SNPs were also associated with five cardiometabolic traits, with discordance in the direction of effect between groups of traits, suggesting that ABO may act through more than one mechanism to influence cardiometabolic risk.

  14. Joint beam design and user selection over non-binary coded MIMO interference channel

    Science.gov (United States)

    Li, Haitao; Yuan, Haiying

    2013-03-01

    In this paper, we discuss the problem of sum rate improvement for coded MIMO interference system, and propose joint beam design and user selection over interference channel. Firstly, we have formulated non-binary LDPC coded MIMO interference networks model. Then, the least square beam design for MIMO interference system is derived, and the low complexity user selection is presented. Simulation results confirm that the sum rate can be improved by the joint user selection and beam design comparing with single interference aligning beamformer.

  15. Long non-coding RNA expression profiles in hereditary haemorrhagic telangiectasia

    DEFF Research Database (Denmark)

    Tørring, Pernille M; Larsen, Martin Jakob; Kjeldsen, Anette D

    2014-01-01

    transcriptome, we wanted to assess whether lncRNAs play a role in the molecular pathogenesis of HHT manifestations. By microarray technology, we profiled lncRNA transcripts from HHT nasal telangiectasial and non-telangiectasial tissue using a paired design. The microarray probes were annotated using the GENCODE...... v.16 dataset, identifying 4,810 probes mapping to 2,811 lncRNAs. Comparing HHT telangiectasial tissue with HHT non-telangiectasial tissue, we identified 42 lncRNAs that are differentially expressed (qUsing GREAT, a tool that assumes cis-regulation, we showed that differently expressed lncRNAs...... to the TGF-β signalling pathway. The exact mechanism of how haploinsufficiency of ENG and ACVRL1 leads to HHT manifestations remains to be identified. As long non-coding RNAs (lncRNAs) are increasingly recognized as key regulators of gene expression and constitute a sizable fraction of the human...

  16. Predicting cell types and genetic variations contributing to disease by combining GWAS and epigenetic data.

    Directory of Open Access Journals (Sweden)

    Anna Gerasimova

    Full Text Available Genome-wide association studies (GWASs identify single nucleotide polymorphisms (SNPs that are enriched in individuals suffering from a given disease. Most disease-associated SNPs fall into non-coding regions, so that it is not straightforward to infer phenotype or function; moreover, many SNPs are in tight genetic linkage, so that a SNP identified as associated with a particular disease may not itself be causal, but rather signify the presence of a linked SNP that is functionally relevant to disease pathogenesis. Here, we present an analysis method that takes advantage of the recent rapid accumulation of epigenomics data to address these problems for some SNPs. Using asthma as a prototypic example; we show that non-coding disease-associated SNPs are enriched in genomic regions that function as regulators of transcription, such as enhancers and promoters. Identifying enhancers based on the presence of the histone modification marks such as H3K4me1 in different cell types, we show that the location of enhancers is highly cell-type specific. We use these findings to predict which SNPs are likely to be directly contributing to disease based on their presence in regulatory regions, and in which cell types their effect is expected to be detectable. Moreover, we can also predict which cell types contribute to a disease based on overlap of the disease-associated SNPs with the locations of enhancers present in a given cell type. Finally, we suggest that it will be possible to re-analyze GWAS studies with much higher power by limiting the SNPs considered to those in coding or regulatory regions of cell types relevant to a given disease.

  17. Code Cactus; Code Cactus

    Energy Technology Data Exchange (ETDEWEB)

    Fajeau, M; Nguyen, L T; Saunier, J [Commissariat a l' Energie Atomique, Centre d' Etudes Nucleaires de Saclay, 91 - Gif-sur-Yvette (France)

    1966-09-01

    This code handles the following problems: -1) Analysis of thermal experiments on a water loop at high or low pressure; steady state or transient behavior; -2) Analysis of thermal and hydrodynamic behavior of water-cooled and moderated reactors, at either high or low pressure, with boiling permitted; fuel elements are assumed to be flat plates: - Flowrate in parallel channels coupled or not by conduction across plates, with conditions of pressure drops or flowrate, variable or not with respect to time is given; the power can be coupled to reactor kinetics calculation or supplied by the code user. The code, containing a schematic representation of safety rod behavior, is a one dimensional, multi-channel code, and has as its complement (FLID), a one-channel, two-dimensional code. (authors) [French] Ce code permet de traiter les problemes ci-dessous: 1. Depouillement d'essais thermiques sur boucle a eau, haute ou basse pression, en regime permanent ou transitoire; 2. Etudes thermiques et hydrauliques de reacteurs a eau, a plaques, a haute ou basse pression, ebullition permise: - repartition entre canaux paralleles, couples on non par conduction a travers plaques, pour des conditions de debit ou de pertes de charge imposees, variables ou non dans le temps; - la puissance peut etre couplee a la neutronique et une representation schematique des actions de securite est prevue. Ce code (Cactus) a une dimension d'espace et plusieurs canaux, a pour complement Flid qui traite l'etude d'un seul canal a deux dimensions. (auteurs)

  18. Exome sequencing generates high quality data in non-target regions

    Directory of Open Access Journals (Sweden)

    Guo Yan

    2012-05-01

    Full Text Available Abstract Background Exome sequencing using next-generation sequencing technologies is a cost efficient approach to selectively sequencing coding regions of human genome for detection of disease variants. A significant amount of DNA fragments from the capture process fall outside target regions, and sequence data for positions outside target regions have been mostly ignored after alignment. Result We performed whole exome sequencing on 22 subjects using Agilent SureSelect capture reagent and 6 subjects using Illumina TrueSeq capture reagent. We also downloaded sequencing data for 6 subjects from the 1000 Genomes Project Pilot 3 study. Using these data, we examined the quality of SNPs detected outside target regions by computing consistency rate with genotypes obtained from SNP chips or the Hapmap database, transition-transversion (Ti/Tv ratio, and percentage of SNPs inside dbSNP. For all three platforms, we obtained high-quality SNPs outside target regions, and some far from target regions. In our Agilent SureSelect data, we obtained 84,049 high-quality SNPs outside target regions compared to 65,231 SNPs inside target regions (a 129% increase. For our Illumina TrueSeq data, we obtained 222,171 high-quality SNPs outside target regions compared to 95,818 SNPs inside target regions (a 232% increase. For the data from the 1000 Genomes Project, we obtained 7,139 high-quality SNPs outside target regions compared to 1,548 SNPs inside target regions (a 461% increase. Conclusions These results demonstrate that a significant amount of high quality genotypes outside target regions can be obtained from exome sequencing data. These data should not be ignored in genetic epidemiology studies.

  19. The Aster code; Code Aster

    Energy Technology Data Exchange (ETDEWEB)

    Delbecq, J.M

    1999-07-01

    The Aster code is a 2D or 3D finite-element calculation code for structures developed by the R and D direction of Electricite de France (EdF). This dossier presents a complete overview of the characteristics and uses of the Aster code: introduction of version 4; the context of Aster (organisation of the code development, versions, systems and interfaces, development tools, quality assurance, independent validation); static mechanics (linear thermo-elasticity, Euler buckling, cables, Zarka-Casier method); non-linear mechanics (materials behaviour, big deformations, specific loads, unloading and loss of load proportionality indicators, global algorithm, contact and friction); rupture mechanics (G energy restitution level, restitution level in thermo-elasto-plasticity, 3D local energy restitution level, KI and KII stress intensity factors, calculation of limit loads for structures), specific treatments (fatigue, rupture, wear, error estimation); meshes and models (mesh generation, modeling, loads and boundary conditions, links between different modeling processes, resolution of linear systems, display of results etc..); vibration mechanics (modal and harmonic analysis, dynamics with shocks, direct transient dynamics, seismic analysis and aleatory dynamics, non-linear dynamics, dynamical sub-structuring); fluid-structure interactions (internal acoustics, mass, rigidity and damping); linear and non-linear thermal analysis; steels and metal industry (structure transformations); coupled problems (internal chaining, internal thermo-hydro-mechanical coupling, chaining with other codes); products and services. (J.S.)

  20. No variation and low synonymous substitution rates in coral mtDNA despite high nuclear variation

    Directory of Open Access Journals (Sweden)

    Hellberg Michael E

    2006-03-01

    Full Text Available Abstract Background The mitochondrial DNA (mtDNA of most animals evolves more rapidly than nuclear DNA, and often shows higher levels of intraspecific polymorphism and population subdivision. The mtDNA of anthozoans (corals, sea fans, and their kin, by contrast, appears to evolve slowly. Slow mtDNA evolution has been reported for several anthozoans, however this slow pace has been difficult to put in phylogenetic context without parallel surveys of nuclear variation or calibrated rates of synonymous substitution that could permit quantitative rate comparisons across taxa. Here, I survey variation in the coding region of a mitochondrial gene from a coral species (Balanophyllia elegans known to possess high levels of nuclear gene variation, and estimate synonymous rates of mtDNA substitution by comparison to another coral (Tubastrea coccinea. Results The mtDNA surveyed (630 bp of cytochrome oxidase subunit I was invariant among individuals sampled from 18 populations spanning 3000 km of the range of B. elegans, despite high levels of variation and population subdivision for allozymes over these same populations. The synonymous substitution rate between B. elegans and T. coccinea (0.05%/site/106 years is similar to that in most plants, but 50–100 times lower than rates typical for most animals. In addition, while substitutions to mtDNA in most animals exhibit a strong bias toward transitions, mtDNA from these corals does not. Conclusion Slow rates of mitochondrial nucleotide substitution result in low levels of intraspecific mtDNA variation in corals, even when nuclear loci vary. Slow mtDNA evolution appears to be the basal condition among eukaryotes. mtDNA substitution rates switch from slow to fast abruptly and unidirectionally. This switch may stem from the loss of just one or a few mitochondrion-specific DNA repair or replication genes.

  1. The interplay of long non-coding RNAs and MYC in cancer

    Directory of Open Access Journals (Sweden)

    Michael J. Hamilton

    2015-12-01

    Full Text Available Long non-coding RNAs (lncRNAs are a class of RNA molecules that are changing how researchers view eukaryotic gene regulation. Once considered to be non-functional products of low-level aberrant transcription from non-coding regions of the genome, lncRNAs are now viewed as important epigenetic regulators and several lncRNAs have now been demonstrated to be critical players in the development and/or maintenance of cancer. Similarly, the emerging variety of interactions between lncRNAs and MYC, a well-known oncogenic transcription factor linked to most types of cancer, have caught the attention of many biomedical researchers. Investigations exploring the dynamic interactions between lncRNAs and MYC, referred to as the lncRNA-MYC network, have proven to be especially complex. Genome-wide studies have shown that MYC transcriptionally regulates many lncRNA genes. Conversely, recent reports identified lncRNAs that regulate MYC expression both at the transcriptional and post-transcriptional levels. These findings are of particular interest because they suggest roles of lncRNAs as regulators of MYC oncogenic functions and the possibility that targeting lncRNAs could represent a novel avenue to cancer treatment. Here, we briefly review the current understanding of how lncRNAs regulate chromatin structure and gene transcription, and then focus on the new developments in the emerging field exploring the lncRNA-MYC network in cancer.

  2. The AutoProof Verifier: Usability by Non-Experts and on Standard Code

    Directory of Open Access Journals (Sweden)

    Carlo A. Furia

    2015-08-01

    Full Text Available Formal verification tools are often developed by experts for experts; as a result, their usability by programmers with little formal methods experience may be severely limited. In this paper, we discuss this general phenomenon with reference to AutoProof: a tool that can verify the full functional correctness of object-oriented software. In particular, we present our experiences of using AutoProof in two contrasting contexts representative of non-expert usage. First, we discuss its usability by students in a graduate course on software verification, who were tasked with verifying implementations of various sorting algorithms. Second, we evaluate its usability in verifying code developed for programming assignments of an undergraduate course. The first scenario represents usability by serious non-experts; the second represents usability on "standard code", developed without full functional verification in mind. We report our experiences and lessons learnt, from which we derive some general suggestions for furthering the development of verification tools with respect to improving their usability.

  3. An integrative approach to predicting the functional effects of small indels in non-coding regions of the human genome.

    Science.gov (United States)

    Ferlaino, Michael; Rogers, Mark F; Shihab, Hashem A; Mort, Matthew; Cooper, David N; Gaunt, Tom R; Campbell, Colin

    2017-10-06

    Small insertions and deletions (indels) have a significant influence in human disease and, in terms of frequency, they are second only to single nucleotide variants as pathogenic mutations. As the majority of mutations associated with complex traits are located outside the exome, it is crucial to investigate the potential pathogenic impact of indels in non-coding regions of the human genome. We present FATHMM-indel, an integrative approach to predict the functional effect, pathogenic or neutral, of indels in non-coding regions of the human genome. Our method exploits various genomic annotations in addition to sequence data. When validated on benchmark data, FATHMM-indel significantly outperforms CADD and GAVIN, state of the art models in assessing the pathogenic impact of non-coding variants. FATHMM-indel is available via a web server at indels.biocompute.org.uk. FATHMM-indel can accurately predict the functional impact and prioritise small indels throughout the whole non-coding genome.

  4. Screening for SNPs with Allele-Specific Methylation based on Next-Generation Sequencing Data.

    Science.gov (United States)

    Hu, Bo; Ji, Yuan; Xu, Yaomin; Ting, Angela H

    2013-05-01

    Allele-specific methylation (ASM) has long been studied but mainly documented in the context of genomic imprinting and X chromosome inactivation. Taking advantage of the next-generation sequencing technology, we conduct a high-throughput sequencing experiment with four prostate cell lines to survey the whole genome and identify single nucleotide polymorphisms (SNPs) with ASM. A Bayesian approach is proposed to model the counts of short reads for each SNP conditional on its genotypes of multiple subjects, leading to a posterior probability of ASM. We flag SNPs with high posterior probabilities of ASM by accounting for multiple comparisons based on posterior false discovery rates. Applying the Bayesian approach to the in-house prostate cell line data, we identify 269 SNPs as candidates of ASM. A simulation study is carried out to demonstrate the quantitative performance of the proposed approach.

  5. Quantification of non-coding RNA target localization diversity and its application in cancers.

    Science.gov (United States)

    Cheng, Lixin; Leung, Kwong-Sak

    2018-04-01

    Subcellular localization is pivotal for RNAs and proteins to implement biological functions. The localization diversity of protein interactions has been studied as a crucial feature of proteins, considering that the protein-protein interactions take place in various subcellular locations. Nevertheless, the localization diversity of non-coding RNA (ncRNA) target proteins has not been systematically studied, especially its characteristics in cancers. In this study, we provide a new algorithm, non-coding RNA target localization coefficient (ncTALENT), to quantify the target localization diversity of ncRNAs based on the ncRNA-protein interaction and protein subcellular localization data. ncTALENT can be used to calculate the target localization coefficient of ncRNAs and measure how diversely their targets are distributed among the subcellular locations in various scenarios. We focus our study on long non-coding RNAs (lncRNAs), and our observations reveal that the target localization diversity is a primary characteristic of lncRNAs in different biotypes. Moreover, we found that lncRNAs in multiple cancers, differentially expressed cancer lncRNAs, and lncRNAs with multiple cancer target proteins are prone to have high target localization diversity. Furthermore, the analysis of gastric cancer helps us to obtain a better understanding that the target localization diversity of lncRNAs is an important feature closely related to clinical prognosis. Overall, we systematically studied the target localization diversity of the lncRNAs and uncovered its association with cancer.

  6. Non-coding RNAs: New therapeutic targets and opportunities for hepatocellular carcinoma

    Directory of Open Access Journals (Sweden)

    Yu Cuiyun

    2016-02-01

    Full Text Available Non-coding RNAs (ncRNA are RNA molecules without protein coding functions owing to the lack of an open reading frame (ORF. Based on the length, ncRNAs can be divided into long and short ncRNAs; short ncRNAs include miRNAs and piRNAs. Hepatocellular carcinoma (HCC is among the most frequent forms of cancer worldwide and its incidence is increasing rapidly. Studies have found that ncRNAs are likely to play a crucial role in a variety of biological processes including the pathogenesis and progression of HCC. In this review, we summarized the regulation mechanism and biological functions of ncRNAs in HCC with respect to its application in HCC diagnosis, therapy and prognosis.

  7. Beamforming-Based Physical Layer Network Coding for Non-Regenerative Multi-Way Relaying

    Directory of Open Access Journals (Sweden)

    Klein Anja

    2010-01-01

    Full Text Available We propose non-regenerative multi-way relaying where a half-duplex multi-antenna relay station (RS assists multiple single-antenna nodes to communicate with each other. The required number of communication phases is equal to the number of the nodes, N. There are only one multiple-access phase, where the nodes transmit simultaneously to the RS, and broadcast (BC phases. Two transmission methods for the BC phases are proposed, namely, multiplexing transmission and analog network coded transmission. The latter is a cooperation method between the RS and the nodes to manage the interference in the network. Assuming that perfect channel state information is available, the RS performs transceive beamforming to the received signals and transmits simultaneously to all nodes in each BC phase. We address the optimum transceive beamforming maximising the sum rate of non-regenerative multi-way relaying. Due to the nonconvexity of the optimization problem, we propose suboptimum but practical signal processing schemes. For multiplexing transmission, we propose suboptimum schemes based on zero forcing, minimising the mean square error, and maximising the signal to noise ratio. For analog network coded transmission, we propose suboptimum schemes based on matched filtering and semidefinite relaxation of maximising the minimum signal to noise ratio. It is shown that analog network coded transmission outperforms multiplexing transmission.

  8. FEAST: a two-dimensional non-linear finite element code for calculating stresses

    International Nuclear Information System (INIS)

    Tayal, M.

    1986-06-01

    The computer code FEAST calculates stresses, strains, and displacements. The code is two-dimensional. That is, either plane or axisymmetric calculations can be done. The code models elastic, plastic, creep, and thermal strains and stresses. Cracking can also be simulated. The finite element method is used to solve equations describing the following fundamental laws of mechanics: equilibrium; compatibility; constitutive relations; yield criterion; and flow rule. FEAST combines several unique features that permit large time-steps in even severely non-linear situations. The features include a special formulation for permitting many finite elements to simultaneously cross the boundary from elastic to plastic behaviour; accomodation of large drops in yield-strength due to changes in local temperature and a three-step predictor-corrector method for plastic analyses. These features reduce computing costs. Comparisons against twenty analytical solutions and against experimental measurements show that predictions of FEAST are generally accurate to ± 5%

  9. Polymorphism of BMP4 gene in Indian goat breeds differing in prolificacy.

    Science.gov (United States)

    Sharma, Rekha; Ahlawat, Sonika; Maitra, A; Roy, Manoranjan; Mandakmale, S; Tantia, M S

    2013-12-10

    Bone morphogenetic proteins (BMPs) are members of the TGF-β (transforming growth factor-beta) superfamily, of which BMP4 is the most important due to its crucial role in follicular growth and differentiation, cumulus expansion and ovulation. Reproduction is a crucial trait in goat breeding and based on the important role of BMP4 gene in reproduction it was considered as a possible candidate gene for the prolificacy of goats. The objective of the present study was to detect polymorphism in intronic, exonic and 3' un-translated regions of BMP4 gene in Indian goats. Nine different goat breeds (Barbari, Beetal, Black Bengal, Malabari, Jakhrana (Twinning>40%), Osmanabadi, Sangamneri (Twinning 20-30%), Sirohi and Ganjam (Twinning<10%)) differing in prolificacy and geographic distribution were employed for polymorphism scanning. Cattle sequence (AC_000167.1) was used to design primers for the amplification of a targeted region followed by direct DNA sequencing to identify the genetic variations. Single nucleotide polymorphisms (SNPs) were not detected in exon 3, the intronic region and the 3' flanking region. A SNP (G1534A) was identified in exon 2. It was a non-synonymous mutation resulting in an arginine to lysine change in a corresponding protein sequence. G to A transition at the 1534 locus revealed two genotypes GG and GA in the nine investigated goat breeds. The GG genotype was predominant with a genotype frequency of 0.98. The GA genotype was present in the Black Bengal as well as Jakhrana breed with a genotype frequency of 0.02. A microsatellite was identified in the 3' flanking region, only 20 nucleotides downstream from the termination site of the coding region, as a short sequence with more than nineteen continuous and repeated CA dinucleotides. Since the gene is highly evolutionarily conserved, identification of a non-synonymous SNP (G1534A) in the coding region gains further importance. To our knowledge, this is the first report of a mutation in the coding

  10. ICSNPathway: identify candidate causal SNPs and pathways from genome-wide association study by one analytical framework.

    Science.gov (United States)

    Zhang, Kunlin; Chang, Suhua; Cui, Sijia; Guo, Liyuan; Zhang, Liuyan; Wang, Jing

    2011-07-01

    Genome-wide association study (GWAS) is widely utilized to identify genes involved in human complex disease or some other trait. One key challenge for GWAS data interpretation is to identify causal SNPs and provide profound evidence on how they affect the trait. Currently, researches are focusing on identification of candidate causal variants from the most significant SNPs of GWAS, while there is lack of support on biological mechanisms as represented by pathways. Although pathway-based analysis (PBA) has been designed to identify disease-related pathways by analyzing the full list of SNPs from GWAS, it does not emphasize on interpreting causal SNPs. To our knowledge, so far there is no web server available to solve the challenge for GWAS data interpretation within one analytical framework. ICSNPathway is developed to identify candidate causal SNPs and their corresponding candidate causal pathways from GWAS by integrating linkage disequilibrium (LD) analysis, functional SNP annotation and PBA. ICSNPathway provides a feasible solution to bridge the gap between GWAS and disease mechanism study by generating hypothesis of SNP → gene → pathway(s). The ICSNPathway server is freely available at http://icsnpathway.psych.ac.cn/.

  11. Association of p21 SNPs and risk of cervical cancer among Chinese women

    International Nuclear Information System (INIS)

    Wang, Ning; Wang, Shizhuo; Zhang, Qiao; Lu, Yanming; Wei, Heng; Li, Wei; Zhang, Shulan; Yin, Duo; Ou, Yangling

    2012-01-01

    The p21 codon 31 single nucleotide polymorphism (SNP), rs1801270, has been linked to cervical cancer but with controversial results. The aims of this study were to investigate the role of p21 SNP-rs1801270 and other untested p21 SNPs in the risk of cervical cancer in a Chinese population. We genotyped five p21 SNPs (rs762623, rs2395655, rs1801270, rs3176352, and rs1059234) using peripheral blood DNA from 393 cervical cancer patients and 434 controls. The frequency of the rs1801270 A allele in patients (0.421) was significantly lower than that in controls (0.494, p = 0.003). The frequency of the rs3176352 C allele in cases (0.319) was significantly lower than that in controls (0.417, p < 0.001).The allele frequency of other three p21 SNPs showed not statistically significantly different between patients and controls. The rs1801270 AA genotype was associated with a decreased risk for the development of cervical cancer (OR = 0.583, 95%CI: 0.399 - 0.853, P = 0.005). We observed that the three p21 SNPs (rs1801270, rs3176352, and rs1059234) was in linkage disequilibrium (LD) and thus haplotype analysis was performed. The AGT haplotype (which includes the rs1801270A allele) was the most frequent haplotype among all subjects, and both homozygosity and heterozygosity for the AGT haplotype provided a protective effect from development of cervical cancer. We show an association between the p21 SNP rs1801270A allele and a decreased risk for cervical cancer in a population of Chinese women. The AGT haplotype formed by three p21 SNPs in LD (rs1801270, rs3176352 and rs1059234) also provided a protective effect in development of cervical cancer in this population

  12. No association of the Arg51Gln and Leu72Met polymorphisms of the ghrelin gene and polycystic ovary syndrome.

    Science.gov (United States)

    Wang, Kehua; Wang, Leiguang; Zhao, Yueran; Shi, Yuhua; Wang, Laicheng; Chen, Zi-Jiang

    2009-02-01

    Ghrelin plays a role in regulating glucose metabolism and energy balance. Polymorphisms in preproghrelin and ghrelin gene could be responsible for obesity, insulin resistance and low ghrelin levels observed in some individuals. The objective of this study was to evaluate the influence of two single-nucleotide polymorphisms (SNPs) of ghrelin gene on the clinical, the hormonal and metabolic features in women with polycystic ovary syndrome (PCOS) in a Chinese population. A large sample of Chinese PCOS (n = 271) women and a control group (n = 296) of healthy women matched for age were studied. Hormone and metabolic profiles were measured and blood samples were collected for genotype and allelic frequency analysis. Non-synonymous SNPs in the coding region (exon 2) of the preproghrelin gene (Arg51Gln (346 G>A) and Leu72Met (408 C>A) were studied using PCR and restriction fragment length polymorphism analysis. The polymorphism Arg51Gln was not found in the cohorts studied. The distribution of Leu72Met was similar in PCOS group and in healthy controls. There was no significant difference in age, BMI, waist-hip-ratio and levels of FSH, LH, estradiol, testosterone and prolactin between PCOS patients with different genotypes, and the level of plasma glucose and insulin was also similar. No association was found between Leu72Met and Arg51Gln polymorphisms in the ghrelin gene and PCOS in Chinese population.

  13. Multiple Evolutionary Selections Involved in Synonymous Codon Usages in the Streptococcus agalactiae Genome.

    Science.gov (United States)

    Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu

    2016-02-24

    Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts.

  14. List Decoding of Matrix-Product Codes from nested codes: an application to Quasi-Cyclic codes

    DEFF Research Database (Denmark)

    Hernando, Fernando; Høholdt, Tom; Ruano, Diego

    2012-01-01

    A list decoding algorithm for matrix-product codes is provided when $C_1,..., C_s$ are nested linear codes and $A$ is a non-singular by columns matrix. We estimate the probability of getting more than one codeword as output when the constituent codes are Reed-Solomon codes. We extend this list...... decoding algorithm for matrix-product codes with polynomial units, which are quasi-cyclic codes. Furthermore, it allows us to consider unique decoding for matrix-product codes with polynomial units....

  15. Genome-wide single nucleotide polymorphisms (SNPs) for a model invasive ascidian Botryllus schlosseri.

    Science.gov (United States)

    Gao, Yangchun; Li, Shiguo; Zhan, Aibin

    2018-04-01

    Invasive species cause huge damages to ecology, environment and economy globally. The comprehensive understanding of invasion mechanisms, particularly genetic bases of micro-evolutionary processes responsible for invasion success, is essential for reducing potential damages caused by invasive species. The golden star tunicate, Botryllus schlosseri, has become a model species in invasion biology, mainly owing to its high invasiveness nature and small well-sequenced genome. However, the genome-wide genetic markers have not been well developed in this highly invasive species, thus limiting the comprehensive understanding of genetic mechanisms of invasion success. Using restriction site-associated DNA (RAD) tag sequencing, here we developed a high-quality resource of 14,119 out of 158,821 SNPs for B. schlosseri. These SNPs were relatively evenly distributed at each chromosome. SNP annotations showed that the majority of SNPs (63.20%) were located at intergenic regions, and 21.51% and 14.58% were located at introns and exons, respectively. In addition, the potential use of the developed SNPs for population genomics studies was primarily assessed, such as the estimate of observed heterozygosity (H O ), expected heterozygosity (H E ), nucleotide diversity (π), Wright's inbreeding coefficient (F IS ) and effective population size (Ne). Our developed SNP resource would provide future studies the genome-wide genetic markers for genetic and genomic investigations, such as genetic bases of micro-evolutionary processes responsible for invasion success.

  16. microRNA-9 targets the long non-coding RNA MALAT1 for degradation in the nucleus

    DEFF Research Database (Denmark)

    Leucci, Eleonora; Patella, Francesca; Waage, Johannes

    2013-01-01

    -coding RNAs. Here we report that microRNA-9 (miR-9) regulates the expression of the Metastasis Associated Lung Adenocarcinoma Transcript 1 (MALAT-1), one of the most abundant and conserved long non-coding RNAs. Intriguingly, we find that miR-9 targets AGO2-mediated regulation of MALAT1 in the nucleus. Our...

  17. Overexpression of long non-coding RNAs following exposure to xenobiotics in the aquatic midge Chironomus riparius

    International Nuclear Information System (INIS)

    Martínez-Guitarte, José-Luis; Planelló, Rosario; Morcillo, Gloria

    2012-01-01

    Non-coding RNAs (ncRNAs) represent an important transcriptional output of eukaryotic genomes. In addition to their functional relevance as housekeeping and regulatory elements, recent studies have suggested their involvement in rather unexpected cellular functions. The aim of this work was to analyse the transcriptional behaviour of non-coding RNAs in the toxic response to pollutants in Chironomus riparius, a reference organism in aquatic toxicology. Three well-characterized long non-coding sequences were studied: telomeric repeats, Cla repetitive elements and the SINE CTRT1. Transcription levels were evaluated by RT-PCR after 24-h exposures to three current aquatic contaminants: bisphenol A (BPA), benzyl butyl phthalate (BBP) and the heavy metal cadmium (Cd). Upregulation of telomeric transcripts was found after BPA treatments. Moreover, BPA significantly activated Cla transcription, which also appeared to be increased by cadmium, whereas BBP did not affect the transcription levels of these sequences. Transcription of SINE CTRT1 was not altered by any of the chemicals tested. These data are discussed in the light of previous studies that have shown a response by long ncRNAS (lncRNAs) to cellular stressors, indicating a relationship with environmental stimuli. Our results demonstrated for the first time the ability of bisphenol A to activate non-coding sequences mainly located at telomeres and centromeres. Overall, this study provides evidence that xenobiotics can induce specific responses in ncRNAs derived from repetitive sequences that could be relevant in the toxic response, and also suggests that ncRNAs could represent a novel class of potential biomarkers in toxicological assessment.

  18. Overexpression of long non-coding RNAs following exposure to xenobiotics in the aquatic midge Chironomus riparius

    Energy Technology Data Exchange (ETDEWEB)

    Martinez-Guitarte, Jose-Luis, E-mail: jlmartinez@ccia.uned.es [Grupo de Biologia y Toxicologia Ambiental, Facultad de Ciencias, Universidad Nacional de Educacion a Distancia, UNED, Senda del Rey 9, 28040 Madrid (Spain); Planello, Rosario; Morcillo, Gloria [Grupo de Biologia y Toxicologia Ambiental, Facultad de Ciencias, Universidad Nacional de Educacion a Distancia, UNED, Senda del Rey 9, 28040 Madrid (Spain)

    2012-04-15

    Non-coding RNAs (ncRNAs) represent an important transcriptional output of eukaryotic genomes. In addition to their functional relevance as housekeeping and regulatory elements, recent studies have suggested their involvement in rather unexpected cellular functions. The aim of this work was to analyse the transcriptional behaviour of non-coding RNAs in the toxic response to pollutants in Chironomus riparius, a reference organism in aquatic toxicology. Three well-characterized long non-coding sequences were studied: telomeric repeats, Cla repetitive elements and the SINE CTRT1. Transcription levels were evaluated by RT-PCR after 24-h exposures to three current aquatic contaminants: bisphenol A (BPA), benzyl butyl phthalate (BBP) and the heavy metal cadmium (Cd). Upregulation of telomeric transcripts was found after BPA treatments. Moreover, BPA significantly activated Cla transcription, which also appeared to be increased by cadmium, whereas BBP did not affect the transcription levels of these sequences. Transcription of SINE CTRT1 was not altered by any of the chemicals tested. These data are discussed in the light of previous studies that have shown a response by long ncRNAS (lncRNAs) to cellular stressors, indicating a relationship with environmental stimuli. Our results demonstrated for the first time the ability of bisphenol A to activate non-coding sequences mainly located at telomeres and centromeres. Overall, this study provides evidence that xenobiotics can induce specific responses in ncRNAs derived from repetitive sequences that could be relevant in the toxic response, and also suggests that ncRNAs could represent a novel class of potential biomarkers in toxicological assessment.

  19. Table S1 Basic characteristics of 32 SNPs of neurotransmitter ...

    Indian Academy of Sciences (India)

    微软用户

    Basic characteristics of 32 SNPs in neurotransmitter-related genes. Gene .... rs45435444, rs80837467 and rs80980072, significant differences (P. *** * ... At the same age and environments, skin lesion scores on the ears (P < 0.001), front (P <.

  20. [Mutation analysis of seven patients with Waardenburg syndrome].

    Science.gov (United States)

    Hao, Ziqi; Zhou, Yongan; Li, Pengli; Zhang, Quanbin; Li, Jiao; Wang, Pengfei; Li, Xiangshao; Feng, Yong

    2016-06-01

    To perform genetic analysis for 7 patients with Waardenburg syndrome. Potential mutation of MITF, PAX3, SOX10 and SNAI2 genes was screened by polymerase chain reaction and direct sequencing. Functions of non-synonymous polymorphisms were predicted with PolyPhen2 software. Seven mutations, including c.649-651delAGA (p.R217del), c.72delG (p.G24fs), c.185T>C (p.M62T), c.118C>T (p.Q40X), c.422T>C (p.L141P), c.640C>T (p.R214X) and c.28G>T(p.G43V), were detected in the patients. Among these, four mutations of the PAX3 gene (c.72delG, c.185T>C, c.118C>T and c.128G>T) and one SOX10 gene mutation (c.422T>C) were not reported previously. Three non-synonymous SNPs (c.185T>C, c.128G>T and c.422T>C) were predicted as harmful. Genetic mutations have been detected in all patients with Waardenburg syndrome.

  1. Canonical Single Nucleotide Polymorphisms (SNPs for High-Resolution Subtyping of Shiga-Toxin Producing Escherichia coli (STEC O157:H7.

    Directory of Open Access Journals (Sweden)

    Sean M Griffing

    Full Text Available The objective of this study was to develop a canonical, parsimoniously-informative SNP panel for subtyping Shiga-toxin producing Escherichia coli (STEC O157:H7 that would be consistent with epidemiological, PFGE, and MLVA clustering of human specimens. Our group had previously identified 906 putative discriminatory SNPs, which were pared down to 391 SNPs based on their prevalence in a test set. The 391 SNPs were screened using a high-throughput form of TaqMan PCR against a set of clinical isolates that represent the most diverse collection of O157:H7 isolates from outbreaks and sporadic cases examined to date. Another 30 SNPs identified by others were also screened using the same method. Two additional targets were tested using standard TaqMan PCR endpoint analysis. These 423 SNPs were reduced to a 32 SNP panel with the almost the same discriminatory value. While the panel partitioned our diverse set of isolates in a manner that was consistent with epidemiological data and PFGE and MLVA phylogenies, it resulted in fewer subtypes than either existing method and insufficient epidemiological resolution in 10 of 47 clusters. Therefore, another round of SNP discovery was undertaken using comparative genomic resequencing of pooled DNA from the 10 clusters with insufficient resolution. This process identified 4,040 potential SNPs and suggested one of the ten clusters was incorrectly grouped. After its removal, there were 2,878 SNPs, of which only 63 were previously identified and 438 occurred across multiple clusters. Among highly clonal bacteria like STEC O157:H7, linkage disequilibrium greatly limits the number of parsimoniously informative SNPs. Therefore, it is perhaps unsurprising that our panel accounted for the potential discriminatory value of numerous other SNPs reported in the literature. We concluded published O157:H7 SNPs are insufficient for effective epidemiological subtyping. However, the 438 multi-cluster SNPs we identified may provide

  2. Non-Coding RNAs are Differentially Expressed by Nocardia brasiliensis in Vitro and in Experimental Actinomycetoma.

    Science.gov (United States)

    Cruz-Rabadán, Josué S; Miranda-Ríos, Juan; Espín-Ocampo, Guadalupe; Méndez-Tovar, Luis J; Maya-Pineda, Héctor Rubén; Hernández-Hernández, Francisca

    2017-01-01

    Nocardia spp. are common soil-inhabiting bacteria that frequently infect humans through traumatic injuries or inhalation routes and cause infections, such as actinomycetoma and nocardiosis, respectively. Nocardia brasiliensis is the main aetiological agent of actinomycetoma in various countries. Many bacterial non-coding RNAs are regulators of genes associated with virulence factors. The aim of this work was to identify non-coding RNAs (ncRNAs) expressed during infection conditions and in free-living form ( in vitro ) in Nocardia brasiliensis . The N. brasiliensis transcriptome (predominately brasiliensis infection compared with the in vitro conditions. The results of this work suggest a possible role for these transcripts in the regulation of virulence genes in actinomycetoma pathogenesis.

  3. Coverage and characteristics of the Affymetrix GeneChip Human Mapping 100K SNP set.

    Directory of Open Access Journals (Sweden)

    2006-05-01

    Full Text Available Improvements in technology have made it possible to conduct genome-wide association mapping at costs within reach of academic investigators, and experiments are currently being conducted with a variety of high-throughput platforms. To provide an appropriate context for interpreting results of such studies, we summarize here results of an investigation of one of the first of these technologies to be publicly available, the Affymetrix GeneChip Human Mapping 100K set of single nucleotide polymorphisms (SNPs. In a systematic analysis of the pattern and distribution of SNPs in the Mapping 100K set, we find that SNPs in this set are undersampled from coding regions (both nonsynonymous and synonymous and oversampled from regions outside genes, relative to SNPs in the overall HapMap database. In addition, we utilize a novel multilocus linkage disequilibrium (LD coefficient based on information content (analogous to the information content scores commonly used for linkage mapping that is equivalent to the familiar measure r2 in the special case of two loci. Using this approach, we are able to summarize for any subset of markers, such as the Affymetrix Mapping 100K set, the information available for association mapping in that subset, relative to the information available in the full set of markers included in the HapMap, and highlight circumstances in which this multilocus measure of LD provides substantial additional insight about the haplotype structure in a region over pairwise measures of LD.

  4. A Mismatch EndoNuclease Array-Based Methodology (MENA) for Identifying Known SNPs or Novel Point Mutations.

    Science.gov (United States)

    Comeron, Josep M; Reed, Jordan; Christie, Matthew; Jacobs, Julia S; Dierdorff, Jason; Eberl, Daniel F; Manak, J Robert

    2016-04-05

    Accurate and rapid identification or confirmation of single nucleotide polymorphisms (SNPs), point mutations and other human genomic variation facilitates understanding the genetic basis of disease. We have developed a new methodology (called MENA (Mismatch EndoNuclease Array)) pairing DNA mismatch endonuclease enzymology with tiling microarray hybridization in order to genotype both known point mutations (such as SNPs) as well as identify previously undiscovered point mutations and small indels. We show that our assay can rapidly genotype known SNPs in a human genomic DNA sample with 99% accuracy, in addition to identifying novel point mutations and small indels with a false discovery rate as low as 10%. Our technology provides a platform for a variety of applications, including: (1) genotyping known SNPs as well as confirming newly discovered SNPs from whole genome sequencing analyses; (2) identifying novel point mutations and indels in any genomic region from any organism for which genome sequence information is available; and (3) screening panels of genes associated with particular diseases and disorders in patient samples to identify causative mutations. As a proof of principle for using MENA to discover novel mutations, we report identification of a novel allele of the beethoven (btv) gene in Drosophila, which encodes a ciliary cytoplasmic dynein motor protein important for auditory mechanosensation.

  5. Landmark Image Retrieval Using Visual Synonyms

    NARCIS (Netherlands)

    Gavves, E.; Snoek, C.G.M.

    2010-01-01

    In this paper, we consider the incoherence problem of the visual words in bag-of-words vocabularies. Different from existing work, which performs assignment of words based solely on closeness in descriptor space, we focus on identifying pairs of independent, distant words - the visual synonyms -

  6. WASP: a Web-based Allele-Specific PCR assay designing tool for detecting SNPs and mutations

    Directory of Open Access Journals (Sweden)

    Assawamakin Anunchai

    2007-08-01

    Full Text Available Abstract Background Allele-specific (AS Polymerase Chain Reaction is a convenient and inexpensive method for genotyping Single Nucleotide Polymorphisms (SNPs and mutations. It is applied in many recent studies including population genetics, molecular genetics and pharmacogenomics. Using known AS primer design tools to create primers leads to cumbersome process to inexperience users since information about SNP/mutation must be acquired from public databases prior to the design. Furthermore, most of these tools do not offer the mismatch enhancement to designed primers. The available web applications do not provide user-friendly graphical input interface and intuitive visualization of their primer results. Results This work presents a web-based AS primer design application called WASP. This tool can efficiently design AS primers for human SNPs as well as mutations. To assist scientists with collecting necessary information about target polymorphisms, this tool provides a local SNP database containing over 10 million SNPs of various populations from public domain databases, namely NCBI dbSNP, HapMap and JSNP respectively. This database is tightly integrated with the tool so that users can perform the design for existing SNPs without going off the site. To guarantee specificity of AS primers, the proposed system incorporates a primer specificity enhancement technique widely used in experiment protocol. In particular, WASP makes use of different destabilizing effects by introducing one deliberate 'mismatch' at the penultimate (second to last of the 3'-end base of AS primers to improve the resulting AS primers. Furthermore, WASP offers graphical user interface through scalable vector graphic (SVG draw that allow users to select SNPs and graphically visualize designed primers and their conditions. Conclusion WASP offers a tool for designing AS primers for both SNPs and mutations. By integrating the database for known SNPs (using gene ID or rs number

  7. THE MIGHT OF RUSSIAN LANGUAGE ACCORDING TO SYNONYMIC DICTIONARY BY COMPUTER EVALUATION SYSTEM ASIS®

    Directory of Open Access Journals (Sweden)

    Vitaly N. Trishin

    2013-01-01

    Full Text Available The article describes electronic dictionary of synonyms in Russian language by ASIS® system (more than 500 000 words and collocations, 190 000 synonymic connections.The program can be used not just as a dictionary of synonyms and close meaning words, but also as spelling dictionary and definition dictionary of Russian language in order to check the orthography and define the meaning of unknown words. The dictionary is also designed to be an instrument of philological surveys and studies of the language trough the extensive query system on different characteristic of words (definition, composition, synonymy, etc.. Program’s lexical base includes words from dictionaries and guides in all subject areas - from astronomy to Japanese painting. Over compilation of dictionary developer used published dictionaries: spelling, synonymic, definition dictionaries, dictionary of collocations, dictionary of foreign words and etc. of 19-21 cc. Newspapers, magazines and web-resources were active used as well for appending the dictionary. This dictionary practically shows, that by the amount of words Russian language belongs with the most developed languages in the world, and by the scale and density of synonymic space, in the author’s opinion, it has no equal.

  8. Differential DNA methylation profiles of coding and non-coding genes define hippocampal sclerosis in human temporal lobe epilepsy

    Science.gov (United States)

    Miller-Delaney, Suzanne F.C.; Bryan, Kenneth; Das, Sudipto; McKiernan, Ross C.; Bray, Isabella M.; Reynolds, James P.; Gwinn, Ryder; Stallings, Raymond L.

    2015-01-01

    Temporal lobe epilepsy is associated with large-scale, wide-ranging changes in gene expression in the hippocampus. Epigenetic changes to DNA are attractive mechanisms to explain the sustained hyperexcitability of chronic epilepsy. Here, through methylation analysis of all annotated C-phosphate-G islands and promoter regions in the human genome, we report a pilot study of the methylation profiles of temporal lobe epilepsy with or without hippocampal sclerosis. Furthermore, by comparative analysis of expression and promoter methylation, we identify methylation sensitive non-coding RNA in human temporal lobe epilepsy. A total of 146 protein-coding genes exhibited altered DNA methylation in temporal lobe epilepsy hippocampus (n = 9) when compared to control (n = 5), with 81.5% of the promoters of these genes displaying hypermethylation. Unique methylation profiles were evident in temporal lobe epilepsy with or without hippocampal sclerosis, in addition to a common methylation profile regardless of pathology grade. Gene ontology terms associated with development, neuron remodelling and neuron maturation were over-represented in the methylation profile of Watson Grade 1 samples (mild hippocampal sclerosis). In addition to genes associated with neuronal, neurotransmitter/synaptic transmission and cell death functions, differential hypermethylation of genes associated with transcriptional regulation was evident in temporal lobe epilepsy, but overall few genes previously associated with epilepsy were among the differentially methylated. Finally, a panel of 13, methylation-sensitive microRNA were identified in temporal lobe epilepsy including MIR27A, miR-193a-5p (MIR193A) and miR-876-3p (MIR876), and the differential methylation of long non-coding RNA documented for the first time. The present study therefore reports select, genome-wide DNA methylation changes in human temporal lobe epilepsy that may contribute to the molecular architecture of the epileptic brain. PMID

  9. The association between individual SNPs or haplotypes of matrix metalloproteinase 1 and gastric cancer susceptibility, progression and prognosis.

    Directory of Open Access Journals (Sweden)

    Yong-Xi Song

    Full Text Available BACKGROUND: The single nucleotide polymorphisms (SNPs in matrix metalloproteinase 1(MMP-1 play important roles in some cancers. This study examined the associations between individual SNPs or haplotypes in MMP-1 and susceptibility, clinicopathological parameters and prognosis of gastric cancer in a large sample of the Han population in northern China. METHODS: In this case-controlled study, there were 404 patients with gastric cancer and 404 healthy controls. Seven SNPs were genotyped using the MALDI-TOF MS system. Then, SPSS software, Haploview 4.2 software, Haplo.states software and THEsias software were used to estimate the association between individual SNPs or haplotypes of MMP-1 and gastric cancer susceptibility, progression and prognosis. RESULTS: Among seven SNPs, there were no individual SNPs correlated to gastric cancer risk. Moreover, only the rs470206 genotype had a correlation with histologic grades, and the patients with GA/AA had well cell differentiation compared to the patients with genotype GG (OR=0.573; 95%CI: 0.353-0.929; P=0.023. Then, we constructed a four-marker haplotype block that contained 4 common haplotypes: TCCG, GCCG, TTCG and TTTA. However, all four common haplotypes had no correlation with gastric cancer risk and we did not find any relationship between these haplotypes and clinicopathological parameters in gastric cancer. Furthermore, neither individual SNPs nor haplotypes had an association with the survival of patients with gastric cancer. CONCLUSIONS: This study evaluated polymorphisms of the MMP-1 gene in gastric cancer with a MALDI-TOF MS method in a large northern Chinese case-controlled cohort. Our results indicated that these seven SNPs of MMP-1 might not be useful as significant markers to predict gastric cancer susceptibility, progression or prognosis, at least in the Han population in northern China.

  10. Non-binary unitary error bases and quantum codes

    Energy Technology Data Exchange (ETDEWEB)

    Knill, E.

    1996-06-01

    Error operator bases for systems of any dimension are defined and natural generalizations of the bit-flip/ sign-change error basis for qubits are given. These bases allow generalizing the construction of quantum codes based on eigenspaces of Abelian groups. As a consequence, quantum codes can be constructed form linear codes over {ital Z}{sub {ital n}} for any {ital n}. The generalization of the punctured code construction leads to many codes which permit transversal (i.e. fault tolerant) implementations of certain operations compatible with the error basis.

  11. Polarization-multiplexed rate-adaptive non-binary-quasi-cyclic-LDPC-coded multilevel modulation with coherent detection for optical transport networks.

    Science.gov (United States)

    Arabaci, Murat; Djordjevic, Ivan B; Saunders, Ross; Marcoccia, Roberto M

    2010-02-01

    In order to achieve high-speed transmission over optical transport networks (OTNs) and maximize its throughput, we propose using a rate-adaptive polarization-multiplexed coded multilevel modulation with coherent detection based on component non-binary quasi-cyclic (QC) LDPC codes. Compared to prior-art bit-interleaved LDPC-coded modulation (BI-LDPC-CM) scheme, the proposed non-binary LDPC-coded modulation (NB-LDPC-CM) scheme not only reduces latency due to symbol- instead of bit-level processing but also provides either impressive reduction in computational complexity or striking improvements in coding gain depending on the constellation size. As the paper presents, compared to its prior-art binary counterpart, the proposed NB-LDPC-CM scheme addresses the needs of future OTNs, which are achieving the target BER performance and providing maximum possible throughput both over the entire lifetime of the OTN, better.

  12. High-throughput SNP genotyping: combining tag SNPs and molecular beacons

    CSIR Research Space (South Africa)

    Barreiro, LB

    2009-10-01

    Full Text Available In the last decade, molecular beacons have emerged to become a widely used tool in the multiplex typing of single nucleotide polymorphisms (SNPs). Improvements in detection technologies in instrumentation and chemistries to label these probes have...

  13. NPTFit: A Code Package for Non-Poissonian Template Fitting

    International Nuclear Information System (INIS)

    Mishra-Sharma, Siddharth; Rodd, Nicholas L.; Safdi, Benjamin R.

    2017-01-01

    We present NPTFit, an open-source code package, written in Python and Cython, for performing non-Poissonian template fits (NPTFs). The NPTF is a recently developed statistical procedure for characterizing the contribution of unresolved point sources (PSs) to astrophysical data sets. The NPTF was first applied to Fermi gamma-ray data to provide evidence that the excess of ∼GeV gamma-rays observed in the inner regions of the Milky Way likely arises from a population of sub-threshold point sources, and the NPTF has since found additional applications studying sub-threshold extragalactic sources at high Galactic latitudes. The NPTF generalizes traditional astrophysical template fits to allow for the ability to search for populations of unresolved PSs that may follow a given spatial distribution. NPTFit builds upon the framework of the fluctuation analyses developed in X-ray astronomy, thus it likely has applications beyond those demonstrated with gamma-ray data. The NPTFit package utilizes novel computational methods to perform the NPTF efficiently. The code is available at http://github.com/bsafdi/NPTFit and up-to-date and extensive documentation may be found at http://nptfit.readthedocs.io.

  14. NPTFit: A Code Package for Non-Poissonian Template Fitting

    Energy Technology Data Exchange (ETDEWEB)

    Mishra-Sharma, Siddharth [Department of Physics, Princeton University, Princeton, NJ 08544 (United States); Rodd, Nicholas L.; Safdi, Benjamin R., E-mail: smsharma@princeton.edu, E-mail: nrodd@mit.edu, E-mail: bsafdi@mit.edu [Center for Theoretical Physics, Massachusetts Institute of Technology, Cambridge, MA 02139 (United States)

    2017-06-01

    We present NPTFit, an open-source code package, written in Python and Cython, for performing non-Poissonian template fits (NPTFs). The NPTF is a recently developed statistical procedure for characterizing the contribution of unresolved point sources (PSs) to astrophysical data sets. The NPTF was first applied to Fermi gamma-ray data to provide evidence that the excess of ∼GeV gamma-rays observed in the inner regions of the Milky Way likely arises from a population of sub-threshold point sources, and the NPTF has since found additional applications studying sub-threshold extragalactic sources at high Galactic latitudes. The NPTF generalizes traditional astrophysical template fits to allow for the ability to search for populations of unresolved PSs that may follow a given spatial distribution. NPTFit builds upon the framework of the fluctuation analyses developed in X-ray astronomy, thus it likely has applications beyond those demonstrated with gamma-ray data. The NPTFit package utilizes novel computational methods to perform the NPTF efficiently. The code is available at http://github.com/bsafdi/NPTFit and up-to-date and extensive documentation may be found at http://nptfit.readthedocs.io.

  15. Evolutionary analysis reveals regulatory and functional landscape of coding and non-coding RNA editing.

    Science.gov (United States)

    Zhang, Rui; Deng, Patricia; Jacobson, Dionna; Li, Jin Billy

    2017-02-01

    Adenosine-to-inosine RNA editing diversifies the transcriptome and promotes functional diversity, particularly in the brain. A plethora of editing sites has been recently identified; however, how they are selected and regulated and which are functionally important are largely unknown. Here we show the cis-regulation and stepwise selection of RNA editing during Drosophila evolution and pinpoint a large number of functional editing sites. We found that the establishment of editing and variation in editing levels across Drosophila species are largely explained and predicted by cis-regulatory elements. Furthermore, editing events that arose early in the species tree tend to be more highly edited in clusters and enriched in slowly-evolved neuronal genes, thus suggesting that the main role of RNA editing is for fine-tuning neurological functions. While nonsynonymous editing events have been long recognized as playing a functional role, in addition to nonsynonymous editing sites, a large fraction of 3'UTR editing sites is evolutionarily constrained, highly edited, and thus likely functional. We find that these 3'UTR editing events can alter mRNA stability and affect miRNA binding and thus highlight the functional roles of noncoding RNA editing. Our work, through evolutionary analyses of RNA editing in Drosophila, uncovers novel insights of RNA editing regulation as well as its functions in both coding and non-coding regions.

  16. Sequence-based heuristics for faster annotation of non-coding RNA families.

    Science.gov (United States)

    Weinberg, Zasha; Ruzzo, Walter L

    2006-01-01

    Non-coding RNAs (ncRNAs) are functional RNA molecules that do not code for proteins. Covariance Models (CMs) are a useful statistical tool to find new members of an ncRNA gene family in a large genome database, using both sequence and, importantly, RNA secondary structure information. Unfortunately, CM searches are extremely slow. Previously, we created rigorous filters, which provably sacrifice none of a CM's accuracy, while making searches significantly faster for virtually all ncRNA families. However, these rigorous filters make searches slower than heuristics could be. In this paper we introduce profile HMM-based heuristic filters. We show that their accuracy is usually superior to heuristics based on BLAST. Moreover, we compared our heuristics with those used in tRNAscan-SE, whose heuristics incorporate a significant amount of work specific to tRNAs, where our heuristics are generic to any ncRNA. Performance was roughly comparable, so we expect that our heuristics provide a high-quality solution that--unlike family-specific solutions--can scale to hundreds of ncRNA families. The source code is available under GNU Public License at the supplementary web site.

  17. Fragile X mental retardation protein participates in non-coding RNA pathways.

    Science.gov (United States)

    Li, En-Hui; Zhao, Xin; Zhang, Ce; Liu, Wei

    2018-02-20

    Fragile X syndrome is one of the most common forms of inherited intellectual disability. It is caused by mutations of the Fragile X mental retardation 1(FMR1) gene, resulting in either the loss or abnormal expression of the Fragile X mental retardation protein (FMRP). Recent research showed that FMRP participates in non-coding RNA pathways and plays various important roles in physiology, thereby extending our knowledge of the pathogenesis of the Fragile X syndrome. Initial studies showed that the Drosophila FMRP participates in siRNA and miRNA pathways by interacting with Dicer, Ago1 and Ago2, involved in neural activity and the fate determination of the germline stem cells. Subsequent studies showed that the Drosophila FMRP participates in piRNA pathway by interacting with Aub, Ago1 and Piwi in the maintenance of normal chromatin structures and genomic stability. More recent studies showed that FMRP is associated with lncRNA pathway, suggesting a potential role for the involvement in the clinical manifestations. In this review, we summarize the novel findings and explore the relationship between FMRP and non-coding RNA pathways, particularly the piRNA pathway, thereby providing critical insights on the molecular pathogenesis of Fragile X syndrome, and potential translational applications in clinical management of the disease.

  18. CRNDE: a long non-coding RNA involved in CanceR, Neurobiology and DEvelopment

    Directory of Open Access Journals (Sweden)

    Blake C. Ellis

    2012-11-01

    Full Text Available CRNDE is the gene symbol for Colorectal Neoplasia Differentially Expressed (non protein-coding, a long non-coding RNA (lncRNA gene that expresses multiple splice variants and displays a very tissue-specific pattern of expression. CRNDE was initially identified as a lncRNA whose expression is highly elevated in colorectal cancer, but it is also upregulated in many other solid tumors and in leukemias. Indeed, CRNDE is the most upregulated lncRNA in gliomas and here, as in other cancers, it is associated with a stemness signature. CRNDE is expressed in specific regions within the human and mouse brain; the mouse ortholog is high in induced pluripotent stem cells and increases further during neuronal differentiation. We suggest that CRNDE is a multifunctional lncRNA whose different splice forms provide specific functional scaffolds for regulatory complexes, such as the polycomb repressive complex 2 (PRC2 and CoREST chromatin-modifying complexes, which CRNDE helps pilot to target genes.

  19. Experimental benchmark of non-local-thermodynamic-equilibrium plasma atomic physics codes

    International Nuclear Information System (INIS)

    Nagels-Silvert, V.

    2004-09-01

    The main purpose of this thesis is to get experimental data for the testing and validation of atomic physics codes dealing with non-local-thermodynamical-equilibrium plasmas. The first part is dedicated to the spectroscopic study of xenon and krypton plasmas that have been produced by a nanosecond laser pulse interacting with a gas jet. A Thomson scattering diagnostic has allowed us to measure independently plasma parameters such as electron temperature, electron density and the average ionisation state. We have obtained time integrated spectra in the range between 5 and 10 angstroms. We have identified about one hundred xenon rays between 8.6 and 9.6 angstroms via the use of the Relac code. We have discovered unknown rays for the krypton between 5.2 and 7.5 angstroms. In a second experiment we have extended the wavelength range to the X UV domain. The Averroes/Transpec code has been tested in the ranges from 9 to 15 angstroms and from 10 to 130 angstroms, the first range has been well reproduced while the second range requires a more complex data analysis. The second part is dedicated to the spectroscopic study of aluminium, selenium and samarium plasmas in femtosecond operating rate. We have designed an interferometry diagnostic in the frequency domain that has allowed us to measure the expanding speed of the target's backside. Via the use of an adequate isothermal model this parameter has led us to know the plasma electron temperature. Spectra and emission times of various rays from the aluminium and selenium plasmas have been computed satisfactorily with the Averroes/Transpec code coupled with Film and Multif hydrodynamical codes. (A.C.)

  20. Investigation of genes coding for inflammatory components in Parkinson's disease.

    Science.gov (United States)

    Håkansson, Anna; Westberg, Lars; Nilsson, Staffan; Buervenich, Silvia; Carmine, Andrea; Holmberg, Björn; Sydow, Olof; Olson, Lars; Johnels, Bo; Eriksson, Elias; Nissbrandt, Hans

    2005-05-01

    Several findings obtained recently indicate that inflammation may contribute to the pathogenesis in Parkinson's disease (PD). Genetic variants of genes coding for components involved in immune reactions in the brain might therefore influence the risk of developing PD or the age of disease onset. Five single nucleotide polymorphisms (SNPs) in the genes coding for interferon-gamma (IFN-gamma; T874A in intron 1), interferon-gamma receptor 2 (IFN-gamma R2; Gln64Arg), interleukin-10 (IL-10; G1082A in the promoter region), platelet-activating factor acetylhydrolase (PAF-AH; Val379Ala), and intercellular adhesion molecule 1 (ICAM-1; Lys469Glu) were genotyped, using pyrosequencing, in 265 patients with PD and 308 controls. None of the investigated SNPs was found to be associated with PD; however, the G1082A polymorphism in the IL-10 gene promoter was found to be related to the age of disease onset. Linear regression showed a significantly earlier onset with more A-alleles (P = 0.0095; after Bonferroni correction, P = 0.048), resulting in a 5-year delayed age of onset of the disease for individuals having two G-alleles compared with individuals having two A-alleles. The results indicate that the IL-10 G1082A SNP could possibly be related to the age of onset of PD. Copyright 2005 Movement Disorder Society.

  1. Visual synonyms for landmark image retrieval

    NARCIS (Netherlands)

    Gavves, E.; Snoek, C.G.M.; Smeulders, A.W.M.

    2012-01-01

    In this paper, we address the incoherence problem of the visual words in bag-of-words vocabularies. Different from existing work, which assigns words based on closeness in descriptor space, we focus on identifying pairs of independent, distant words - the visual synonyms - that are likely to host

  2. Non-coding RNAs in endometriosis: a narrative review.

    Science.gov (United States)

    Panir, Kavita; Schjenken, John E; Robertson, Sarah A; Hull, M Louise

    2018-04-25

    Endometriosis is a benign gynaecological disorder, which affects 10% of reproductive-aged women and is characterized by endometrial cells from the lining of the uterus being found outside the uterine cavity. However, the pathophysiological mechanisms causing the development of this heterogeneous disease remain enigmatic, and a lack of effective biomarkers necessitates surgical intervention for diagnosis. There is international recognition that accurate non-invasive diagnostic tests and more effective therapies are urgently needed. Non-coding RNA (ncRNA) molecules, which are important regulators of cellular function, have been implicated in many chronic conditions. In endometriosis, transcriptome profiling of tissue samples and functional in vivo and in vitro studies demonstrate that ncRNAs are key contributors to the disease process. In this review, we outline the biogenesis of various ncRNAs relevant to endometriosis and then summarize the evidence indicating their roles in regulatory pathways that govern disease establishment and progression. Articles from 2000 to 2016 were selected for relevance, validity and quality, from results obtained in PubMed, MEDLINE and Google Scholar using the following search terms: ncRNA and reproduction; ncRNA and endometriosis; miRNA and endometriosis; lncRNA and endometriosis; siRNA and endometriosis; endometriosis; endometrial; cervical; ovary; uterus; reproductive tract. All articles were independently screened for eligibility by the authors. This review integrates extensive information from all relevant published studies focusing on microRNAs, long ncRNAs and short inhibitory RNAs in endometriosis. We outline the biological function and synthesis of microRNAs, long ncRNAs and short inhibitory RNAs and provide detailed findings from human research as well as functional studies carried out both in vitro and in vivo, including animal models. Although variability in findings between individual studies exists, collectively, the

  3. Non-standard model for electron heat transport for multidimensional hydrodynamic codes

    Energy Technology Data Exchange (ETDEWEB)

    Nicolai, Ph.; Busquet, M.; Schurtz, G. [CEA/DAM-Ile de France, 91 - Bruyeres Le Chatel (France)

    2000-07-01

    In simulations of laser-produced plasma, modeling of heat transport requires an artificial limitation of standard Spitzer-Haerm fluxes. To improve heat conduction processing, we have developed a multidimensional model which accounts for non-local features of heat transport and effects of self-generated magnetic fields. This consistent treatment of both mechanisms has been implemented in a two-dimensional radiation-hydrodynamic code. First results indicate good agreements between simulations and experimental data. (authors)

  4. Non-standard model for electron heat transport for multidimensional hydrodynamic codes

    International Nuclear Information System (INIS)

    Nicolai, Ph.; Busquet, M.; Schurtz, G.

    2000-01-01

    In simulations of laser-produced plasma, modeling of heat transport requires an artificial limitation of standard Spitzer-Haerm fluxes. To improve heat conduction processing, we have developed a multidimensional model which accounts for non-local features of heat transport and effects of self-generated magnetic fields. This consistent treatment of both mechanisms has been implemented in a two-dimensional radiation-hydrodynamic code. First results indicate good agreements between simulations and experimental data. (authors)

  5. Four new synonyms and a new combination in Parnassia (Celastraceae).

    Science.gov (United States)

    Shu, Yumin; Zhang, Zhixiang

    2017-01-01

    Parnassia yunnanensis had been previously described based on mixed specimens containing materials partially belonging to Parnassia cacuminum , which makes the application of Parnassia yunnanensis ambiguous. Therefore, we lectotypified Parnassia yunnanensis and meanwhile synonymized Parnassia lanceolata var. oblongipetala under it. Parnassia yunnanensis var. longistipitata was found more similar to Parnassia cacuminum rather than Parnassia yunnanensis , thus a new combination, Parnassia cacuminum var. longistipitata comb. nov. was proposed. Furthermore, other three names ( Parnassia vevusta , Parnassia degeensis and Parnassia kangdingensis ) were reduced to synonyms of Parnassia cacuminum too.

  6. Genomic Footprints in Selected and Unselected Beef Cattle Breeds in Korea.

    Directory of Open Access Journals (Sweden)

    Dajeong Lim

    Full Text Available Korean Hanwoo cattle have been subjected to intensive artificial selection over the past four decades to improve meat production traits. Another three cattle varieties very closely related to Hanwoo reside in Korea (Jeju Black and Brindle and in China (Yanbian. These breeds have not been part of a breeding scheme to improve production traits. Here, we compare the selected Hanwoo against these similar but presumed to be unselected populations to identify genomic regions that have been under recent selection pressure due to the breeding program. Rsb statistics were used to contrast the genomes of Hanwoo versus a pooled sample of the three unselected population (UN. We identified 37 significant SNPs (FDR corrected in the HW/UN comparison and 21 known protein coding genes were within 1 MB to the identified SNPs. These genes were previously reported to affect traits important for meat production (14 genes, reproduction including mammary gland development (3 genes, coat color (2 genes, and genes affecting behavioral traits in a broader sense (2 genes. We subsequently sequenced (Illumina HiSeq 2000 platform 10 individuals of the brown Hanwoo and the Chinese Yanbian to identify SNPs within the candidate genomic regions. Based on allele frequency differences, haplotype structures, and literature research, we singled out one non-synonymous SNP in the APP gene (APP: c.569C>T, Ala199Val and predicted the mutational effect on the protein structure. We found that protein-protein interactions might be impaired due to increased exposed hydrophobic surfaces of the mutated protein. The APP gene has also been reported to affect meat tenderness in pigs and obesity in humans. Meat tenderness has been linked to intramuscular fat content, which is one of the main breeding goals for brown Hanwoo, potentially supporting a causal influence of the herein described nsSNP in the APP gene.

  7. Genomic Footprints in Selected and Unselected Beef Cattle Breeds in Korea.

    Science.gov (United States)

    Lim, Dajeong; Strucken, Eva M; Choi, Bong Hwan; Chai, Han Ha; Cho, Yong Min; Jang, Gul Won; Kim, Tae-Hun; Gondro, Cedric; Lee, Seung Hwan

    2016-01-01

    Korean Hanwoo cattle have been subjected to intensive artificial selection over the past four decades to improve meat production traits. Another three cattle varieties very closely related to Hanwoo reside in Korea (Jeju Black and Brindle) and in China (Yanbian). These breeds have not been part of a breeding scheme to improve production traits. Here, we compare the selected Hanwoo against these similar but presumed to be unselected populations to identify genomic regions that have been under recent selection pressure due to the breeding program. Rsb statistics were used to contrast the genomes of Hanwoo versus a pooled sample of the three unselected population (UN). We identified 37 significant SNPs (FDR corrected) in the HW/UN comparison and 21 known protein coding genes were within 1 MB to the identified SNPs. These genes were previously reported to affect traits important for meat production (14 genes), reproduction including mammary gland development (3 genes), coat color (2 genes), and genes affecting behavioral traits in a broader sense (2 genes). We subsequently sequenced (Illumina HiSeq 2000 platform) 10 individuals of the brown Hanwoo and the Chinese Yanbian to identify SNPs within the candidate genomic regions. Based on allele frequency differences, haplotype structures, and literature research, we singled out one non-synonymous SNP in the APP gene (APP: c.569C>T, Ala199Val) and predicted the mutational effect on the protein structure. We found that protein-protein interactions might be impaired due to increased exposed hydrophobic surfaces of the mutated protein. The APP gene has also been reported to affect meat tenderness in pigs and obesity in humans. Meat tenderness has been linked to intramuscular fat content, which is one of the main breeding goals for brown Hanwoo, potentially supporting a causal influence of the herein described nsSNP in the APP gene.

  8. Long non-coding RNAs and mRNAs profiling during spleen development in pig.

    Science.gov (United States)

    Che, Tiandong; Li, Diyan; Jin, Long; Fu, Yuhua; Liu, Yingkai; Liu, Pengliang; Wang, Yixin; Tang, Qianzi; Ma, Jideng; Wang, Xun; Jiang, Anan; Li, Xuewei; Li, Mingzhou

    2018-01-01

    Genome-wide transcriptomic studies in humans and mice have become extensive and mature. However, a comprehensive and systematic understanding of protein-coding genes and long non-coding RNAs (lncRNAs) expressed during pig spleen development has not been achieved. LncRNAs are known to participate in regulatory networks for an array of biological processes. Here, we constructed 18 RNA libraries from developing fetal pig spleen (55 days before birth), postnatal pig spleens (0, 30, 180 days and 2 years after birth), and the samples from the 2-year-old Wild Boar. A total of 15,040 lncRNA transcripts were identified among these samples. We found that the temporal expression pattern of lncRNAs was more restricted than observed for protein-coding genes. Time-series analysis showed two large modules for protein-coding genes and lncRNAs. The up-regulated module was enriched for genes related to immune and inflammatory function, while the down-regulated module was enriched for cell proliferation processes such as cell division and DNA replication. Co-expression networks indicated the functional relatedness between protein-coding genes and lncRNAs, which were enriched for similar functions over the series of time points examined. We identified numerous differentially expressed protein-coding genes and lncRNAs in all five developmental stages. Notably, ceruloplasmin precursor (CP), a protein-coding gene participating in antioxidant and iron transport processes, was differentially expressed in all stages. This study provides the first catalog of the developing pig spleen, and contributes to a fuller understanding of the molecular mechanisms underpinning mammalian spleen development.

  9. NONCODE v2.0: decoding the non-coding.

    Science.gov (United States)

    He, Shunmin; Liu, Changning; Skogerbø, Geir; Zhao, Haitao; Wang, Jie; Liu, Tao; Bai, Baoyan; Zhao, Yi; Chen, Runsheng

    2008-01-01

    The NONCODE database is an integrated knowledge database designed for the analysis of non-coding RNAs (ncRNAs). Since NONCODE was first released 3 years ago, the number of known ncRNAs has grown rapidly, and there is growing recognition that ncRNAs play important regulatory roles in most organisms. In the updated version of NONCODE (NONCODE v2.0), the number of collected ncRNAs has reached 206 226, including a wide range of microRNAs, Piwi-interacting RNAs and mRNA-like ncRNAs. The improvements brought to the database include not only new and updated ncRNA data sets, but also an incorporation of BLAST alignment search service and access through our custom UCSC Genome Browser. NONCODE can be found under http://www.noncode.org or http://noncode.bioinfo.org.cn.

  10. Ontological function annotation of long non-coding RNAs through hierarchical multi-label classification.

    Science.gov (United States)

    Zhang, Jingpu; Zhang, Zuping; Wang, Zixiang; Liu, Yuting; Deng, Lei

    2018-05-15

    Long non-coding RNAs (lncRNAs) are an enormous collection of functional non-coding RNAs. Over the past decades, a large number of novel lncRNA genes have been identified. However, most of the lncRNAs remain function uncharacterized at present. Computational approaches provide a new insight to understand the potential functional implications of lncRNAs. Considering that each lncRNA may have multiple functions and a function may be further specialized into sub-functions, here we describe NeuraNetL2GO, a computational ontological function prediction approach for lncRNAs using hierarchical multi-label classification strategy based on multiple neural networks. The neural networks are incrementally trained level by level, each performing the prediction of gene ontology (GO) terms belonging to a given level. In NeuraNetL2GO, we use topological features of the lncRNA similarity network as the input of the neural networks and employ the output results to annotate the lncRNAs. We show that NeuraNetL2GO achieves the best performance and the overall advantage in maximum F-measure and coverage on the manually annotated lncRNA2GO-55 dataset compared to other state-of-the-art methods. The source code and data are available at http://denglab.org/NeuraNetL2GO/. leideng@csu.edu.cn. Supplementary data are available at Bioinformatics online.

  11. Examining Method Effect of Synonym and Antonym Test in Verbal Abilities Measure

    Directory of Open Access Journals (Sweden)

    Wahyu Widhiarso

    2015-08-01

    Full Text Available Many researchers have assumed that different methods could be substituted to measure the same attributes in assessment. Various models have been developed to accommodate the amount of variance attributable to the methods but these models application in empirical research is rare. The present study applied one of those models to examine whether method effects were presents in synonym and antonym tests. Study participants were 3,469 applicants to graduate school. The instrument used was the Graduate Academic Potential Test (PAPS, which includes synonym and antonym questions to measure verbal abilities. Our analysis showed that measurement models that using correlated trait–correlated methods minus one, CT-C(M–1, that separated trait and method effect into distinct latent constructs yielded slightly better values for multiple goodness-of-fit indices than one factor model. However, either for the synonym or antonym items, the proportion of variance accounted for by the method is smaller than trait variance. The correlation between factor scores of both methods is high (r = 0.994. These findings confirm that synonym and antonym tests represent the same attribute so that both tests cannot be treated as two unique methods for measuring verbal ability.

  12. Examining Method Effect of Synonym and Antonym Test in Verbal Abilities Measure.

    Science.gov (United States)

    Widhiarso, Wahyu; Haryanta

    2015-08-01

    Many researchers have assumed that different methods could be substituted to measure the same attributes in assessment. Various models have been developed to accommodate the amount of variance attributable to the methods but these models application in empirical research is rare. The present study applied one of those models to examine whether method effects were presents in synonym and antonym tests. Study participants were 3,469 applicants to graduate school. The instrument used was the Graduate Academic Potential Test (PAPS), which includes synonym and antonym questions to measure verbal abilities. Our analysis showed that measurement models that using correlated trait-correlated methods minus one, CT-C(M-1), that separated trait and method effect into distinct latent constructs yielded slightly better values for multiple goodness-of-fit indices than one factor model. However, either for the synonym or antonym items, the proportion of variance accounted for by the method is smaller than trait variance. The correlation between factor scores of both methods is high (r = 0.994). These findings confirm that synonym and antonym tests represent the same attribute so that both tests cannot be treated as two unique methods for measuring verbal ability.

  13. On the Comparative Performance Analysis of Turbo-Coded Non-Ideal Sigle-Carrier and Multi-Carrier Waveforms over Wideband Vogler-Hoffmeyer HF Channels

    Directory of Open Access Journals (Sweden)

    F. Genc

    2014-09-01

    Full Text Available The purpose of this paper is to compare the turbo-coded Orthogonal Frequency Division Multiplexing (OFDM and turbo-coded Single Carrier Frequency Domain Equalization (SC-FDE systems under the effects of Carrier Frequency Offset (CFO, Symbol Timing Offset (STO and phase noise in wide-band Vogler-Hoffmeyer HF channel model. In mobile communication systems multi-path propagation occurs. Therefore channel estimation and equalization is additionally necessary. Furthermore a non-ideal local oscillator generally is misaligned with the operating frequency at the receiver. This causes carrier frequency offset. Hence in coded SC-FDE and coded OFDM systems; a very efficient, low complex frequency domain channel estimation and equalization is implemented in this paper. Also Cyclic Prefix (CP based synchronization synchronizes the clock and carrier frequency offset.The simulations show that non-ideal turbo-coded OFDM has better performance with greater diversity than non-ideal turbo-coded SC-FDE system in HF channel.

  14. Four new synonyms and a new combination in Parnassia (Celastraceae

    Directory of Open Access Journals (Sweden)

    Yumin Shu

    2017-03-01

    Full Text Available Parnassia yunnanensis had been previously described based on mixed specimens containing materials partially belonging to P. cacuminum, which makes the application of P. yunnanensis ambiguous. Therefore, we lectotypified P. yunnanensis and meanwhile synonymized P. lanceolata var. oblongipetala under it. P. yunnanensis var. longistipitata was found more similar to P. cacuminum rather than P. yunnanensis, thus a new combination, P. cacuminum var. longistipitata comb. nov. was proposed. Furthermore, other three names (P. vevusta, P. degeensis and P. kangdingensis were reduced to synonyms of P. cacuminum too.

  15. Integration and visualization of non-coding RNA and protein interaction networks

    OpenAIRE

    Junge, Alexander; Refsgaard, Jan Christian; Garde, Christian; Pan, Xiaoyong; Santos Delgado, Alberto; Anthon, Christian; Alkan, Ferhat; von Mering, Christian; Workman, Christopher; Jensen, Lars Juhl; Gorodkin, Jan

    2015-01-01

    Non-coding RNAs (ncRNAs) fulfill a diverse set of biological functions relying on interactions with other molecular entities. The advent of new experimental and computational approaches makes it possible to study ncRNAs and their associations on an unprecedented scale. We present RAIN (RNA Association and Interaction Networks) - a database that combines ncRNA-ncRNA, ncRNA-mRNA and ncRNA-protein interactions with large-scale protein association networks available in the STRING database. By int...

  16. Identificação de SNPs para conteúdo de ácidos graxos em soja pela técnica HRM

    Directory of Open Access Journals (Sweden)

    Maria Fernanda Antunes da Cruz

    2013-12-01

    Full Text Available O objetivo deste trabalho foi identificar SNPs em genes associados ao conteúdo de ácidos graxos em soja e implementar a metodologia "high resolution melting" (HRM para genotipagem desses SNPs. Os iniciadores HRM foram desenhados para discriminar os alelos SNPs em duas populações de mapeamento (RILs e F2 e seguiram o padrão esperado de segregação. Os SNPs do gene ABI associaram-se significativamente ao conteúdo de ácido esteárico (R² = 12,14, e os do gene FAD3B, aos conteúdos de ácido oleico (R² = 14,69 e linolênico (R² = 10,62. A técnica de genotipagem dos SNPs por HRM é eficiente na discriminação das classes genotípicas.

  17. Long Non-Coding RNAs: The Key Players in Glioma Pathogenesis

    Energy Technology Data Exchange (ETDEWEB)

    Kiang, Karrie Mei-Yee; Zhang, Xiao-Qin; Leung, Gilberto Ka-Kit, E-mail: gilberto@hku.hk [Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Queen Mary Hospital, Hong Kong (China)

    2015-07-29

    Long non-coding RNAs (LncRNAs) represent a novel class of RNAs with no functional protein-coding ability, yet it has become increasingly clear that interactions between lncRNAs with other molecules are responsible for important gene regulatory functions in various contexts. Given their relatively high expressions in the brain, lncRNAs are now thought to play important roles in normal brain development as well as diverse disease processes including gliomagenesis. Intriguingly, certain lncRNAs are closely associated with the initiation, differentiation, progression, recurrence and stem-like characteristics in glioma, and may therefore be exploited for the purposes of sub-classification, diagnosis and prognosis. LncRNAs may also serve as potential therapeutic targets as well as a novel biomarkers in the treatment of glioma. In this article, the functional aspects of lncRNAs, particularly within the central nervous system (CNS), will be briefly discussed, followed by highlights of the important roles of lncRNAs in mediating critical steps during glioma development. In addition, the key lncRNA players and their possible mechanistic pathways associated with gliomagenesis will be addressed.

  18. Long Non-Coding RNAs: The Key Players in Glioma Pathogenesis

    International Nuclear Information System (INIS)

    Kiang, Karrie Mei-Yee; Zhang, Xiao-Qin; Leung, Gilberto Ka-Kit

    2015-01-01

    Long non-coding RNAs (LncRNAs) represent a novel class of RNAs with no functional protein-coding ability, yet it has become increasingly clear that interactions between lncRNAs with other molecules are responsible for important gene regulatory functions in various contexts. Given their relatively high expressions in the brain, lncRNAs are now thought to play important roles in normal brain development as well as diverse disease processes including gliomagenesis. Intriguingly, certain lncRNAs are closely associated with the initiation, differentiation, progression, recurrence and stem-like characteristics in glioma, and may therefore be exploited for the purposes of sub-classification, diagnosis and prognosis. LncRNAs may also serve as potential therapeutic targets as well as a novel biomarkers in the treatment of glioma. In this article, the functional aspects of lncRNAs, particularly within the central nervous system (CNS), will be briefly discussed, followed by highlights of the important roles of lncRNAs in mediating critical steps during glioma development. In addition, the key lncRNA players and their possible mechanistic pathways associated with gliomagenesis will be addressed

  19. Control of ribosome traffic by position-dependent choice of synonymous codons

    DEFF Research Database (Denmark)

    Mitarai, Namiko; Pedersen, Steen

    2013-01-01

    Messenger RNA (mRNA) encodes a sequence of amino acids by using codons. For most amino acids, there are multiple synonymous codons that can encode the amino acid. The translation speed can vary from one codon to another, thus there is room for changing the ribosome speed while keeping the amino...... acid sequence and hence the resulting protein. Recently, it has been noticed that the choice of the synonymous codon, via the resulting distribution of slow- and fast-translated codons, affects not only on the average speed of one ribosome translating the mRNA but also might have an effect on nearby...... ribosomes by affecting the appearance of 'traffic jams' where multiple ribosomes collide and form queues. To test this 'context effect' further, we here investigate the effect of the sequence of synonymous codons on the ribosome traffic by using a ribosome traffic model with codon-dependent rates, estimated...

  20. LncRNAWiki: harnessing community knowledge in collaborative curation of human long non-coding RNAs

    KAUST Repository

    Ma, L.; Li, A.; Zou, D.; Xu, X.; Xia, L.; Yu, J.; Bajic, Vladimir B.; Zhang, Z.

    2014-01-01

    Long non-coding RNAs (lncRNAs) perform a diversity of functions in numerous important biological processes and are implicated in many human diseases. In this report we present lncRNAWiki (http://lncrna.big.ac.cn), a wiki-based platform that is open

  1. Genetic variation in eleven phase I drug metabolism genes in an ethnically diverse population.

    Science.gov (United States)

    Solus, Joseph F; Arietta, Brenda J; Harris, James R; Sexton, David P; Steward, John Q; McMunn, Chara; Ihrie, Patrick; Mehall, Janelle M; Edwards, Todd L; Dawson, Elliott P

    2004-10-01

    The extent of genetic variation found in drug metabolism genes and its contribution to interindividual variation in response to medication remains incompletely understood. To better determine the identity and frequency of variation in 11 phase I drug metabolism genes, the exons and flanking intronic regions of the cytochrome P450 (CYP) isoenzyme genes CYP1A1, CYP1A2, CYP2A6, CYP2B6, CYP2C8, CYP2C9, CYP2C19, CYP2D6, CYP2E1, CYP3A4 and CYP3A5 were amplified from genomic DNA and sequenced. A total of 60 kb of bi-directional sequence was generated from each of 93 human DNAs, which included Caucasian, African-American and Asian samples. There were 388 different polymorphisms identified. These included 269 non-coding, 45 synonymous and 74 non-synonymous polymorphisms. Of these, 54% were novel and included 176 non-coding, 14 synonymous and 21 non-synonymous polymorphisms. Of the novel variants observed, 85 were represented by single occurrences of the minor allele in the sample set. Much of the variation observed was from low-frequency alleles. Comparatively, these genes are variation-rich. Calculations measuring genetic diversity revealed that while the values for the individual genes are widely variable, the overall nucleotide diversity of 7.7 x 10(-4) and polymorphism parameter of 11.5 x 10(-4) are higher than those previously reported for other gene sets. Several independent measurements indicate that these genes are under selective pressure, particularly for polymorphisms corresponding to non-synonymous amino acid changes. There is relatively little difference in measurements of diversity among the ethnic groups, but there are large differences among the genes and gene subfamilies themselves. Of the three CYP subfamilies involved in phase I drug metabolism (1, 2, and 3), subfamily 2 displays the highest levels of genetic diversity.

  2. Evaluation of Agency Non-Code Layered Pressure Vessels (LPVs)

    Science.gov (United States)

    Prosser, William H.

    2014-01-01

    In coordination with the Office of Safety and Mission Assurance and the respective Center Pressure System Managers (PSMs), the NASA Engineering and Safety Center (NESC) was requested to formulate a consensus draft proposal for the development of additional testing and analysis methods to establish the technical validity, and any limitation thereof, for the continued safe operation of facility non-code layered pressure vessels. The PSMs from each NASA Center were asked to participate as part of the assessment team by providing, collecting, and reviewing data regarding current operations of these vessels. This report contains the outcome of the assessment and the findings, observations, and NESC recommendations to the Agency and individual NASA Centers.

  3. CONDOR: a database resource of developmentally associated conserved non-coding elements

    Directory of Open Access Journals (Sweden)

    Smith Sarah

    2007-08-01

    Full Text Available Abstract Background Comparative genomics is currently one of the most popular approaches to study the regulatory architecture of vertebrate genomes. Fish-mammal genomic comparisons have proved powerful in identifying conserved non-coding elements likely to be distal cis-regulatory modules such as enhancers, silencers or insulators that control the expression of genes involved in the regulation of early development. The scientific community is showing increasing interest in characterizing the function, evolution and language of these sequences. Despite this, there remains little in the way of user-friendly access to a large dataset of such elements in conjunction with the analysis and the visualization tools needed to study them. Description Here we present CONDOR (COnserved Non-coDing Orthologous Regions available at: http://condor.fugu.biology.qmul.ac.uk. In an interactive and intuitive way the website displays data on > 6800 non-coding elements associated with over 120 early developmental genes and conserved across vertebrates. The database regularly incorporates results of ongoing in vivo zebrafish enhancer assays of the CNEs carried out in-house, which currently number ~100. Included and highlighted within this set are elements derived from duplication events both at the origin of vertebrates and more recently in the teleost lineage, thus providing valuable data for studying the divergence of regulatory roles between paralogs. CONDOR therefore provides a number of tools and facilities to allow scientists to progress in their own studies on the function and evolution of developmental cis-regulation. Conclusion By providing access to data with an approachable graphics interface, the CONDOR database presents a rich resource for further studies into the regulation and evolution of genes involved in early development.

  4. A Bayesian Hierarchical Model for Relating Multiple SNPs within Multiple Genes to Disease Risk

    Directory of Open Access Journals (Sweden)

    Lewei Duan

    2013-01-01

    Full Text Available A variety of methods have been proposed for studying the association of multiple genes thought to be involved in a common pathway for a particular disease. Here, we present an extension of a Bayesian hierarchical modeling strategy that allows for multiple SNPs within each gene, with external prior information at either the SNP or gene level. The model involves variable selection at the SNP level through latent indicator variables and Bayesian shrinkage at the gene level towards a prior mean vector and covariance matrix that depend on external information. The entire model is fitted using Markov chain Monte Carlo methods. Simulation studies show that the approach is capable of recovering many of the truly causal SNPs and genes, depending upon their frequency and size of their effects. The method is applied to data on 504 SNPs in 38 candidate genes involved in DNA damage response in the WECARE study of second breast cancers in relation to radiotherapy exposure.

  5. Non-coding RNA may be associated with cytoplasmic male sterility in Silene vulgaris

    Czech Academy of Sciences Publication Activity Database

    Stone, James D.; Koloušková, Pavla; Sloan, D.B.; Štorchová, Helena

    2017-01-01

    Roč. 68, č. 7 (2017), s. 1599-1612 ISSN 0022-0957 R&D Projects: GA ČR(CZ) GA16-09220S Institutional support: RVO:61389030 Keywords : Cytoplasmic male sterility * Editing * Mitochondrion * Non-coding RNA * Silene vulgaris * Splicing * Transcriptome Subject RIV: EF - Botanics OBOR OECD: Plant sciences, botany Impact factor: 5.830, year: 2016

  6. Detection of two non-synonymous SNPs in SLC45A2 on BTA20 as candidate causal mutations for oculocutaneous albinism in Braunvieh cattle.

    Science.gov (United States)

    Rothammer, Sophie; Kunz, Elisabeth; Seichter, Doris; Krebs, Stefan; Wassertheurer, Martina; Fries, Ruedi; Brem, Gottfried; Medugorac, Ivica

    2017-10-05

    Cases of albinism have been reported in several species including cattle. So far, research has identified many genes that are involved in this eye-catching phenotype. Thus, when two paternal Braunvieh half-sibs with oculocutaneous albinism were detected on a private farm, we were interested in knowing whether their phenotype was caused by an already known gene/mutation. Analysis of genotyping data (50K) of the two albino individuals, their mothers and five other relatives identified a 47.61-Mb candidate haplotype on Bos taurus chromosome BTA20. Subsequent comparisons of the sequence of this haplotype with sequence data from four Braunvieh sires and the Aurochs genome identified two possible candidate causal mutations at positions 39,829,806 bp (G/A; R45Q) and 39,864,148 bp (C/T; T444I) that were absent in 1682 animals from various bovine breeds included in the 1000 bull genomes project. Both polymorphisms represent coding variants in the SLC45A2 gene, for which the human equivalent harbors numerous variants associated with oculocutaneous albinism type 4. We demonstrate an association of R45Q and T444I with the albino phenotype by targeted genotyping. Although the candidate gene SLC45A2 is known to be involved in albinism in different species, to date in cattle only mutations in the TYR and MITF genes were reported to be associated with albinism or albinism-like phenotypes. Thus, our study extends the list of genes that are associated with bovine albinism. However, further research and more samples from related animals are needed to elucidate if only one of these two single nucleotide polymorphisms or the combination of both is the actual causal variant.

  7. Long Non-Coding RNAs Associated with Metabolic Traits in Human White Adipose Tissue

    Directory of Open Access Journals (Sweden)

    Hui Gao

    2018-04-01

    Full Text Available Long non-coding RNAs (lncRNAs belong to a recently discovered class of molecules proposed to regulate various cellular processes. Here, we systematically analyzed their expression in human subcutaneous white adipose tissue (WAT and found that a limited set was differentially expressed in obesity and/or the insulin resistant state. Two lncRNAs herein termed adipocyte-specific metabolic related lncRNAs, ASMER-1 and ASMER-2 were enriched in adipocytes and regulated by both obesity and insulin resistance. Knockdown of either ASMER-1 or ASMER-2 by antisense oligonucleotides in in vitro differentiated human adipocytes revealed that both genes regulated adipogenesis, lipid mobilization and adiponectin secretion. The observed effects could be attributed to crosstalk between ASMERs and genes within the master regulatory pathways for adipocyte function including PPARG and INSR. Altogether, our data demonstrate that lncRNAs are modulators of the metabolic and secretory functions in human fat cells and provide an emerging link between WAT and common metabolic conditions. Keywords: White adipose tissue, Adipocytes, Long non-coding RNAs, Metabolic traits, Lipolysis, Adiponectin

  8. Natural functional SNPs in miR-155 alter its expression level, blood cell counts and immune responses

    Directory of Open Access Journals (Sweden)

    Congcong Li

    2016-08-01

    Full Text Available miR-155 has been confirmed to be a key factor in immune responses in humans and other mammals. Therefore, investigation of variations in miR-155 could be useful for understanding the differences in immunity between individuals. In this study, four SNPs in miR-155 were identified in mice (Mus musculus and humans (Homo sapiens. In mice, the four SNPs were closely linked and formed two miR-155 haplotypes (A and B. Ten distinct types of blood parameters were associated with miR-155 expression under normal conditions. Additionally, 4 and 14 blood parameters were significantly different between these two genotypes under normal and lipopolysaccharide (LPS stimulation conditions, respectively. Moreover, the expression levels of miR-155, the inflammatory response to LPS stimulation and the lethal ratio following Salmonella typhimurium infection were significantly increased in mice harboring the AA genotype. Further, two SNPs, one in the loop region and the other near the 3' terminal of pre-miR-155, were confirmed to be responsible for the differential expression of miR-155 in mice. Interestingly, two additional SNPs, one in the loop region and the other in the middle of miR-155*, modulated the function of miR-155 in humans. Predictions of secondary RNA structure using RNAfold showed that these SNPs affected the structure of miR-155 in both mice and humans. Our results provide novel evidence of the natural functional SNPs of miR-155 in both mice and humans, which may affect the expression levels of mature miR-155 by modulating its secondary structure. The SNPs of human miR-155 may be considered as causal mutations for some immune-related diseases in the clinic. The two genotypes of mice could be used as natural models for studying the mechanisms of immune diseases caused by abnormal expression of miR-155 in humans.

  9. Iterative demodulation and decoding of coded non-square QAM

    Science.gov (United States)

    Li, L.; Divsalar, D.; Dolinar, S.

    2003-01-01

    Simulation results show that, with iterative demodulation and decoding, coded NS-8QAM performs 0.5 dB better than standard 8QAM and 0.7 dB better than 8PSK at BER= 10(sup -5), when the FEC code is the (15, 11) Hamming code concatenated with a rate-1 accumulator code, while coded NS-32QAM performs 0.25 dB better than standard 32QAM.

  10. The role of non-coding RNAs in cytoplasmic male sterility in flowering plants

    Czech Academy of Sciences Publication Activity Database

    Štorchová, Helena

    2017-01-01

    Roč. 18, č. 11 (2017), č. článku 2429. E-ISSN 1422-0067 R&D Projects: GA ČR GA16-09220S Institutional support: RVO:61389030 Keywords : Cytoplasmic male sterility * Gene expression * Global transcriptome * Non-coding RNA * Pollen development Subject RIV: EF - Botanics OBOR OECD: Plant sciences, botany Impact factor: 3.226, year: 2016

  11. Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs

    DEFF Research Database (Denmark)

    Lee, S Hong; Ripke, Stephan; Neale, Benjamin M

    2013-01-01

    Most psychiatric disorders are moderately to highly heritable. The degree to which genetic variation is unique to individual disorders or shared across disorders is unclear. To examine shared genetic etiology, we use genome-wide genotype data from the Psychiatric Genomics Consortium (PGC) for cases...... and controls in schizophrenia, bipolar disorder, major depressive disorder, autism spectrum disorders (ASD) and attention-deficit/hyperactivity disorder (ADHD). We apply univariate and bivariate methods for the estimation of genetic variation within and covariation between disorders. SNPs explained 17......-29% of the variance in liability. The genetic correlation calculated using common SNPs was high between schizophrenia and bipolar disorder (0.68 ± 0.04 s.e.), moderate between schizophrenia and major depressive disorder (0.43 ± 0.06 s.e.), bipolar disorder and major depressive disorder (0.47 ± 0.06 s.e.), and ADHD...

  12. Discovery of Gene Sources for Economic Traits in Hanwoo by Whole-genome Resequencing

    Directory of Open Access Journals (Sweden)

    Younhee Shin

    2016-09-01

    Full Text Available Hanwoo, a Korean native cattle (Bos taurus coreana, has great economic value due to high meat quality. Also, the breed has genetic variations that are associated with production traits such as health, disease resistance, reproduction, growth as well as carcass quality. In this study, next generation sequencing technologies and the availability of an appropriate reference genome were applied to discover a large amount of single nucleotide polymorphisms (SNPs in ten Hanwoo bulls. Analysis of whole-genome resequencing generated a total of 26.5 Gb data, of which 594,716,859 and 592,990,750 reads covered 98.73% and 93.79% of the bovine reference genomes of UMD 3.1 and Btau 4.6.1, respectively. In total, 2,473,884 and 2,402,997 putative SNPs were discovered, of which 1,095,922 (44.3% and 982,674 (40.9% novel SNPs were discovered against UMD3.1 and Btau 4.6.1, respectively. Among the SNPs, the 46,301 (UMD 3.1 and 28,613 SNPs (Btau 4.6.1 that were identified as Hanwoo-specific SNPs were included in the functional genes that may be involved in the mechanisms of milk production, tenderness, juiciness, marbling of Hanwoo beef and yellow hair. Most of the Hanwoo-specific SNPs were identified in the promoter region, suggesting that the SNPs influence differential expression of the regulated genes relative to the relevant traits. In particular, the non-synonymous (ns SNPs found in CORIN, which is a negative regulator of Agouti, might be a causal variant to determine yellow hair of Hanwoo. Our results will provide abundant genetic sources of variation to characterize Hanwoo genetics and for subsequent breeding.

  13. The ALI-ARMS Code for Modeling Atmospheric non-LTE Molecular Band Emissions: Current Status and Applications

    Science.gov (United States)

    Kutepov, A. A.; Feofilov, A. G.; Manuilova, R. O.; Yankovsky, V. A.; Rezac, L.; Pesnell, W. D.; Goldberg, R. A.

    2008-01-01

    The Accelerated Lambda Iteration (ALI) technique was developed in stellar astrophysics at the beginning of 1990s for solving the non-LTE radiative transfer problem in atomic lines and multiplets in stellar atmospheres. It was later successfully applied to modeling the non-LTE emissions and radiative cooling/heating in the vibrational-rotational bands of molecules in planetary atmospheres. Similar to the standard lambda iterations ALI operates with the matrices of minimal dimension. However, it provides higher convergence rate and stability due to removing from the iterating process the photons trapped in the optically thick line cores. In the current ALI-ARMS (ALI for Atmospheric Radiation and Molecular Spectra) code version additional acceleration of calculations is provided by utilizing the opacity distribution function (ODF) approach and "decoupling". The former allows replacing the band branches by single lines of special shape, whereas the latter treats non-linearity caused by strong near-resonant vibration-vibrational level coupling without additional linearizing the statistical equilibrium equations. Latest code application for the non-LTE diagnostics of the molecular band emissions of Earth's and Martian atmospheres as well as for the non-LTE IR cooling/heating calculations are discussed.

  14. Contributions towards a monograph of Phoma (Coelomycetes) — III. 2. Misapplications of the type species name and the generic synonyms of section Plenodomus (Excluded species)

    NARCIS (Netherlands)

    Boerema, G.H.; Loerakker, W.M.; Hamers, Maria E.C.

    1996-01-01

    Various old records of Phoma lingam (teleomorph Leptosphaeria maculans) on non-cruciferous plants proved to be based on misidentifications. In the past the fungus has also often been confused with other fungi occurring on crucifers. Forty-five species formerly classified under the generic synonyms

  15. Comprehensive survey of SNPs in the Affymetrix exon array using the 1000 Genomes dataset.

    Directory of Open Access Journals (Sweden)

    Eric R Gamazon

    Full Text Available Microarray gene expression data has been used in genome-wide association studies to allow researchers to study gene regulation as well as other complex phenotypes including disease risks and drug response. To reach scientifically sound conclusions from these studies, however, it is necessary to get reliable summarization of gene expression intensities. Among various factors that could affect expression profiling using a microarray platform, single nucleotide polymorphisms (SNPs in target mRNA may lead to reduced signal intensity measurements and result in spurious results. The recently released 1000 Genomes Project dataset provides an opportunity to evaluate the distribution of both known and novel SNPs in the International HapMap Project lymphoblastoid cell lines (LCLs. We mapped the 1000 Genomes Project genotypic data to the Affymetrix GeneChip Human Exon 1.0ST array (exon array, which had been used in our previous studies and for which gene expression data had been made publicly available. We also evaluated the potential impact of these SNPs on the differentially spliced probesets we had identified previously. Though the 1000 Genomes Project data allowed a comprehensive survey of the SNPs in this particular array, the same approach can certainly be applied to other microarray platforms. Furthermore, we present a detailed catalogue of SNP-containing probesets (exon-level and transcript clusters (gene-level, which can be considered in evaluating findings using the exon array as well as benefit the design of follow-up experiments and data re-analysis.

  16. Snpdat: Easy and rapid annotation of results from de novo snp discovery projects for model and non-model organisms

    Directory of Open Access Journals (Sweden)

    Doran Anthony G

    2013-02-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the most abundant genetic variant found in vertebrates and invertebrates. SNP discovery has become a highly automated, robust and relatively inexpensive process allowing the identification of many thousands of mutations for model and non-model organisms. Annotating large numbers of SNPs can be a difficult and complex process. Many tools available are optimised for use with organisms densely sampled for SNPs, such as humans. There are currently few tools available that are species non-specific or support non-model organism data. Results Here we present SNPdat, a high throughput analysis tool that can provide a comprehensive annotation of both novel and known SNPs for any organism with a draft sequence and annotation. Using a dataset of 4,566 SNPs identified in cattle using high-throughput DNA sequencing we demonstrate the annotations performed and the statistics that can be generated by SNPdat. Conclusions SNPdat provides users with a simple tool for annotation of genomes that are either not supported by other tools or have a small number of annotated SNPs available. SNPdat can also be used to analyse datasets from organisms which are densely sampled for SNPs. As a command line tool it can easily be incorporated into existing SNP discovery pipelines and fills a niche for analyses involving non-model organisms that are not supported by many available SNP annotation tools. SNPdat will be of great interest to scientists involved in SNP discovery and analysis projects, particularly those with limited bioinformatics experience.

  17. Identification of polymorphisms in and genes and their associations with plumage colors in Asian duck breeds

    Directory of Open Access Journals (Sweden)

    Hasina Sultana

    2018-02-01

    Full Text Available Objective The aim of this study was to investigate the effect of single nucleotide polymorphisms (SNPs of the melanogenesis associated transcription factor (MITF and dopachrome tautomerase (DCT genes on plumage coloration in Asian native duck breeds. MITF encodes a protein for microphthalmia-associated transcription factor, which regulates the development and function of melanocytes for pigmentation of skin, hair, and eyes. Among the tyrosinase-related family genes, DCT is a pigment cell-specific gene that plays important roles in the melanin synthesis pathway and the expression of skin, feather, and retina color. Methods Five Asian duck varieties (black Korean native, white Korean native, commercial Peking, Nageswari, and Bangladeshi Deshi white ducks were investigated to examine the polymorphisms associated with plumage colors. Among previously identified SNPs, three synonymous SNPs and one indel of MITF and nine SNPs in exon regions of DCT were genotyped. The allele frequencies for SNPs of the black and white plumage color populations were estimated and Fisher’s exact test was conducted to assess the association between the allele frequencies of these two populations. Results Two synonymous SNPs (c.114T>G and c.147T>C and a 14-bp indel (GCTGCAAAC AGATG in intron 7 of MITF were significantly associated with the black- and white-colored breeds (pG (p.His313Arg] in DCT, was highly significantly associated (pG was significantly associated (p<0.05 with black and white color plumage in the studied duck populations. Conclusion The results of this study provide a basis for further investigations of the associations between polymorphisms and plumage color phenotypes in Asian duck breeds.

  18. Scaling Non-Regular Shared-Memory Codes by Reusing Custom Loop Schedules

    Directory of Open Access Journals (Sweden)

    Dimitrios S. Nikolopoulos

    2003-01-01

    Full Text Available In this paper we explore the idea of customizing and reusing loop schedules to improve the scalability of non-regular numerical codes in shared-memory architectures with non-uniform memory access latency. The main objective is to implicitly setup affinity links between threads and data, by devising loop schedules that achieve balanced work distribution within irregular data spaces and reusing them as much as possible along the execution of the program for better memory access locality. This transformation provides a great deal of flexibility in optimizing locality, without compromising the simplicity of the shared-memory programming paradigm. In particular, the programmer does not need to explicitly distribute data between processors. The paper presents practical examples from real applications and experiments showing the efficiency of the approach.

  19. Integration and visualization of non-coding RNA and protein interaction networks

    DEFF Research Database (Denmark)

    Junge, Alexander; Refsgaard, Jan Christian; Garde, Christian

    Non-coding RNAs (ncRNAs) fulfill a diverse set of biological functions relying on interactions with other molecular entities. The advent of new experimental and computational approaches makes it possible to study ncRNAs and their associations on an unprecedented scale. We present RAIN (RNA Associ......) co-occurrences found by text mining Medline abstracts. Each resource was assigned a reliability score by assessing its agreement with a gold standard set of microRNA-target interactions. RAIN is available at: http://rth.dk/resources/rain...

  20. Comparison of Antidepressant Efficacy-related SNPs Among Taiwanese and Four Populations in the HapMap Database

    Directory of Open Access Journals (Sweden)

    Mei-Hung Chi

    2011-07-01

    Full Text Available The genetic influence of single nucleotide polymorphisms (SNPs on antidepressant efficacy has been previously demonstrated. To evaluate whether there are ethnic differences, we compared the allele frequencies of antidepressant efficacy-related SNPs between the Taiwanese population and four other populations in the HapMap database. We recruited 198 Taiwanese major depression patients and 106 Taiwanese controls. A panel of possible relevant SNPs (in brain-derived neurotrophic factor, 5-hydroxytryptamine receptor 2A, interleukin 1 beta, and G-protein beta 3 subunit genes was selected for comparisons of allele frequencies using the χ2 test. Our results suggested no difference between Taiwanese patients and controls, but there were significant differences among Taiwanese controls and the other four ethnic groups in brain-derived neurotrophic factor, 5-hydroxytryptamine receptor 2A, interleukin 1 beta and G-protein beta 3 subunit genes. We conclude that there are ethnic differences in the allele frequencies of antidepressant efficacy-related SNPs, and that the degree of variations is consistent with geographic distances. Further investigation is required to verify the attribution of genetic differences to ethnic-specific antidepressant responses.

  1. NOAA Weather Radio - EAS Event Codes

    Science.gov (United States)

    Non-Zero All Hazards Logo Emergency Alert Description Event Codes Fact Sheet FAQ Organization Search Coding Using SAME SAME Non-Zero Codes DOCUMENTS NWR Poster NWR Brochure NWR Brochure Printing Notes

  2. Development of a computer program to support an efficient non-regression test of a thermal-hydraulic system code

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Jun Yeob; Jeong, Jae Jun [School of Mechanical Engineering, Pusan National University, Busan (Korea, Republic of); Suh, Jae Seung [System Engineering and Technology Co., Daejeon (Korea, Republic of); Kim, Kyung Doo [Korea Atomic Energy Research Institute, Daejeon (Korea, Republic of)

    2014-10-15

    During the development process of a thermal-hydraulic system code, a non-regression test (NRT) must be performed repeatedly in order to prevent software regression. The NRT process, however, is time-consuming and labor-intensive. Thus, automation of this process is an ideal solution. In this study, we have developed a program to support an efficient NRT for the SPACE code and demonstrated its usability. This results in a high degree of efficiency for code development. The program was developed using the Visual Basic for Applications and designed so that it can be easily customized for the NRT of other computer codes.

  3. The Long Non-coding RNA HOTTIP Enhances Pancreatic Cancer Cell Proliferation, Survival and Migration

    Science.gov (United States)

    ABSTRACTHOTTIP is a long non-coding RNA (lncRNA) transcribed from the 5' tip of the HOXA locus and is associated with the polycomb repressor complex 2 (PRC2) and WD repeat containing protein 5 (WDR5)/mixed lineage leukemia 1 (MLL1) chromatin modifying complexes. HOTTIP is expres...

  4. Control of ribosome traffic by position-dependent choice of synonymous codons

    International Nuclear Information System (INIS)

    Mitarai, Namiko; Pedersen, Steen

    2013-01-01

    Messenger RNA (mRNA) encodes a sequence of amino acids by using codons. For most amino acids, there are multiple synonymous codons that can encode the amino acid. The translation speed can vary from one codon to another, thus there is room for changing the ribosome speed while keeping the amino acid sequence and hence the resulting protein. Recently, it has been noticed that the choice of the synonymous codon, via the resulting distribution of slow- and fast-translated codons, affects not only on the average speed of one ribosome translating the mRNA but also might have an effect on nearby ribosomes by affecting the appearance of ‘traffic jams’ where multiple ribosomes collide and form queues. To test this ‘context effect’ further, we here investigate the effect of the sequence of synonymous codons on the ribosome traffic by using a ribosome traffic model with codon-dependent rates, estimated from experiments. We compare the ribosome traffic on wild-type (WT) sequences and sequences where the synonymous codons were swapped randomly. By simulating translation of 87 genes, we demonstrate that the WT sequences, especially those with a high bias in codon usage, tend to have the ability to reduce ribosome collisions, hence optimizing the cellular investment in the translation apparatus. The magnitude of such reduction of the translation time might have a significant impact on the cellular growth rate and thereby have importance for the survival of the species. (paper)

  5. Development of a multiplex PCR assay detecting 52 autosomal SNPs

    DEFF Research Database (Denmark)

    Sanchez Sanchez, Juan Jose; Phillips, C.; Børsting, Claus

    2006-01-01

    for amplifying 52 genomic DNA fragments, each containing one SNP, in a single tube, and accurately genotyping the PCR product mixture using two single base extension reactions. This multiplex approach reduces the cost of SNP genotyping and requires as little as 0.5 ng of genomic DNA to detect 52 SNPs. We used...

  6. THE EXISTENCE OF SYNONYMS IN A LANGUAGE - 2 FORMS BUT ONE, OR RATHER 2 MEANINGS

    NARCIS (Netherlands)

    DEJONGE, B

    1993-01-01

    This paper focuses on the problem of synonymy. In practice, the existence of synonyms is generally accepted, but in theory, synonymy is undesirable. In this article an attempt is made to show that functional differences in meaning can distinguish two apparently synonymous verbs in Modern Italian.

  7. QualitySNP: a pipeline for detecting single nucleotide polymorphisms and insertions/deletions in EST data from diploid and polyploid species

    Directory of Open Access Journals (Sweden)

    Voorrips Roeland E

    2006-10-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are important tools in studying complex genetic traits and genome evolution. Computational strategies for SNP discovery make use of the large number of sequences present in public databases (in most cases as expressed sequence tags (ESTs and are considered to be faster and more cost-effective than experimental procedures. A major challenge in computational SNP discovery is distinguishing allelic variation from sequence variation between paralogous sequences, in addition to recognizing sequencing errors. For the majority of the public EST sequences, trace or quality files are lacking which makes detection of reliable SNPs even more difficult because it has to rely on sequence comparisons only. Results We have developed a new algorithm to detect reliable SNPs and insertions/deletions (indels in EST data, both with and without quality files. Implemented in a pipeline called QualitySNP, it uses three filters for the identification of reliable SNPs. Filter 1 screens for all potential SNPs and identifies variation between or within genotypes. Filter 2 is the core filter that uses a haplotype-based strategy to detect reliable SNPs. Clusters with potential paralogs as well as false SNPs caused by sequencing errors are identified. Filter 3 screens SNPs by calculating a confidence score, based upon sequence redundancy and quality. Non-synonymous SNPs are subsequently identified by detecting open reading frames of consensus sequences (contigs with SNPs. The pipeline includes a data storage and retrieval system for haplotypes, SNPs and alignments. QualitySNP's versatility is demonstrated by the identification of SNPs in EST datasets from potato, chicken and humans. Conclusion QualitySNP is an efficient tool for SNP detection, storage and retrieval in diploid as well as polyploid species. It is available for running on Linux or UNIX systems. The program, test data, and user manual are available at

  8. Coded excitation for infrared non-destructive testing of carbon fiber reinforced plastics.

    Science.gov (United States)

    Mulaveesala, Ravibabu; Venkata Ghali, Subbarao

    2011-05-01

    This paper proposes a Barker coded excitation for defect detection using infrared non-destructive testing. Capability of the proposed excitation scheme is highlighted with recently introduced correlation based post processing approach and compared with the existing phase based analysis by taking the signal to noise ratio into consideration. Applicability of the proposed scheme has been experimentally validated on a carbon fiber reinforced plastic specimen containing flat bottom holes located at different depths.

  9. Synonymous Mutations at the Beginning of the Influenza A Virus Hemagglutinin Gene Impact Experimental Fitness.

    Science.gov (United States)

    Canale, Aneth S; Venev, Sergey V; Whitfield, Troy W; Caffrey, Daniel R; Marasco, Wayne A; Schiffer, Celia A; Kowalik, Timothy F; Jensen, Jeffrey D; Finberg, Robert W; Zeldovich, Konstantin B; Wang, Jennifer P; Bolon, Daniel N A

    2018-04-13

    The fitness effects of synonymous mutations can provide insights into biological and evolutionary mechanisms. We analyzed the experimental fitness effects of all single-nucleotide mutations, including synonymous substitutions, at the beginning of the influenza A virus hemagglutinin (HA) gene. Many synonymous substitutions were deleterious both in bulk competition and for individually isolated clones. Investigating protein and RNA levels of a subset of individually expressed HA variants revealed that multiple biochemical properties contribute to the observed experimental fitness effects. Our results indicate that a structural element in the HA segment viral RNA may influence fitness. Examination of naturally evolved sequences in human hosts indicates a preference for the unfolded state of this structural element compared to that found in swine hosts. Our overall results reveal that synonymous mutations may have greater fitness consequences than indicated by simple models of sequence conservation, and we discuss the implications of this finding for commonly used evolutionary tests and analyses. Copyright © 2018. Published by Elsevier Ltd.

  10. Molecular Evolution of the non-coding Eosinophil Granule Ontogeny Transcript EGOT

    Directory of Open Access Journals (Sweden)

    Dominic eRose

    2011-10-01

    Full Text Available Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs. The evolutionary history of mlncRNAs is still largely uncharted territory.In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT, an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs. EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyse patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrat here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved and thermodynamic stable secondary structures.Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element.

  11. TRPV6 exhibits unusual patterns of polymorphism and divergence in worldwide populations.

    Science.gov (United States)

    Akey, Joshua M; Swanson, Willie J; Madeoy, Jennifer; Eberle, Michael; Shriver, Mark D

    2006-07-01

    A striking footprint of positive selection was recently identified on chromosome 7q34-35 that spans at least 115 kb and encompasses four known genes (KEL, TRPV5, TRPV6, EPHB6). The signature of selection was observed in only one of the two populations analyzed suggesting the action of geographically restricted selective pressures. However, as only two populations were analyzed, it remains unknown whether the signature of selection extends to additional populations. To address this issue and begin to dissect the evolutionary history of this region in more detail, we performed an in-depth population genetic analysis on TRPV6, which is a calcium-permeable ion channel thought to mediate the rate-limiting step of dietary calcium absorption. We demonstrate that the rate of TRPV6 protein evolution is significantly accelerated in the human lineage, but only for a haplotype defined by three non-synonymous SNPs (C157R, M378V and M681T) that are nearly fixed for the derived alleles in non-African populations. Interestingly, we found that these three non-synonymous SNPs have high posterior probabilities for being targets of positive selection and are therefore strong candidates for mediating the population-specific signatures of selection in this region. In addition, we resequenced the exons corresponding to the C157R, M378V and M681T polymorphisms in 90 geographically diverse individuals and characterized their global allele frequency distribution by genotyping them in 1064 individuals from 52 populations. These data strongly suggest that the TRPV6 haplotype defined by the derived alleles at C157R, M378V and M681T conferred a selective advantage that varied spatially, and perhaps temporally, during human history.

  12. Analysis of EFL Students' Ability in Reading Vocabulary of Synonyms and Antonyms

    OpenAIRE

    Vina Fathira

    2017-01-01

    Reading is an important thing for academic level. Every student must have many vocabularies to encourage her/his reading skill. The aim of this research is to analyze the students' understanding of reading vocabularies of synonyms and antonyms in the higher education level. Synonyms and antonyms are two important things should be mastered to get better reading comprehension. The method used in this research was quantitative with survey design. The population same as the sample of this researc...

  13. Genetic Determinants of Metabolism and Benign Prostate Enlargement: Associations with Prostate Volume.

    Directory of Open Access Journals (Sweden)

    Ayush Giri

    Full Text Available Prostate enlargement leading to clinical benign prostatic hyperplasia (BPH is associated with metabolic dysregulation and obesity. The genetic basis of this association is unclear. Our objective was to evaluate whether single nucleotide polymorphisms (SNPs previously associated with metabolic disorders are also associated with prostate volume (PV. Participants included 876 men referred for prostate biopsy and found to be prostate cancer free. PV was measured by transrectal ultrasound. Samples were genotyped using the Illumina Cardio-MetaboChip platform. Multivariable adjusted linear regression models were used to evaluate SNPs (additive coding in relation to natural-log transformed (log PV. We compared SNP-PV results from biopsy-negative men to 442 men with low-grade prostate cancer with similar levels of obesity and PV. Beta-coefficients from the discovery and replication samples were then aggregated with fixed effects inverse variance weighted meta-analysis. SNP rs11736129 (near the pseudo-gene LOC100131429 was significantly associated with log-PV (beta: 0.16, p-value 1.16x10(-8 after adjusting for multiple testing. Other noteworthy SNPs that were nominally associated (p-value < 1x10(-4 with log-PV included rs9583484 (intronic SNP in COL4A2, rs10146527 (intronic SNP in NRXN3, rs9909466 (SNP near RPL32P31, and rs2241606 (synonymous SNP in SLC12A7. We found several SNPs in metabolic loci associated with PV. Further studies are needed to confirm our results and elucidate the mechanism between these genetic loci, PV, and clinical BPH.

  14. Decoding the Emerging Patterns Exhibited in Non-coding RNAs Characteristic of Lung Cancer with Regard to their Clinical Significance.

    Science.gov (United States)

    Sonea, Laura; Buse, Mihail; Gulei, Diana; Onaciu, Anca; Simon, Ioan; Braicu, Cornelia; Berindan-Neagoe, Ioana

    2018-05-01

    Lung cancer continues to be the leading topic concerning global mortality rate caused by can-cer; it needs to be further investigated to reduce these dramatic unfavorable statistic data. Non-coding RNAs (ncRNAs) have been shown to be important cellular regulatory factors and the alteration of their expression levels has become correlated to extensive number of pathologies. Specifically, their expres-sion profiles are correlated with development and progression of lung cancer, generating great interest for further investigation. This review focuses on the complex role of non-coding RNAs, namely miR-NAs, piwi-interacting RNAs, small nucleolar RNAs, long non-coding RNAs and circular RNAs in the process of developing novel biomarkers for diagnostic and prognostic factors that can then be utilized for personalized therapies toward this devastating disease. To support the concept of personalized medi-cine, we will focus on the roles of miRNAs in lung cancer tumorigenesis, their use as diagnostic and prognostic biomarkers and their application for patient therapy.

  15. Coherent Synchrotron Radiation A Simulation Code Based on the Non-Linear Extension of the Operator Splitting Method

    CERN Document Server

    Dattoli, Giuseppe

    2005-01-01

    The coherent synchrotron radiation (CSR) is one of the main problems limiting the performance of high intensity electron accelerators. A code devoted to the analysis of this type of problems should be fast and reliable: conditions that are usually hardly achieved at the same time. In the past, codes based on Lie algebraic techniques have been very efficient to treat transport problem in accelerators. The extension of these method to the non-linear case is ideally suited to treat CSR instability problems. We report on the development of a numerical code, based on the solution of the Vlasov equation, with the inclusion of non-linear contribution due to wake field effects. The proposed solution method exploits an algebraic technique, using exponential operators implemented numerically in C++. We show that the integration procedure is capable of reproducing the onset of an instability and effects associated with bunching mechanisms leading to the growth of the instability itself. In addition, parametric studies a...

  16. Roles, Functions, and Mechanisms of Long Non-coding RNAs in Cancer

    Directory of Open Access Journals (Sweden)

    Yiwen Fang

    2016-02-01

    Full Text Available Long non-coding RNAs (lncRNAs play important roles in cancer. They are involved in chromatin remodeling, as well as transcriptional and post-transcriptional regulation, through a variety of chromatin-based mechanisms and via cross-talk with other RNA species. lncRNAs can function as decoys, scaffolds, and enhancer RNAs. This review summarizes the characteristics of lncRNAs, including their roles, functions, and working mechanisms, describes methods for identifying and annotating lncRNAs, and discusses future opportunities for lncRNA-based therapies using antisense oligonucleotides.

  17. Computer codes for three dimensional mass transport with non-linear sorption

    International Nuclear Information System (INIS)

    Noy, D.J.

    1985-03-01

    The report describes the mathematical background and data input to finite element programs for three dimensional mass transport in a porous medium. The transport equations are developed and sorption processes are included in a general way so that non-linear equilibrium relations can be introduced. The programs are described and a guide given to the construction of the required input data sets. Concluding remarks indicate that the calculations require substantial computer resources and suggest that comprehensive preliminary analysis with lower dimensional codes would be important in the assessment of field data. (author)

  18. Influence of ghrelin gene polymorphisms on hypertension and atherosclerotic disease.

    Science.gov (United States)

    Berthold, Heiner K; Giannakidou, Eleni; Krone, Wilhelm; Trégouët, David-Alexandre; Gouni-Berthold, Ioanna

    2010-02-01

    Ghrelin is involved in several metabolic and cardiovascular processes. Recent evidence suggests its involvement in blood pressure regulation and hypertension. The aim of the study was to determine associations of single-nucleotide polymorphisms (SNPs) and haplotypes of the ghrelin gene (GHRL) with hypertension and atherosclerotic disease. Six GHRL SNPs (rs27647, rs26802, rs34911341, rs696217, rs4684677 and a -473G/A (with no assigned rsID)) were investigated in a sample of 1143 hypertensive subjects and 1489 controls of Caucasian origin. Both single-locus and haplotype association analyses were performed. In single-locus analyses, only the non-synonymous rs34911341 was associated with hypertension (odds ratio (OR)=1.95 (95% confidence interval (CI): 1.26-3.02), P=0.003). Six common haplotypes with frequency >1% were inferred from the studied GHRL SNPs, and their frequency distribution was significantly different between hypertensive subjects and controls (chi(2)=12.96 with 5 d.f. (degree of freedom), P=0.024). The effect of rs26802 was found to be significantly (P=0.017) modulated by other GHRL SNPs, as its C allele conferred either an increased risk (OR=1.30 (1.08-1.57), P=0.005) or a decreased risk (OR=0.50 (0.23-1.06), P=0.07) of hypertension according to the two different haplotypes on which it can be found. No association of GHRL SNPs or haplotypes with atherosclerotic disease was observed. In conclusion, we observed statistical evidence for association between GHRL SNPs and risk of hypertension.

  19. Analysis of Case-Control Association Studies: SNPs, Imputation and Haplotypes

    KAUST Repository

    Chatterjee, Nilanjan; Chen, Yi-Hau; Luo, Sheng; Carroll, Raymond J.

    2009-01-01

    Although prospective logistic regression is the standard method of analysis for case-control data, it has been recently noted that in genetic epidemiologic studies one can use the "retrospective" likelihood to gain major power by incorporating various population genetics model assumptions such as Hardy-Weinberg-Equilibrium (HWE), gene-gene and gene-environment independence. In this article we review these modern methods and contrast them with the more classical approaches through two types of applications (i) association tests for typed and untyped single nucleotide polymorphisms (SNPs) and (ii) estimation of haplotype effects and haplotype-environment interactions in the presence of haplotype-phase ambiguity. We provide novel insights to existing methods by construction of various score-tests and pseudo-likelihoods. In addition, we describe a novel two-stage method for analysis of untyped SNPs that can use any flexible external algorithm for genotype imputation followed by a powerful association test based on the retrospective likelihood. We illustrate applications of the methods using simulated and real data. © Institute of Mathematical Statistics, 2009.

  20. Analysis of Case-Control Association Studies: SNPs, Imputation and Haplotypes

    KAUST Repository

    Chatterjee, Nilanjan

    2009-11-01

    Although prospective logistic regression is the standard method of analysis for case-control data, it has been recently noted that in genetic epidemiologic studies one can use the "retrospective" likelihood to gain major power by incorporating various population genetics model assumptions such as Hardy-Weinberg-Equilibrium (HWE), gene-gene and gene-environment independence. In this article we review these modern methods and contrast them with the more classical approaches through two types of applications (i) association tests for typed and untyped single nucleotide polymorphisms (SNPs) and (ii) estimation of haplotype effects and haplotype-environment interactions in the presence of haplotype-phase ambiguity. We provide novel insights to existing methods by construction of various score-tests and pseudo-likelihoods. In addition, we describe a novel two-stage method for analysis of untyped SNPs that can use any flexible external algorithm for genotype imputation followed by a powerful association test based on the retrospective likelihood. We illustrate applications of the methods using simulated and real data. © Institute of Mathematical Statistics, 2009.

  1. Is α‐T catenin (VR22) an Alzheimer's disease risk gene?

    Science.gov (United States)

    Bertram, Lars; Mullin, Kristina; Parkinson, Michele; Hsiao, Monica; Moscarillo, Thomas J; Wagner, Steven L; Becker, K David; Velicelebi, Gonul; Blacker, Deborah; Tanzi, Rudolph E

    2007-01-01

    Background Recently, conflicting reports have been published on the potential role of genetic variants in the α‐T catenin gene (VR22; CTNNA3) on the risk for Alzheimer's disease. In these papers, evidence for association is mostly observed in multiplex families with Alzheimer's disease, whereas case–control samples of sporadic Alzheimer's disease are predominantly negative. Methods After sequencing VR22 in multiplex families with Alzheimer's disease linked to chromosome 10q21, we identified a novel non‐synonymous (Ser596Asn; rs4548513) single nucleotide polymorphism (SNP). This and four non‐coding SNPs were assessed in two independent samples of families with Alzheimer's disease, one with 1439 subjects from 437 multiplex families with Alzheimer's disease and the other with 489 subjects from 217 discordant sibships. Results A weak association with the Ser596Asn SNP in the multiplex sample, predominantly in families with late‐onset Alzheimer's disease (p = 0.02), was observed. However, this association does not seem to contribute substantially to the chromosome 10 Alzheimer's disease linkage signal that we and others have reported previously. No evidence was found of association with any of the four additional SNPs tested in the multiplex families with Alzheimer's disease. Finally, the Ser596Asn change was not associated with the risk for Alzheimer's disease in the independent discordant sibship sample. Conclusions This is the first study to report evidence of an association between a potentially functional, non‐synonymous SNP in VR22 and the risk for Alzheimer's disease. As the underlying effects are probably small, and are only seen in families with multiple affected members, the population‐wide significance of this finding remains to be determined. PMID:17209133

  2. BAC-end sequence-based SNPs and Bin mapping for rapid integration of physical and genetic maps in apple.

    Science.gov (United States)

    Han, Yuepeng; Chagné, David; Gasic, Ksenija; Rikkerink, Erik H A; Beever, Jonathan E; Gardiner, Susan E; Korban, Schuyler S

    2009-03-01

    A genome-wide BAC physical map of the apple, Malus x domestica Borkh., has been recently developed. Here, we report on integrating the physical and genetic maps of the apple using a SNP-based approach in conjunction with bin mapping. Briefly, BAC clones located at ends of BAC contigs were selected, and sequenced at both ends. The BAC end sequences (BESs) were used to identify candidate SNPs. Subsequently, these candidate SNPs were genetically mapped using a bin mapping strategy for the purpose of mapping the physical onto the genetic map. Using this approach, 52 (23%) out of 228 BESs tested were successfully exploited to develop SNPs. These SNPs anchored 51 contigs, spanning approximately 37 Mb in cumulative physical length, onto 14 linkage groups. The reliability of the integration of the physical and genetic maps using this SNP-based strategy is described, and the results confirm the feasibility of this approach to construct an integrated physical and genetic maps for apple.

  3. Streptomyces phaeopurpureus Shinobu 1957 (Approved Lists 1980) and Streptomyces griseorubiginosus (Ryabova and Preobrazhenskaya 1957) Pridham et al. 1958 (Approved Lists 1980) are heterotypic subjective synonyms.

    Science.gov (United States)

    Kämpfer, Peter; Rückert, Christian; Blom, Jochen; Goesmann, Alexander; Wink, Joachim; Kalinowski, Jörn; Glaeser, Stefanie P

    2017-08-01

    On the basis of whole genome comparisons of Streptomyces griseorubiginosus and Streptomyces phaeopurpureus it could by shown that these two species are subjective synonyms. The names of both species have been published in the Approved Lists of Bacterial Names and, in such a case, normally Rule 24b (1) of the Prokaryotic Code applies, which reads: 'If two names compete for priority and if both names date from 1 January 1980 on an Approved List, the priority shall be determined by the date of the original publication of the name before 1 January 1980'. Streptomyces griseorubiginosus and Streptomyces phaeopurpureus were both effectively published in 1957, and for both publications, the exact date cannot be obtained. In this case a further statement of Rule 24 applies, which reads: 'If the names or epithets are of the same date, the author who first unites the taxa has the right to choose one of them, and his choice must be followed.' Hence we propose that Streptomyces phaeopurpureus is a later heterotypic subjective synonym of Streptomyces griseorubiginosus.

  4. Design of a High Density SNP Genotyping Assay in the Pig Using SNPs Identified and Characterized by Next Generation Sequencing Technology

    Science.gov (United States)

    Ramos, Antonio M.; Crooijmans, Richard P. M. A.; Affara, Nabeel A.; Amaral, Andreia J.; Archibald, Alan L.; Beever, Jonathan E.; Bendixen, Christian; Churcher, Carol; Clark, Richard; Dehais, Patrick; Hansen, Mark S.; Hedegaard, Jakob; Hu, Zhi-Liang; Kerstens, Hindrik H.; Law, Andy S.; Megens, Hendrik-Jan; Milan, Denis; Nonneman, Danny J.; Rohrer, Gary A.; Rothschild, Max F.; Smith, Tim P. L.; Schnabel, Robert D.; Van Tassell, Curt P.; Taylor, Jeremy F.; Wiedmann, Ralph T.; Schook, Lawrence B.; Groenen, Martien A. M.

    2009-01-01

    Background The dissection of complex traits of economic importance to the pig industry requires the availability of a significant number of genetic markers, such as single nucleotide polymorphisms (SNPs). This study was conducted to discover several hundreds of thousands of porcine SNPs using next generation sequencing technologies and use these SNPs, as well as others from different public sources, to design a high-density SNP genotyping assay. Methodology/Principal Findings A total of 19 reduced representation libraries derived from four swine breeds (Duroc, Landrace, Large White, Pietrain) and a Wild Boar population and three restriction enzymes (AluI, HaeIII and MspI) were sequenced using Illumina's Genome Analyzer (GA). The SNP discovery effort resulted in the de novo identification of over 372K SNPs. More than 549K SNPs were used to design the Illumina Porcine 60K+SNP iSelect Beadchip, now commercially available as the PorcineSNP60. A total of 64,232 SNPs were included on the Beadchip. Results from genotyping the 158 individuals used for sequencing showed a high overall SNP call rate (97.5%). Of the 62,621 loci that could be reliably scored, 58,994 were polymorphic yielding a SNP conversion success rate of 94%. The average minor allele frequency (MAF) for all scorable SNPs was 0.274. Conclusions/Significance Overall, the results of this study indicate the utility of using next generation sequencing technologies to identify large numbers of reliable SNPs. In addition, the validation of the PorcineSNP60 Beadchip demonstrated that the assay is an excellent tool that will likely be used in a variety of future studies in pigs. PMID:19654876

  5. The Role of Non-­Coding RNA in Plant Stress

    KAUST Repository

    MacPherson, Cameron R.

    2012-12-01

    Post-transcriptional gene silencing (PTGS) is a powerful mechanism that can be adapted to genetically modify crop plants. PTGS operates in many plant signaling pathways including those mediating stress responses. Given the small number of miRNAs known, research on the characterization of stress-related micro-RNA (miRNA) and their targets could provide the basis for engineering stress tolerant traits in crops. Indeed, several examples of miRNA mediated crop tolerance have been reported. In the research presented here, we aimed to analyze the role of small non-coding RNA (smRNA) pathways involved in plant stress. In particular, we focused on miRNA-mediated PTGS in phosphate (Pi) starvation. The analysis was split into two research projects. First, to identify potential miRNA targets we began by analyzing the response and recovery of coding and long non-coding RNAs (lncRNA) to Pi starvation in shoot and root. The results obtained were the first genome-wide description of the root’s Pi starvation response and recovery. We found that the root\\'s response involved a widely different set of genes than that of the shoot. In the second research project, the results of the first project were correlated with the responses of miRNA and trans-acting small-interfering RNA (tasiRNA) during Pi starvation. Many miRNA circuits have been predicted before, however, tasiRNA circuits are not as well defined. Therefore, we made use of the double-stranded RNA-binding protein 4 (DRB4) smRNA libraries to enhance our prediction of tasiRNAs. Altogether, we provided evidence to support the following miRNA-mRNA pairs that may function in Pi starvation: IPS1:miR399:PHO2; miR399:RS4; miR399:NF-YA10; miR398:CSD1/2; miR2111:TPS11; miR164:NAC6; miR157:TMO7; miR157:PSB28; RPS2:miR169:IPS2; miR397:LAC2; TAS4:PAP1; NR1:PAP1; and Chr3_1967672:TMO7. In general, we found that non-miR399 related circuits were active only during the root’s recovery from Pi starvation. The functional roles of the genes

  6. Utilizing elements of the CSAU phenomena identification and ranking table (PIRT) to qualify a PWR non-LOCA transients system code

    Energy Technology Data Exchange (ETDEWEB)

    Greene, K.R.; Fletcher, C.D.; Gottula, R.C.; Lindquist, T.R.; Stitt, B.D. [Framatome ANP, Richland, WA (United States)

    2001-07-01

    Licensing analyses of Nuclear Regulatory Commission (NRC) Standard Review Plan (SRP) Chapter 15 non-LOCA transients are an important part of establishing operational safety limits and design limits for nuclear power plants. The applied codes and methods are generally qualified using traditional methods of benchmarking and assessment, sample problems, and demonstration of conservatism. Rigorous formal methods for developing code and methodology have been created and applied to qualify realistic methods for Large Break Loss-of-Coolant Accidents (LBLOCA's). This methodology, Code Scaling, Applicability, and Uncertainty (CSAU), is a very demanding, resource intensive, process to apply. It would be challenging to apply a comprehensive and complete CSAU level of analysis, individually, to each of the more than 30 non-LOCA transients that comprise Chapter 15 events. However, certain elements of the process can be easily adapted to improve quality of the codes and methods used to analyze non- LOCA transients. One of these elements is the Phenomena Identification and Ranking Table (PIRT). This paper presents the results of an informally constructed PIRT that applies to non-LOCA transients for Pressurized Water Reactors (PWR's) of the Westinghouse and Combustion Engineering design. A group of experts in thermal-hydraulics and safety analysis identified and ranked the phenomena. To begin the process, the PIRT was initially performed individually by each expert. Then through group interaction and discussion, a consensus was reached on both the significant phenomena and the appropriate ranking. The paper also discusses using the PIRT as an aid to qualify a 'conservative' system code and methodology. Once agreement was obtained on the phenomena and ranking, the table was divided into six functional groups, by nature of the transients, along the same lines as Chapter 15. Then, assessment and disposition of the significant phenomena was performed. The PIRT and

  7. The Nym Family: Synonyms, Antonyms, Homonyms, Acronyms.

    Science.gov (United States)

    Cummings, Melodie

    Intended to help students improve their vocabulary and spelling skills, this booklet offers activities on synonyms, antonyms, homonyms (including homophones and homographs), and acronyms. It is suggested that the teacher present these types of words as members of the "Nym Family." Ideas for posters and books to be used as instructional…

  8. 118 SNPs of folate-related genes and risks of spina bifida and conotruncal heart defects

    Directory of Open Access Journals (Sweden)

    Shaw Gary M

    2009-06-01

    Full Text Available Abstract Background Folic acid taken in early pregnancy reduces risks for delivering offspring with several congenital anomalies. The mechanism by which folic acid reduces risk is unknown. Investigations into genetic variation that influences transport and metabolism of folate will help fill this data gap. We focused on 118 SNPs involved in folate transport and metabolism. Methods Using data from a California population-based registry, we investigated whether risks of spina bifida or conotruncal heart defects were influenced by 118 single nucleotide polymorphisms (SNPs associated with the complex folate pathway. This case-control study included 259 infants with spina bifida and a random sample of 359 nonmalformed control infants born during 1983–86 or 1994–95. It also included 214 infants with conotruncal heart defects born during 1983–86. Infant genotyping was performed blinded to case or control status using a designed SNPlex assay. We examined single SNP effects for each of the 118 SNPs, as well as haplotypes, for each of the two outcomes. Results Few odds ratios (ORs revealed sizable departures from 1.0. With respect to spina bifida, we observed ORs with 95% confidence intervals that did not include 1.0 for the following SNPs (heterozygous or homozygous relative to the reference genotype: BHMT (rs3733890 OR = 1.8 (1.1–3.1, CBS (rs2851391 OR = 2.0 (1.2–3.1; CBS (rs234713 OR = 2.9 (1.3–6.7; MTHFD1 (rs2236224 OR = 1.7 (1.1–2.7; MTHFD1 (hcv11462908 OR = 0.2 (0–0.9; MTHFD2 (rs702465 OR = 0.6 (0.4–0.9; MTHFD2 (rs7571842 OR = 0.6 (0.4–0.9; MTHFR (rs1801133 OR = 2.0 (1.2–3.1; MTRR (rs162036 OR = 3.0 (1.5–5.9; MTRR (rs10380 OR = 3.4 (1.6–7.1; MTRR (rs1801394 OR = 0.7 (0.5–0.9; MTRR (rs9332 OR = 2.7 (1.3–5.3; TYMS (rs2847149 OR = 2.2 (1.4–3.5; TYMS (rs1001761 OR = 2.4 (1.5–3.8; and TYMS (rs502396 OR = 2.1 (1.3–3.3. However, multiple SNPs observed for a given gene showed evidence of linkage disequilibrium indicating

  9. PCA-based bootstrap confidence interval tests for gene-disease association involving multiple SNPs

    Directory of Open Access Journals (Sweden)

    Xue Fuzhong

    2010-01-01

    Full Text Available Abstract Background Genetic association study is currently the primary vehicle for identification and characterization of disease-predisposing variant(s which usually involves multiple single-nucleotide polymorphisms (SNPs available. However, SNP-wise association tests raise concerns over multiple testing. Haplotype-based methods have the advantage of being able to account for correlations between neighbouring SNPs, yet assuming Hardy-Weinberg equilibrium (HWE and potentially large number degrees of freedom can harm its statistical power and robustness. Approaches based on principal component analysis (PCA are preferable in this regard but their performance varies with methods of extracting principal components (PCs. Results PCA-based bootstrap confidence interval test (PCA-BCIT, which directly uses the PC scores to assess gene-disease association, was developed and evaluated for three ways of extracting PCs, i.e., cases only(CAES, controls only(COES and cases and controls combined(CES. Extraction of PCs with COES is preferred to that with CAES and CES. Performance of the test was examined via simulations as well as analyses on data of rheumatoid arthritis and heroin addiction, which maintains nominal level under null hypothesis and showed comparable performance with permutation test. Conclusions PCA-BCIT is a valid and powerful method for assessing gene-disease association involving multiple SNPs.

  10. Characterization of LEDGF/p75 genetic variants and association with HIV-1 disease progression.

    Directory of Open Access Journals (Sweden)

    Peter Messiaen

    Full Text Available BACKGROUND: As Lens epithelium-derived growth factor (LEDGF/p75 is an important co-factor involved in HIV-1 integration, the LEDGF/p75-IN interaction is a promising target for the new class of allosteric HIV integrase inhibitors (LEDGINs. Few data are available on the genetic variability of LEDGF/p75 and the influence on HIV disease in vivo. This study evaluated the relation between LEDGF/p75 genetic variation, mRNA expression and HIV-1 disease progression in order to guide future clinical use of LEDGINs. METHODS: Samples were derived from a therapy-naïve cohort at Ghent University Hospital and a Spanish long-term-non-progressor cohort. High-resolution melting curve analysis and Sanger sequencing were used to identify all single nucleotide polymorphisms (SNPs in the coding region, flanking intronic regions and full 3'UTR of LEDGF/p75. In addition, two intronic tagSNPs were screened based on previous indication of influencing HIV disease. LEDGF/p75 mRNA was quantified in patient peripheral blood mononuclear cells (PBMC using RT-qPCR. RESULTS: 325 samples were investigated from patients of Caucasian (n = 291 and African (n = 34 origin, including Elite (n = 49 and Viremic controllers (n = 62. 21 SNPs were identified, comprising five in the coding region and 16 in the non-coding regions and 3'UTR. The variants in the coding region were infrequent and had no major impact on protein structure according to SIFT and PolyPhen score. One intronic SNP (rs2737828 was significantly under-represented in Caucasian patients (P<0.0001 compared to healthy controls (HapMap. Two SNPs showed a non-significant trend towards association with slower disease progression but not with LEDGF/p75 expression. The observed variation in LEDGF/p75 expression was not correlated with disease progression. CONCLUSIONS: LEDGF/p75 is a highly conserved protein. Two non-coding polymorphisms were identified indicating a correlation with disease outcome, but further

  11. Assessment of 12 CHF prediction methods, for an axially non-uniform heat flux distribution, with the RELAP5 computer code

    Energy Technology Data Exchange (ETDEWEB)

    Ferrouk, M. [Laboratoire du Genie Physique des Hydrocarbures, University of Boumerdes, Boumerdes 35000 (Algeria)], E-mail: m_ferrouk@yahoo.fr; Aissani, S. [Laboratoire du Genie Physique des Hydrocarbures, University of Boumerdes, Boumerdes 35000 (Algeria); D' Auria, F.; DelNevo, A.; Salah, A. Bousbia [Dipartimento di Ingegneria Meccanica, Nucleare e della Produzione, Universita di Pisa (Italy)

    2008-10-15

    The present article covers the evaluation of the performance of twelve critical heat flux methods/correlations published in the open literature. The study concerns the simulation of an axially non-uniform heat flux distribution with the RELAP5 computer code in a single boiling water reactor channel benchmark problem. The nodalization scheme employed for the considered particular geometry, as modelled in RELAP5 code, is described. For this purpose a review of critical heat flux models/correlations applicable to non-uniform axial heat profile is provided. Simulation results using the RELAP5 code and those obtained from our computer program, based on three type predictions methods such as local conditions, F-factor and boiling length average approaches were compared.

  12. SNP mining porcine ESTs with MAVIANT, a novel tool for SNP evaluation and annotation

    DEFF Research Database (Denmark)

    Panitz, Frank; Stengaard, Henrik; Hornshoj, Henrik

    2007-01-01

    MOTIVATION: Single nucleotide polymorphisms (SNPs) analysis is an important means to study genetic variation. A fast and cost-efficient approach to identify large numbers of novel candidates is the SNP mining of large scale sequencing projects. The increasing availability of sequence trace data...... manual annotation, which is immediately accessible and can be easily shared with external collaborators. RESULTS: Large-scale SNP mining of polymorphisms bases on porcine EST sequences yielded more than 7900 candidate SNPs in coding regions (cSNPs), which were annotated relative to the human genome. Non...

  13. Long Non-Coding RNAs: Emerging and Versatile Regulators in Host–Virus Interactions

    Directory of Open Access Journals (Sweden)

    Xing-Yu Meng

    2017-11-01

    Full Text Available Long non-coding RNAs (lncRNAs are a class of non-protein-coding RNA molecules, which are involved in various biological processes, including chromatin modification, cell differentiation, pre-mRNA transcription and splicing, protein translation, etc. During the last decade, increasing evidence has suggested the involvement of lncRNAs in both immune and antiviral responses as positive or negative regulators. The immunity-associated lncRNAs modulate diverse and multilayered immune checkpoints, including activation or repression of innate immune signaling components, such as interleukin (IL-8, IL-10, retinoic acid inducible gene I, toll-like receptors 1, 3, and 8, and interferon (IFN regulatory factor 7, transcriptional regulation of various IFN-stimulated genes, and initiation of the cell apoptosis pathways. Additionally, some virus-encoded lncRNAs facilitate viral replication through individually or synergistically inhibiting the host antiviral responses or regulating multiple steps of the virus life cycle. Moreover, some viruses are reported to hijack host-encoded lncRNAs to establish persistent infections. Based on these amazing discoveries, lncRNAs are an emerging hotspot in host–virus interactions. In this review, we summarized the current findings of the host- or virus-encoded lncRNAs and the underlying mechanisms, discussed their impacts on immune responses and viral replication, and highlighted their critical roles in host–virus interactions.

  14. Genome-wide identification and characterization of long intergenic non-coding RNAs in Ganoderma lucidum.

    Directory of Open Access Journals (Sweden)

    Jianqin Li

    Full Text Available Ganoderma lucidum is a white-rot fungus best-known for its medicinal activities. We have previously sequenced its genome and annotated the protein coding genes. However, long non-coding RNAs in G. lucidum genome have not been analyzed. In this study, we have identified and characterized long intergenic non-coding RNAs (lincRNA in G. lucidum systematically. We developed a computational pipeline, which was used to analyze RNA-Seq data derived from G. lucidum samples collected from three developmental stages. A total of 402 lincRNA candidates were identified, with an average length of 609 bp. Analysis of their adjacent protein-coding genes (apcGenes revealed that 46 apcGenes belong to the pathways of triterpenoid biosynthesis and lignin degradation, or families of cytochrome P450, mating type B genes, and carbohydrate-active enzymes. To determine if lincRNAs and these apcGenes have any interactions, the corresponding pairs of lincRNAs and apcGenes were analyzed in detail. We developed a modified 3' RACE method to analyze the transcriptional direction of a transcript. Among the 46 lincRNAs, 37 were found unidirectionally transcribed, and 9 were found bidirectionally transcribed. The expression profiles of 16 of these 37 lincRNAs were found to be highly correlated with those of the apcGenes across the three developmental stages. Among them, 11 are positively correlated (r>0.8 and 5 are negatively correlated (r<-0.8. The co-localization and co-expression of lincRNAs and those apcGenes playing important functions is consistent with the notion that lincRNAs might be important regulators for cellular processes. In summary, this represents the very first study to identify and characterize lincRNAs in the genomes of basidiomycetes. The results obtained here have laid the foundation for study of potential lincRNA-mediated expression regulation of genes in G. lucidum.

  15. MASTR: multiple alignment and structure prediction of non-coding RNAs using simulated annealing

    DEFF Research Database (Denmark)

    Lindgreen, Stinus; Gardner, Paul P; Krogh, Anders

    2007-01-01

    function that considers sequence conservation, covariation and basepairing probabilities. The results show that the method is very competitive to similar programs available today, both in terms of accuracy and computational efficiency. AVAILABILITY: Source code available from http://mastr.binf.ku.dk/......MOTIVATION: As more non-coding RNAs are discovered, the importance of methods for RNA analysis increases. Since the structure of ncRNA is intimately tied to the function of the molecule, programs for RNA structure prediction are necessary tools in this growing field of research. Furthermore......, it is known that RNA structure is often evolutionarily more conserved than sequence. However, few existing methods are capable of simultaneously considering multiple sequence alignment and structure prediction. RESULT: We present a novel solution to the problem of simultaneous structure prediction...

  16. APCR, factor V gene known and novel SNPs and adverse pregnancy outcomes in an Irish cohort of pregnant women

    LENUS (Irish Health Repository)

    Sedano-Balbas, Sara

    2010-03-10

    Abstract Background Activated Protein C Resistance (APCR), a poor anticoagulant response of APC in haemostasis, is the commonest heritable thrombophilia. Adverse outcomes during pregnancy have been linked to APCR. This study determined the frequency of APCR, factor V gene known and novel SNPs and adverse outcomes in a group of pregnant women. Methods Blood samples collected from 907 pregnant women were tested using the Coatest® Classic and Modified functional haematological tests to establish the frequency of APCR. PCR-Restriction Enzyme Analysis (PCR-REA), PCR-DNA probe hybridisation analysis and DNA sequencing were used for molecular screening of known mutations in the factor V gene in subjects determined to have APCR based on the Coatest® Classic and\\/or Modified functional haematological tests. Glycosylase Mediated Polymorphism Detection (GMPD), a SNP screening technique and DNA sequencing, were used to identify SNPs in the factor V gene of 5 APCR subjects. Results Sixteen percent of the study group had an APCR phenotype. Factor V Leiden (FVL), FV Cambridge, and haplotype (H) R2 alleles were identified in this group. Thirty-three SNPs; 9 silent SNPs and 24 missense SNPs, of which 20 SNPs were novel, were identified in the 5 APCR subjects. Adverse pregnancy outcomes were found at a frequency of 35% in the group with APCR based on Classic Coatest® test only and at 45% in the group with APCR based on the Modified Coatest® test. Forty-eight percent of subjects with FVL had adverse outcomes while in the group of subjects with no FVL, adverse outcomes occurred at a frequency of 37%. Conclusions Known mutations and novel SNPs in the factor V gene were identified in the study cohort determined to have APCR in pregnancy. Further studies are required to investigate the contribution of these novel SNPs to the APCR phenotype. Adverse outcomes including early pregnancy loss (EPL), preeclampsia (PET) and intrauterine growth restriction (IGUR) were not significantly more

  17. Local non-Calderbank-Shor-Steane quantum error-correcting code on a three-dimensional lattice

    International Nuclear Information System (INIS)

    Kim, Isaac H.

    2011-01-01

    We present a family of non-Calderbank-Shor-Steane quantum error-correcting code consisting of geometrically local stabilizer generators on a 3D lattice. We study the Hamiltonian constructed from ferromagnetic interaction of overcomplete set of local stabilizer generators. The degenerate ground state of the system is characterized by a quantum error-correcting code whose number of encoded qubits are equal to the second Betti number of the manifold. These models (i) have solely local interactions; (ii) admit a strong-weak duality relation with an Ising model on a dual lattice; (iii) have topological order in the ground state, some of which survive at finite temperature; and (iv) behave as classical memory at finite temperature.

  18. Local non-Calderbank-Shor-Steane quantum error-correcting code on a three-dimensional lattice

    Science.gov (United States)

    Kim, Isaac H.

    2011-05-01

    We present a family of non-Calderbank-Shor-Steane quantum error-correcting code consisting of geometrically local stabilizer generators on a 3D lattice. We study the Hamiltonian constructed from ferromagnetic interaction of overcomplete set of local stabilizer generators. The degenerate ground state of the system is characterized by a quantum error-correcting code whose number of encoded qubits are equal to the second Betti number of the manifold. These models (i) have solely local interactions; (ii) admit a strong-weak duality relation with an Ising model on a dual lattice; (iii) have topological order in the ground state, some of which survive at finite temperature; and (iv) behave as classical memory at finite temperature.

  19. From Poule de Luxe to Geisha: Source Languages behind the Present-Day English Synonyms of Prostitute

    Directory of Open Access Journals (Sweden)

    Bożena Duda

    2014-11-01

    Full Text Available This paper aims at drawing a picture, as complete as possible, of an anthropocentric reality hidden in the synonyms of prostitute which have been incorporated into the English lexico-semantic system from other languages since the beginning of the 19th century. The body of Present-day English synonyms of prostitute to be analysed includes horizontal, geisha, shawl and poule de luxe. Apart from providing the source languages from which English borrowed the afore-mentioned synonyms of prostitute, an attempt will be made at discovering the plausible cultural and sociological justification for the lexical borrowings to have taken place. In order to make the onomasiological picture of the sense ‘prostitute’ as complete as it can be within the limits of this paper, a mention will be made of the lexical heritage within the range of the synonyms of prostitute which were incorporated into the English language in the course of Middle English, Early Modern English and Late Modern English.

  20. SNPs in genes implicated in radiation response are associated with radiotoxicity and evoke roles as predictive and prognostic biomarkers

    International Nuclear Information System (INIS)

    Alsbeih, Ghazi; El-Sebaie, Medhat; Al-Harbi, Najla; Al-Hadyan, Khaled; Shoukri, Mohamed; Al-Rajhi, Nasser

    2013-01-01

    Biomarkers are needed to individualize cancer radiation treatment. Therefore, we have investigated the association between various risk factors, including single nucleotide polymorphisms (SNPs) in candidate genes and late complications to radiotherapy in our nasopharyngeal cancer patients. A cohort of 155 patients was included. Normal tissue fibrosis was scored using RTOG/EORTC grading system. A total of 45 SNPs in 11 candidate genes (ATM, XRCC1, XRCC3, XRCC4, XRCC5, PRKDC, LIG4, TP53, HDM2, CDKN1A, TGFB1) were genotyped by direct genomic DNA sequencing. Patients with severe fibrosis (cases, G3-4, n = 48) were compared to controls (G0-2, n = 107). Univariate analysis showed significant association (P < 0.05) with radiation complications for 6 SNPs (ATM G/A rs1801516, HDM2 promoter T/G rs2279744 and T/A rs1196333, XRCC1 G/A rs25487, XRCC5 T/C rs1051677 and TGFB1 C/T rs1800469). In addition, Kaplan-Meier analyses have also highlighted significant association between genotypes and length of patients’ follow-up after radiotherapy. Multivariate logistic regression has further sustained these results suggesting predictive and prognostic roles of SNPs. Univariate and multivariate analysis suggest that radiation toxicity in radiotherapy patients are associated with certain SNPs, in genes including HDM2 promoter studied for the 1st time. These results support the use of SNPs as genetic predictive markers for clinical radiosensitivity and evoke a prognostic role for length of patients’ follow-up after radiotherapy