Catsburg, Arnold; van der Zwet, Wil C.; Morre, Servaas A.; Ouburg, Sander; Vandenbroucke-Grauls, Christina M. J. E.; Savelkoul, Paul H. M.
Reliable analysis of single nucleotide polymorphisms (SNPs) in DNA derived from samples containing low numbers of cells or from suboptimal sources can be difficult. A new procedure to characterize multiple SNPs in traces of DNA from plasma and old dried blood samples was developed. Six SNPs in the
Bruno, Damien L; Stark, Zornitza; Amor, David J; Burgess, Trent; Butler, Kathy; Corrie, Sylvea; Francis, David; Ganesamoorthy, Devika; Hills, Louise; James, Paul A; O'Rielly, Darren; Oertel, Ralph; Savarirayan, Ravi; Prabhakara, Krishnamurthy; Salce, Nicholas; Slater, Howard R
Microarray analysis has provided significant advances in the diagnosis of conditions resulting from submicroscopic chromosome abnormalities. It has been recommended that array testing should be a "first tier" test in the evaluation of individuals with intellectual disability, developmental delay, congenital anomalies, and autism. The availability of arrays with increasingly high probe coverage and resolution has increased the detection of decreasingly small copy number changes (CNCs) down to the intragenic or even exon level. Importantly, arrays that genotype SNPs also detect extended regions of homozygosity. We describe 14 examples of single gene disorders caused by intragenic changes from a consecutive set of 6,500 tests using high-resolution SNP microarrays. These cases illustrate the increased scope of cytogenetic testing beyond dominant chromosome rearrangements that typically contain many genes. Nine of the cases confirmed the clinical diagnosis, that is, followed a "phenotype to genotype" approach. Five were diagnosed by the laboratory analysis in the absence of a specific clinical diagnosis, that is, followed a "genotype to phenotype" approach. Two were clinically significant, incidental findings. The importance of astute clinical assessment and laboratory-clinician consultation is emphasized to optimize the value of microarrays in the diagnosis of disorders caused by single gene copy number and sequence mutations. © 2011 Wiley-Liss, Inc.
Hedayat, Amin A; Linos, Konstantinos; Jung, Hou-Sung; Tafe, Laura J; Yan, Shaofeng; LeBlanc, Robert E; Lefferts, Joel A
Histopathology is the gold standard for diagnosing melanocytic lesions; however, distinguishing benign versus malignant is not always clear histologically. Single nucleotide polymorphism (SNP) microarray analysis may help in making a definitive diagnosis. Here, we share our experience with the Oncoscan FFPE Assay and demonstrate its diagnostic utility in the context of ambiguous melanocytic lesions. Eleven archival melanocytic lesions, including three benign nevi, four melanomas, three BAP1-deficient Spitzoid nevi and one nevoid melanoma were selected for validation. SNP-array was performed according to the manufacturer's protocol, using the recommended 80ng of DNA; however, as little as 15ng was used if the extraction yield was lower. Concordance was assessed with H&E and various combinations of BAP1 and p16 immunohistochemical stains (IHC) and external reference laboratory chromosomal microarray results. After validation, the SNP array was utilized to make definitive diagnoses in four challenging cases. Oncoscan SNP array findings were in concordance with H&E, IHC, and reference laboratory chromosomal microarray testing. The SNP-based microarray can accurately detect copy number changes and aid in making a more definitive diagnosis of challenging melanocytic lesions. This can be accomplished using significantly less DNA than is required by other microarray technologies. Copyright © 2017. Published by Elsevier Inc.
Full Text Available Tea is an important cash crop, representing a $40 billion-a-year global market. Differentiation of the tea market has resulted in increasing demand for tea products that are sustainably and responsibly produced. Tea authentication is important because of growing concerns about fraud involving premium tea products. Analytical technologies are needed for protection and value enhancement of high-quality brands. For loose-leaf teas, the challenge is that the authentication needs to be established on the basis of a single leaf, so that the products can be traced back to the original varieties. A new generation of molecular markers offers an ideal solution for authentication of processed agricultural products. Using a nanofluidic array to identify variant SNP sequences, we tested genetic identities using DNA extracted from single leaves of 14 processed commercial tea products. Based on the profiles of 60 SNP markers, the genetic identity of each tea sample was unambiguously identified by multilocus matching and ordination analysis. Results for repeated samples of multiple tea leaves from the same products (using three independent DNA extractions showed 100% concordance, showing that the nanofluidic system is a reliable platform for generating tea DNA fingerprints with high accuracy. The method worked well on green, oolong, and black teas, and can handle a large number of samples in a short period of time. It is robust and cost-effective, thus showing high potential for practical application in the value chain of the tea industry.
Full Text Available Whole-genome single-nucleotide polymorphism (SNP markers are valuable genetic resources for the association and conservation studies. Genome-wide SNP development in many teleost species are still challenging because of the genome complexity and the cost of re-sequencing. Genotyping-By-Sequencing (GBS provided an efficient reduced representative method to squeeze cost for SNP detection; however, most of recent GBS applications were reported on plant organisms. In this work, we used an EcoRI-NlaIII based GBS protocol to teleost large yellow croaker, an important commercial fish in China and East-Asia, and reported the first whole-genome SNP development for the species. 69,845 high quality SNP markers that evenly distributed along genome were detected in at least 80% of 500 individuals. Nearly 95% randomly selected genotypes were successfully validated by Sequenom MassARRAY assay. The association studies with the muscle eicosapentaenoic acid (EPA and docosahexaenoic acid (DHA content discovered 39 significant SNP markers, contributing as high up to ∼63% genetic variance that explained by all markers. Functional genes that involved in fat digestion and absorption pathway were identified, such as APOB, CRAT and OSBPL10. Notably, PPT2 Gene, previously identified in the association study of the plasma n-3 and n-6 polyunsaturated fatty acid level in human, was re-discovered in large yellow croaker. Our study verified that EcoRI-NlaIII based GBS could produce quality SNP markers in a cost-efficient manner in teleost genome. The developed SNP markers and the EPA and DHA associated SNP loci provided invaluable resources for the population structure, conservation genetics and genomic selection of large yellow croaker and other fish organisms.
Rizzi, Giovanni; Østerberg, Frederik Westergaard; Dufva, Martin
We present a magnetoresistive sensor platform for hybridization assays and demonstrate its applicability on single nucleotide polymorphism (SNP) genotyping. The sensor relies on anisotropic magnetoresistance in a new geometry with a local negative reference and uses the magnetic field from...
Full Text Available Canine atopic dermatitis (cAD is a common inflammatory skin disease that is considered to be a naturally occurring, spontaneous model of human atopic dermatitis (eczema. The aim of the paper was to identify of the SNP rs22114085 in different dog breeds. The material involved 52 dogs from 5 different breeds. Canine genomic DNA was isolated from saliva by modified method with using DNAzol® and linear polyacrylamide (LPA carrier and from blood by using commercial kit NucleospinBlood and used in order to estimate rs22114085 SNP genotypes by PCR-RFLP method. The PCR products were digested with DdeI restriction enzyme. The C allele was distributed in Czech Pointer, Chihuahua, German Wirehaired Pointer with an allele frequency ranging from 0.4545 to 1.00. In the population of Czech Pointer we detected all genotypes CC, CT and TT with frequency in male 0.25, 0.5833 and 0.1667, and in female 0.2728, 0.3636 and 0.3636, subsequently. In German Wirehaired Pointer was detected homozygote genotype CC in male and heterozygote genotype CT in female with frequency 1 and 1. In Chihuahua was observed homozygote genotype CC and heterozygote genotype CT with frequency 0.3333 and 0.6667, subsequently. In Golden retriever and Pincher we detected genotype TT with frequency 1.
Full Text Available The aim of the paper was to identify of the SNP rs23472497 associated with canine atopic dermatitis (cAD. cAD is a common inflammatory skin disease that is considered to be a naturally occurring, spontaneous model of human atopic dermatitis (eczema. The material involved 60 dogs from 6 different breeds. Canine genomic DNA was isolated from saliva by modified method with using DNAzol® and linear polyacrylamide (LPA carrier and from blood by using commercial kit NucleospinBlood and used in order to estimate rs23472497 SNP genotypes by ACRS-PCR method. The PCR products were digested with NlaIII restriction enzyme. In the population of Czech Pointer and Slovak Wirehaired Pointer we detected all genotypes AA, AG and GG with frequency 0.0732, 0.5122 and 0.4146 for Czech Pointer and 0.1818, 0.5455 and 0.2727 for Slovak Wirehaired Pointer. In Border Collie was observed heterozygote genotype AG and homozygote genotype GG with frequency 0.6667 and 0.3333, subsequently. In German Wirehaired Pointer, Australian Shepherd dog and American Staffordshire terrier we detected only genotype AG with frequency 1. The A allele was distributed with an allele frequency ranging from 0.3293 to 0.5. The G allele was distributed with an allele frequency ranging from 0.5 to 0.6707.
The nature of the single nucleotide polymorphism (SNP) marker was validated by DNA sequencing of the parental PCR products. Using high resolution melt (HRM) profiles and normalised difference plots, we successfully differentiated the homozygous dominant (wild type), homozygous recessive (LPA) and heterozygous ...
Rizzi, Giovanni; Østerberg, Frederik Westergaard; Dufva, Martin
We present a magnetoresistive sensor platform for hybridization assays and demonstrate its applicability on single nucleotide polymorphism (SNP) genotyping. The sensor relies on anisotropic magnetoresistance in a new geometry with a local negative reference and uses the magnetic field from the se...
Tsui, Circe; Coleman, Laura E.; Griffith, Jacqulyn L.; Bennett, E. Andrew; Goodson, Summer G.; Scott, Jason D.; Pittard, W. Stephen; Devine, Scott E.
An international effort is underway to generate a comprehensive haplotype map (HapMap) of the human genome represented by an estimated 300 000 to 1 million ‘tag’ single nucleotide polymorphisms (SNPs). Our analysis indicates that the current human SNP map is not sufficiently dense to support the HapMap project. For example, 24.6% of the genome currently lacks SNPs at the minimal density and spacing that would be required to construct even a conservative tag SNP map containing 300 000 SNPs. In an effort to improve the human SNP map, we identified 140 696 additional SNP candidates using a new bioinformatics pipeline. Over 51 000 of these SNPs mapped to the largest gaps in the human SNP map, leading to significant improvements in these regions. Our SNPs will be immediately useful for the HapMap project, and will allow for the inclusion of many additional genomic intervals in the final HapMap. Nevertheless, our results also indicate that additional SNP discovery projects will be required both to define the haplotype architecture of the human genome and to construct comprehensive tag SNP maps that will be useful for genetic linkage studies in humans. PMID:12907734
Galeano, Carlos H; Fernández, Andrea C; Gómez, Marcela; Blair, Matthew W
Expressed sequence tags (ESTs) are an important source of gene-based markers such as those based on insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). Several gel based methods have been reported for the detection of sequence variants, however they have not been widely exploited in common bean, an important legume crop of the developing world. The objectives of this project were to develop and map EST based markers using analysis of single strand conformation polymorphisms (SSCPs), to create a transcript map for common bean and to compare synteny of the common bean map with sequenced chromosomes of other legumes. A set of 418 EST based amplicons were evaluated for parental polymorphisms using the SSCP technique and 26% of these presented a clear conformational or size polymorphism between Andean and Mesoamerican genotypes. The amplicon based markers were then used for genetic mapping with segregation analysis performed in the DOR364 x G19833 recombinant inbred line (RIL) population. A total of 118 new marker loci were placed into an integrated molecular map for common bean consisting of 288 markers. Of these, 218 were used for synteny analysis and 186 presented homology with segments of the soybean genome with an e-value lower than 7 x 10-12. The synteny analysis with soybean showed a mosaic pattern of syntenic blocks with most segments of any one common bean linkage group associated with two soybean chromosomes. The analysis with Medicago truncatula and Lotus japonicus presented fewer syntenic regions consistent with the more distant phylogenetic relationship between the galegoid and phaseoloid legumes. The SSCP technique is a useful and inexpensive alternative to other SNP or Indel detection techniques for saturating the common bean genetic map with functional markers that may be useful in marker assisted selection. In addition, the genetic markers based on ESTs allowed the construction of a transcript map and given their high conservation
Full Text Available Abstract Background Expressed sequence tags (ESTs are an important source of gene-based markers such as those based on insertion-deletions (Indels or single-nucleotide polymorphisms (SNPs. Several gel based methods have been reported for the detection of sequence variants, however they have not been widely exploited in common bean, an important legume crop of the developing world. The objectives of this project were to develop and map EST based markers using analysis of single strand conformation polymorphisms (SSCPs, to create a transcript map for common bean and to compare synteny of the common bean map with sequenced chromosomes of other legumes. Results A set of 418 EST based amplicons were evaluated for parental polymorphisms using the SSCP technique and 26% of these presented a clear conformational or size polymorphism between Andean and Mesoamerican genotypes. The amplicon based markers were then used for genetic mapping with segregation analysis performed in the DOR364 × G19833 recombinant inbred line (RIL population. A total of 118 new marker loci were placed into an integrated molecular map for common bean consisting of 288 markers. Of these, 218 were used for synteny analysis and 186 presented homology with segments of the soybean genome with an e-value lower than 7 × 10-12. The synteny analysis with soybean showed a mosaic pattern of syntenic blocks with most segments of any one common bean linkage group associated with two soybean chromosomes. The analysis with Medicago truncatula and Lotus japonicus presented fewer syntenic regions consistent with the more distant phylogenetic relationship between the galegoid and phaseoloid legumes. Conclusion The SSCP technique is a useful and inexpensive alternative to other SNP or Indel detection techniques for saturating the common bean genetic map with functional markers that may be useful in marker assisted selection. In addition, the genetic markers based on ESTs allowed the construction
Tsui, Circe; Coleman, Laura E.; Griffith, Jacqulyn L.; Bennett, E. Andrew; Goodson, Summer G.; Scott, Jason D.; Pittard, W. Stephen; Devine, Scott E.
An international effort is underway to generate a comprehensive haplotype map (HapMap) of the human genome represented by an estimated 300 000 to 1 million ‘tag’ single nucleotide polymorphisms (SNPs). Our analysis indicates that the current human SNP map is not sufficiently dense to support the HapMap project. For example, 24.6% of the genome currently lacks SNPs at the minimal density and spacing that would be required to construct even a conservative tag SNP map containing 300 000 SNPs. In...
Therefore, we developed the first SNP assay to test stratification between Chinese and Japanese populations living in East Asia. The ancestry ... The SNP assay showed excellent promise as a highly potential application to test population stratification in case-control studies of association in Eastern Asians. Key words: ...
Shi, Ainong; Chen, Pengyin; Vierling, Richard; Zheng, Cuming; Li, Dexiao; Dong, Dekun; Shakiba, Ehsan; Cervantez, Innan
Soybean mosaic virus (SMV) is one of the most destructive viral diseases in soybean (Glycine max). Three independent loci for SMV resistance have been identified in soybean germplasm. The use of genetic resistance is the most effective method of controlling this disease. Marker assisted selection (MAS) has become very important and useful in the effort of selecting genes for SMV resistance. Single nucleotide polymorphism (SNP), because of its abundance and high-throughput potential, is a powerful tool in genome mapping, association studies, diversity analysis, and tagging of important genes in plant genomics. In this study, a 10 SNPs plus one insert/deletion (InDel) multiplex assay was developed for SMV resistance: two SNPs were developed from the candidate gene 3gG2 at Rsv1 locus, two SNPs selected from the clone N11PF linked to Rsv1, one 'BARC' SNP screened from soybean chromosome 13 [linkage group (LG) F] near Rsv1, two 'BARC' SNPs from probe A519 linked to Rsv3, one 'BARC' SNP from chromosome 14 (LG B2) near Rsv3, and two 'BARC' SNPs from chromosome 2 (LG D1b) near Rsv4, plus one InDel marker from expressed sequence tag (EST) AW307114 linked to Rsv4. This 11 SNP/InDel multiplex assay showed polymorphism among 47 diverse soybean germplasm, indicating this assay can be used to investigate the mode of inheritance in a SMV resistant soybean line carrying Rsv1, Rsv3, and/or Rsv4 through a segregating population with phenotypic data, and to select a specific gene or pyramid two or three genes for SMV resistance through MAS in soybean breeding program. The presence of two SMV resistance genes (Rsv1 and Rsv3) in J05 soybean was confirmed by the SNP assay.
Full Text Available The papers published in this Special Issue “SNP arrays” (Single Nucleotide Polymorphism Arrays focus on several perspectives associated with arrays of this type. The range of papers vary from a case report to reviews, thereby targeting wider audiences working in this field. The research focus of SNP arrays is often human cancers but this Issue expands that focus to include areas such as rare conditions, animal breeding and bioinformatics tools. Given the limited scope, the spectrum of papers is nothing short of remarkable and even from a technical point of view these papers will contribute to the field at a general level. Three of the papers published in this Special Issue focus on the use of various SNP array approaches in the analysis of three different cancer types. Two of the papers concentrate on two very different rare conditions, applying the SNP arrays slightly differently. Finally, two other papers evaluate the use of the SNP arrays in the context of genetic analysis of livestock. The findings reported in these papers help to close gaps in the current literature and also to give guidelines for future applications of SNP arrays.
Mohammadnejad, Afsaneh; Brasch-Andersen, Charlotte; Haagerup, Annette
Background: Allergic Rhinitis (AR) is a complex disorder that affects many people around the world. There is a high genetic contribution to the development of the AR, as twins and family studies have estimated heritability of more than 33%. Due to the complex nature of the disease, single SNP...... analysis has limited power in identifying the genetic variations for AR. We combined genome-wide association analysis (GWAS) with polygenic risk score (PRS) in exploring the genetic basis underlying the disease. Methods: We collected clinical data on 631 Danish subjects with AR cases consisting of 434...... sibling pairs and unrelated individuals and control subjects of 197 unrelated individuals. SNP genotyping was done by Affymetrix Genome-Wide Human SNP Array 5.0. SNP imputation was performed using "IMPUTE2". Using additive effect model, GWAS was conducted in discovery sample, the genotypes...
Jiang, Chao; Huang, Lu-Qi; Yuan, Yuan; Chen, Min; Hou, Jing-Yi; Wu, Zhi-Gang; Lin, Shu-Fang
Single nucleotide polymorphisms (SNP) is an important molecular marker in traditional Chinese medicine research, and it is widely used in TCM authentication. The present study created a new genotyping method by combining restriction endonuclease digesting with melting curve analysis, which is a stable, rapid and easy doing SNP genotyping method. The new method analyzed SNP genotyping of two chloroplast SNP which was located in or out of the endonuclease recognition site, the results showed that when attaching a 14 bp GC-clamp (cggcgggagggcgg) to 5' end of the primer and selecting suited endonuclease to digest the amplification products, the melting curve of Lonicera japonica and Atractylodes macrocephala were all of double peaks and the adulterants Shan-yin-hua and A. lancea were of single peaks. The results indicated that the method had good stability and reproducibility for identifying authentic medicines from its adulterants. It is a potential SNP genotyping method and named restriction endonuclease digest - melting curve analysis.
Full Text Available Many genetic association studies used single nucleotide polymorphisms (SNPs data to identify genetic variants for complex diseases. Although SNP-based associations are most common in genome-wide association studies (GWAS, gene-based association analysis has received increasing attention in understanding genetic etiologies for complex diseases. While both methods have been used to analyze the same data, few genome-wide association studies compare the results or observe the connection between them. We performed a comprehensive analysis of the data from the Study of Addiction: Genetics and Environment (SAGE and compared the results from the SNP-based and gene-based analyses. Our results suggest that the gene-based method complements the individual SNP-based analysis, and conceptually they are closely related. In terms of gene findings, our results validate many genes that were either reported from the analysis of the same dataset or based on animal studies for substance dependence.
Rodrigo Iván Contreras-Soto
Full Text Available Mapping quantitative trait loci through the use of linkage disequilibrium (LD in populations of unrelated individuals provides a valuable approach for dissecting the genetic basis of complex traits in soybean (Glycine max. The haplotype-based genome-wide association study (GWAS has now been proposed as a complementary approach to intensify benefits from LD, which enable to assess the genetic determinants of agronomic traits. In this study a GWAS was undertaken to identify genomic regions that control 100-seed weight (SW, plant height (PH and seed yield (SY in a soybean association mapping panel using single nucleotide polymorphism (SNP markers and haplotype information. The soybean cultivars (N = 169 were field-evaluated across four locations of southern Brazil. The genome-wide haplotype association analysis (941 haplotypes identified eleven, seventeen and fifty-nine SNP-based haplotypes significantly associated with SY, SW and PH, respectively. Although most marker-trait associations were environment and trait specific, stable haplotype associations were identified for SY and SW across environments (i.e., haplotypes Gm12_Hap12. The haplotype block 42 on Chr19 (Gm19_Hap42 was confirmed to be associated with PH in two environments. These findings enable us to refine the breeding strategy for tropical soybean, which confirm that haplotype-based GWAS can provide new insights on the genetic determinants that are not captured by the single-marker approach.
Home; Journals; Journal of Genetics; Volume 96; Issue 2. cDNA cloning, structural analysis, SNP detection and tissue ... Abstract. Insulin-like growth factor 1 (IGF1) plays an important role in growth, reproduction, foetal development and cell proliferation. The present study was conducted to clone and sequence the ...
[Naicy T., Venkatachalapathy T., Aravindakshan T., Raghavan K. C., Mini M. and Shyama K. 2017 cDNA cloning, structural analysis, SNP detection and tissue expression profile of the IGF1 gene in Malabari and Attappady Black goats of India. J. Genet. 96, xx–xx]. Introduction. Insulin-like growth factor 1 (IGF1), an important ...
Ting, Jason C; Ye, Ying; Thomas, George H; Ruczinski, Ingo; Pevsner, Jonathan
A variety of diseases are caused by chromosomal abnormalities such as aneuploidies (having an abnormal number of chromosomes), microdeletions, microduplications, and uniparental disomy. High density single nucleotide polymorphism (SNP) microarrays provide information on chromosomal copy number changes, as well as genotype (heterozygosity and homozygosity). SNP array studies generate multiple types of data for each SNP site, some with more than 100,000 SNPs represented on each array. The identification of different classes of anomalies within SNP data has been challenging. We have developed SNPscan, a web-accessible tool to analyze and visualize high density SNP data. It enables researchers (1) to visually and quantitatively assess the quality of user-generated SNP data relative to a benchmark data set derived from a control population, (2) to display SNP intensity and allelic call data in order to detect chromosomal copy number anomalies (duplications and deletions), (3) to display uniparental isodisomy based on loss of heterozygosity (LOH) across genomic regions, (4) to compare paired samples (e.g. tumor and normal), and (5) to generate a file type for viewing SNP data in the University of California, Santa Cruz (UCSC) Human Genome Browser. SNPscan accepts data exported from Affymetrix Copy Number Analysis Tool as its input. We validated SNPscan using data generated from patients with known deletions, duplications, and uniparental disomy. We also inspected previously generated SNP data from 90 apparently normal individuals from the Centre d'Etude du Polymorphisme Humain (CEPH) collection, and identified three cases of uniparental isodisomy, four females having an apparently mosaic X chromosome, two mislabelled SNP data sets, and one microdeletion on chromosome 2 with mosaicism from an apparently normal female. These previously unrecognized abnormalities were all detected using SNPscan. The microdeletion was independently confirmed by fluorescence in situ
Thomas George H
Full Text Available Abstract Background A variety of diseases are caused by chromosomal abnormalities such as aneuploidies (having an abnormal number of chromosomes, microdeletions, microduplications, and uniparental disomy. High density single nucleotide polymorphism (SNP microarrays provide information on chromosomal copy number changes, as well as genotype (heterozygosity and homozygosity. SNP array studies generate multiple types of data for each SNP site, some with more than 100,000 SNPs represented on each array. The identification of different classes of anomalies within SNP data has been challenging. Results We have developed SNPscan, a web-accessible tool to analyze and visualize high density SNP data. It enables researchers (1 to visually and quantitatively assess the quality of user-generated SNP data relative to a benchmark data set derived from a control population, (2 to display SNP intensity and allelic call data in order to detect chromosomal copy number anomalies (duplications and deletions, (3 to display uniparental isodisomy based on loss of heterozygosity (LOH across genomic regions, (4 to compare paired samples (e.g. tumor and normal, and (5 to generate a file type for viewing SNP data in the University of California, Santa Cruz (UCSC Human Genome Browser. SNPscan accepts data exported from Affymetrix Copy Number Analysis Tool as its input. We validated SNPscan using data generated from patients with known deletions, duplications, and uniparental disomy. We also inspected previously generated SNP data from 90 apparently normal individuals from the Centre d'Étude du Polymorphisme Humain (CEPH collection, and identified three cases of uniparental isodisomy, four females having an apparently mosaic X chromosome, two mislabelled SNP data sets, and one microdeletion on chromosome 2 with mosaicism from an apparently normal female. These previously unrecognized abnormalities were all detected using SNPscan. The microdeletion was independently
Qiu, Ping; Wang, Luquan; Kostich, Mitch; Ding, Wei; Simon, Jason S; Greene, Jonathan R
Carcinogenesis occurs, at least in part, due to the accumulation of mutations in critical genes that control the mechanisms of cell proliferation, differentiation and death. Publicly accessible databases contain millions of expressed sequence tag (EST) and single nucleotide polymorphism (SNP) records, which have the potential to assist in the identification of SNPs overrepresented in tumor tissue. An in silico SNP-tumor association study was performed utilizing tissue library and SNP information available in NCBI's dbEST (release 092002) and dbSNP (build 106). A total of 4865 SNPs were identified which were present at higher allele frequencies in tumor compared to normal tissues. A subset of 327 (6.7%) SNPs induce amino acid changes to the protein coding sequences. This approach identified several SNPs which have been previously associated with carcinogenesis, as well as a number of SNPs that now warrant further investigation This novel in silico approach can assist in prioritization of genes and SNPs in the effort to elucidate the genetic mechanisms underlying the development of cancer
You, Qian; Yang, Xiping; Peng, Ze; Xu, Liping; Wang, Jianping
Polypoid species play significant roles in agriculture and food production. Many crop species are polyploid, such as potato, wheat, strawberry, and sugarcane. Genotyping has been a daunting task for genetic studies of polyploid crops, which lags far behind the diploid crop species. Single nucleotide polymorphism (SNP) array is considered to be one of, high-throughput, relatively cost-efficient and automated genotyping approaches. However, there are significant challenges for SNP identification in complex, polyploid genomes, which has seriously slowed SNP discovery and array development in polyploid species. Ploidy is a significant factor impacting SNP qualities and validation rates of SNP markers in SNP arrays, which has been proven to be a very important tool for genetic studies and molecular breeding. In this review, we (1) discussed the pros and cons of SNP array in general for high throughput genotyping, (2) presented the challenges of and solutions to SNP calling in polyploid species, (3) summarized the SNP selection criteria and considerations of SNP array design for polyploid species, (4) illustrated SNP array applications in several different polyploid crop species, then (5) discussed challenges, available software, and their accuracy comparisons for genotype calling based on SNP array data in polyploids, and finally (6) provided a series of SNP array design and genotype calling recommendations. This review presents a complete overview of SNP array development and applications in polypoid crops, which will benefit the research in molecular breeding and genetics of crops with complex genomes. PMID:29467780
Fang, Wan-Ping; Meinhardt, Lyndel W; Tan, Hua-Wei; Zhou, Lin; Mischke, Sue; Zhang, Dapeng
Apart from water, tea is the world's most widely consumed beverage. Tea is produced in more than 50 countries with an annual production of approximately 4.7 million tons. The market segment for specialty tea has been expanding rapidly owing to increased demand, resulting in higher revenues and profits for tea growers and the industry. Accurate varietal identification is critically important to ensure traceability and authentication of premium tea products, which in turn contribute to on-farm conservation of tea genetic diversity. Using a set of single nucleotide polymorphism (SNP) markers developed from the expressed sequence tag (EST) database of Camilla senensis, we genotyped deoxyribonucleic acid (DNA) samples extracted from a diverse group of tea varieties, including both fresh and processed commercial loose-leaf teas. The validation led to the designation of 60 SNPs that unambiguously identified all 40 tested tea varieties with high statistical rigor (pauthenticity and genetic relationships among the analyzed cultivars were further characterized by ordination and Bayesian clustering analysis. These SNP markers, in combination with a high-throughput genotyping protocol, effectively established and verified specific DNA fingerprints for all tested tea varieties. This method provides a powerful tool for variety authentication and quality control for the tea industry. It is also highly useful for the management of tea genetic resources and breeding, where accurate and efficient genotype identification is essential.
Rotherham, D; Harbison, S A
Cannabis sativa is both an illegal drug and a legitimate crop. The differentiation of illegal drug Cannabis from non-drug forms of Cannabis is relevant in the context of the growth of fibre and seed oil varieties of Cannabis for commercial purposes. This differentiation is currently determined based on the levels of tetrahydrocannabinol (THC) in adult plants. DNA based methods have the potential to assay Cannabis material unsuitable for analysis using conventional means including seeds, pollen and severely degraded material. The purpose of this research was to develop a single nucleotide polymorphism (SNP) assay for the differentiation of "drug" and "non-drug"Cannabis plants. An assay was developed based on four polymorphisms within a 399 bp fragment of the tetrahydrocannabinolic acid (THCA) synthase gene, utilising the snapshot multiplex kit. This SNP assay was tested on 94 Cannabis plants, which included 10 blind samples, and was able to differentiate between "drug" and "non-drug"Cannabis in all cases, while also differentiating between Cannabis and other species. Non-drug plants were found to be homozygous at the four sites assayed while drug Cannabis plants were either homozygous or heterozygous. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Single nucleotide polymorphisms (SNPs) may be considered the ultimate genetic markers as they represent the finest resolution of a DNA sequence (a single nucleotide), and are generally abundant in populations with a low mutation rate. SNPs are important tools in studying complex genetic traits and genome evolution.
Accurate identification of individual genotypes is important for cacao (Theobroma cacao L.) breeding, germplasm conservation and seed propagation. The development of single nucleotide polymorphism (SNP) markers in cacao offers an effective way to use a high-throughput genotyping system for cacao gen...
Seo, Seunghee; Takayama, Kanako; Uno, Kyosuke; Ohi, Kazutaka; Hashimoto, Ryota; Nishizawa, Daisuke; Ikeda, Kazutaka; Ozaki, Norio; Nabeshima, Toshitaka; Miyamoto, Yoshiaki; Nitta, Atsumi
The single nucleotide polymorphism (SNP) rs13438494 in intron 24 of PCLO was significantly associated with bipolar disorder in a meta-analysis of genome-wide association studies. In this study, we performed functional minigene analysis and bioinformatics prediction of splicing regulatory sequences to characterize the deep intronic SNP rs13438494. We constructed minigenes with A and C alleles containing exon 24, intron 24, and exon 25 of PCLO to assess the genetic effect of rs13438494 on splicing. We found that the C allele of rs13438494 reduces the splicing efficiency of the PCLO minigene. In addition, prediction analysis of enhancer/silencer motifs using the Human Splice Finder web tool indicated that rs13438494 induces the abrogation or creation of such binding sites. Our results indicate that rs13438494 alters splicing efficiency by creating or disrupting a splicing motif, which functions by binding of splicing regulatory proteins, and may ultimately result in bipolar disorder in affected people. PMID:24167553
Liu, Jingyu; Pearlson, Godfrey; Calhoun, Vince; Windemuth, Andreas
There is current interest in understanding genetic influences on brain function in both the healthy and the disordered brain. Parallel independent component analysis, a new method for analyzing multimodal data, is proposed in this paper and applied to functional magnetic resonance imaging (fMRI) and a single nucleotide polymorphism (SNP) array. The method aims to identify the independent components of each modality and the relationship between the two modalities. We analyzed 92 participants, including 29 schizophrenia (SZ) patients, 13 unaffected SZ relatives, and 50 healthy controls. We found a correlation of 0.79 between one fMRI component and one SNP component. The fMRI component consists of activations in cingulate gyrus, multiple frontal gyri, and superior temporal gyrus. The related SNP component is contributed to significantly by 9 SNPs located in sets of genes, including those coding for apolipoprotein A-I, and C-III, malate dehydrogenase 1 and the gamma-aminobutyric acid alpha-2 receptor. A significant difference in the presences of this SNP component is found between the SZ group (SZ patients and their relatives) and the control group. In summary, we constructed a framework to identify the interactions between brain functional and genetic information; our findings provide new insight into understanding genetic influences on brain function in a common mental disorder.
Gross, Arnd; Tönjes, Anke; Scholz, Markus
When testing for SNP (single nucleotide polymorphism) associations in related individuals, observations are not independent. Simple linear regression assuming independent normally distributed residuals results in an increased type I error and the power of the test is also affected in a more complicate manner. Inflation of type I error is often successfully corrected by genomic control. However, this reduces the power of the test when relatedness is of concern. In the present paper, we derive explicit formulae to investigate how heritability and strength of relatedness contribute to variance inflation of the effect estimate of the linear model. Further, we study the consequences of variance inflation on hypothesis testing and compare the results with those of genomic control correction. We apply the developed theory to the publicly available HapMap trio data (N=129), the Sorbs (a self-contained population with N=977 characterised by a cryptic relatedness structure) and synthetic family studies with different sample sizes (ranging from N=129 to N=999) and different degrees of relatedness. We derive explicit and easily to apply approximation formulae to estimate the impact of relatedness on the variance of the effect estimate of the linear regression model. Variance inflation increases with increasing heritability. Relatedness structure also impacts the degree of variance inflation as shown for example family structures. Variance inflation is smallest for HapMap trios, followed by a synthetic family study corresponding to the trio data but with larger sample size than HapMap. Next strongest inflation is observed for the Sorbs, and finally, for a synthetic family study with a more extreme relatedness structure but with similar sample size as the Sorbs. Type I error increases rapidly with increasing inflation. However, for smaller significance levels, power increases with increasing inflation while the opposite holds for larger significance levels. When genomic control
Cunningham, J M; Vierkant, R A; Sellers, T A
BACKGROUND: Dysregulation of the cell cycle is a hallmark of many cancers including ovarian cancer, a leading cause of gynaecologic cancer mortality worldwide. METHODS: We examined single nucleotide polymorphisms (SNPs) (n=288) from 39 cell cycle regulation genes, including cyclins, cyclin...... in a replication population, and the association remained suggestive in the combined analysis [OR(BB vs AA) 1.59 (1.08-2.34), P=0.02]. No other SNP associations remained suggestive in the replication populations. CONCLUSION: ABL1 has been implicated in multiple processes including cell division, cell adhesion...
Chen, Bowang; Wilkening, Stefan; Drechsel, Marion; Hemminki, Kari
Single nucleotide polymorphism (SNP) genotyping is a major activity in biomedical research. Scientists prefer to have a facile access to the results which may require conversions between data formats. First hand SNP data is often entered in or saved in the MS-Excel format, but this software lacks genetic and epidemiological related functions. A general tool to do basic genetic and epidemiological analysis and data conversion for MS-Excel is needed. The SNP_tools package is prepared as an add-in for MS-Excel. The code is written in Visual Basic for Application, embedded in the Microsoft Office package. This add-in is an easy to use tool for users with basic computer knowledge (and requirements for basic statistical analysis). Our implementation for Microsoft Excel 2000-2007 in Microsoft Windows 2000, XP, Vista and Windows 7 beta can handle files in different formats and converts them into other formats. It is a free software.
Full Text Available Abstract Background The tumor suppressor gene p53 is involved in multiple cellular pathways including apoptosis, transcriptional control, and cell cycle regulation. In the last decade it has been demonstrated that the single nucleotide polymorphism (SNP at codon 72 of the p53 gene is associated with the risk for development of various neoplasms. MDM2 SNP309 is a single nucleotide T to G polymorphism located in the MDM2 gene promoter. From the time that this well-characterized functional polymorphism was identified, a variety of case-control studies have been published that investigate the possible association between MDM2 SNP309 and cancer risk. However, the results of the published studies, as well as the subsequent meta-analyses, remain contradictory. Methods To investigate whether currently published epidemiological studies can clarify the potential interaction between MDM2 SNP309 and the functional genetic variant in p53 codon72 (Arg72Pro and p53 mutation status, we performed a meta-analysis of the risk estimate on 27,813 cases with various tumor types and 30,295 controls. Results The data we reviewed indicated that variant homozygote 309GG and heterozygote 309TG were associated with a significant increased risk of all tumor types (homozygote comparison: odds ratio (OR = 1.25, 95% confidence interval (CI = 1.13-1.37; heterozygote comparison: OR = 1.10, 95% CI = 1.03-1.17. We also found that the combination of GG and Pro/Pro, TG and Pro/Pro, GG and Arg/Arg significantly increased the risk of cancer (OR = 3.38, 95% CI = 1.77-6.47; OR = 1.88, 95% CI = 1.26-2.81; OR = 1.96, 95% CI = 1.01-3.78, respectively. In a stratified analysis by tumor location, we also found a significant increased risk in brain, liver, stomach and uterus cancer (OR = 1.47, 95% CI = 1.06-2.03; OR = 2.24, 95%CI = 1.57-3.18; OR = 1.54, 95%CI = 1.04-2.29; OR = 1.34, 95%CI = 1.07-1.29, respectively. However, no association was seen between MDM2 SNP309 and tumor susceptibility
Wan, Yan; Wu, Wei; Yin, Zhihua; Guan, Peng; Zhou, Baosen
The tumor suppressor gene p53 is involved in multiple cellular pathways including apoptosis, transcriptional control, and cell cycle regulation. In the last decade it has been demonstrated that the single nucleotide polymorphism (SNP) at codon 72 of the p53 gene is associated with the risk for development of various neoplasms. MDM2 SNP309 is a single nucleotide T to G polymorphism located in the MDM2 gene promoter. From the time that this well-characterized functional polymorphism was identified, a variety of case-control studies have been published that investigate the possible association between MDM2 SNP309 and cancer risk. However, the results of the published studies, as well as the subsequent meta-analyses, remain contradictory. To investigate whether currently published epidemiological studies can clarify the potential interaction between MDM2 SNP309 and the functional genetic variant in p53 codon72 (Arg72Pro) and p53 mutation status, we performed a meta-analysis of the risk estimate on 27,813 cases with various tumor types and 30,295 controls. The data we reviewed indicated that variant homozygote 309GG and heterozygote 309TG were associated with a significant increased risk of all tumor types (homozygote comparison: odds ratio (OR) = 1.25, 95% confidence interval (CI) = 1.13-1.37; heterozygote comparison: OR = 1.10, 95% CI = 1.03-1.17). We also found that the combination of GG and Pro/Pro, TG and Pro/Pro, GG and Arg/Arg significantly increased the risk of cancer (OR = 3.38, 95% CI = 1.77-6.47; OR = 1.88, 95% CI = 1.26-2.81; OR = 1.96, 95% CI = 1.01-3.78, respectively). In a stratified analysis by tumor location, we also found a significant increased risk in brain, liver, stomach and uterus cancer (OR = 1.47, 95% CI = 1.06-2.03; OR = 2.24, 95%CI = 1.57-3.18; OR = 1.54, 95%CI = 1.04-2.29; OR = 1.34, 95%CI = 1.07-1.29, respectively). However, no association was seen between MDM2 SNP309 and tumor susceptibility in the stratified analysis by p53 mutation status
Sarah L. Nickerson
Full Text Available Autosomal recessive cerebellar ataxia encompasses a large and heterogeneous group of neurodegenerative disorders. We employed single nucleotide polymorphism (SNP analysis and whole exome sequencing to investigate a consanguineous Maori pedigree segregating ataxia. We identified a novel mutation in exon 10 of the SACS gene: c.7962T>G p.(Tyr2654*, establishing the diagnosis of autosomal recessive spastic ataxia of Charlevoix-Saguenay (ARSACS. Our findings expand both the genetic and phenotypic spectrum of this rare disorder, and highlight the value of high-density SNP analysis and whole exome sequencing as powerful and cost-effective tools in the diagnosis of genetically heterogeneous disorders such as the hereditary ataxias.
Cregan Perry B
Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs as defined here are single base sequence changes or short insertion/deletions between or within individuals of a given species. As a result of their abundance and the availability of high throughput analysis technologies SNP markers have begun to replace other traditional markers such as restriction fragment length polymorphisms (RFLPs, amplified fragment length polymorphisms (AFLPs and simple sequence repeats (SSRs or microsatellite markers for fine mapping and association studies in several species. For SNP discovery from chromatogram data, several bioinformatics programs have to be combined to generate an analysis pipeline. Results have to be stored in a relational database to facilitate interrogation through queries or to generate data for further analyses such as determination of linkage disequilibrium and identification of common haplotypes. Although these tasks are routinely performed by several groups, an integrated open source SNP discovery pipeline that can be easily adapted by new groups interested in SNP marker development is currently unavailable. Results We developed SNP-PHAGE (SNP discovery Pipeline with additional features for identification of common haplotypes within a sequence tagged site (Haplotype Analysis and GenBank (-dbSNP submissions. This tool was applied for analyzing sequence traces from diverse soybean genotypes to discover over 10,000 SNPs. This package was developed on UNIX/Linux platform, written in Perl and uses a MySQL database. Scripts to generate a user-friendly web interface are also provided with common queries for preliminary data analysis. A machine learning tool developed by this group for increasing the efficiency of SNP discovery is integrated as a part of this package as an optional feature. The SNP-PHAGE package is being made available open source at http://bfgl.anri.barc.usda.gov/ML/snp-phage/. Conclusion SNP-PHAGE provides a bioinformatics
Mehta, Bhavik; Daniel, Runa; Phillips, Chris; McNevin, Dennis
Short tandem repeats are the gold standard for human identification but are not informative for forensic DNA phenotyping (FDP). Single-nucleotide polymorphisms (SNPs) as genetic markers can be applied to both identification and FDP. The concept of DNA intelligence emerged with the potential for SNPs to infer biogeographical ancestry (BGA) and externally visible characteristics (EVCs), which together enable the FDP process. For more than a decade, the SNaPshot ® technique has been utilised to analyse identity and FDP-associated SNPs in forensic DNA analysis. SNaPshot is a single-base extension (SBE) assay with capillary electrophoresis as its detection system. This multiplexing technique offers the advantage of easy integration into operational forensic laboratories without the requirement for any additional equipment. Further, the SNP panels from SNaPshot ® assays can be incorporated into customised panels for massively parallel sequencing (MPS). Many SNaPshot ® assays are available for identity, BGA and EVC profiling with examples including the well-known SNPforID 52-plex identity assay, the SNPforID 34-plex BGA assay and the HIrisPlex EVC assay. This review lists the major forensically relevant SNaPshot ® assays for human DNA SNP analysis and can be used as a guide for selecting the appropriate assay for specific identity and FDP applications.
IGF1 gene might play important roles in almost all body functions especially growth, reproduction and meat quality traits in goats. Further studies are required to establish the role of the detected SNP in the IGF1 gene expression and association with economic traits in goats. The results of the present study suggests that IGF1 ...
Johnson Peter W
Full Text Available Abstract Background The MDM2 gene encodes a negative regulator of the p53 tumour suppressor protein. A single nucleotide polymorphism (SNP in the MDM2 promoter (a T to G exchange at nucleotide 309 has been reported to produce accelerated tumour formation in individuals with inherited p53 mutations. We have investigated the effect of the MDM2 SNP309 on clinical outcome in a cohort of patients with germline mutations of BRCA1. Methods Genomic DNA was obtained for 102 healthy controls and 116 patients with established pathogenic mutations of BRCA1 and Pyrosequencing technology™ was used to determine the genotype at the MDM2 SNP309 locus. Results The polymorphism was present in 52.9% of the controls (G/T in 37.3% and G/G in 15.6% and 58.6% of the BRCA1 mutation carriers (47.4% G/T and 11.2% G/G. Incidence of malignancy in female BRCA1 carriers was not significantly higher in SNP309 carriers than in wildtype (T/T individuals (72.7% vs. 75.6%, p = 1.00. Mean age of diagnosis of first breast cancer was 41.2 years in the SNP309 G/G genotype carriers, 38.6 years in those with the SNP309 G/T genotype and 39.0 years in wildtype subjects (p = 0.80. Conclusion We found no evidence that the MDM2 SNP309 accelerates tumour development in carriers of known pathogenic germline mutations of BRCA1.
Full Text Available Cervical cancer is the most fatal disease among Indonesian women. In recognition of the substantial variation in the intrinsic response of individuals to radiation, an effort had been done to identify the genetic markers, primarily Single Nucleotide polymorphisms (SNPs, which are associated with responsiveness of cancer cells to radiation therapy. One of these SNPs is X-ray repair cross-complementing protein 1 (XRCC1 that is one of the most important genes in deoxyribonucleic acid (DNA repair pathways. Meta-analysis in the determination of the association of XRCC1 polymorphisms with cervical cancer revealed the potential role of XRCC1 polymorphisms in predicting cell response to radiotherapy.Our preliminary study with real-time polymerase chain reaction (RT-PCR showed that radiotherapy affected the XRCC1 gene analyzed in blood of cervical cancer patient. Other published study found three SNPs of XRCC1 (Arg194Trp, Arg280His, and Arg399Gln that cause amino acid substitutions. Arg194Trp is only SNPs that associated with high risk of cervical cancer but not others. Additionally, structure and function of this protein can be altered by functional SNPs, which may lead to the susceptibility of individuals to cancers. Anotherstudy found G399A polymorphisms. We concluded that SNP of this DNA repair genes have been found to be good predictors of efficacy of radiotherapy.Kanker serviks adalah penyakit yang paling fatal pada perempuan di Indonesia. Untuk memahami variasi substansial respon intrinsik individual terhadap radiasi, suatu usaha telah dilakukan untuk mengidentifikasi petanda genetik, terutama Single Nucleotide polymorphism (SNP, yang berkaitan dengan responsel kanker terhadap terapi radiasi. Satu dari SNP tersebut adalah X-ray repair cross-complementing protein 1 (XRCC1 yang merupakan satu dari gen paling penting dalam lajur perbaikan asam deoksiribonukleat (DNA. Meta-analysis dalam penentuan hubungan polimorfisme XRCC1 dengan kanker serviks
Venables, Samantha J; Mehta, Bhavik; Daniel, Runa; Walsh, Simon J; van Oorschot, Roland A H; McNevin, Dennis
High resolution melting (HRM) analysis is a simple, cost effective, closed tube SNP genotyping technique with high throughput potential. The effectiveness of HRM for forensic SNP genotyping was assessed with five commercially available HRM kits evaluated on the ViiA™ 7 Real Time PCR instrument. Four kits performed satisfactorily against forensically relevant criteria. One was further assessed to determine the sensitivity, reproducibility, and accuracy of HRM SNP genotyping. The manufacturer's protocol using 0.5 ng input DNA and 45 PCR cycles produced accurate and reproducible results for 17 of the 19 SNPs examined. Problematic SNPs had GC rich flanking regions which introduced additional melting domains into the melting curve (rs1800407) or included homozygotes that were difficult to distinguish reliably (rs16891982; a G to C SNP). A proof of concept multiplexing experiment revealed that multiplexing a small number of SNPs may be possible after further investigation. HRM enables genotyping of a number of SNPs in a large number of samples without extensive optimization. However, it requires more genomic DNA as template in comparison to SNaPshot®. Furthermore, suitably modifying pre-existing forensic intelligence SNP panels for HRM analysis may pose difficulties due to the properties of some SNPs. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Tvedegaard, Kristine C.; Parner, Erik; Hooper, Craig W.
is a further development of allele specific primer extension (ASPE) for multiplex SNP analysis based on the Luminex 100 IS platform. It uses isobases (isoC and isoG) and the software MultiCode-PLx platform for data analysis and data handling. We validate the EraGen multicode system in two 6-plex assays used...
Wang, Xiaoge; Lu, Xuke; Wang, Junjuan; Wang, Delong; Yin, Zujun; Fan, Weili; Wang, Shuai; Ye, Wuwei
Salinity stress is a major abiotic factor that affects crop output, and as a pioneer crop in saline and alkaline land, salt tolerance study of cotton is particularly important. In our experiment, four salt-tolerance varieties with different salt tolerance indexes including CRI35 (65.04%), Kanghuanwei164 (56.19%), Zhong9807 (55.20%) and CRI44 (50.50%), as well as four salt-sensitive cotton varieties including Hengmian3 (48.21%), GK50 (40.20%), Xinyan96-48 (34.90%), ZhongS9612 (24.80%) were used as the materials. These materials were divided into salt-tolerant group (ST) and salt-sensitive group (SS). Illumina Cotton SNP 70K Chip was used to detect SNP in different cotton varieties. SNPv (SNP variation of the same seedling pre- and after- salt stress) in different varieties were screened; polymorphic SNP and SNPr (SNP related to salt tolerance) were obtained. Annotation and analysis of these SNPs showed that (1) the induction efficiency of salinity stress on SNPv of cotton materials with different salt tolerance index was different, in which the induction efficiency on salt-sensitive materials was significantly higher than that on salt-tolerant materials. The induction of salt stress on SNPv was obviously biased. (2) SNPv induced by salt stress may be related to the methylation changes under salt stress. (3) SNPr may influence salt tolerance of plants by affecting the expression of salt-tolerance related genes.
Full Text Available Abstract Background Single nucleotide polymorphism (SNP genotyping is a major activity in biomedical research. Scientists prefer to have a facile access to the results which may require conversions between data formats. First hand SNP data is often entered in or saved in the MS-Excel format, but this software lacks genetic and epidemiological related functions. A general tool to do basic genetic and epidemiological analysis and data conversion for MS-Excel is needed. Findings The SNP_tools package is prepared as an add-in for MS-Excel. The code is written in Visual Basic for Application, embedded in the Microsoft Office package. This add-in is an easy to use tool for users with basic computer knowledge (and requirements for basic statistical analysis. Conclusion Our implementation for Microsoft Excel 2000-2007 in Microsoft Windows 2000, XP, Vista and Windows 7 beta can handle files in different formats and converts them into other formats. It is a free software.
1Centre of Biotechnology of Sfax, P.B. '1177', 3018 Sfax, Tunisia. 2Laboratory of Physics-Mathematics and Applications, Faculty of Sciences of Sfax, BP 1177, 3000 Sfax, Tunisia. [Ben Ayed R., Kallel I., Ben Hassen H. and Rebai A. 2014 SNP marker analysis for validating the authenticity of Tunisian olive oil. J. Genet.
Rippe, Ralph Christian Alexander
Fluorescence bias in in signals from individual SNP arrays can be calibrated using linear models. Given the data, the system of equations is very large, so a specialized symbolic algorithm was developed. These models are also used to illustrate that genomic waves do not exist, but are merely an
Full Text Available Understanding the habitat use patterns of migratory fish, such as Atlantic salmon (Salmo salar L., and the natural and anthropogenic impacts on them, is aided by the ability to identify individuals to their stock of origin. Presented here are the results of an analysis of informative single nucleotide polymorphic (SNP markers for detecting genetic structuring in Atlantic salmon in Scotland and NE England and their ability to allow accurate genetic stock identification. 3,787 fish from 147 sites covering 27 rivers were screened at 5,568 SNP markers. In order to identify a cost-effective subset of SNPs, they were ranked according to their ability to differentiate between fish from different rivers. A panel of 288 SNPs was used to examine both individual assignments and mixed stock fisheries and eighteen assignment units were defined. The results improved greatly on previously available methods and, for the first time, fish caught in the marine environment can be confidently assigned to geographically coherent units within Scotland and NE England, including individual rivers. As such, this SNP panel has the potential to aid understanding of the various influences acting upon Atlantic salmon on their marine migrations, be they natural environmental variations and/or anthropogenic impacts, such as mixed stock fisheries and interactions with marine power generation installations.
Gilbey, John; Cauwelier, Eef; Coulson, Mark W; Stradmeyer, Lee; Sampayo, James N; Armstrong, Anja; Verspoor, Eric; Corrigan, Laura; Shelley, Jonathan; Middlemas, Stuart
Understanding the habitat use patterns of migratory fish, such as Atlantic salmon (Salmo salar L.), and the natural and anthropogenic impacts on them, is aided by the ability to identify individuals to their stock of origin. Presented here are the results of an analysis of informative single nucleotide polymorphic (SNP) markers for detecting genetic structuring in Atlantic salmon in Scotland and NE England and their ability to allow accurate genetic stock identification. 3,787 fish from 147 sites covering 27 rivers were screened at 5,568 SNP markers. In order to identify a cost-effective subset of SNPs, they were ranked according to their ability to differentiate between fish from different rivers. A panel of 288 SNPs was used to examine both individual assignments and mixed stock fisheries and eighteen assignment units were defined. The results improved greatly on previously available methods and, for the first time, fish caught in the marine environment can be confidently assigned to geographically coherent units within Scotland and NE England, including individual rivers. As such, this SNP panel has the potential to aid understanding of the various influences acting upon Atlantic salmon on their marine migrations, be they natural environmental variations and/or anthropogenic impacts, such as mixed stock fisheries and interactions with marine power generation installations.
Cingolani, Pablo; Platts, Adrian; Wang, Le Lily; Coon, Melissa; Nguyen, Tung; Wang, Luan; Land, Susan J.; Lu, Xiangyi; Ruden, Douglas M.
We describe a new computer program, SnpEff, for rapidly categorizing the effects of variants in genome sequences. Once a genome is sequenced, SnpEff annotates variants based on their genomic locations and predicts coding effects. Annotated genomic locations include intronic, untranslated region, upstream, downstream, splice site, or intergenic regions. Coding effects such as synonymous or non-synonymous amino acid replacement, start codon gains or losses, stop codon gains or losses, or frame shifts can be predicted. Here the use of SnpEff is illustrated by annotating ~356,660 candidate SNPs in ~117 Mb unique sequences, representing a substitution rate of ~1/305 nucleotides, between the Drosophila melanogaster w1118; iso-2; iso-3 strain and the reference y1; cn1 bw1 sp1 strain. We show that ~15,842 SNPs are synonymous and ~4,467 SNPs are non-synonymous (N/S ~0.28). The remaining SNPs are in other categories, such as stop codon gains (38 SNPs), stop codon losses (8 SNPs), and start codon gains (297 SNPs) in the 5′UTR. We found, as expected, that the SNP frequency is proportional to the recombination frequency (i.e., highest in the middle of chromosome arms). We also found that start-gain or stop-lost SNPs in Drosophila melanogaster often result in additions of N-terminal or C-terminal amino acids that are conserved in other Drosophila species. It appears that the 5′ and 3′ UTRs are reservoirs for genetic variations that changes the termini of proteins during evolution of the Drosophila genus. As genome sequencing is becoming inexpensive and routine, SnpEff enables rapid analyses of whole-genome sequencing data to be performed by an individual laboratory. PMID:22728672
Brown, Allan F; Yousef, Gad G; Chebrolu, Kranthi K; Byrd, Robert W; Everhart, Koyt W; Thomas, Aswathy; Reid, Robert W; Parkin, Isobel A P; Sharpe, Andrew G; Oliver, Rebekah; Guzman, Ivette; Jackson, Eric W
A high-resolution genetic linkage map of B. oleracea was developed from a B. napus SNP array. The work will facilitate genetic and evolutionary studies in Brassicaceae. A broccoli population, VI-158 × BNC, consisting of 150 F2:3 families was used to create a saturated Brassica oleracea (diploid: CC) linkage map using a recently developed rapeseed (Brassica napus) (tetraploid: AACC) Illumina Infinium single nucleotide polymorphism (SNP) array. The map consisted of 547 non-redundant SNP markers spanning 948.1 cM across nine chromosomes with an average interval size of 1.7 cM. As the SNPs are anchored to the genomic reference sequence of the rapid cycling B. oleracea TO1000, we were able to estimate that the map provides 96 % coverage of the diploid genome. Carotenoid analysis of 2 years data identified 3 QTLs on two chromosomes that are associated with up to half of the phenotypic variation associated with the accumulation of total or individual compounds. By searching the genome sequences of the two related diploid species (B. oleracea and B. rapa), we further identified putative carotenoid candidate genes in the region of these QTLs. This is the first description of the use of a B. napus SNP array to rapidly construct high-density genetic linkage maps of one of the constituent diploid species. The unambiguous nature of these markers with regard to genomic sequences provides evidence to the nature of genes underlying the QTL, and demonstrates the value and impact this resource will have on Brassica research.
In this paper, we discuss the datasets stored in SNP-Seek, architecture of the database and web application, interoperability methodologies in place, and discuss a few use cases demonstrating the utility of SNP-Seek for diversity analysis and molecular breeding.
Chen, Jin-Bor; Chuang, Li-Yeh; Lin, Yu-Da; Liou, Chia-Wei; Lin, Tsu-Kung; Lee, Wen-Chin; Cheng, Ben-Chung; Chang, Hsueh-Wei; Yang, Cheng-Hong
Single nucleotide polymorphism (SNP) interaction analysis can simultaneously evaluate the complex SNP interactions present in complex diseases. However, it is less commonly applied to evaluate the predisposition of chronic dialysis and its computational analysis remains challenging. In this study, we aimed to improve the analysis of SNP-SNP interactions within the mitochondrial D-loop in chronic dialysis. The SNP-SNP interactions between 77 reported SNPs within the mitochondrial D-loop in chronic dialysis study were evaluated in terms of SNP barcodes (different SNP combinations with their corresponding genotypes). We propose a genetic algorithm (GA) to generate SNP barcodes. The χ(2) values were then calculated by the occurrences of the specific SNP barcodes and their non-specific combinations between cases and controls. Each SNP barcode (2- to 7-SNP) with the highest value in the χ(2) test was regarded as the best SNP barcode (11.304 to 23.310; p algorithm to address the SNP-SNP interactions and demonstrated that many non-significant SNPs within the mitochondrial D-loop may play a role in jointed effects to chronic dialysis susceptibility.
Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Canine Cohort, Canine Intelligence Assessment Regimen, Genome-Wide Single Nucleotide Polymorphism (SNP) Typing, and Unsupervised Classification Algorithm for Genome-Wide Association Data Analysis
Almasy, L, Blangero, J. (2009) Human QTL linkage mapping. Genetica 136:333-340. Amos, CI. (2007) Successful design and conduct of genome-wide...quantitative trait loci. Genetica 136:237-243. Skol AD, Scott LJ, Abecasis GR, Boehnke M. (2006) Joint analysis is more efficient than replication
Wei, Yi-Liang; Wei, Li; Zhao, Lei; Sun, Qi-Fan; Jiang, Li; Zhang, Tao; Liu, Hai-Bo; Chen, Jian-Gang; Ye, Jian; Hu, Lan; Li, Cai-Xia
A single-tube multiplex assay of a small set of ancestry-informative markers (AIMs) for effectively estimating individual ancestry and admixture is an ideal forensic tool to trace the population origin of an unknown DNA sample. We present a newly developed 27-plex single nucleotide polymorphism (SNP) panel with highly robust and balanced differential power to perfectly assign individuals to African, European, and East Asian ancestries. Evaluating 968 previously described intercontinental AIMs from three HapMap population genotyping datasets (Yoruban in Ibadan, Nigeria (YRI); Utah residents with Northern and Western European ancestry from the Centre de'Etude du Polymorphism Humain (CEPH) collection (CEU); and Han Chinese in Beijing, China (CHB)), the best set of markers was selected on the basis of Hardy-Weinberg equilibrium (p > 0.00001), population-specific allele frequency (two of three δ values >0.5), according to linkage disequilibrium (r (2) ancestry of the 11 populations in the HapMap project. Then, we tested the 27-plex SNP assay with 1164 individuals from 17 additional populations. The results demonstrated that the SNP panel was successful for ancestry inference of individuals with African, European, and East Asian ancestry. Furthermore, the system performed well when inferring the admixture of Eurasians (EUR/EAS) after analyzing admixed populations from Xinjiang (Central Asian) as follows: Tajik (68:27), Uyghur (49:46), Kirgiz (40:57), and Kazak (36:60). For individual analyses, we interpreted each sample with a three-ancestry component percentage and a population match probability sequence. This multiplex assay is a convenient and cost-effective tool to assist in criminal investigations, as well as to correct for the effects of population stratification for case-control studies.
The other. SNP, called SOD, is an insertion/deletion polymorphism type localized in Cu–Zn superoxide dismutase gene which is involved in the oxidative stress, one of .... Authenticity of olive oil by SNP markers. Table 1. Characteristics of SNP markers used for DNA amplification. Size of. SNP amplicon. Gene name. SNPs.
Authors: Kai Wang & Maja Bucan ### INTRODUCTION High-density single nucleotide polymorphism (SNP) genotyping arrays recently have been used for copy number variation (CNV) detection and analysis, because the arrays can serve a dual role for SNP- and CNV-based association studies. They also can provide considerably higher precision and resolution than traditional techniques. Here we describe PennCNV, a computational protocol designed for CNV detection from high-density SNP genotyping d...
Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David
Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions
Full Text Available Abstract Background Diagnostic analysis of patients with developmental disorders has improved over recent years largely due to the use of microarray technology. Array methods that facilitate copy number analysis have enabled the diagnosis of up to 20% more patients with previously normal karyotyping results. A substantial number of patients remain undiagnosed, however. Methods and Results Using the Genome-Wide Human SNP array 6.0, we analyzed 35 patients with a developmental disorder of unknown cause and normal array comparative genomic hybridization (array CGH results, in order to characterize previously undefined genomic aberrations. We detected no seemingly pathogenic copy number aberrations. Most of the vast amount of data produced by the array was polymorphic and non-informative. Filtering of this data, based on copy number variant (CNV population frequencies as well as phenotypically relevant genes, enabled pinpointing regions of allelic homozygosity that included candidate genes correlating to the phenotypic features in four patients, but results could not be confirmed. Conclusions In this study, the use of an ultra high-resolution SNP array did not contribute to further diagnose patients with developmental disorders of unknown cause. The statistical power of these results is limited by the small size of the patient cohort, and interpretation of these negative results can only be applied to the patients studied here. We present the results of our study and the recurrence of clustered allelic homozygosity present in this material, as detected by the SNP 6.0 array.
Johnson, Neil; Speirs, Valerie; Curtin, Nicola J; Hall, Andrew G
Allelic imbalance is a common feature of many malignancies. We have measured allelic imbalance in genomic DNA from the breast cancer cell lines T47D, MDA-MB-231, two antiestrogen sensitive (MCF7N and MCF7L) and two resistant MCF7 cell lines (MMU2 and LCC9) using single nucleotide polymorphism (SNP) oligonucleotide microarrays. DNA from MCF7(L) and MMU2 cells was also analysed by comparative genome hybridisation (CGH) to compare with SNP microarray data. Proteins previously determined to be involved in disease progression were quantified by Western blot and compared to array data. The SNP and CGH array both detected cytogenetic abnormalities commonly found in breast cancer: amplification of chromosomes 11q13-14.1, 17q and 20q containing cyclin D1, BCAS1 and 3 (Breast Cancer Amplified Sequence) and AIB1 (Amplified in Breast cancer) genes; losses at 6q, 9p and X chromosomes, which included ERalpha (Estrogen Receptor alpha) and p16 ( INK4A ) genes. However the SNP chip array data additionally identified regions of loss of heterozygosity (LOH) followed by duplication of the remaining allele-uniparental disomy (UPD). Good concordance between SNP arrays and CGH analyses was observed, however there was poor correlation between gene copy number and protein levels between the cell lines. There were reductions in ERalpha, cyclin D1 and p27 protein levels whilst p21 protein levels were elevated in antiestrogen resistant MCF7 cell lines. Although protein levels varied there was no difference in gene copy number. This study shows SNP and CGH array analysis are powerful tools for analysis of allelic imbalance in breast cancer. However, the antiestrogen resistant phenotype was likely to be due to changes in gene expression and protein degradation rather than in altered gene copy number.
Børsting, Claus; Rockenbauer, Eszter; Morling, Niels
cases and 33 twin cases were typed at least twice for the 49 SNPs. All electropherograms were analysed independently by two expert analysts prior to approval. Based on these results, detailed guidelines for analysis of the SBE products were developed. With these guidelines, the peak height ratio...... of a heterozygous allele call or the signal to noise ratio of a homozygous allele call is compared with previously obtained ratios. A laboratory protocol for analysis of SBE products was developed where allele calls with unusual ratios were highlighted to facilitate the analysis of difficult allele calls......A multiplex assay with 49 autosomal single nucleotide polymorphisms (SNPs) developed for human identification was validated for forensic genetic casework and accredited according to the ISO 17025 standard. The multiplex assay was based on the SNPforID 52plex SNP assay [J.J. Sanchez, C. Phillips, C...
Full Text Available Polymerase chain reaction-amplified product length polymorphism (PCR-APLP is one of the most convenient and reliable methods for single nucleotide polymorphism (SNP analysis. This method is based on PCR, but uses allele-specific primers containing SNP sites at the 3'-terminus of each primer. To use this method at least two allele-specific primers and one "counter-primer", which serves as a common forward or reverse primer of the allele-specific primers, are required. The allele-specific primers have SNP sites at the 3'-terminus, and another primer should have a few non-complementary flaps at the 5'-terminus to detect SNPs by determining the difference of amplicon length by PCR and subsequent electrophoresis. A major disadvantage of the addition of a non-complementary flap is the non-specific annealing of the primer with non-complementary flaps. However, a design principle for avoiding this undesired annealing has not been fully established, therefore, it is often difficult to design effective APLP primers. Here, we report allele-specific primers with an inosine chain at the 5'-terminus for PCR-APLP analysis. This unique design improves the competitiveness of allele-specific primers and the reliability of SNP analysis when using the PCR-APLP method.
Bailey Dione K
Full Text Available Abstract Background DNA copy number aberration (CNA is one of the key characteristics of cancer cells. Recent studies demonstrated the feasibility of utilizing high density single nucleotide polymorphism (SNP genotyping arrays to detect CNA. Compared with the two-color array-based comparative genomic hybridization (array-CGH, the SNP arrays offer much higher probe density and lower signal-to-noise ratio at the single SNP level. To accurately identify small segments of CNA from SNP array data, segmentation methods that are sensitive to CNA while resistant to noise are required. Results We have developed a highly sensitive algorithm for the edge detection of copy number data which is especially suitable for the SNP array-based copy number data. The method consists of an over-sensitive edge-detection step and a test-based forward-backward edge selection step. Conclusion Using simulations constructed from real experimental data, the method shows high sensitivity and specificity in detecting small copy number changes in focused regions. The method is implemented in an R package FASeg, which includes data processing and visualization utilities, as well as libraries for processing Affymetrix SNP array data.
U.S. Department of Health & Human Services — dbSNP is a database of single nucleotide polymorphisms (SNPs) and multiple small-scale variations that include insertions/deletions, microsatellites, and...
Patil, Gunvant; Do, Tuyen; Vuong, Tri D.; Valliyodan, Babu; Lee, Jeong-Dong; Chaudhary, Juhi; Shannon, J. Grover; Nguyen, Henry T.
Soil salinity is a limiting factor of crop yield. The soybean is sensitive to soil salinity, and a dominant gene, Glyma03g32900 is primarily responsible for salt-tolerance. The identification of high throughput and robust markers as well as the deployment of salt-tolerant cultivars are effective approaches to minimize yield loss under saline conditions. We utilized high quality (15x) whole-genome resequencing (WGRS) on 106 diverse soybean lines and identified three major structural variants and allelic variation in the promoter and genic regions of the GmCHX1 gene. The discovery of single nucleotide polymorphisms (SNPs) associated with structural variants facilitated the design of six KASPar assays. Additionally, haplotype analysis and pedigree tracking of 93 U.S. ancestral lines were performed using publically available WGRS datasets. Identified SNP markers were validated, and a strong correlation was observed between the genotype and salt treatment phenotype (leaf scorch, chlorophyll content and Na+ accumulation) using a panel of 104 soybean lines and, an interspecific bi-parental population (F8) from PI483463 x Hutcheson. These markers precisely identified salt-tolerant/sensitive genotypes (>91%), and different structural-variants (>98%). These SNP assays, supported by accurate phenotyping, haplotype analyses and pedigree tracking information, will accelerate marker-assisted selection programs to enhance the development of salt-tolerant soybean cultivars. PMID:26781337
Cronin, Matthew A; Cánovas, Angela; Bannasch, Danika L; Oberbauer, Anita M; Medrano, Juan F
There is considerable interest in the genetics of wolves (Canis lupus) because of their close relationship to domestic dogs (C. familiaris) and the need for informed conservation and management. This includes wolf populations in Southeast Alaska for which we determined genotypes of 305 wolves at 173662 single nucleotide polymorphism (SNP) loci. After removal of invariant and linked SNP, 123801 SNP were used to quantify genetic differentiation of wolves in Southeast Alaska and wolves, coyotes (C. latrans), and dogs from other areas in North America. There is differentiation of SNP allele frequencies between the species (wolves, coyotes, and dogs), although differentiation is relatively low between some wolf and coyote populations. There are varying levels of differentiation among populations of wolves, including low differentiation of wolves in interior Alaska, British Columbia, and the northern US Rocky Mountains. There is considerable differentiation of SNP allele frequencies of wolves in Southeast Alaska from wolves in other areas. However, wolves in Southeast Alaska are not a genetically homogeneous group and there are comparable levels of genetic differentiation among areas within Southeast Alaska and between Southeast Alaska and other geographic areas. SNP variation and other genetic data are discussed regarding taxonomy and management. © The American Genetic Association 2014. All rights reserved. For permissions, please e-mail: email@example.com.
Li, Shengting; Ma, Lijia; Li, Heng
Snap (Single Nucleotide Polymorphism Annotation Platform) is a server designed to comprehensively analyze single genes and relationships between genes basing on SNPs in the human genome. The aim of the platform is to facilitate the study of SNP finding and analysis within the framework of medical...
Full Text Available Abstract Background The restriction fragment length polymorphism (RFLP is a common laboratory method for the genotyping of single nucleotide polymorphisms (SNPs. Here, we describe a web-based software, named SNP-RFLPing, which provides the restriction enzyme for RFLP assays on a batch of SNPs and genes from the human, rat, and mouse genomes. Results Three user-friendly inputs are included: 1 NCBI dbSNP "rs" or "ss" IDs; 2 NCBI Entrez gene ID and HUGO gene name; 3 any formats of SNP-in-sequence, are allowed to perform the SNP-RFLPing assay. These inputs are auto-programmed to SNP-containing sequences and their complementary sequences for the selection of restriction enzymes. All SNPs with available RFLP restriction enzymes of each input genes are provided even if many SNPs exist. The SNP-RFLPing analysis provides the SNP contig position, heterozygosity, function, protein residue, and amino acid position for cSNPs, as well as commercial and non-commercial restriction enzymes. Conclusion This web-based software solves the input format problems in similar softwares and greatly simplifies the procedure for providing the RFLP enzyme. Mixed free forms of input data are friendly to users who perform the SNP-RFLPing assay. SNP-RFLPing offers a time-saving application for association studies in personalized medicine and is freely available at http://bio.kuas.edu.tw/snp-rflp/.
Chang, Hsueh-Wei; Yang, Cheng-Hong; Chang, Phei-Lang; Cheng, Yu-Huei; Chuang, Li-Yeh
The restriction fragment length polymorphism (RFLP) is a common laboratory method for the genotyping of single nucleotide polymorphisms (SNPs). Here, we describe a web-based software, named SNP-RFLPing, which provides the restriction enzyme for RFLP assays on a batch of SNPs and genes from the human, rat, and mouse genomes. Three user-friendly inputs are included: 1) NCBI dbSNP "rs" or "ss" IDs; 2) NCBI Entrez gene ID and HUGO gene name; 3) any formats of SNP-in-sequence, are allowed to perform the SNP-RFLPing assay. These inputs are auto-programmed to SNP-containing sequences and their complementary sequences for the selection of restriction enzymes. All SNPs with available RFLP restriction enzymes of each input genes are provided even if many SNPs exist. The SNP-RFLPing analysis provides the SNP contig position, heterozygosity, function, protein residue, and amino acid position for cSNPs, as well as commercial and non-commercial restriction enzymes. This web-based software solves the input format problems in similar softwares and greatly simplifies the procedure for providing the RFLP enzyme. Mixed free forms of input data are friendly to users who perform the SNP-RFLPing assay. SNP-RFLPing offers a time-saving application for association studies in personalized medicine and is freely available at http://bio.kuas.edu.tw/snp-rflp/.
Gregory W. Peterson
Full Text Available Genotyping-by-sequencing (GBS has recently emerged as a promising genomic approach for exploring plant genetic diversity on a genome-wide scale. However, many uncertainties and challenges remain in the application of GBS, particularly in non-model species. Here, we present a GBS protocol we developed and use for plant genetic diversity analysis. It uses two restriction enzymes to reduce genome complexity, applies Illumina multiplexing indexes for barcoding and has a custom bioinformatics pipeline for genotyping. This genetic diversity-focused GBS (gd-GBS protocol can serve as an easy-to-follow lab guide to assist a researcher through every step of a GBS application with five main components: sample preparation, library assembly, sequencing, SNP calling and diversity analysis. Specifically, in this presentation, we provide a brief overview of the GBS approach, describe the gd-GBS procedures, illustrate it with an application to analyze genetic diversity in 20 flax (Linum usitatissimum L. accessions and discuss related issues in GBS application. Following these lab bench procedures and using the custom bioinformatics pipeline, one could generate genome-wide SNP genotype data for a conventional genetic diversity analysis of a non-model plant species.
Tang, J.; Vosman, B.; Voorrips, R.E.; Linden, van der C.G.; Leunissen, J.A.M.
Background - Single nucleotide polymorphisms (SNPs) are important tools in studying complex genetic traits and genome evolution. Computational strategies for SNP discovery make use of the large number of sequences present in public databases (in most cases as expressed sequence tags (ESTs)) and are
Chang, Ling; Zhao, Nan; Wei, Yuan; Zhong, Su; Liu, Ping; Qiao, Jie
To compare the roles of single nucleotide polymorphism array (SNP array) and karyotype analysis in high-risk pregnant women prenatal diagnosis. From July 2012 to December 2013, a total of 141 pregnant women with high-risk in prenatal diagnosis were selected as the object of study in Department of Obstetrics and Gynecology, Peking University Third Hospital, 78 cases of umbilical cord puncture and 63 of amnion cavity puncture , both taking SNP array detection and karyotype analysis. The abnormality karyotype rate was 6.4%, the abnormal rate of SNP array result was 11.3%, and the abnormal rate of the combined two methods for detecting was 12.1%. There were significant differences between the SNP array and karyotype analysis (P=0.039). There were obvious differences between the two techniques. It is an effective way to determine genetic disease by integrating SNP array and karyotype analysis in prenatal diagnosis.
Contreras-Sesvold, Carmen L; Sambuughin, Nyamkhishig; Blokhin, Andrei; Deuster, Patricia A
Heat shock proteins act as molecular chaperones, assist in peptide maturation, and transport nascent peptides across membranes. One commonly studied single nucleotide polymorphism (SNP) for one of the proteins is HSPA1B (+A1538G). However, several studies of this polymorphism have failed to achieve Hardy-Weinberg equilibrium (HWE) for their sample. We compared various published procedures for analyzing the HSPA1B +A1538G SNP and report reasons for HWE discrepancies. Samples from 141 apparently healthy, physically active, volunteers (99 men and 42 women) were analyzed. The first protocol, initially described by Schröder et al., resulted in a genotypic distribution of 22 GG (15.6%), 119 AG (84.4%), and 0 AA; results were confirmed by reanalysis and sequencing. Two other published protocols, one described by Klausz et al. and another by Fekete et al., were used to confirm these results: both resulted in 22 GG (15.6%), 46 AA (32.6%), and 73 AG (51.7%). Additionally, the results were within HWE and confirmed by sequence analysis. Of the original 119 subjects genotyped as AG by the Schröder protocol, 46 of those were confirmed as AA with the Klausz and Fekete methods. Mixing primers from the Schröder and Klausz protocol resulted in 100% concordance with the data generated by the Klausz and Fekete protocols. Some published data on HSP genotyping deviate from HWE; thus, primers used for analyzing these highly homologous genes must be carefully considered. Our results highlight the importance of reinvestigating data when HWE is not achieved for the HSPA1B, or another, polymorphism.
Full Text Available The mouse double minute 2 (MDM2 gene encodes a phosphoprotein that interacts with P53 and negatively regulates its activity. The SNP309 polymorphism (T-G in the promoter of MDM2 gene has been reported to be associated with enhanced MDM2 expression and tumor development. Studies investigating the association between MDM2 SNP309 polymorphism and colorectal cancer (CRC risk reported conflicting results. We performed a meta-analysis of all available studies to explore the association of this polymorphism with CRC risk.All studies published up to July 2013 on the association between MDM2 SNP309 polymorphism and CRC risk were identified by searching electronic databases PubMed, EMBASE, and Chinese Biomedical Literature database (CBM databases. The association between the MDM2 SNP309 polymorphism and CRC risk was assessed by odds ratios (ORs together with their 95% confidence intervals (CIs.A total of 14 case-control studies including 4460 CRC cases and 4828 controls were identified. We did not find a significant association between the MDM2 SNP309 polymorphism and CRC risk in all genetic models in overall population. However, in subgroup analysis by ethnicity, significant associations were found in Asians (TG vs. TT: OR = 1.197, 95% CI = 1.055-1.358, P=0.005; GG+TG vs. TT: OR = 1.246, 95% CI = 1.106-1.404, P=0.000 and Africans. When stratified by HWE in controls, significantly increased risk was also found among the studies consistent with HWE (TG vs. TT: OR = 1.166, 95% CI = 1.037-1.311, P= 0.010. In subgroup analysis according to p53 mutation status, and gender, no any significant association was detected.The present meta-analysis suggests that the MDM2 is a candidate gene for CRC susceptibility. The MDM2 SNP309 polymorphism may be a risk factor for CRC in Asians.
Voorrips Roeland E
Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are important tools in studying complex genetic traits and genome evolution. Computational strategies for SNP discovery make use of the large number of sequences present in public databases (in most cases as expressed sequence tags (ESTs and are considered to be faster and more cost-effective than experimental procedures. A major challenge in computational SNP discovery is distinguishing allelic variation from sequence variation between paralogous sequences, in addition to recognizing sequencing errors. For the majority of the public EST sequences, trace or quality files are lacking which makes detection of reliable SNPs even more difficult because it has to rely on sequence comparisons only. Results We have developed a new algorithm to detect reliable SNPs and insertions/deletions (indels in EST data, both with and without quality files. Implemented in a pipeline called QualitySNP, it uses three filters for the identification of reliable SNPs. Filter 1 screens for all potential SNPs and identifies variation between or within genotypes. Filter 2 is the core filter that uses a haplotype-based strategy to detect reliable SNPs. Clusters with potential paralogs as well as false SNPs caused by sequencing errors are identified. Filter 3 screens SNPs by calculating a confidence score, based upon sequence redundancy and quality. Non-synonymous SNPs are subsequently identified by detecting open reading frames of consensus sequences (contigs with SNPs. The pipeline includes a data storage and retrieval system for haplotypes, SNPs and alignments. QualitySNP's versatility is demonstrated by the identification of SNPs in EST datasets from potato, chicken and humans. Conclusion QualitySNP is an efficient tool for SNP detection, storage and retrieval in diploid as well as polyploid species. It is available for running on Linux or UNIX systems. The program, test data, and user manual are available at
Tang, Jifeng; Vosman, Ben; Voorrips, Roeland E; van der Linden, C Gerard; Leunissen, Jack AM
Background Single nucleotide polymorphisms (SNPs) are important tools in studying complex genetic traits and genome evolution. Computational strategies for SNP discovery make use of the large number of sequences present in public databases (in most cases as expressed sequence tags (ESTs)) and are considered to be faster and more cost-effective than experimental procedures. A major challenge in computational SNP discovery is distinguishing allelic variation from sequence variation between paralogous sequences, in addition to recognizing sequencing errors. For the majority of the public EST sequences, trace or quality files are lacking which makes detection of reliable SNPs even more difficult because it has to rely on sequence comparisons only. Results We have developed a new algorithm to detect reliable SNPs and insertions/deletions (indels) in EST data, both with and without quality files. Implemented in a pipeline called QualitySNP, it uses three filters for the identification of reliable SNPs. Filter 1 screens for all potential SNPs and identifies variation between or within genotypes. Filter 2 is the core filter that uses a haplotype-based strategy to detect reliable SNPs. Clusters with potential paralogs as well as false SNPs caused by sequencing errors are identified. Filter 3 screens SNPs by calculating a confidence score, based upon sequence redundancy and quality. Non-synonymous SNPs are subsequently identified by detecting open reading frames of consensus sequences (contigs) with SNPs. The pipeline includes a data storage and retrieval system for haplotypes, SNPs and alignments. QualitySNP's versatility is demonstrated by the identification of SNPs in EST datasets from potato, chicken and humans. Conclusion QualitySNP is an efficient tool for SNP detection, storage and retrieval in diploid as well as polyploid species. It is available for running on Linux or UNIX systems. The program, test data, and user manual are available at and as Additional files
Full Text Available Linguo Xie,1,2,* Yan Sun,2,* Tao Chen,1,2,* Dawei Tian,1,2 Yujuan Li,3 Yu Zhang,1,2 Na Ding,2 Zhonghua Shen,1,2 Hao Xu,1,2 Xuewu Nian,4 Nan Sha,1,2 Ruifa Han,1,2 Hailong Hu,1,2 Changli Wu1,2 Objective: Human murine double minute 2 protein (MDM2 is mainly a negative regulator of p53 tumor suppressor pathway. We aimed to investigate the association between MDM2 SNP309 polymorphism and bladder cancer risk. Methods: A total of 535 bladder cancer patients and 649 health controls were recruited for our study. MDM2 SNP309 T>G polymorphism was genotyped by polymerase chain reaction-ligase detection reaction method. Logistic regression was used to analyze the relationship between the genotype and susceptibility of bladder cancer. Kaplan–Meier estimates and log-rank test were obtained to analyze the association between the genotype and risk of recrudesce in nonmuscle-invasive bladder cancer patients. A multivariable Cox proportional hazards model was fitted to identify independent prognostic factors. To further investigate the association, we conducted a meta-analysis including six studies. Results: The frequency of the MDM2 SNP309 T>G polymorphism showed no significant difference between cases and controls (all P>0.05. In the stratification analysis, the results showed that G allele carriers were prone to have a significant decrease in risk of low-grade bladder cancer (adjusted odds ratio: 0.613, 95% confidence interval: 0.427–0.881, and G variant was associated with a significantly reduced risk of recurrence in nonmuscle-invasive bladder cancer patients with or without chemotherapy (P<0.05. The results of the meta-analysis showed that G allele and GG genotype of MDM2 SNP309 polymorphism were significantly associated with increased risk of bladder cancer in Caucasians (both P<0.05, and no association was observed in total populations and Asians (P>0.05. Conclusion: MDM2 SNP309 T>G polymorphism has no influence on bladder cancer risk in Asians, but
Daniel, R; Santos, C; Phillips, C; Fondevila, M; van Oorschot, R A H; Carracedo, A; Lareu, M V; McNevin, D
Forensic phenotyping can provide useful intelligence regarding the biogeographical ancestry (BGA) and externally visible characteristics (EVCs) of the donor of an evidentiary sample. Currently, single nucleotide polymorphism (SNP) based inference of BGA and EVCs is performed most commonly using SNaPshot(®), a single base extension (SBE) assay. However, a single SNaPshot multiplex PCR is limited to 30-40 SNPs. Next generation sequencing (NGS) offers the potential to genotype hundreds to thousands of SNPs from multiple samples in a single experimental run. The PCR multiplexes from five SNaPshot assays (SNPforID 52plex, SNPforID 34plex, Eurasiaplex, IrisPlex and an unpublished BGA assay) were applied to three different DNA template amounts (0.1, 0.2 and 0.3 ng) in three samples (9947A and 007 control DNAs and a male donor). The pooled PCR amplicons containing 136 unique SNPs were sequenced using Life Technologies' Ion Torrent™ PGM system. Approximately 72 Mb of sequence was generated from two 10 Mb Ion 314™ v1 chips. Accurate genotypes were readily obtained from all three template amounts. Of a total of 408 genotypes, 395 (97%) were fully concordant with SNaPshot across all three template amounts. Of those genotypes discordant with SNaPshot, six Ion Torrent sequences (1.5%) were fully concordant with Sanger sequencing across the three template amounts. Seven SNPs (1.7%) were either discordant between template amounts or discordant with Sanger sequencing. Sequence coverage observed in the negative control, and, allele coverage variation for heterozygous genotypes highlights the need to establish a threshold for background levels of sequence output and heterozygous balance. This preliminary study of the Ion Torrent PGM system has demonstrated considerable potential for use in forensic DNA analyses as a low to medium throughput NGS platform using established SNaPshot assays. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Butler, John M.; Budowle, B.; Gill, P.
Six scientists presented their views and experience with single nucleotide polymorphism (SNP) markers, multiplexes, and methods regarding their potential application in forensic identity and relationship testing. Benefits and limitations of SNPs were reviewed, as were different SNP marker...
Butler, John M.; Budowle, B.; Gill, P.
Six scientists presented their views and experience with single nucleotide polymorphism (SNP) markers, multiplexes, and methods regarding their potential application in forensic identity and relationship testing. Benefits and limitations of SNPs were reviewed, as were different SNP marker categor...
Full Text Available Interferon alpha (IFNα is approved for adjuvant treatment of stage III melanoma in Europe and the US. Its clinical efficacy, however, is restricted to a subpopulation of patients while side effects occur in most of treated patients. Thus, the identification of predictive biomarkers would be highly beneficial to improve the benefit to risk ratio. In this regard, STAT3 is important for signaling of the IFNα receptor. Moreover, the STAT3 single nucleotide polymorphism (SNP rs4796793 has recently been reported to be associated with IFNα sensitivity in metastatic renal cell carcinoma. To translate this notion to melanoma, we scrutinized the impact of rs4796793 functionally and clinically in this cancer. Interestingly, melanoma cells carrying the minor allele of rs4796793 were the most sensitive to IFNα in vitro. However, we did not detect a correlation between SNP genotype and STAT3 mRNA expression for either melanoma cells or for peripheral blood lymphocytes. Next, we analyzed the impact of rs4796793 on the clinical outcome of 259 stage III melanoma patients of which one third had received adjuvant IFNα treatment. These analyses did not reveal a significant association between the STAT3 rs4796793 SNP and patients’ progression free or overall survival when IFN treated and untreated patients were compared. In conclusion, STAT3 rs4796793 SNP is no predictive marker for the efficacy of adjuvant IFNα treatment in melanoma patients.
Full Text Available The analysis of next-generation sequence (NGS data is often a fragmented step-wise process. For example, multiple pieces of software are typically needed to map NGS reads, extract variant sites, and construct a DNA sequence matrix containing only single nucleotide polymorphisms (i.e., a SNP matrix for a set of individuals. The management and chaining of these software pieces and their outputs can often be a cumbersome and difficult task. Here, we present CFSAN SNP Pipeline, which combines into a single package the mapping of NGS reads to a reference genome with Bowtie2, processing of those mapping (BAM files using SAMtools, identification of variant sites using VarScan, and production of a SNP matrix using custom Python scripts. We also introduce a Python package (CFSAN SNP Mutator that when given a reference genome will generate variants of known position against which we validate our pipeline. We created 1,000 simulated Salmonella enterica sp. enterica Serovar Agona genomes at 100× and 20× coverage, each containing 500 SNPs, 20 single-base insertions and 20 single-base deletions. For the 100× dataset, the CFSAN SNP Pipeline recovered 98.9% of the introduced SNPs and had a false positive rate of 1.04 × 10−6; for the 20× dataset 98.8% of SNPs were recovered and the false positive rate was 8.34 × 10−7. Based on these results, CFSAN SNP Pipeline is a robust and accurate tool that it is among the first to combine into a single executable the myriad steps required to produce a SNP matrix from NGS data. Such a tool is useful to those working in an applied setting (e.g., food safety traceback investigations as well as for those interested in evolutionary questions.
Andreassen, Christian Nicolaj; Rosenstein, Barry S; Kerns, Sarah L
PURPOSE: Several small studies have indicated that the ATM rs1801516 SNP is associated with risk of normal tissue toxicity after radiotherapy. However, the findings have not been consistent. In order to test this SNP in a well-powered study, an individual patient data meta-analysis was carried ou...
Gardner, Shea [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Slezak, Tom [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
With the flood of whole genome finished and draft microbial sequences, we need faster, more scalable bioinformatics tools for sequence comparison. An algorithm is described to find single nucleotide polymorphisms (SNPs) in whole genome data. It scales to hundreds of bacterial or viral genomes, and can be used for finished and/or draft genomes available as unassembled contigs. The method is fast to compute, finding SNPs and building a SNP phylogeny in seconds to hours. We use it to identify thousands of putative SNPs from all publicly available Filoviridae, Poxviridae, foot-and-mouth disease virus, Bacillus, and Escherichia coli genomes and plasmids. The SNP-based trees that result are consistent with known taxonomy and trees determined in other studies. The approach we describe can handle as input hundreds of gigabases of sequence in a single run. The algorithm is based on k-mer analysis using a suffix array, so we call it saSNP.
Kim, Ki-Seung; Unfried, Jair R; Hyten, David L; Frederick, Reid D; Hartman, Glen L; Nelson, Randall L; Song, Qijian; Diers, Brian W
Soybean rust (SBR), caused by Phakopsora pachyrhizi Sydow, is one of the most economically important and destructive diseases of soybean [Glycine max (L.) Merr.] and the discovery of novel SBR resistance genes is needed because of virulence diversity in the pathogen. The objectives of this research were to map SBR resistance in plant introduction (PI) 561356 and to identify single nucleotide polymorphism (SNP) haplotypes within the region on soybean chromosome 18 where the SBR resistance gene Rpp1 maps. One-hundred F(2:3) lines derived from a cross between PI 561356 and the susceptible experimental line LD02-4485 were genotyped with genetic markers and phenotyped for resistance to P. pachyrhizi isolate ZM01-1. The segregation ratio of reddish brown versus tan lesion type in the population supported that resistance was controlled by a single dominant gene. The gene was mapped to a 1-cM region on soybean chromosome 18 corresponding to the same interval as Rpp1. A haplotype analysis of diverse germplasm across a 213-kb interval that included Rpp1 revealed 21 distinct haplotypes of which 4 were present among 5 SBR resistance sources that have a resistance gene in the Rpp1 region. Four major North American soybean ancestors belong to the same SNP haplotype as PI 561356 and seven belong to the same haplotype as PI 594538A, the Rpp1-b source. There were no North American soybean ancestors belonging to the SNP haplotypes found in PI 200492, the source of Rpp1, or PI 587886 and PI 587880A, additional sources with SBR resistance mapping to the Rpp1 region.
Rajasuriar, Reena; Kong, Yong Yean; Nadarajah, Reshika; Abdullah, Noor Kamila; Spelman, Tim; Yuhana, Muhamad Yazli; Ponampalavanar, Sasheela; Kamarulzaman, Adeeba; Lewin, Sharon R
HIV-infected individuals have an increased risk of cardiovascular disease (CVD). T-allele carriers of the CD14 C-260T single-nucleotide polymorphism (SNP) have reported increased expression of the LPS-binding receptor, CD14 and inflammation in the general population. Our aim was to explore the relationship of this SNP with monocyte/macrophage activation and inflammation and its association with sub-clinical atherosclerosis in HIV-infected individuals. Patients with no pre-existing CVD risk factors on suppressive antiretroviral therapy were recruited from University Malaya Medical Centre, Malaysia (n = 84). The CD14 C-260T and TLR4 SNPs, Asp299Gly and Thr399Ile were genotyped and soluble(s) CD14 and sCD163 and high-sensitivity C-reactive protein, hsCRP were measured in plasma. Subclinical atherosclerosis was assessed by measuring carotid intima media thickness (cIMT). The association between CD14 C-260T SNP carriage and cIMT was assessed in a multivariable quantile regression model where a p-value of CVD risk profile.
Xu, Qian; Wu, Ye-Feng; Li, Ying; He, Cai-Yun; Sun, Li-Ping; Liu, Jing-Wei; Yuan, Yuan
Gastric cancer (GC) is a multistep complex disease involving multiple genes, and gene-gene interactions have a greater effect than a single gene in determining cancer susceptibility. This study aimed to explore the interaction of the let-7e rs8111742, miR-365b rs121224, and miR-4795 rs1002765 single nucleotide polymorphisms (SNPs) with SNPs of the predicted target gene PGC and Helicobacter pylori status in GC and atrophic gastritis (AG) risk. Three miRNA SNPs and seven PGC SNPs were detected in 2448 cases using the Sequenom MassArray platform. Two pairwise combinations of miRNA and PGC SNPs were associated with increased AG risk (let-7e rs8111742 - PGC rs6458238 and miR-4795 rs1002765 - PGC rs9471643). Singly, miR-365b rs121224 and PGC rs6912200 had no effect individually but in combination they demonstrated an epistatic interaction associated with AG risk. Similarly, let-7e rs8111742 and miR-4795 rs1002765 SNPs interacted with H. pylori infection to increase GC risk (rs8111742: Pinteraction = 0.024; rs1002765: Pinteraction = 0.031, respectively). A three-dimensional interaction analysis found miR-4795 rs1002765, PGC rs9471643, and H. pylori infection positively interacted to increase AG risk (Pinteraction = 0.027). Also, let-7e rs8111742, PGC rs6458238, and H. pylori infection positively interacted to increase GC risk (Pinteraction = 0.036). Furthermore, both of these three-dimensional interactions had a dosage-effect correspondence (Ptrend < 0.001) and were verified by MDR. In conclusion, the miRNAs SNPs (let-7e rs8111742 and miR-4795 rs1002765) might have more superior efficiency when combined with PGC SNPs and/or H. pylori for GC or AG risk than a single SNP on its own.
Evidence for the impact of mislabeling and/or pollen contamination on consistency of field performance has been lacking to reinforce the need for strict adherence to quality control protocols in cacao seed garden and germplasm plot management. The present study used SNP fingerprinting at 64 loci to ...
Lee, Jong-Min; Gillis, Tammy; Mysore, Jayalakshmi Srinidhi; Ramos, Eliana Marisa; Myers, Richard H.; Hayden, Michael R.; Morrison, Patrick J.; Nance, Martha; Ross, Christopher A.; Margolis, Russell L.; Squitieri, Ferdinando; Griguoli, Annamaria; Di Donato, Stefano; Gomez-Tortosa, Estrella; Ayuso, Carmen; Suchowersky, Oksana; Trent, Ronald J.; McCusker, Elizabeth; Novelletto, Andrea; Frontali, Marina; Jones, Randi; Ashizawa, Tetsuo; Frank, Samuel; Saint-Hilaire, Marie-Helene; Hersch, Steven M.; Rosas, Herminia D.; Lucente, Diane; Harrison, Madaline B.; Zanko, Andrea; Abramson, Ruth K.; Marder, Karen; Sequeiros, Jorge; MacDonald, Marcy E.; Gusella, James F.
Age at the onset of motor symptoms in Huntington disease (HD) is determined largely by the length of a CAG repeat expansion in HTT but is also influenced by other genetic factors. We tested whether common genetic variation near the mutation site is associated with differences in the distribution of expanded CAG alleles or age at the onset of motor symptoms. To define disease-associated single-nucleotide polymorphisms (SNPs), we compared 4p16.3 SNPs in HD subjects with population controls in a case:control strategy, which revealed that the strongest signals occurred at a great distance from the HD mutation as a result of “synthetic association” with SNP alleles that are of low frequency in population controls. Detailed analysis delineated a prominent ancestral haplotype that accounted for ∼50% of HD chromosomes and extended to at least 938 kb on about half of these. Together, the seven most abundant haplotypes accounted for ∼83% of HD chromosomes. Neither the extended shared haplotype nor the individual local HTT haplotypes were associated with altered CAG-repeat length distribution or residual age at the onset of motor symptoms, arguing against modification of these disease features by common cis-regulatory elements. Similarly, the 11 most frequent control haplotypes showed no trans-modifier effect on age at the onset of motor symptoms. Our results argue against common local regulatory variation as a factor influencing HD pathogenesis, suggesting that genetic modifiers be sought elsewhere in the genome. They also indicate that genome-wide association analysis with a small number of cases can be effective for regional localization of genetic defects, even when a founder effect accounts for only a fraction of the disorder. PMID:22387017
Full Text Available BACKGROUND: Blunt snout bream (Megalobrama amblycephala is an herbivorous freshwater fish species native to China and has been recognized as a main aquaculture species in the Chinese freshwater polyculture system with high economic value. Right now, only limited EST resources were available for M. amblycephala. Recent advances in large-scale RNA sequencing provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes. METHODOLOGY AND PRINCIPAL FINDINGS: Using 454 pyrosequencing, a total of 1,409,706 high quality reads (total length 577 Mbp were generated from the normalized cDNA of pooled M. amblycephala individuals. These sequences were assembled into 26,802 contigs and 73,675 singletons. After BLAST searches against the NCBI non-redundant (NR and UniProt databases with an arbitrary expectation value of E(-10, over 40,000 unigenes were functionally annotated and classified using the FunCat functional annotation scheme. A comparative genomics approach revealed a substantial proportion of genes expressed in M. amblycephala tanscriptome to be shared across the genomes of zebrafish, medaka, tetraodon, fugu, stickleback, human, mouse, and chicken, and identified a substantial number of potentially novel M. amblycephala genes. A total number of 4,952 SSRs were found and 116 polymorphic loci have been characterized. A significant number of SNPs (25,697 and indels (23,287 were identified based on specific filter criteria in the M. amblycephala. CONCLUSIONS: This study is the first comprehensive transcriptome analysis for a fish species belonging to the genus Megalobrama. These large EST resources are expected to be valuable for the development of molecular markers, construction of gene-based linkage map, and large-scale expression analysis of M. amblycephala, as well as comparative genome analysis for the genus
Full Text Available Beekeeping activities, especially queen trading, have shaped the distribution of honey bee (Apis mellifera subspecies in Europe, and have resulted in extensive introductions of two eastern European C-lineage subspecies (A. m. ligustica and A. m. carnica into the native range of the M-lineage A. m. mellifera subspecies in Western Europe. As a consequence, replacement and gene flow between native and commercial populations have occurred at varying levels across western European populations. Genetic identification and introgression analysis using molecular markers is an important tool for management and conservation of honey bee subspecies. Previous studies have monitored introgression by using microsatellite, PCR-RFLP markers and most recently, high density assays using single nucleotide polymorphism (SNP markers. While the latter are almost prohibitively expensive, the information gained to date can be exploited to create a reduced panel containing the most ancestry-informative markers (AIMs for those purposes with very little loss of information. The objective of this study was to design reduced panels of AIMs to verify the origin of A. m. mellifera individuals and to provide accurate estimates of the level of C-lineage introgression into their genome. The discriminant power of the SNPs using a variety of metrics and approaches including the Weir & Cockerham's FST, an FST-based outlier test, Delta, informativeness (In, and PCA was evaluated. This study shows that reduced AIMs panels assign individuals to the correct origin and calculates the admixture level with a high degree of accuracy. These panels provide an essential tool in Europe for genetic stock identification and estimation of admixture levels which can assist management strategies and monitor honey bee conservation programs.
Full Text Available High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432, but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.
Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela
High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.
Full Text Available High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus. A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs. Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.
Panitz, Frank; Stengaard, Henrik; Hornshoj, Henrik
MOTIVATION: Single nucleotide polymorphisms (SNPs) analysis is an important means to study genetic variation. A fast and cost-efficient approach to identify large numbers of novel candidates is the SNP mining of large scale sequencing projects. The increasing availability of sequence trace data...... in public repositories makes it feasible to evaluate SNP predictions on the DNA chromatogram level. MAVIANT, a platform-independent Multipurpose Alignment VIewing and Annotation Tool, provides DNA chromatogram and alignment views and facilitates evaluation of predictions. In addition, it supports direct...... manual annotation, which is immediately accessible and can be easily shared with external collaborators. RESULTS: Large-scale SNP mining of polymorphisms bases on porcine EST sequences yielded more than 7900 candidate SNPs in coding regions (cSNPs), which were annotated relative to the human genome. Non...
Full Text Available The identification of statistical SNP-SNP interactions may help explain the genetic etiology of many human diseases, but exhaustive genome-wide searches for these interactions have been difficult, due to a lack of power in most datasets. We aimed to use data from the Resource for Genetic Epidemiology Research on Adult Health and Aging (GERA study to search for SNP-SNP interactions associated with 10 common diseases. FastEpistasis and BOOST were used to evaluate all pairwise interactions among approximately N = 300,000 single nucleotide polymorphisms (SNPs with minor allele frequency (MAF ≥ 0.15, for the dichotomous outcomes of allergic rhinitis, asthma, cardiac disease, depression, dermatophytosis, type 2 diabetes, dyslipidemia, hemorrhoids, hypertensive disease, and osteoarthritis. A total of N = 45,171 subjects were included after quality control steps were applied. These data were divided into discovery and replication subsets; the discovery subset had > 80% power, under selected models, to detect genome-wide significant interactions (P < 10−12. Interactions were also evaluated for enrichment in particular SNP features, including functionality, prior disease relevancy, and marginal effects. No interaction in any disease was significant in both the discovery and replication subsets. Enrichment analysis suggested that, for some outcomes, interactions involving SNPs with marginal effects were more likely to be nominally replicated, compared to interactions without marginal effects. If SNP-SNP interactions play a role in the etiology of the studied conditions, they likely have weak effect sizes, involve lower-frequency variants, and/or involve complex models of interaction that are not captured well by the methods that were utilized.
Full Text Available Abstract Background Single-nucleotide polymorphisms (SNPs are considered to be useful polymorphic markers for genetic studies of polygenic traits. Single-stranded conformational polymorphism (SSCP analysis has been widely applied to detect SNPs, including point mutations in cancer and congenital diseases. In this study, we describe an application of the fluorescent labeling of PCR fragments using a fluorescent-adapted primer for SSCP analysis as a novel method. Methods Single-nucleotide polymorphisms (SNPs of the inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP gene were analyzed using a fluorescence-adapted SSCP method. The method was constructed from two procedures: 1 a fluorescent labeling reaction of PCR fragments using fluorescence-adapted primers in a single tube, and 2 electrophoresis on a non-denaturing polyacrylamide gel. Results This method was more economical and convenient than the single-stranded conformational polymorphism (SSCP methods previously reported in the detection of the labeled fragments obtained. In this study, eight SNPs of the IHRP gene were detected by the fluorescence-adapted SSCP. One of the SNPs was a new SNP resulting in an amino acid substitution, while the other SNPs have already been reported in the public databases. Six SNPs of the IHRP were associated with two haplotypes. Conclusions The fluorescence-adapted SSCP was useful for detecting and genotyping SNPs.
Full Text Available Abstract Background The eastern oyster, Crassostrea virginica (Gmelin 1791, is an economically important species cultured in many areas in North America. It is also ecologically important because of the impact of its filter feeding behaviour on water quality. Populations of C. virginica have been threatened by overfishing, habitat degradation, and diseases. Through genome research, strategies are being developed to reverse its population decline. However, large-scale expressed sequence tag (EST resources have been lacking for this species. Efficient generation of EST resources from this species has been hindered by a high redundancy of transcripts. The objectives of this study were to construct a normalized cDNA library for efficient EST analysis, to generate thousands of ESTs, and to analyze the ESTs for microsatellites and potential single nucleotide polymorphisms (SNPs. Results A normalized and subtracted C. virginica cDNA library was constructed from pooled RNA isolated from hemocytes, mantle, gill, gonad and digestive tract, muscle, and a whole juvenile oyster. A total of 6,528 clones were sequenced from this library generating 5,542 high-quality EST sequences. Cluster analysis indicated the presence of 635 contigs and 4,053 singletons, generating a total of 4,688 unique sequences. About 46% (2,174 of the unique ESTs had significant hits (E-value ≤ 1e-05 to the non-redundant protein database; 1,104 of which were annotated using Gene Ontology (GO terms. A total of 35 microsatellites were identified from the ESTs, with 18 having sufficient flanking sequences for primer design. A total of 6,533 putative SNPs were also identified using all existing and the newly generated EST resources of the eastern oysters. Conclusion A high quality normalized cDNA library was constructed. A total of 5,542 ESTs were generated representing 4,688 unique sequences. Putative microsatellite and SNP markers were identified. These genome resources provide the
Full Text Available Bulk segregant analysis (BSA using microarrays, and extreme array mapping (XAM have recently been used to rapidly identify genomic regions associated with phenotypes in multiple species. These experiments, however, require the identification of single feature polymorphisms (SFP between the cross parents for each new combination of genotypes, which raises the cost of experiments. The availability of the genomic polymorphism data in Arabidopsis thaliana, coupled with the efficient designs of Single Nucleotide Polymorphism (SNP genotyping arrays removes the requirement for SFP detection and lowers the per array cost, thereby lowering the overall cost per experiment. To demonstrate that these approaches would be functional on SNP arrays and determine confidence intervals, we analyzed hybridizations of natural accessions to the Arabidopsis ATSNPTILE array and simulated BSA or XAM given a variety of gene models, populations, and bulk selection parameters. Our results show a striking degree of correlation between the genotyping output of both methods, which suggests that the benefit of SFP genotyping in context of BSA can be had with the cheaper, more efficient SNP arrays. As a final proof of concept, we hybridized the DNA from bulks of an F2 mapping population of a Sulfur and Selenium ionomics mutant to both the Arabidopsis ATTILE1R and ATSNPTILE arrays, which produced almost identical results. We have produced R scripts that prompt the user for the required parameters and perform the BSA analysis using the ATSNPTILE1 array and have provided them as supplemental data files.
The rs5934505 single nucleotide polymorphism (SNP) is associated with low testosterone and late-onset hypogonadism, but the rs10822184 SNP is associated with overweight and obesity in a Chinese Han population: a case-control study.
Chen, Y-P; Nie, L-L; Li, H-G; Liu, T-H; Fang, F; Zhao, K; Yang, R-F; Ma, X-L; Kong, X-B; Zhang, H-P; Guan, H-T; Xia, W; Hong, W-X; Duan, S; Zeng, X-C; Shang, X-J; Zhou, Y-Z; Gu, Y-Q; Wu, W-X; Xiong, C-L
Low testosterone is associated with late-onset hypogonadism (LOH) and obesity. Recently, studies have shown that four single nucleotide polymorphisms (SNPs), rs12150660, rs727428, rs5934505, and rs10822184, are associated with testosterone levels in populations of European descent. Therefore, we investigated whether the SNP loci are related to low testosterone, LOH, or obesity in a Chinese Han population. Ruling out co-morbidities, DNA was prepared from 409 men (aged 40-65 years) with low serum testosterone (defined as total testosterone testosterone and calculated free testosterone (CFT) levels (p = 0.045 and p = 0.021). rs5934505 (C>T) was associated with an increased risk of low total testosterone, low CFT, and LOH and adjusted for other factors, with an odds ratio (OR) of 2.01 (1.34-3.01), 2.14 (1.42-3.20), and 1.64 (1.04-2.58). rs10822184 was significantly correlated with weight and BMI (p = 0.035 and p = 0.027). rs10822184 (T>C) was associated with an increased risk of overweight and obesity. We adjusted for other factors, with odds ratios (ORs) of 1.94 (1.36-2.78) and 1.56 (1.00-2.43). In summary, our study provided convincing evidence that rs5934505 (C>T) was associated with the risk of low testosterone and LOH in Chinese populations. We were the first to find that rs10822184 (T>C) was significantly correlated with the risk of overweight and obesity in Chinese populations. However, further large and functional studies are warranted to confirm our findings. © 2015 American Society of Andrology and European Academy of Andrology.
Konishi, Saeko; Izawa, Takeshi; Lin, Shao Yang; Ebana, Kaworu; Fukuta, Yoshimichi; Sasaki, Takuji; Yano, Masahiro
Loss of seed shattering was a key event in the domestication of major cereals. We revealed that the qSH1 gene, a major quantitative trait locus of seed shattering in rice, encodes a BEL1-type homeobox gene and demonstrated that a single-nucleotide polymorphism (SNP) in the 5' regulatory region of the qSH1 gene caused loss of seed shattering owing to the absence of abscission layer formation. Haplotype analysis and association analysis in various rice collections revealed that the SNP was highly associated with shattering among japonica subspecies of rice, implying that it was a target of artificial selection during rice domestication.
Velázquez-Fernández, David; Mercado-Celis, Gabriela; Flores-Morales, Jeny; Clavellina-Gaytán, Diana; Vidrio, Ramón; Vidrio, Eduardo; Mosti, Maureen; Sánchez-Aguilar, Hugo; Rodriguez, Donaji; León, Pablo; Herrera, Miguel F
Obesity is the result of a complex interaction between multiple genetic traits and psychological, behavioral, nutritional and environmental factors. The aims of the study were (a) to comparatively evaluate the presence of 20 candidate gene single nucleotide polymorphisms (SNPs) in morbidly obese patients, (2) their association to comorbid conditions and (3) their impact on weight loss after a Roux-en-Y gastric bypass (RYGB). Two hundred forty-nine patients were eligible for this study. Clinical, anthropometric, biochemical and demographic variables were analyzed. Body mass index (BMI) and composition were assessed by bioelectrical impedance. Twenty SNPs were included for analysis. There were 168 Mexican mestizos (67.5 %) and 81 (32.5 %) patients with other ancestral origin. One hundred fifty-nine (64.1 %) were females. Mean ± SD age of the general cohort was 41.1 ± 11.3 years (17-71). Preoperative mean ± SD BMI was 42.5 ± 6.5 kg/m 2 . There were no significant differences between mestizo and non-mestizo for most SNPs except for IFI, LIPC, and ST8SIA2. FTO (OR = 1.71; CI 95 % = 1.14-2.57; p = 0.008) and APOB (OR = 0.31; CI 95 % = 0.14-0.72; p = 0.004) result is statistically associated to high blood pressure and FTO (OR = 2.0; CI 95 % = 1.3-3.1; p = 0.001), GNB3 (OR = 2.69; CI 95 % = 1.0-7.2; p = 0.04), IFI30 (OR = 2.0; CI 95 % = 1.16-3.6; p = 0.01), and MC4R (OR = 1.81; CI 95 % = 1.13-2.9; p = 0.01) to type 2 diabetes (T2D). Based on ANOVA analysis, POMC (rs1042571) was the SNP most significantly associated to a higher weight loss after RYGB. Obese patients have similar SNP frequencies. Several SNP results are statistically associated to high blood pressure and T2D. POMC was significantly associated to a higher surgically induced weight loss.
Wang, Binbin; Zhou, Sirui; Hong, Fuchang; Wang, Jing; Liu, Xiaoli; Cai, Yumao; Wang, Feng; Feng, Tiejian; Ma, Xu
Male sexual orientation has been proposed to have genetic components, but previously suggested candidate genes have all received negative results. The human sonic hedgehog (SHH) gene is located in the 7q36 region, which was linked to male sexual orientation in a previous genome-wide association study. SHH is known to play an important role in embryo patterning, and there is evidence connecting it to sexual orientation. In this study, we performed an association analysis of the SHH tag single nucleotide polymorphism rs9333613 in 361 subjects and 319 Chinese male controls. We find a significant difference in genotype and allele distribution between identified homosexuals and heterosexual control subjects, suggesting that the SHH gene could potentially be associated with male sexual orientation.
Nava, Caroline; Keren, Boris; Mignot, Cyril; Rastetter, Agnès; Chantot-Bastaraud, Sandra; Faudet, Anne; Fonteneau, Eric; Amiet, Claire; Laurent, Claudine; Jacquette, Aurélia; Whalen, Sandra; Afenjar, Alexandra; Périsse, Didier; Doummar, Diane; Dorison, Nathalie; Leboyer, Marion; Siffroi, Jean-Pierre; Cohen, David; Brice, Alexis; Héron, Delphine; Depienne, Christel
Copy number variants (CNVs) have repeatedly been found to cause or predispose to autism spectrum disorders (ASDs). For diagnostic purposes, we screened 194 individuals with ASDs for CNVs using Illumina SNP arrays. In several probands, we also analyzed candidate genes located in inherited deletions to unmask autosomal recessive variants. Three CNVs, a de novo triplication of chromosome 15q11-q12 of paternal origin, a deletion on chromosome 9p24 and a de novo 3q29 deletion, were identified as the cause of the disorder in one individual each. An autosomal recessive cause was considered possible in two patients: a homozygous 1p31.1 deletion encompassing PTGER3 and a deletion of the entire DOCK10 gene associated with a rare hemizygous missense variant. We also identified multiple private or recurrent CNVs, the majority of which were inherited from asymptomatic parents. Although highly penetrant CNVs or variants inherited in an autosomal recessive manner were detected in rare cases, our results mainly support the hypothesis that most CNVs contribute to ASDs in association with other CNVs or point variants located elsewhere in the genome. Identification of these genetic interactions in individuals with ASDs constitutes a formidable challenge.
Birdsell, Dawn N; Pearson, Talima; Price, Erin P; Hornstra, Heidie M; Nera, Roxanne D; Stone, Nathan; Gruendike, Jeffrey; Kaufman, Emily L; Pettus, Amanda H; Hurbon, Audriana N; Buchhagen, Jordan L; Harms, N Jane; Chanturia, Gvantsa; Gyuranecz, Miklos; Wagner, David M; Keim, Paul S
Single nucleotide polymorphisms (SNPs) are abundant in genomes of all species and biologically informative markers extensively used across broad scientific disciplines. Newly identified SNP markers are publicly available at an ever-increasing rate due to advancements in sequencing technologies. Efficient, cost-effective SNP genotyping methods to screen sample populations are in great demand in well-equipped laboratories, but also in developing world situations. Dual Probe TaqMan assays are robust but can be cost-prohibitive and require specialized equipment. The Mismatch Amplification Mutation Assay, coupled with melt analysis (Melt-MAMA), is flexible, efficient and cost-effective. However, Melt-MAMA traditionally suffers from high rates of assay design failures and knowledge gaps on assay robustness and sensitivity. In this study, we identified strategies that improved the success of Melt-MAMA. We examined the performance of 185 Melt-MAMAs across eight different pathogens using various optimization parameters. We evaluated the effects of genome size and %GC content on assay development. When used collectively, specific strategies markedly improved the rate of successful assays at the first design attempt from ~50% to ~80%. We observed that Melt-MAMA accurately genotypes across a broad DNA range (~100 ng to ~0.1 pg). Genomic size and %GC content influence the rate of successful assay design in an independent manner. Finally, we demonstrated the versatility of these assays by the creation of a duplex Melt-MAMA real-time PCR (two SNPs) and conversion to a size-based genotyping system, which uses agarose gel electrophoresis. Melt-MAMA is comparable to Dual Probe TaqMan assays in terms of design success rate and accuracy. Although sensitivity is less robust than Dual Probe TaqMan assays, Melt-MAMA is superior in terms of cost-effectiveness, speed of development and versatility. We detail the parameters most important for the successful application of Melt-MAMA, which
Huang, Yong-Zhen; Zhan, Zhao-Yang; Li, Xin-Yi; Wu, Sheng-Ru; Sun, Yu-Jia; Xue, Jing; Lan, Xian-Yong; Lei, Chu-Zhao; Zhang, Chun-Lei; Jia, Yu-Tang; Chen, Hong
Insulin-like growth factor 2 (IGF2) is a potent cell growth and differentiation factor and is implicated in mammals' growth and development. The objective of this study was to evaluate the effects of the mutations in the bovine IGF2 with growth traits in Chinese Qinchuan cattle. Four single nucleotide polymorphisms (SNPs) were detected of the bovine IGF2 by DNA pool sequencing and forced polymerase chain reaction-restriction fragment length polymorphism (forced PCR-RFLP) methods. We also investigated haplotype structure and linkage disequilibrium (LD) coefficients for four SNPs in 817 individuals representing two main cattle breeds from China. The result of haplotype analysis showed eight different haplotypes and 27 combined genotypes within the study population. The statistical analyses indicated that the four SNPs, combined genotypes and haplotypes are associated with the withers height, body length, chest breadth, chest depth and body weight in Qinchuan cattle population (P growth traits; the heterozygote diplotype was associated with higher growth traits compared to wild-type homozygote. Our results provide evidence that polymorphisms in the IGF2 gene are associated with growth traits, and may be used for marker-assisted selection in beef cattle breeding program.
Bai, Bing; Zhao, Wen-Ming; Tang, Bi-Xia; Wang, Yan-Qing; Wang, Lu; Zhang, Zhang; Yang, He-Chuan; Liu, Yan-Hu; Zhu, Jun-Wei; Irwin, David M; Wang, Guo-Dong; Zhang, Ya-Ping
The rapid advancement of next-generation sequencing technology has generated a deluge of genomic data from domesticated dogs and their wild ancestor, grey wolves, which have simultaneously broadened our understanding of domestication and diseases that are shared by humans and dogs. To address the scarcity of single nucleotide polymorphism (SNP) data provided by authorized databases and to make SNP data more easily/friendly usable and available, we propose DoGSD (http://dogsd.big.ac.cn), the first canidae-specific database which focuses on whole genome SNP data from domesticated dogs and grey wolves. The DoGSD is a web-based, open-access resource comprising ∼ 19 million high-quality whole-genome SNPs. In addition to the dbSNP data set (build 139), DoGSD incorporates a comprehensive collection of SNPs from two newly sequenced samples (1 wolf and 1 dog) and collected SNPs from three latest dog/wolf genetic studies (7 wolves and 68 dogs), which were taken together for analysis with the population genetic statistics, Fst. In addition, DoGSD integrates some closely related information including SNP annotation, summary lists of SNPs located in genes, synonymous and non-synonymous SNPs, sampling location and breed information. All these features make DoGSD a useful resource for in-depth analysis in dog-/wolf-related studies. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Dozmorov, Mikhail G; Cara, Lukas R; Giles, Cory B; Wren, Jonathan D
The growing amount of regulatory data from the ENCODE, Roadmap Epigenomics and other consortia provides a wealth of opportunities to investigate the functional impact of single nucleotide polymorphisms (SNPs). Yet, given the large number of regulatory datasets, researchers are posed with a challenge of how to efficiently utilize them to interpret the functional impact of SNP sets. We developed the GenomeRunner web server to automate systematic statistical analysis of SNP sets within a regulatory context. Besides defining the functional impact of SNP sets, GenomeRunner implements novel regulatory similarity/differential analyses, and cell type-specific regulatory enrichment analysis. Validated against literature- and disease ontology-based approaches, analysis of 39 disease/trait-associated SNP sets demonstrated that the functional impact of SNP sets corresponds to known disease relationships. We identified a group of autoimmune diseases with SNPs distinctly enriched in the enhancers of T helper cell subpopulations, and demonstrated relevant cell type-specificity of the functional impact of other SNP sets. In summary, we show how systematic analysis of genomic data within a regulatory context can help interpreting the functional impact of SNP sets. GenomeRunner web server is freely available at http://www.integrativegenomics.org/ firstname.lastname@example.org Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: email@example.com.
Wang, Yunsheng; Shahid, Muhammad Qasim; Lin, Shunquan; Chen, Chengjie; Hu, Chen
The process of crop domestication has long been a major area of research to gain insights into the history of human civilization and to understand the process of evolution. Loquat (Eriobotrya japonica Lindl.) is one of the typical subtropical fruit trees, which was domesticated in China at least 2000 years ago. In the present study, we re-sequenced the genome of nine wild loquat accessions collected from wide geographical range and 10 representative cultivated loquat cultivars by using RAD-tag tacit to exploit the molecular footprints of domestication. We obtained 26.4 Gb clean sequencing data from 19 loquat accessions, with an average of 32.64 M reads per genotype. We identified more than 80,000 SNPs distributed throughout the loquat genome. The SNP density and numbers were slightly higher in the wild loquat populations than that in the cultivated populations. All cultivars were clustered together by structure, phylogenetic and PCA analyses. The modern loquat cultivars have experienced a non-significant genetic bottleneck during domestication, and originated from a single domesticated event. Moreover, our study revealed that Hubei province of China is probably the origin center of cultivated loquat.
Single nucleotide polymorphism (SNP discovery in duplicated genomes: intron-primed exon-crossing (IPEC as a strategy for avoiding amplification of duplicated loci in Atlantic salmon (Salmo salar and other salmonid fishes
Primmer Craig R
Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs represent the most abundant type of DNA variation in the vertebrate genome, and their applications as genetic markers in numerous studies of molecular ecology and conservation of natural populations are emerging. Recent large-scale sequencing projects in several fish species have provided a vast amount of data in public databases, which can be utilized in novel SNP discovery in salmonids. However, the suggested duplicated nature of the salmonid genome may hamper SNP characterization if the primers designed in conserved gene regions amplify multiple loci. Results Here we introduce a new intron-primed exon-crossing (IPEC method in an attempt to overcome this duplication problem, and also evaluate different priming methods for SNP discovery in Atlantic salmon (Salmo salar and other salmonids. A total of 69 loci with differing priming strategies were screened in S. salar, and 27 of these produced ~13 kb of high-quality sequence data consisting of 19 SNPs or indels (one per 680 bp. The SNP frequency and the overall nucleotide diversity (3.99 × 10-4 in S. salar was lower than reported in a majority of other organisms, which may suggest a relative young population history for Atlantic salmon. A subset of primers used in cross-species analyses revealed considerable variation in the SNP frequencies and nucleotide diversities in other salmonids. Conclusion Sequencing success was significantly higher with the new IPEC primers; thus the total number of loci to screen in order to identify one potential polymorphic site was six times less with this new strategy. Given that duplication may hamper SNP discovery in some species, the IPEC method reported here is an alternative way of identifying novel polymorphisms in such cases.
Full Text Available Two methods of SNPs pre-selection based on single marker regression for the estimation of genomic breeding values (G-EBVs were compared using simulated data provided by the XII QTL-MAS workshop: i Bonferroni correction of the significance threshold and ii Permutation test to obtain the reference distribution of the null hypothesis and identify significant markers at P<0.01 and P<0.001 significance thresholds. From the set of markers significant at P<0.001, random subsets of 50% and 25% markers were extracted, to evaluate the effect of further reducing the number of significant SNPs on G-EBV predictions. The Bonferroni correction method allowed the identification of 595 significant SNPs that gave the best G-EBV accuracies in prediction generations (82.80%. The permutation methods gave slightly lower G-EBV accuracies even if a larger number of SNPs resulted significant (2,053 and 1,352 for 0.01 and 0.001 significance thresholds, respectively. Interestingly, halving or dividing by four the number of SNPs significant at P<0.001 resulted in an only slightly decrease of G-EBV accuracies. The genetic structure of the simulated population with few QTL carrying large effects, might have favoured the Bonferroni method.
Fusarium spp. cause severe damage in many agricultural crops including sugar beet. Sugar beet needs to be protected from these soil borne pathogens to guarantee an optimal sugar yield in the field. The genetic control is the key to overcoming this disease. Identification of single nucleotide polymor...
Full Text Available Tibetan (TB and Bama (BM miniature pigs are two popular pig breeds that are used as experimental animals in China due to their small body size. Here, we analyzed single-nucleotide polymorphisms (SNPs in gene fragments that are closely related to growth traits [growth hormone (GH, growth hormone receptor (GHR, and insulin-like growth factor (IGF-1] in these pig breeds and a large white (LW control pig breed. On the basis of the analysis of 100 BMs, 108 TBs, and 50 LWs, the polymorphic distribution levels of GH, GHR, and IGF-1 were significantly different among these three pig breeds. According to correlation analyses between SNPs and five growth traits - body weight (BW, body length (BL, withers height (WH, chest circumference (CC, and abdomen circumference (AC - three SNP loci in BMs and four SNP loci in TBs significantly affected growth traits. Three SNP sites in BMs and four SNP sites in TBs significantly affected growth traits. SNPs located in the GH gene fragment significantly affected BL and CC at locus 12 and BL at locus 45 in BMs, and also BW, WH, CC, and AC at locus 45 and WH and CC at locus 93 in TBs. One SNP at locus 85 in the BM GHR gene fragment significantly affected all growth traits. All indices were significantly reduced with a mixture of alleles at locus 85. These results provide more information regarding the genetic background of these minipig species and indicate useful selection markers for pig breeding programs.
Evelyn Mendoza Torres
Full Text Available CONTEXT: Genotyping of single nucleotide polymorphism (SNP C/T-13910 located upstream of the lactase gene is used to determine adult-type hypolactasia/lactase persistence in North-European Caucasian subjects. The applicability of this polymorphism has been studied by comparing it with the standard diagnostic methods in different populations. OBJECTIVE: To compare the lactose hydrogen breath test with the genetic test in a sample of the Colombian Caribbean population. METHODS: Lactose hydrogen breath test and genotyping of SNP C/T-13910 were applied to 128 healthy individuals (mean age 35 ± 1. A positive lactose hydrogen breath test was indicative of hypolactasia. Genotyping was done using polymerase chain reaction/restriction fragment length polymorphism. The kappa index was used to establish agreement between the two methods. RESULTS: Seventy-six subjects (59% were lactose-maldigesters (hypolactasia and 52 subjects (41% were lactose-digesters (lactase persistence. The frequencies of the CC, CT and TT genotypes were 80%, 20% and 0%, respectively. Genotyping had 97% sensitivity and 46% specificity. The kappa index = 0.473 indicates moderate agreement between the genotyping of SNP C/T-13910 and the lactose hydrogen breath test. CONCLUSION: The moderate agreement indicates that the genotyping of the SNP C/T-13910 is not applicable to determine adult-type hypolactasia/lactase persistence in the population participating in this study.
Jensen, Marie Pødenphant
Isolation and manipulation of single cells have gained an increasing interest from researchers because of the heterogeneity of cells from the same cell culture. Single cell analysis can ensure a better understanding of differences between individual cells and potentially solve a variety of clinical...... problems. In this thesis lab on a chip systems for rare single cell analysis are investigated. The focus was to develop a commercial, disposable device for circulating tumour cell (CTC) analysis. Such a device must be able to separate rare cells from blood samples and subsequently capture the specific...... cells, and simultaneously be fabricated and operated at low costs and be user-friendly. These challenges were addressed through development of two microfluidic devices, one for rare cell isolation based on pinched flow fractionation (PFF) and one for single cell capture based on hydrodynamic trapping...
Therkildsen, Nina Overgaard; Hansen, Jakob Hemmer; Hedeholm, R.B.
Accurate prediction of species distribution shifts in the face of climate change requires a sound understanding of population diversity and local adaptations. Previous modeling has suggested that global warming will lead to increased abundance of Atlantic cod (Gadus morhua) in the ocean around...... Greenland, but the dynamics of earlier abundance fluctuations are not well understood. We applied a retrospective spatiotemporal population genomics approach to examine the temporal stability of cod population structure in this region and to search for signatures of divergent selection over a 78-year period...... spanning major demographic changes. Analyzing >900 gene-associated single nucleotide polymorphisms in 847 individuals, we identified four genetically distinct groups that exhibited varying spatial distributions with considerable overlap and mixture. The genetic composition had remained stable over decades...
Full Text Available In the population, there are aggressive sheep in a small number which requires special management those specific animal house and routine management. The purpose of this study was to identify the variation of DNA marker SNP (single nucleotide polymorphism as a genetic marker for the aggressive trait in several of sheep breed. The identification of point mutations in exon 8 of MAO-A gene associated with aggressive behavior in sheep may be further useful to become of DNA markers for the aggressive trait in sheep. Five of sheep breed were used, i.e.: Barbados Black belly Cross sheep (BC, Composite Garut (KG, Local Garut (LG, Composite Sumatra (KS and St. Cross Croix (SC. Duration of ten behavior traits, blood serotonin concentrations and DNA sequence of exon 8 of MAO-A gene from the sheep aggressive and nonaggressive were observed. PROC GLM of SAS Ver. 9.0 program was used to analyze variable behavior and blood serotonin concentrations. DNA polymorphism in exon 8 of MAO-A gene was analyzed using the MEGA software Ver. 4.0. The results show that the percentage of the aggressive rams of each breed was less than 10 percent; except for the KS sheep is higher (23%. Based on the duration of behavior, aggressive sheep group was not significantly different with non aggressive sheep group, except duration of care giving and drinking behavior. It is known that concentration of blood serotonin in aggressive and non aggressive rams was not significantly different. The aggressive trait in sheep has a mechanism or a different cause like that occurs in mice and humans. In this study, aggressive behavior in sheep was not associated with a mutation in exon 8 of MAO-A gene.
Cho, Young-Il; Ahn, Yul-Kyun; Tripathi, Swati; Kim, Jeong-Ho; Lee, Hye-Eun; Kim, Do-Sun
Numerous studies using single nucleotide polymorphisms (SNPs) have been conducted in humans, and other animals, and in major crops, including rice, soybean, and Chinese cabbage. However, the number of SNP studies in cabbage is limited. In this present study, we evaluated whether 7,645 SNPs previously identified as molecular markers linked to disease resistance in the Brassica rapa genome could be applied to B. oleracea. In a BLAST analysis using the SNP sequences of B. rapa and B. oleracea genomic sequence data registered in the NCBI database, 256 genes for which SNPs had been identified in B. rapa were found in B. oleracea. These genes were classified into three functional groups: molecular function (64 genes), biological process (96 genes), and cellular component (96 genes). A total of 693 SNP markers, including 145 SNP markers [BRH--developed from the B. rapa genome for high-resolution melt (HRM) analysis], 425 SNP markers (BRP--based on the B. rapa genome that could be applied to B. oleracea), and 123 new SNP markers (BRS--derived from BRP and designed for HRM analysis), were investigated for their ability to amplify sequences from cabbage genomic DNA. In total, 425 of the SNP markers (BRP-based on B. rapa genome), selected from 7,645 SNPs, were successfully applied to B. oleracea. Using PCR, 108 of 145 BRH (74.5%), 415 of 425 BRP (97.6%), and 118 of 123 BRS (95.9%) showed amplification, suggesting that it is possible to apply SNP markers developed based on the B. rapa genome to B. oleracea. These results provide valuable information that can be utilized in cabbage genetics and breeding programs using molecular markers derived from other Brassica species.
Cho, Young-Il; Ahn, Yul-Kyun; Tripathi, Swati; Kim, Jeong-Ho; Lee, Hye-Eun; Kim, Do-Sun
Numerous studies using single nucleotide polymorphisms (SNPs) have been conducted in humans, and other animals, and in major crops, including rice, soybean, and Chinese cabbage. However, the number of SNP studies in cabbage is limited. In this present study, we evaluated whether 7,645 SNPs previously identified as molecular markers linked to disease resistance in the Brassica rapa genome could be applied to B. oleracea. In a BLAST analysis using the SNP sequences of B. rapa and B. oleracea genomic sequence data registered in the NCBI database, 256 genes for which SNPs had been identified in B. rapa were found in B. oleracea. These genes were classified into three functional groups: molecular function (64 genes), biological process (96 genes), and cellular component (96 genes). A total of 693 SNP markers, including 145 SNP markers [BRH—developed from the B. rapa genome for high-resolution melt (HRM) analysis], 425 SNP markers (BRP—based on the B. rapa genome that could be applied to B. oleracea), and 123 new SNP markers (BRS—derived from BRP and designed for HRM analysis), were investigated for their ability to amplify sequences from cabbage genomic DNA. In total, 425 of the SNP markers (BRP-based on B. rapa genome), selected from 7,645 SNPs, were successfully applied to B. oleracea. Using PCR, 108 of 145 BRH (74.5%), 415 of 425 BRP (97.6%), and 118 of 123 BRS (95.9%) showed amplification, suggesting that it is possible to apply SNP markers developed based on the B. rapa genome to B. oleracea. These results provide valuable information that can be utilized in cabbage genetics and breeding programs using molecular markers derived from other Brassica species. PMID:25790283
Full Text Available Numerous studies using single nucleotide polymorphisms (SNPs have been conducted in humans, and other animals, and in major crops, including rice, soybean, and Chinese cabbage. However, the number of SNP studies in cabbage is limited. In this present study, we evaluated whether 7,645 SNPs previously identified as molecular markers linked to disease resistance in the Brassica rapa genome could be applied to B. oleracea. In a BLAST analysis using the SNP sequences of B. rapa and B. oleracea genomic sequence data registered in the NCBI database, 256 genes for which SNPs had been identified in B. rapa were found in B. oleracea. These genes were classified into three functional groups: molecular function (64 genes, biological process (96 genes, and cellular component (96 genes. A total of 693 SNP markers, including 145 SNP markers [BRH--developed from the B. rapa genome for high-resolution melt (HRM analysis], 425 SNP markers (BRP--based on the B. rapa genome that could be applied to B. oleracea, and 123 new SNP markers (BRS--derived from BRP and designed for HRM analysis, were investigated for their ability to amplify sequences from cabbage genomic DNA. In total, 425 of the SNP markers (BRP-based on B. rapa genome, selected from 7,645 SNPs, were successfully applied to B. oleracea. Using PCR, 108 of 145 BRH (74.5%, 415 of 425 BRP (97.6%, and 118 of 123 BRS (95.9% showed amplification, suggesting that it is possible to apply SNP markers developed based on the B. rapa genome to B. oleracea. These results provide valuable information that can be utilized in cabbage genetics and breeding programs using molecular markers derived from other Brassica species.
The small East African Shorthorn Zebu is the main indigenous cattle across East Africa. A recent genome wide SNPs analysis has revealed their ancient stable African taurine x Asian zebu admixture. Here, we assess the presence of candidate signature of positive selection in their genome, with the aim...
Dawn N Birdsell
Full Text Available Single nucleotide polymorphisms (SNPs are abundant in genomes of all species and biologically informative markers extensively used across broad scientific disciplines. Newly identified SNP markers are publicly available at an ever-increasing rate due to advancements in sequencing technologies. Efficient, cost-effective SNP genotyping methods to screen sample populations are in great demand in well-equipped laboratories, but also in developing world situations. Dual Probe TaqMan assays are robust but can be cost-prohibitive and require specialized equipment. The Mismatch Amplification Mutation Assay, coupled with melt analysis (Melt-MAMA, is flexible, efficient and cost-effective. However, Melt-MAMA traditionally suffers from high rates of assay design failures and knowledge gaps on assay robustness and sensitivity. In this study, we identified strategies that improved the success of Melt-MAMA. We examined the performance of 185 Melt-MAMAs across eight different pathogens using various optimization parameters. We evaluated the effects of genome size and %GC content on assay development. When used collectively, specific strategies markedly improved the rate of successful assays at the first design attempt from ~50% to ~80%. We observed that Melt-MAMA accurately genotypes across a broad DNA range (~100 ng to ~0.1 pg. Genomic size and %GC content influence the rate of successful assay design in an independent manner. Finally, we demonstrated the versatility of these assays by the creation of a duplex Melt-MAMA real-time PCR (two SNPs and conversion to a size-based genotyping system, which uses agarose gel electrophoresis. Melt-MAMA is comparable to Dual Probe TaqMan assays in terms of design success rate and accuracy. Although sensitivity is less robust than Dual Probe TaqMan assays, Melt-MAMA is superior in terms of cost-effectiveness, speed of development and versatility. We detail the parameters most important for the successful application of
The extraction of elemental information from single cultured cells using nuclear microscopy is an area of great potential because it can provide both quantitative information on the uptake of elements by the cell, and also its elemental response to a wide variety of external stimuli. A recent technique based on nuclear physics technology enables the analysis of single cells down to the parts per million level to be achieved
Lin, Hui Yi; Chen, Dung Tsa; Huang, Po Yu
Motivation: Testing SNP-SNP interactions is considered as a key for overcoming bottlenecks of genetic association studies. However, related statistical methods for testing SNP-SNP interactions are underdeveloped. Results: We propose the SNP Interaction Pattern Identifier (SIPI), which tests 45 bi...
Results based on 124 cases of breast cancer and 97 controls from Taiwan suggest that the single nucleotide polymorphism (SNP309 in the MDM2 gene promoter is associated with earlier onset and increased risk of breast cancer
Full Text Available Abstract Background It has been suggested that the single nucleotide polymorphism 309 (SNP309, T -> G in the promoter region of the MDM2 gene is important for tumor development; however, with regards to breast cancer, inconsistent associations have been reported worldwide. It is speculated that these conflicting results may have arisen due to different patient subgroups and ethnicities studied. For the first time, this study explores the effect of the MDM2 SNP309 genotype on Taiwanese breast cancer patients. Methods Genomic DNA was obtained from the whole blood of 124 breast cancer patients and 97 cancer-free healthy women living in Taiwan. MDM2 SNP309 genotyping was carried out by restriction fragment length polymorphism (RFLP assay. The multivariate logistic regression and the Kaplan-Meier method were used for analyzing the risk association and significance of age at diagnosis among different MDM2 SNP309 genotypes, respectively. Results Compared to the TT genotype, an increased risk association with breast cancer was apparent for the GG genotype (OR = 3.05, 95% CI = 1.04 to 8.95, and for the TG genotype (OR = 2.12, 95% CI = 0.90 to 5.00 after adjusting for age, cardiovascular disease/diabetes, oral contraceptive usage, and body mass index, which exhibits significant difference between cases and controls. Furthermore, the average ages at diagnosis for breast cancer patients were 53.6, 52 and 47 years for those harboring TT, TG and GG genotypes, respectively. A significant difference in median age of onset for breast cancer between GG and TT+TG genotypes was obtained by the log-rank test (p = 0.0067. Conclusion Findings based on the current sample size suggest that the MDM2 SNP309 GG genotype may be associated with both the risk of breast cancer and an earlier age of onset in Taiwanese women.
Tian, Hong-Li; Wang, Feng-Ge; Zhao, Jiu-Ran; Yi, Hong-Mei; Wang, Lu; Wang, Rui; Yang, Yang; Song, Wei
Single nucleotide polymorphisms (SNPs) are abundant and evenly distributed throughout the maize ( Zea mays L.) genome. SNPs have several advantages over simple sequence repeats, such as ease of data comparison and integration, high-throughput processing of loci, and identification of associated phenotypes. SNPs are thus ideal for DNA fingerprinting, genetic diversity analysis, and marker-assisted breeding. Here, we developed a high-throughput and compatible SNP array, maizeSNP3072, containing 3072 SNPs developed from the maizeSNP50 array. To improve genotyping efficiency, a high-quality cluster file, maizeSNP3072_GT.egt, was constructed. All 3072 SNP loci were localized within different genes, where they were distributed in exons (43 %), promoters (21 %), 3' untranslated regions (UTRs; 22 %), 5' UTRs (9 %), and introns (5 %). The average genotyping failure rate using these SNPs was only 6 %, or 3 % using the cluster file to call genotypes. The genotype consistency of repeat sample analysis on Illumina GoldenGate versus Infinium platforms exceeded 96.4 %. The minor allele frequency (MAF) of the SNPs averaged 0.37 based on data from 309 inbred lines. The 3072 SNPs were highly effective for distinguishing among 276 examined hybrids. Comparative analysis using Chinese varieties revealed that the 3072SNP array showed a better marker success rate and higher average MAF values, evaluation scores, and variety-distinguishing efficiency than the maizeSNP50K array. The maizeSNP3072 array thus can be successfully used in DNA fingerprinting identification of Chinese maize varieties and shows potential as a useful tool for germplasm resource evaluation and molecular marker-assisted breeding.
Full Text Available This study is the first to use genome-wide association study (GWAS data to evaluate the multidimensional genetic architecture underlying nasopharyngeal cancer. Since analysis of data from GWAS confirms a close and consistent association between elevated risk for nasopharyngeal carcinoma (NPC and major histocompatibility complex class 1 genes, our goal here was to explore lesser effects of gene-gene interactions. We conducted an exhaustive genome-wide analysis of GWAS data of NPC, revealing two-locus interactions occurring between single nucleotide polymorphisms (SNPs, and identified a number of suggestive interaction loci which were missed by traditional GWAS analyses. Although none of the interaction pairs we identified passed the genome-wide Bonferroni-adjusted threshold for significance, using independent GWAS data from the same population (Stage 2, we selected 66 SNP pairs in 39 clusters with P<0.01. We identified that in several chromosome regions, multiple suggestive interactions group to form a block-like signal, effectively reducing the rate of false discovery. The strongest cluster of interactions involved the CREB5 gene and a SNP rs1607979 on chromosome 17q22 (P = 9.86×10(-11 which also show trans-expression quantitative loci (eQTL association in Chinese population. We then detected a complicated cis-interaction pattern around the NPC-associated HLA-B locus, which is immediately adjacent to copy-number variations implicated in male susceptibility for NPC. While it remains to be seen exactly how and to what degree SNP-SNP interactions such as these affect susceptibility for nasopharyngeal cancer, future research on these questions holds great promise for increasing our understanding of this disease's genetic etiology, and possibly also that of other gene-related cancers.
Marjolein van Gent
Full Text Available To monitor changes in Bordetella pertussis populations, mainly two typing methods are used; Pulsed-Field Gel Electrophoresis (PFGE and Multiple-Locus Variable-Number Tandem Repeat Analysis (MLVA. In this study, a single nucleotide polymorphism (SNP typing method, based on 87 SNPs, was developed and compared with PFGE and MLVA. The discriminatory indices of SNP typing, PFGE and MLVA were found to be 0.85, 0.95 and 0.83, respectively. Phylogenetic analysis, using SNP typing as Gold Standard, revealed false homoplasies in the PFGE and MLVA trees. Further, in contrast to the SNP-based tree, the PFGE- and MLVA-based trees did not reveal a positive correlation between root-to-tip distance and the isolation year of strains. Thus PFGE and MLVA do not allow an estimation of the relative age of the selected strains. In conclusion, SNP typing was found to be phylogenetically more informative than PFGE and more discriminative than MLVA. Further, in contrast to PFGE, it is readily standardized allowing interlaboratory comparisons. We applied SNP typing to study strains with a novel allele for the pertussis toxin promoter, ptxP3, which have a worldwide distribution and which have replaced the resident ptxP1 strains in the last 20 years. Previously, we showed that ptxP3 strains showed increased pertussis toxin expression and that their emergence was associated with increased notification in The Netherlands. SNP typing showed that the ptxP3 strains isolated in the Americas, Asia, Australia and Europe formed a monophyletic branch which recently diverged from ptxP1 strains. Two predominant ptxP3 SNP types were identified which spread worldwide. The widespread use of SNP typing will enhance our understanding of the evolution and global epidemiology of B. pertussis.
Full Text Available Increasing evidence shows that the heterogeneity of individual cells within a genetically identical population can be critical to their peculiar function and fate. Conventional cell based assays mainly analysis the average responses from a population cells, while the difference within individual cells may often be masked. The cell size, RNA transcripts and protein expression level are quite different within individual cells and these variations are key point to answer the problems in cancer, neurobiology, stem cell biology, immunology and developmental biology. To better understand the cell-to-cell variations, the single cell analysis can provide much more detailed information which may be helpful for therapeutic decisions in an increasingly personalized medicine. In this review, we will focus on the recent development in single cell analysis, including methods used in single cell isolation, analysis and some application examples. The review provides the historical background to single cell analysis, discusses limitations, and current and future possibilities in this exciting field of research.
Grigoris D Amoutzias
Full Text Available The European brown hare (Lepus europaeus, Pallas 1778 is an important small game species in Europe. Due to its size and position in the food chain, as well as its life history, phenotypic variation and the relatively recent speciation events, brown hare plays an important role in the structure of various ecosystems and has emerged as an important species for population management and evolutionary studies. In order to identify informative SNPs for such studies, heart and liver tissues of three samples from the European lineage and a three-sample pool from the Anatolian lineage were subjected to RNA-Sequencing analysis. This effort resulted in 9496 well-assembled protein-coding sequences with close homology to human. After applying very stringent filtering criteria, 66185 polymorphic sites were identified in 7665 genes/cds and 2050 of those polymorphic sites are potentially capable of distinguishing the European from the Anatolian lineage. From these distinguishing mutations we focused on those in genes that are involved in cellular energy production, namely the glycolysis, Krebs cycle and the OXPHOS machinery. A selected set of SNPs was also validated by Sanger sequencing. By simulating the three European individuals as one pool, no substantial informative-SNP identification was lost, making it a cost-efficient approach. To our knowledge this is the first attempt to correlate the differentiation in both nuclear and mitochondrial genome between the two different lineages of L. europaeus with the observed spatial partitioning of the lineages of the species, proposing a possible mechanism that is maintaining the reproductive isolation of the lineages.
Valbusa, Ugo; Ierardi, Vincenzo
The knowledge of the effects of single-nucleotide polymorphisms (SNPs) in the human genome greatly contributes to better comprehension of the relation between genetic factors and diseases. Sequence analysis of genomic DNA in different individuals reveals positions where variations that involve individual base substitutions can occur. Single-nucleotide polymorphisms are highly abundant and can have different consequences at phenotypic level. Several attempts were made to apply atomic force microscopy (AFM) to detect and map SNP sites in DNA strands. The most promising approach is the study of DNA mutations producing heteroduplex DNA strands and identifying the mismatches by means of a protein that labels the mismatches. MutS is a protein that is part of a well-known complex of mismatch repair, which initiates the process of repairing when the MutS binds to the mismatched DNA filament. The position of MutS on the DNA filament can be easily recorded by means of AFM imaging.
Qiao, Xian; Su, Rui; Wang, Yang; Wang, Ruijun; Yang, Ting; Li, Xiaokai; Chen, Wei; He, Shiyang; Jiang, Yu; Xu, Qiwu; Wan, Wenting; Zhang, Yaolei; Zhang, Wenguang; Chen, Jiang; Liu, Bin; Liu, Xin; Fan, Yixing; Chen, Duoyuan; Jiang, Huaizhi; Fang, Dongming; Liu, Zhihong; Wang, Xiaowen; Zhang, Yanjun; Mao, Danqing; Wang, Zhiying; Di, Ran; Zhao, Qianjun; Zhong, Tao; Yang, Huanming; Wang, Jian; Wang, Wen; Dong, Yang; Chen, Xiaoli; Xu, Xun; Li, Jinquan
Compared with the commercially available single nucleotide polymorphism (SNP) chip based on the Bead Chip technology, the solution hybrid selection (SHS)-based target enrichment SNP chip is not only design-flexible, but also cost-effective for genotype sequencing. In this study, we propose to design an animal SNP chip using the SHS-based target enrichment strategy for the first time. As an update to the international collaboration on goat research, a 66 K SNP chip for cashmere goat was created from the whole-genome sequencing data of 73 individuals. Verification of this 66 K SNP chip with the whole-genome sequencing data of 436 cashmere goats showed that the SNP call rates was between 95.3% and 99.8%. The average sequencing depth for target SNPs were 40X. The capture regions were shown to be 200 bp that flank target SNPs. This chip was further tested in a genome-wide association analysis of cashmere fineness (fiber diameter). Several top hit loci were found marginally associated with signaling pathways involved in hair growth. These results demonstrate that the 66 K SNP chip is a useful tool in the genomic analyses of cashmere goats. The successful chip design shows that the SHS-based target enrichment strategy could be applied to SNP chip design in other species.
Yang, Cheng-Hong; Chuang, Li-Yeh; Lin, Yu-Da
Single-nucleotide polymorphism (SNP)-SNP interactions (SSIs) are popular markers for understanding disease susceptibility. Multifactor dimensionality reduction (MDR) can successfully detect considerable SSIs. Currently, MDR-based methods mainly adopt a single-objective function (a single measure based on contingency tables) to detect SSIs. However, generally, a single-measure function might not yield favorable results due to potential model preferences and disease complexities. This study proposes a multiobjective MDR (MOMDR) method that is based on a contingency table of MDR as an objective function. MOMDR considers the incorporated measures, including correct classification and likelihood rates, to detect SSIs and adopts set theory to predict the most favorable SSIs with cross-validation consistency. MOMDR enables simultaneously using multiple measures to determine potential SSIs. Three simulation studies were conducted to compare the detection success rates of MOMDR and single-objective MDR (SOMDR), revealing that MOMDR had higher detection success rates than SOMDR. Furthermore, the Wellcome Trust Case Control Consortium data set was analyzed by MOMDR to detect SSIs associated with coronary artery disease.
Khrustaleva, A.M.; Limborg, Morten; Seeb, J. E.
Sockeye salmon samples from six populations from Kamchatka Peninsula were tested for polymorphism at six microsatellite (STR) and forty-five single nucleotide polymorphism (SNP) loci. These populations included the five largest populations in the region. Statistically significant genetic...... differentiation among the local populations from this part of the species range examined was demonstrated. The STR variability points to pronounced genetic divergence of the populations from two geographical regions, Eastern and Western Kamchatka. The results of SNP analysis further revealed that the populations...
Gutacker, Michaela M; Mathema, Barun; Soini, Hanna; Shashkina, Elena; Kreiswirth, Barry N; Graviss, Edward A; Musser, James M
We studied genetic relationships among 5069 Mycobacterium tuberculosis strains recovered from patients enrolled in 4 population-based studies in the United States and Europe, by analysis of 36 synonymous single-nucleotide polymorphisms (SNPs). All strains were assigned to 1 of 9 major genetic clusters based on sSNP profile. The same 9 genetic clusters were revealed by analysis of 227 nonsynonymous SNPs, 121 intergenic SNPs, and concatenated profiles of 578 SNPs available for a subset of 48 representative strains. IS6110 profiles, spoligotypes, and mycobacterial interspersed repetitive unit patterns were nonrandomly associated with SNP-based phylogenetic lineages, together indicating a strongly clonal population structure. Isolates of the 9 genetic clusters were not distributed with equal frequency in all localities, reflecting geographic subdivision. The SNP-based phylogenetic framework provides new insight into the worldwide evolution of M. tuberculosis and a gateway for investigating genotype-disease phenotype relationships in large samples of strains.
Shiokai, Sachiko; Kitashiba, Hiroyasu; Nishio, Takeshi
Although the dot-blot-SNP technique is a simple cost-saving technique suitable for genotyping of many plant individuals, optimization of hybridization and washing conditions for each SNP marker requires much time and labor. For prediction of the optimum hybridization conditions for each probe, we compared T (m) values estimated from nucleotide sequences using the DINAMelt web server, measured T (m) values, and hybridization conditions yielding allele-specific signals. The estimated T (m) values were comparable to the measured T (m) values with small differences of less than 3 degrees C for most of the probes. There were differences of approximately 14 degrees C between the specific signal detection conditions and estimated T (m) values. Change of one level of SSC concentrations of 0.1, 0.2, 0.5, and 1.0x SSC corresponded to a difference of approximately 5 degrees C in optimum signal detection temperature. Increasing the sensitivity of signal detection by shortening the exposure time to X-ray film changed the optimum hybridization condition for specific signal detection. Addition of competitive oligonucleotides to the hybridization mixture increased the suitable hybridization conditions by 1.8. Based on these results, optimum hybridization conditions for newly produced dot-blot-SNP markers will become predictable.
Magri, Chiara; Marchina, Eleonora; Bertini, Valeria; Traversa, Michele; Savio, Giulia; Pilotta, Alba; Piovani, Giovanna
In about one third of healthy subjects, the microscopic analysis of chromosomes reveals heteromorphisms with no clinical implications: for example changes in size of the short arm of acrocentric chromosomes. In patients with a pathological phenotype, however, a large acrocentric short arm can mask a genomic imbalance and should be investigated in more detail. We report the first case of a chromosome 22 with a large acrocentric short arm masking a partial trisomy of the distal long arm, characterized by SNP array. We suggest a possible molecular mechanism underlying the rearrangement. We report the case of a 15-year-old dysmorphic girl with low grade psychomotor retardation characterized by a karyotype with a large acrocentric short arm of one chromosome 22. Cytogenetic analysis revealed a normal karyotype with a very intense Q-fluorescent and large satellite on the chromosome 22 short arm. Fluorescence in situ hybridisation analysis showed a de novo partial trisomy of the 22q13.2-qter chromosome region attached to the short arm of chromosome 22. SNP-array analysis showed that the duplication was 8.5 Mb long and originated from the paternal chromosome. Haplotype analysis revealed that the two paternal copies of the distal part of chromosome 22 have the same haplotype and, therefore, both originated from the same paternal chromosome 22. A possible molecular mechanism that could explain this scenario is a break-induced replication (BIR) which is involved in non-reciprocal translocation events. The combined use of FISH and SNP arrays was crucial for a better understanding of the molecular mechanism underlying this rearrangement. This strategy could be applied for a better understanding of the molecular mechanisms underlying cryptic chromosomal rearrangements.
Silva, Deborah S B S; Sawitzki, Fernanda R; De Toni, Elisa C; Graebin, Pietra; Picanco, Juliane B; Abujamra, Ana Lucia; de Farias, Caroline B; Roesler, Rafael; Brunetto, Algemir L; Alho, Clarice S
We aimed to investigate single nucleotide polymorphisms (SNPs) in the EWS gene breaking region in order to analyze Ewing's sarcoma susceptibility. The SNPs were investigated in a healthy subject population and in Ewing's sarcoma patients from Southern Brazil. Genotyping was performed by TaqMan® assay for allelic discrimination using Real-Time PCR. The analysis of incidence of SNPs or different SNP-arrangements revealed a higher presence of homozygote TT-rs4820804 in Ewing's sarcoma patients (p=0.02; Chi Square Test). About 300 bp from the rs4820804 SNP lies a palindromic hexamer (5'-GCTAGC-3') and three nucleotides (GTC), which were previously identified to be in close vicinity of the breakpoint junction in both EWS and FLI1 genes. This DNA segment surrounding the rs4820804 SNP is likely to indicate a breakpoint region. If the T-rs4820804 allele predisposes a DNA fragment to breakage, homozygotes (TT-rs4820804) would have double the chance of having a chromosome break, increasing the chances for a translocation to occur. In conclusion, the TT-rs4820804 EWS genotype can be associated with Ewing's sarcoma and the SNP rs4820804 can be a candidate marker to understand Ewing's sarcoma susceptibility. Copyright © 2012 Elsevier B.V. All rights reserved.
Full Text Available Abstract Background Ancestry informative markers (AIMs are a type of genetic marker that is informative for tracing the ancestral ethnicity of individuals. Application of AIMs has gained substantial attention in population genetics, forensic sciences, and medical genetics. Single nucleotide polymorphisms (SNPs, the materials of AIMs, are useful for classifying individuals from distinct continental origins but cannot discriminate individuals with subtle genetic differences from closely related ancestral lineages. Proof-of-principle studies have shown that gene expression (GE also is a heritable human variation that exhibits differential intensity distributions among ethnic groups. GE supplies ethnic information supplemental to SNPs; this motivated us to integrate SNP and GE markers to construct AIM panels with a reduced number of required markers and provide high accuracy in ancestry inference. Few studies in the literature have considered GE in this aspect, and none have integrated SNP and GE markers to aid classification of samples from closely related ethnic populations. Results We integrated a forward variable selection procedure into flexible discriminant analysis to identify key SNP and/or GE markers with the highest cross-validation prediction accuracy. By analyzing genome-wide SNP and/or GE markers in 210 independent samples from four ethnic groups in the HapMap II Project, we found that average testing accuracies for a majority of classification analyses were quite high, except for SNP-only analyses that were performed to discern study samples containing individuals from two close Asian populations. The average testing accuracies ranged from 0.53 to 0.79 for SNP-only analyses and increased to around 0.90 when GE markers were integrated together with SNP markers for the classification of samples from closely related Asian populations. Compared to GE-only analyses, integrative analyses of SNP and GE markers showed comparable testing
Ramírez-Soriano, Anna; Nielsen, Rasmus
Most single-nucleotide polymorphism (SNP) data suffer from an ascertainment bias caused by the process of SNP discovery followed by SNP genotyping. The final genotyped data are biased toward an excess of common alleles compared to directly sequenced data, making standard genetic methods of analysis...... the variances and covariances of these estimators and provide a corrected version of Tajima's D statistic. We reanalyze a human genomewide SNP data set and find substantial differences in the results with or without ascertainment bias correction....
Luna, Augustin; Nicodemus, Kristin K
snp.plotter is a newly developed R package which produces high-quality plots of results from genetic association studies. The main features of the package include options to display a linkage disequilibrium (LD) plot below the P-value plot using either the r2 or D' LD metric, to set the X-axis to equal spacing or to use the physical map of markers, and to specify plot labels, colors, symbols and LD heatmap color scheme. snp.plotter can plot single SNP and/or haplotype data and simultaneously plot multiple sets of results. R is a free software environment for statistical computing and graphics available for most platforms. The proposed package provides a simple way to convey both association and LD information in a single appealing graphic for genetic association studies. Downloadable R package and example datasets are available at http://cbdb.nimh.nih.gov/~kristin/snp.plotter.html and http://www.r-project.org.
Therkildsen, Nina Overgaard; Hansen, Jakob Hemmer; Als, Thomas Damm
of Atlantic cod (Gadus morhua) studied over an 80-year period. Screening of >1000 gene-associated single-nucleotide polymorphisms (SNPs) identified 77 loci that showed highly elevated levels of differentiation, likely as an effect of directional selection, in either time, space or both. Exploratory analysis...... the 1928 to 1960 period showed almost complete stability during later decades. The contrasting microevolutionary trajectories among populations resulted in sequential shifts in spatial outliers, with no locus maintaining elevated spatial differentiation throughout the study period. Simulations of migration...
We characterized 11 single nucleotide ploymorphism (SNP) markers for the Chinese black sleeper, Bostrychus sinensis. These markers were isolated from a genomic library and tested in ten geographically distant individuals of B. sinensis. Polymorphisms of these SNP loci were assessed using a wild population including ...
Onay, Venüs Ümmiye; Ozcelik, Hilmi; Briollais, Laurent; Knight, Julia A; Shi, Ellen; Wang, Yuanyuan; Wells, Sean; Li, Hong; Rajendram, Isaac; Andrulis, Irene L
Breast cancer predisposition genes identified to date (e.g., BRCA1 and BRCA2) are responsible for less than 5% of all breast cancer cases. Many studies have shown that the cancer risks associated with individual commonly occurring single nucleotide polymorphisms (SNPs) are incremental. However, polygenic models suggest that multiple commonly occurring low to modestly penetrant SNPs of cancer related genes might have a greater effect on a disease when considered in combination. In an attempt to identify the breast cancer risk conferred by SNP interactions, we have studied 19 SNPs from genes involved in major cancer related pathways. All SNPs were genotyped by TaqMan 5'nuclease assay. The association between the case-control status and each individual SNP, measured by the odds ratio and its corresponding 95% confidence interval, was estimated using unconditional logistic regression models. At the second stage, two-way interactions were investigated using multivariate logistic models. The robustness of the interactions, which were observed among SNPs with stronger functional evidence, was assessed using a bootstrap approach, and correction for multiple testing based on the false discovery rate (FDR) principle. None of these SNPs contributed to breast cancer risk individually. However, we have demonstrated evidence for gene-gene (SNP-SNP) interaction among these SNPs, which were associated with increased breast cancer risk. Our study suggests cross talk between the SNPs of the DNA repair and immune system (XPD-[Lys751Gln] and IL10-[G(-1082)A]), cell cycle and estrogen metabolism (CCND1-[Pro241Pro] and COMT-[Met108/158Val]), cell cycle and DNA repair (BARD1-[Pro24Ser] and XPD-[Lys751Gln]), and within carcinogen metabolism (GSTP1-[Ile105Val] and COMT-[Met108/158Val]) pathways. The importance of these pathways and their communication in breast cancer predisposition has been emphasized previously, but their biological interactions through SNPs have not been described
Full Text Available Abstract Background Breast cancer predisposition genes identified to date (e.g., BRCA1 and BRCA2 are responsible for less than 5% of all breast cancer cases. Many studies have shown that the cancer risks associated with individual commonly occurring single nucleotide polymorphisms (SNPs are incremental. However, polygenic models suggest that multiple commonly occurring low to modestly penetrant SNPs of cancer related genes might have a greater effect on a disease when considered in combination. Methods In an attempt to identify the breast cancer risk conferred by SNP interactions, we have studied 19 SNPs from genes involved in major cancer related pathways. All SNPs were genotyped by TaqMan 5'nuclease assay. The association between the case-control status and each individual SNP, measured by the odds ratio and its corresponding 95% confidence interval, was estimated using unconditional logistic regression models. At the second stage, two-way interactions were investigated using multivariate logistic models. The robustness of the interactions, which were observed among SNPs with stronger functional evidence, was assessed using a bootstrap approach, and correction for multiple testing based on the false discovery rate (FDR principle. Results None of these SNPs contributed to breast cancer risk individually. However, we have demonstrated evidence for gene-gene (SNP-SNP interaction among these SNPs, which were associated with increased breast cancer risk. Our study suggests cross talk between the SNPs of the DNA repair and immune system (XPD-[Lys751Gln] and IL10-[G(-1082A], cell cycle and estrogen metabolism (CCND1-[Pro241Pro] and COMT-[Met108/158Val], cell cycle and DNA repair (BARD1-[Pro24Ser] and XPD-[Lys751Gln], and within carcinogen metabolism (GSTP1-[Ile105Val] and COMT-[Met108/158Val] pathways. Conclusion The importance of these pathways and their communication in breast cancer predisposition has been emphasized previously, but their
Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs represent the most widespread type of DNA variation in vertebrates and may be used as genetic markers for a range of applications. This has led to an increased interest in identification of SNP markers in non-model species and farmed animals. The in silico SNP mining method used for discovery of most known SNPs in Atlantic salmon (Salmo salar has applied a global (genome-wide approach. In this study we present a targeted 3'UTR-primed SNP discovery strategy that utilizes sequence data from Salmo salar full length sequenced cDNAs (FLIcs. We compare the efficiency of this new strategy to the in silico SNP mining method when using both methods for targeted SNP discovery. Results The SNP discovery efficiency of the two methods was tested in a set of FLIc target genes. The 3'UTR-primed SNP discovery method detected novel SNPs in 35% of the target genes while the in silico SNP mining method detected novel SNPs in 15% of the target genes. Furthermore, the 3'UTR-primed SNP discovery strategy was the less labor intensive one and revealed a higher success rate than the in silico SNP mining method in the initial amplification step. When testing the methods we discovered 112 novel bi-allelic polymorphisms (type I markers in 88 salmon genes [dbSNP: ss179319972-179320081, ss250608647-250608648], and three of the SNPs discovered were missense substitutions. Conclusions Full length insert cDNAs (FLIcs are important genomic resources that have been developed in many farmed animals. The 3'UTR-primed SNP discovery strategy successfully utilized FLIc data to detect novel SNPs in the partially tetraploid Atlantic salmon. This strategy may therefore be useful for targeted SNP discovery in several species, and particularly useful in species that, like salmonids, have duplicated genomes.
Abduljaleel, Zainularifeen; Al-Allaf, Faisal A; Khan, Wajahatullah; Athar, Mohammad; Shahzad, Naiyer; Taher, Mohiuddin M; Alanazi, Mohammed; Elrobh, Mohamed; Reddy, Narasimha P
We screened for the major essential single-nucleotide polymorphism (SNP) variant that might be associated with the MSH2 gene based on the data available from three types of human tissue samples [156 lymphoblastoid cell variations (LCL), 160 epidermis, 166 fat]. An association analysis confirmed that the KCNK12 SNP variant (rs748780) was highly associated (p value 9 × 10(-4)) with the MSH2 gene for all three samples. Using SNP identification, we further found that the recognized SNP was also relevant among Hapmap populations. Techniques that display specific SNPs associated with the gene of interest or nearby genes provide more reliable genetic associations than techniques that rely on data from individual SNPs. We investigated the MSH2 gene regional linkage association with the determined SNP (rs748780), KCNK12 variant (Allele T>C) in the intronic region, in HapMap3 full dataset populations, Yoruba in Ibadan, Nigeria (YRI), Utah residents with ancestry from northern Europe (CEU), Han Chinese in Beijing, China (CHB), and a population of Mexican ancestry in Los Angeles, California (MEX). A gene-based SNP association analysis analyzes the combined impact of every variant within the gene while creating referrals to linkage disequilibrium or connections between markers. Our results indicated that among the four populations studied, this association was highest in the MEX population based on the r(2) value; a similar pattern was also observed in the other three populations. The relevant SNP rs748780 in KCNK12 is related to a superfamily of potassium channel pore-forming P-domain proteins as well as to other non-pore-forming proteins and has been shown to be relevant to neurological disorder predisposition in MEX as well as in other populations.
Kim, Song E; Kim, Han-Na; Yun, Yeo-Jun; Heo, Seong Gu; Cho, Juhee; Kwon, Min-Jung; Chang, Yoosoo; Ryu, Seungho; Shin, Hocheol; Shin, Chol; Cho, Nam H; Sung, Yeon Ah; Kim, Hyung-Lae
Neuroticism is a heritable personality trait that is comprised of distinct sub-factors, or facets. Sub-factors of neuroticism are linked to different emotional states or psychiatric symptoms and studying the genetic variants associated with these facets may help reveal the biological mechanisms underlying psychiatric disorders. In the present study, a meta-analysis of genome-wide association studies for six facets of neuroticism was performed in 5584 participants from three cohorts. Additionally, a Gene Set Enrichment Analysis was conducted to find biological pathways associated with each facet. Six neuroticism facets (N1: anxiety, N2: angry hostility, N3: depression, N4: self-consciousness, N5: impulsivity and N6: vulnerability) were assessed using the Korean version of the Revised NEO Personality Inventory. In the single-nucleotide polymorphism-based analysis, results showed genome-wide significance for N2 within the MIR548H3 gene (rs1360001, P=4.14 × 10 -9 ). Notable genes with suggestive associations (Pfacets of neuroticism (N2, N4 and N6). The focal adhesion and extracellular matrix receptor interaction pathways were significantly associated with N2 and N3. Our findings revealed genetic influences and biological pathways that are associated with facets of neuroticism.
Bjoerheim, Jens; Abrahamsen, Torveig Weum; Kristensen, Annette Torgunrud; Gaudernack, Gustav; Ekstroem, Per O.
Melting gel techniques have proven to be amenable and powerful tools in point mutation and single nucleotide polymorphism (SNP) analysis. With the introduction of commercially available capillary electrophoresis instruments, a partly automated platform for denaturant capillary electrophoresis with potential for routine screening of selected target sequences has been established. The aim of this article is to demonstrate the use of automated constant denaturant capillary electrophoresis (ACDCE) in single nucleotide polymorphism analysis of various target sequences. Optimal analysis conditions for different single nucleotide polymorphisms on ACDCE are evaluated with the Poland algorithm. Laboratory procedures include only PCR and electrophoresis. For direct genotyping of individual SNPs, the samples are analyzed with an internal standard and the alleles are identified by co-migration of sample and standard peaks. In conclusion, SNPs suitable for melting gel analysis based on theoretical thermodynamics were separated by ACDCE under appropriate conditions. With this instrumentation (ABI 310 Genetic Analyzer), 48 samples could be analyzed without any intervention. Several institutions have capillary instrumentation in-house, thus making this SNP analysis method accessible to large groups of researchers without any need for instrument modification
Full Text Available Identification of single nucleotide polymorphisms (SNPs and mutations is important for the discovery of genetic predisposition to complex diseases. PCR resequencing is the method of choice for de novo SNP discovery. However, manual curation of putative SNPs has been a major bottleneck in the application of this method to high-throughput screening. Therefore it is critical to develop a more sensitive and accurate computational method for automated SNP detection. We developed a software tool, SNPdetector, for automated identification of SNPs and mutations in fluorescence-based resequencing reads. SNPdetector was designed to model the process of human visual inspection and has a very low false positive and false negative rate. We demonstrate the superior performance of SNPdetector in SNP and mutation analysis by comparing its results with those derived by human inspection, PolyPhred (a popular SNP detection tool, and independent genotype assays in three large-scale investigations. The first study identified and validated inter- and intra-subspecies variations in 4,650 traces of 25 inbred mouse strains that belong to either the Mus musculus species or the M. spretus species. Unexpected heterozygosity in CAST/Ei strain was observed in two out of 1,167 mouse SNPs. The second study identified 11,241 candidate SNPs in five ENCODE regions of the human genome covering 2.5 Mb of genomic sequence. Approximately 50% of the candidate SNPs were selected for experimental genotyping; the validation rate exceeded 95%. The third study detected ENU-induced mutations (at 0.04% allele frequency in 64,896 traces of 1,236 zebra fish. Our analysis of three large and diverse test datasets demonstrated that SNPdetector is an effective tool for genome-scale research and for large-sample clinical studies. SNPdetector runs on Unix/Linux platform and is available publicly (http://lpg.nci.nih.gov.
Full Text Available High-throughput sequencing of RNA (RNA-Seq was developed primarily to analyze global gene expression in different tissues. It is also an efficient way to discover coding SNPs and when multiple individuals with different genetic backgrounds were used, RNA-Seq is very effective for the identification of SNPs. The objective of this study was to perform SNP and INDEL discoveries in human airway transcriptome of healthy never smokers, healthy current smokers, smokers without lung cancer and smokers with lung cancer. By preliminary comparative analysis of these four data sets, it is expected to get SNP and INDEL patterns responsible for lung cancer. A total of 85,028 SNPs and 5738 INDELs in healthy never smokers, 32,671 SNPs and 1561 INDELs in healthy current smokers, 50,205 SNPs and 3008 INDELs in smokers without lung cancer and 51,299 SNPs and 3138 INDELs in smokers with lung cancer were identified. The analysis of the SNPs and INDELs in genes that were reported earlier as differentially expressed was also performed. It has been found that a smoking person has SNPs at position 62,186,542 and 62,190,293 in SCGB1A1 gene and 180,017,251, 180,017,252, and 180,017,597 in SCGB3A1 gene and INDELs at position 35,871,168 in NFKBIA gene and 180,017,797 in SCGB3A1 gene. The SNPs identified in this study provides a resource for genetic studies in smokers and shall contribute to the development of a personalized medicine. This study is only a preliminary kind and more vigorous data analysis and wet lab validation are required.
Strain-specific genomic diversity in the Mycobacterium tuberculosis complex (MTBC) is an important factor in pathogenesis that may affect virulence, transmissibility, host response and emergence of drug resistance. Several systems have been proposed to classify MTBC strains into distinct lineages and families. Here, we investigate single-nucleotide polymorphisms (SNPs) as robust (stable) markers of genetic variation for phylogenetic analysis. We identify ∼92k SNP across a global collection of 1,601 genomes. The SNP-based phylogeny is consistent with the gold-standard regions of difference (RD) classification system. Of the ∼7k strain-specific SNPs identified, 62 markers are proposed to discriminate known circulating strains. This SNP-based barcode is the first to cover all main lineages, and classifies a greater number of sublineages than current alternatives. It may be used to classify clinical isolates to evaluate tools to control the disease, including therapeutics and vaccines whose effectiveness may vary by strain type. © 2014 Macmillan Publishers Limited.
Full Text Available Laying performance is an important economical trait of goose production. As laying performance is of low heritability, it is of significance to develop a marker-assisted selection (MAS strategy for this trait. Definition of sequence variation related to the target trait is a prerequisite of quantitating MAS, but little is presently known about the goose genome, which greatly hinders the identification of genetic markers for the laying traits of geese. Recently developed restriction site-associated DNA (RAD sequencing is a possible approach for discerning large-scale single nucleotide polymorphism (SNP and reducing the complexity of a genome without having reference genomic information available. In the present study, we developed a pooled RAD sequencing strategy for detecting geese laying-related SNP. Two DNA pools were constructed, each consisting of equal amounts of genomic DNA from 10 individuals with either high estimated breeding value (HEBV or low estimated breeding value (LEBV. A total of 139,013 SNP were obtained from 42,291,356 sequences, of which 18,771,943 were for LEBV and 23,519,413 were for HEBV cohorts. Fifty-five SNP which had different allelic frequencies in the two DNA pools were further validated by individual-based AS-PCR genotyping in the LEBV and HEBV cohorts. Ten out of 55 SNP exhibited distinct allele distributions in these two cohorts. These 10 SNP were further genotyped in a goose population of 492 geese to verify the association with egg numbers. The result showed that 8 of 10 SNP were associated with egg numbers. Additionally, liner regression analysis revealed that SNP Record-111407, 106975 and 112359 were involved in a multiplegene network affecting laying performance. We used IPCR to extend the unknown regions flanking the candidate RAD tags. The obtained sequences were subjected to BLAST to retrieve the orthologous genes in either ducks or chickens. Five novel genes were cloned for geese which harbored the
Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the most abundant type of genetic variation in eukaryotic genomes and have recently become the marker of choice in a wide variety of ecological and evolutionary studies. The advent of next-generation sequencing (NGS technologies has made it possible to efficiently genotype a large number of SNPs in the non-model organisms with no or limited genomic resources. Most NGS-based genotyping methods require a reference genome to perform accurate SNP calling. Little effort, however, has yet been devoted to developing or improving algorithms for accurate SNP calling in the absence of a reference genome. Results Here we describe an improved maximum likelihood (ML algorithm called iML, which can achieve high genotyping accuracy for SNP calling in the non-model organisms without a reference genome. The iML algorithm incorporates the mixed Poisson/normal model to detect composite read clusters and can efficiently prevent incorrect SNP calls resulting from repetitive genomic regions. Through analysis of simulation and real sequencing datasets, we demonstrate that in comparison with ML or a threshold approach, iML can remarkably improve the accuracy of de novo SNP genotyping and is especially powerful for the reference-free genotyping in diploid genomes with high repeat contents. Conclusions The iML algorithm can efficiently prevent incorrect SNP calls resulting from repetitive genomic regions, and thus outperforms the original ML algorithm by achieving much higher genotyping accuracy. Our algorithm is therefore very useful for accurate de novo SNP genotyping in the non-model organisms without a reference genome. Reviewers This article was reviewed by Dr. Richard Durbin, Dr. Liliana Florea (nominated by Dr. Steven Salzberg and Dr. Arcady Mushegian.
Full Text Available Abstract Background Mitochondrial single nucleotide polymorphisms (mtSNPs constitute important data when trying to shed some light on human diseases and cancers. Unfortunately, providing relevant mtSNP genotyping information in mtDNA databases in a neatly organized and transparent visual manner still remains a challenge. Amongst the many methods reported for SNP genotyping, determining the restriction fragment length polymorphisms (RFLPs is still one of the most convenient and cost-saving methods. In this study, we prepared the visualization of the mtDNA genome in a way, which integrates the RFLP genotyping information with mitochondria related cancers and diseases in a user-friendly, intuitive and interactive manner. The inherent problem associated with mtDNA sequences in BLAST of the NCBI database was also solved. Description V-MitoSNP provides complete mtSNP information for four different kinds of inputs: (1 color-coded visual input by selecting genes of interest on the genome graph, (2 keyword search by locus, disease and mtSNP rs# ID, (3 visualized input of nucleotide range by clicking the selected region of the mtDNA sequence, and (4 sequences mtBLAST. The V-MitoSNP output provides 500 bp (base pairs flanking sequences for each SNP coupled with the RFLP enzyme and the corresponding natural or mismatched primer sets. The output format enables users to see the SNP genotype pattern of the RFLP by virtual electrophoresis of each mtSNP. The rate of successful design of enzymes and primers for RFLPs in all mtSNPs was 99.1%. The RFLP information was validated by actual agarose electrophoresis and showed successful results for all mtSNPs tested. The mtBLAST function in V-MitoSNP provides the gene information within the input sequence rather than providing the complete mitochondrial chromosome as in the NCBI BLAST database. All mtSNPs with rs number entries in NCBI are integrated in the corresponding SNP in V-MitoSNP. Conclusion V-MitoSNP is a web
Erben, Philipp; Nowak, Daniel; Sauer, Christian; Ströbel, Philipp; Hofmann, Wolf-Karsten; Hofheinz, Ralf-Dieter; Hohenberger, Peter; Kasper, Bernd
Background: Desmoid tumors are neoplastic proliferations of connective tissues. The mutation status of the gene coding for catenin (cadherin-associated protein) beta 1 (CTNNB1) and trisomy 8 on the chromosomal level have been described to have prognostic relevance. Patients and Methods: In order to elucidate new molecular mechanisms underlying these tumors, we carried out a molecular analysis with a genome-wide human high-density single-nucleotide polymorphism (SNP) array, in 9 patients. Resu...
Full Text Available Individual cells in an organism are variable, which strongly impacts cellular processes. Advances in sequencing technologies have enabled single-cell genomic analysis to become widespread, addressing shortcomings of analyses conducted on populations of bulk cells. While the field of single-cell plant genomics is in its infancy, there is great potential to gain insights into cell lineage and functional cell types to help understand complex cellular interactions in plants. In this review, we discuss current approaches for single-cell plant genomic analysis, with a focus on single-cell isolation, DNA amplification, next-generation sequencing, and bioinformatics analysis. We outline the technical challenges of analysing material from a single plant cell, and then examine applications of single-cell genomics and the integration of this approach with genome editing. Finally, we indicate future directions we expect in the rapidly developing field of plant single-cell genomic analysis.
Keywords. ESR1 gene; polymorphism; prediction; bioinformatic tools; association studies. Abstract. Single-nucleotide polymorphism (SNP) association studies have become crucial in uncovering the genetic correlations of genomic variants with complex diseases, quantitative traits and physiological responses to drugs.
Kwong, Qi Bin; Teh, Chee Keng; Ong, Ai Ling; Heng, Huey Ying; Lee, Heng Leng; Mohamed, Mohaimi; Low, Joel Zi-Bin; Apparow, Sukganah; Chew, Fook Tim; Mayes, Sean; Kulaveerasingam, Harikrishna; Tammi, Martti; Appleton, David Ross; Sime Darby Technology Centre
High-density single nucleotide polymorphism (SNP) genotyping arrays are powerful tools that can measure the level of genetic polymorphism within a population. To develop a whole-genome SNP array for oil palms, SNP discovery was performed using deep resequencing of eight libraries derived from 132 Elaeis guineensis and Elaeis oleifera palms belonging to 59 origins, resulting in the discovery of >3 million putative SNPs. After SNP filtering, the Illumina OP200K custom array was built with 170 8...
Rao, Y S; Li, J; Zhang, R; Lin, X R; Xu, J G; Xie, L; Xu, Z Q; Wang, L; Gan, J K; Xie, X J; He, J; Zhang, X Q
Copy number variation (CNV) is an important source of genetic variation in organisms and a main factor that affects phenotypic variation. A comprehensive study of chicken CNV can provide valuable information on genetic diversity and facilitate future analyses of associations between CNV and economically important traits in chickens. In the present study, an F2 full-sib chicken population (554 individuals), established from a cross between Xinghua and White Recessive Rock chickens, was used to explore CNV in the chicken genome. Genotyping was performed using a chicken 60K SNP BeadChip. A total of 1,875 CNV were detected with the PennCNV algorithm, and the average number of CNV was 3.42 per individual. The CNV were distributed across 383 independent CNV regions (CNVR) and covered 41 megabases (3.97%) of the chicken genome. Seven CNVR in 108 individuals were validated by quantitative real-time PCR, and 81 of these individuals (75%) also were detected with the PennCNV algorithm. In total, 274 CNVR (71.54%) identified in the current study were previously reported. Of these, 147 (38.38%) were reported in at least 2 studies. Additionally, 109 of the CNVR (28.46%) discovered here are novel. A total of 709 genes within or overlapping with the CNVR was retrieved. Out of the 2,742 quantitative trait loci (QTL) collected in the chicken QTL database, 43 QTL had confidence intervals overlapping with the CNVR, and 32 CNVR encompassed one or more functional genes. The functional genes located in the CNVR are likely to be the QTG that are associated with underlying economic traits. This study considerably expands our insight into the structural variation in the genome of chickens and provides an important resource for genomic variation, especially for genomic structural variation related to economic traits in chickens. © 2016 Poultry Science Association Inc.
Full Text Available Abstract Background Genome-wide association studies (GWAS have provided a large set of genetic loci influencing the risk for many common diseases. Association studies typically analyze one specific trait in single populations in an isolated fashion without taking into account the potential phenotypic and genetic correlation between traits. However, GWA data can be efficiently used to identify overlapping loci with analogous or contrasting effects on different diseases. Results Here, we describe a new approach to systematically prioritize and interpret available GWA data. We focus on the analysis of joint and disjoint genetic determinants across diseases. Using network analysis, we show that variant-based approaches are superior to locus-based analyses. In addition, we provide a prioritization of disease loci based on network properties and discuss the roles of hub loci across several diseases. We demonstrate that, in general, agonistic associations appear to reflect current disease classifications, and present the potential use of effect sizes in refining and revising these agonistic signals. We further identify potential branching points in disease etiologies based on antagonistic variants and describe plausible small-scale models of the underlying molecular switches. Conclusions The observation that a surprisingly high fraction (>15% of the SNPs considered in our study are associated both agonistically and antagonistically with related as well as unrelated disorders indicates that the molecular mechanisms influencing causes and progress of human diseases are in part interrelated. Genetic overlaps between two diseases also suggest the importance of the affected entities in the specific pathogenic pathways and should be investigated further.
Hilal Betul Kaya
Full Text Available BACKGROUND: The olive tree (Olea europaea L. is a diploid (2n = 2x = 46 outcrossing species mainly grown in the Mediterranean area, where it is the most important oil-producing crop. Because of its economic, cultural and ecological importance, various DNA markers have been used in the olive to characterize and elucidate homonyms, synonyms and unknown accessions. However, a comprehensive characterization and a full sequence of its transcriptome are unavailable, leading to the importance of an efficient large-scale single nucleotide polymorphism (SNP discovery in olive. The objectives of this study were (1 to discover olive SNPs using next-generation sequencing and to identify SNP primers for cultivar identification and (2 to characterize 96 olive genotypes originating from different regions of Turkey. METHODOLOGY/PRINCIPAL FINDINGS: Next-generation sequencing technology was used with five distinct olive genotypes and generated cDNA, producing 126,542,413 reads using an Illumina Genome Analyzer IIx. Following quality and size trimming, the high-quality reads were assembled into 22,052 contigs with an average length of 1,321 bases and 45 singletons. The SNPs were filtered and 2,987 high-quality putative SNP primers were identified. The assembled sequences and singletons were subjected to BLAST similarity searches and annotated with a Gene Ontology identifier. To identify the 96 olive genotypes, these SNP primers were applied to the genotypes in combination with amplified fragment length polymorphism (AFLP and simple sequence repeats (SSR markers. CONCLUSIONS/SIGNIFICANCE: This study marks the highest number of SNP markers discovered to date from olive genotypes using transcriptome sequencing. The developed SNP markers will provide a useful source for molecular genetic studies, such as genetic diversity and characterization, high density quantitative trait locus (QTL analysis, association mapping and map-based gene cloning in the olive. High levels
Fernández Ana I
Full Text Available Abstract Background The traditional strategy to map QTL is to use linkage analysis employing a limited number of markers. These analyses report wide QTL confidence intervals, making very difficult to identify the gene and polymorphisms underlying the QTL effects. The arrival of genome-wide panels of SNPs makes available thousands of markers increasing the information content and therefore the likelihood of detecting and fine mapping QTL regions. The aims of the current study are to confirm previous QTL regions for growth and body composition traits in different generations of an Iberian x Landrace intercross (IBMAP and especially identify new ones with narrow confidence intervals by employing the PorcineSNP60 BeadChip in linkage analyses. Results Three generations (F3, Backcross 1 and Backcross 2 of the IBMAP and their related animals were genotyped with PorcineSNP60 BeadChip. A total of 8,417 SNPs equidistantly distributed across autosomes were selected after filtering by quality, position and frequency to perform the QTL scan. The joint and separate analyses of the different IBMAP generations allowed confirming QTL regions previously identified in chromosomes 4 and 6 as well as new ones mainly for backfat thickness in chromosomes 4, 5, 11, 14 and 17 and shoulder weight in chromosomes 1, 2, 9 and 13; and many other to the chromosome-wide signification level. In addition, most of the detected QTLs displayed narrow confidence intervals, making easier the selection of positional candidate genes. Conclusions The use of higher density of markers has allowed to confirm results obtained in previous QTL scans carried out with microsatellites. Moreover several new QTL regions have been now identified in regions probably not covered by markers in previous scans, most of these QTLs displayed narrow confidence intervals. Finally, prominent putative biological and positional candidate genes underlying those QTL effects are listed based on recent porcine
Chuang, Li-Yeh; Moi, Sin-Hua; Lin, Yu-Da; Yang, Cheng-Hong
Evolutionary algorithms could overcome the computational limitations for the statistical evaluation of large datasets for high-order single nucleotide polymorphism (SNP) barcodes. Previous studies have proposed several chaotic particle swarm optimization (CPSO) methods to detect SNP barcodes for disease analysis (e.g., for breast cancer and chronic diseases). This work evaluated additional chaotic maps combined with the particle swarm optimization (PSO) method to detect SNP barcodes using a high-dimensional dataset. Nine chaotic maps were used to improve PSO method results and compared the searching ability amongst all CPSO methods. The XOR and ZZ disease models were used to compare all chaotic maps combined with PSO method. Efficacy evaluations of CPSO methods were based on statistical values from the chi-square test (χ 2 ). The results showed that chaotic maps could improve the searching ability of PSO method when population are trapped in the local optimum. The minor allele frequency (MAF) indicated that, amongst all CPSO methods, the numbers of SNPs, sample size, and the highest χ 2 value in all datasets were found in the Sinai chaotic map combined with PSO method. We used the simple linear regression results of the gbest values in all generations to compare the all methods. Sinai chaotic map combined with PSO method provided the highest β values (β≥0.32 in XOR disease model and β≥0.04 in ZZ disease model) and the significant p-value (p-value<0.001 in both the XOR and ZZ disease models). The Sinai chaotic map was found to effectively enhance the fitness values (χ 2 ) of PSO method, indicating that the Sinai chaotic map combined with PSO method is more effective at detecting potential SNP barcodes in both the XOR and ZZ disease models. Copyright © 2016 Elsevier B.V. All rights reserved.
Full Text Available This study was to investigate the single nucleotide polymorphism (SNP in the interferon regulatory factor 6 (IRF6 gene in healthy residents of Guangdong Province, China, for further analysis of their associations with the development of cleft lip with or without palate (CL/P.
Sambo, Francesco; Di Camillo, Barbara; Toffolo, Gianna; Cobelli, Claudio
The increasing interest in rare genetic variants and epistatic genetic effects on complex phenotypic traits is currently pushing genome-wide association study design towards datasets of increasing size, both in the number of studied subjects and in the number of genotyped single nucleotide polymorphisms (SNPs). This, in turn, is leading to a compelling need for new methods for compression and fast retrieval of SNP data. We present a novel algorithm and file format for compressing and retrieving SNP data, specifically designed for large-scale association studies. Our algorithm is based on two main ideas: (i) compress linkage disequilibrium blocks in terms of differences with a reference SNP and (ii) compress reference SNPs exploiting information on their call rate and minor allele frequency. Tested on two SNP datasets and compared with several state-of-the-art software tools, our compression algorithm is shown to be competitive in terms of compression rate and to outperform all tools in terms of time to load compressed data. Our compression and decompression algorithms are implemented in a C++ library, are released under the GNU General Public License and are freely downloadable from http://www.dei.unipd.it/~sambofra/snpack.html. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: firstname.lastname@example.org.
Pendergrass Sarah A
Full Text Available Abstract Background Initial genome-wide association study (GWAS discoveries are being further explored through the use of large cohorts across multiple and diverse populations involving meta-analyses within large consortia and networks. Many of the additional studies characterize less than 100 single nucleotide polymorphisms (SNPs, often include multiple and correlated phenotypic measurements, and can include data from multiple-sites, multiple-studies, as well as multiple race/ethnicities. New approaches for visualizing resultant data are necessary in order to fully interpret results and obtain a broad view of the trends between DNA variation and phenotypes, as well as provide information on specific SNP and phenotype relationships. Results The Synthesis-View software tool was designed to visually synthesize the results of the aforementioned types of studies. Presented herein are multiple examples of the ways Synthesis-View can be used to report results from association studies of DNA variation and phenotypes, including the visual integration of p-values or other metrics of significance, allele frequencies, sample sizes, effect size, and direction of effect. Conclusions To truly allow a user to visually integrate multiple pieces of information typical of a genetic association study, innovative views are needed to integrate multiple pieces of information. As a result, we have created "Synthesis-View" software for the visualization of genotype-phenotype association data in multiple cohorts. Synthesis-View is freely available for non-commercial research institutions, for full details see https://chgr.mc.vanderbilt.edu/synthesisview.
Zhang, Le-Le; Fang, Jian-Qiao; Shao, Xiao; Lian, Lin-Li; Yu, Xiao-Jing; Dong, Zhen-Hua; Mo, Ya-Di
To observe the effect of gastric dynamics by transcutaneous electrical acupoint stimulation (TEAS) combined general anesthesia when controlled hypotension dropped to 60% of the mean arterial prenssure (MAP) baseline, and to provide experimental evidence for organ protection in clinical controlled hypotension. Eighteen male beagles were randomly divided into three groups, the general anesthesia group (blank), the general anesthesia induced controlled hypotension group (control), and the general anesthesia combined TEAS induced controlled hypotension group (experiment), 6 in each group. Controlled hypotension was performed in the latter two groups with isoflurane inhalation and intravenous injection of sodium nitroprusside (SNP). The mean arterial pressure (MAP) was lowered to 60% of the MAP baseline and kept for 60 min. Controlled hypotension was not performed in Beagles of the control group. For Beagles in the experiment group, TEAS [2/100 Hz, (4 ± 1) mA] was applied to bilateral Hegu (LI4), Quchi (LI11), Zusanli (ST36), and Sanyinjiao (SP6) from stable physiological conditions to the end of maintaining stages. Changes of EGG frequencies and EGG amplitudes were monitored. Serum levels of gastrin (GAS) and motilin (MTL) were also detected at corresponding time points during and after experiment. As for the pressure control effect of TEAS combined general anesthesia in the controlled hypotension, during the process of controlled hypotension (T1-T4), MAP levels of two controlled pressure groups remained relatively stable, and were kept at 60% of the MAP baseline. When the blood pressure dropped to the target low MAP and maintained at 60 min (T1-T4), EGG amplitudes of Beagles in all the three groups showed decreasing tendency. But it was more obviously lower than its basic level in the control group (P control group and the blank group (P control group. During this period, EGG frequencies of the two controlled hypotension groups decreased more than those of the blank
Esteras, Cristina; Gómez, Pedro; Monforte, Antonio J; Blanca, José; Vicente-Dólera, Nelly; Roig, Cristina; Nuez, Fernando; Picó, Belén
Cucurbita pepo is a member of the Cucurbitaceae family, the second- most important horticultural family in terms of economic importance after Solanaceae. The "summer squash" types, including Zucchini and Scallop, rank among the highest-valued vegetables worldwide. There are few genomic tools available for this species.The first Cucurbita transcriptome, along with a large collection of Single Nucleotide Polymorphisms (SNP), was recently generated using massive sequencing. A set of 384 SNP was selected to generate an Illumina GoldenGate assay in order to construct the first SNP-based genetic map of Cucurbita and map quantitative trait loci (QTL). We herein present the construction of the first SNP-based genetic map of Cucurbita pepo using a population derived from the cross of two varieties with contrasting phenotypes, representing the main cultivar groups of the species' two subspecies: Zucchini (subsp. pepo) × Scallop (subsp. ovifera). The mapping population was genotyped with 384 SNP, a set of selected EST-SNP identified in silico after massive sequencing of the transcriptomes of both parents, using the Illumina GoldenGate platform. The global success rate of the assay was higher than 85%. In total, 304 SNP were mapped, along with 11 SSR from a previous map, giving a map density of 5.56 cM/marker. This map was used to infer syntenic relationships between C. pepo and cucumber and to successfully map QTL that control plant, flowering and fruit traits that are of benefit to squash breeding. The QTL effects were validated in backcross populations. Our results show that massive sequencing in different genotypes is an excellent tool for SNP discovery, and that the Illumina GoldenGate platform can be successfully applied to constructing genetic maps and performing QTL analysis in Cucurbita. This is the first SNP-based genetic map in the Cucurbita genus and is an invaluable new tool for biological research, especially considering that most of these markers are located in
Full Text Available Abstract Background Cucurbita pepo is a member of the Cucurbitaceae family, the second- most important horticultural family in terms of economic importance after Solanaceae. The "summer squash" types, including Zucchini and Scallop, rank among the highest-valued vegetables worldwide. There are few genomic tools available for this species. The first Cucurbita transcriptome, along with a large collection of Single Nucleotide Polymorphisms (SNP, was recently generated using massive sequencing. A set of 384 SNP was selected to generate an Illumina GoldenGate assay in order to construct the first SNP-based genetic map of Cucurbita and map quantitative trait loci (QTL. Results We herein present the construction of the first SNP-based genetic map of Cucurbita pepo using a population derived from the cross of two varieties with contrasting phenotypes, representing the main cultivar groups of the species' two subspecies: Zucchini (subsp. pepo × Scallop (subsp. ovifera. The mapping population was genotyped with 384 SNP, a set of selected EST-SNP identified in silico after massive sequencing of the transcriptomes of both parents, using the Illumina GoldenGate platform. The global success rate of the assay was higher than 85%. In total, 304 SNP were mapped, along with 11 SSR from a previous map, giving a map density of 5.56 cM/marker. This map was used to infer syntenic relationships between C. pepo and cucumber and to successfully map QTL that control plant, flowering and fruit traits that are of benefit to squash breeding. The QTL effects were validated in backcross populations. Conclusion Our results show that massive sequencing in different genotypes is an excellent tool for SNP discovery, and that the Illumina GoldenGate platform can be successfully applied to constructing genetic maps and performing QTL analysis in Cucurbita. This is the first SNP-based genetic map in the Cucurbita genus and is an invaluable new tool for biological research
Fondevila, M; Børsting, C; Phillips, C
This review explores the key factors that influence the optimization, routine use, and profile interpretation of the SNaPshot single-base extension (SBE) system applied to forensic single-nucleotide polymorphism (SNP) genotyping. Despite being a mainly complimentary DNA genotyping technique...... to routine STR profiling, use of SNaPshot is an important part of the development of SNP sets for a wide range of forensic applications with these markers, from genotyping highly degraded DNA with very short amplicons to the introduction of SNPs to ascertain the ancestry and physical characteristics...... of an unidentified contact trace donor. However, this technology, as resourceful as it is, displays several features that depart from the usual STR genotyping far enough to demand a certain degree of expertise from the forensic analyst before tackling the complex casework on which SNaPshot application provides...
US Agency for International Development — Polymorphism of SNP Markers (single nucleotide polymorphisms) was assessed on 24 parental lines of the ISRA sorghum breeding program . About 1300 SNP have been used...
Fragomeni, Breno de Oliveira; Misztal, Ignacy; Lourenco, Daniela Lino; Aguilar, Ignacio; Okimoto, Ronald; Muir, William M
The purpose of this study was to determine if the set of genomic regions inferred as accounting for the majority of genetic variation in quantitative traits remain stable over multiple generations of selection. The data set contained phenotypes for five generations of broiler chicken for body weight, breast meat, and leg score. The population consisted of 294,632 animals over five generations and also included genotypes of 41,036 single nucleotide polymorphism (SNP) for 4,866 animals, after quality control. The SNP effects were calculated by a GWAS type analysis using single step genomic BLUP approach for generations 1-3, 2-4, 3-5, and 1-5. Variances were calculated for windows of 20 SNP. The top ten windows for each trait that explained the largest fraction of the genetic variance across generations were examined. Across generations, the top 10 windows explained more than 0.5% but less than 1% of the total variance. Also, the pattern of the windows was not consistent across generations. The windows that explained the greatest variance changed greatly among the combinations of generations, with a few exceptions. In many cases, a window identified as top for one combination, explained less than 0.1% for the other combinations. We conclude that identification of top SNP windows for a population may have little predictive power for genetic selection in the following generations for the traits here evaluated.
Chang, Hsueh-Wei; Yang, Cheng-Hong; Chang, Phei-Lang; Cheng, Yu-Huei; Chuang, Li-Yeh
Abstract Background The restriction fragment length polymorphism (RFLP) is a common laboratory method for the genotyping of single nucleotide polymorphisms (SNPs). Here, we describe a web-based software, named SNP-RFLPing, which provides the restriction enzyme for RFLP assays on a batch of SNPs and genes from the human, rat, and mouse genomes. Results Three user-friendly inputs are included: 1) NCBI dbSNP "rs" or "ss" IDs; 2) NCBI Entrez gene ID and HUGO gene name; 3) any formats of SNP-in-se...
Full Text Available BACKGROUND: The suppressor of cytokine signalling 3 (SOCS3 provides a link between cytokine action and their negative consequences on insulin signalling. Thus SOCS3 is a potential candidate gene for type 2 diabetes (T2DM. METHODOLOGY/PRINCIPAL FINDINGS: Based on HapMap we identified the polymorphism A+930-->G (rs4969168 as a haplotype tagging SNP (htSNP sufficiently covering the genetic variation of the whole gene. We therefore examined the association between rs4969168 within SOCS3 and T2DM in three independent study populations; one prospective case-cohort study and two cross-sectional study populations. Due to the low frequency of individuals being homozygous for the polymorphism a dominant model of inheritance was assumed. The case-cohort study with 2,957 individuals (764 of them with incident T2DM showed no effect of the polymorphism on diabetes risk (hazard ratio (95%CI: 0.86 (0.66-1.13; p = 0.3. Within the MeSyBePo-study population 325 subjects had T2DM from a total of 1,897 individuals, while the second cross-sectional cohort included 851 cases of T2DM within a total of 1653 subjects. According to the results in the prospective study, no association with T2DM was found (odds ratio (95%CI: 0.78 (0.54-1.12 for MesyBepo and 1.13 (0.90-1.42 for the Leipzig study population. There was also no association with metabolic subtraits such as insulin sensitivity (p = 0.7, insulin secretion (p = 0.8 or the hyperbolic relation of both, the disposition index (p = 0.7. In addition, no evidence for interaction with BMI or sex was found. We subsequently performed a meta-analysis, additionally including the publicly available data from the T2DM-subcohort of the WTCCC (n = 4,855. The overall odds ratio within that meta-analysis was 0.96 (0.88-1.06. CONCLUSIONS/SIGNIFICANCE: There is no strong effect of the common genetic variation within the SOCS3 gene on the development of T2DM.
Tofighi, Davood; Thoemmes, Felix
Mediation analysis is a statistical approach used to examine how the effect of an independent variable on an outcome is transmitted through an intervening variable (mediator). In this article, we provide a gentle introduction to single-level and multilevel mediation analyses. Using single-level data, we demonstrate an application of structural…
Hoeffding, L. K.; Kock, K. F.; Johnsen, Iben Birgit Gade
The single nucleotide polymorphism (SNP) microarray technology has emerged as a powerful tool to screen the whole genome for sub-microscopic duplications and deletions that are not detectable by traditional cytogenetic analysis. Case: We report a case of a female twin born at 27th week of gestati...... to maturation of the lungs or the perinatal death of one of the twins. However, disruptions in the biosynthesis of gangliosides have been previously associated with premature death in mice....
Ferchaud, Anne-Laure; Pedersen, Susanne H.; Bekkevold, Dorte
for rapid and cost efficient analysis of genetic divergence between freshwater and marine sticklebacks, we generated a low-density SNP (Single Nucleotide Polymorphism) array encompassing markers of chromosome regions under putative directional selection, along with neutral markers for background. Results...... have developed a cost-efficient low-density SNP array that allows for rapid screening of polymorphisms in threespine stickleback. The array provides a valuable tool for analyzing adaptive divergence between freshwater and marine stickleback populations beyond the well-established candidate gene...
Kristensen, Lasse S; Dobrovic, Alexander
High-resolution melting (HRM) shows great promise for high-throughput, rapid genotyping of individual polymorphic loci. We have developed HRM assays for genotyping single nucleotide polymorphisms (SNP) in several key genes that are involved in methyl metabolism and may directly or indirectly affect the methylation status of the DNA. The SNPs are in the 5,10-methylenetetrahydrofolate reductase (MTHFR; C677T and A1298C), methionine synthetase (MTR; 5-methyltetrahydrofolate-homocysteine methyltransferase; A2756G), and DNA methyltransferase 3b (DNMT3b; C46359T and C31721T) loci. The choice of short amplicons led to greater melting temperature (Tm) differences between the two homozygous genotypes, which allowed accurate genotyping without the use of probes or spiking with control DNA. In the case of MTHFR, there is a second rarer SNP (rs4846051) close to the A1298C SNP that may result in inaccurate genotyping. We masked this second SNP by placing the primer over it and choosing a base at the polymorphic position that was equally mismatched to both alleles. The HRM assays were done on HRM capable real-time PCR machines rather than stand-alone HRM machines. Monitoring the amplification allows ready identification of samples that may give rise to aberrant melting curves because of PCR abnormalities. We show that samples amplifying markedly late can give rise to shifted melting curves without alteration of shapes and potentially lead to misclassification of genotypes. In conclusion, rapid and high-throughput SNP analysis can be done with probe-free HRM if sufficient attention is paid to amplicon design and quality control to omit aberrantly amplifying samples.
Kohane Isaac S
Full Text Available Abstract Background Single Nucleotide Polymorphisms (SNPs are an increasingly important tool for genetic and biomedical research. Although current genomic databases contain information on several million SNPs and are growing at a very fast rate, the true value of a SNP in this context is a function of the quality of the annotations that characterize it. Retrieving and analyzing such data for a large number of SNPs often represents a major bottleneck in the design of large-scale association studies. Description SNPper is a web-based application designed to facilitate the retrieval and use of human SNPs for high-throughput research purposes. It provides a rich local database generated by combining SNP data with the Human Genome sequence and with several other data sources, and offers the user a variety of querying, visualization and data export tools. In this paper we describe the structure and organization of the SNPper database, we review the available data export and visualization options, and we describe how the architecture of SNPper and its specialized data structures support high-volume SNP analysis. Conclusions The rich annotation database and the powerful data manipulation and presentation facilities it offers make SNPper a very useful online resource for SNP research. Its success proves the great need for integrated and interoperable resources in the field of computational biology, and shows how such systems may play a critical role in supporting the large-scale computational analysis of our genome.
Lomonaco, Sara; Furumoto, Emily J; Loquasto, Joseph R; Morra, Patrizia; Grassi, Ausilia; Roberts, Robert F
Identification at the genus, species, and strain levels is desirable when a probiotic microorganism is added to foods. Strains of Bifidobacterium animalis ssp. lactis (BAL) are commonly used worldwide in dairy products supplemented with probiotic strains. However, strain discrimination is difficult because of the high degree of genome identity (99.975%) between different genomes of this subspecies. Typing of monomorphic species can be carried out efficiently by targeting informative single nucleotide polymorphisms (SNP). Findings from a previous study analyzing both reference and commercial strains of BAL identified SNP that could be used to discriminate common strains into 8 groups. This paper describes development of a minisequencing assay based on the primer extension reaction (PER) targeting multiple SNP that can allow strain differentiation of BAL. Based on previous data, 6 informative SNP were selected for further testing, and a multiplex preliminary PCR was optimized to amplify the DNA regions containing the selected SNP. Extension primers (EP) annealing immediately adjacent to the selected SNP were developed and tested in simplex and multiplex PER to evaluate their performance. Twenty-five strains belonging to 9 distinct genomic clusters of B. animalis ssp. lactis were selected and analyzed using the developed minisequencing assay, simultaneously targeting the 6 selected SNP. Fragment analysis was subsequently carried out in duplicate and demonstrated that the assay yielded 8 specific profiles separating the most commonly used commercial strains. This novel multiplex PER approach provides a simple, rapid, flexible SNP-based subtyping method for proper characterization and identification of commercial probiotic strains of BAL from fermented dairy products. To assess the usefulness of this method, DNA was extracted from yogurt manufactured with and without the addition of B. animalis ssp. lactis BB-12. Extracted DNA was then subjected to the minisequencing
Full Text Available In this study, we investigated the relationship between unsaturated fatty acids influencing beef flavor and four types of SNPs (c.280A>G, c.388G>A, c.408G>C and c.456A>G located at exon 2, 3 and 4 of the FABP4 gene, which is a fatty acid binding protein 4 in Korean cattle (n = 513. When analyzing the relationship between single genotype, fatty acids and carcass trait, individuals of GG, GG, CC and GG genotypes that are homozygotes, had a higher content of unsaturated fatty acids and marbling scores than other genotypes (p<0.05. Then, haplotype block showed strong significant relationships not only with unsaturated fatty acids (54.73%, but also with marbling scores (5.82 in ht1×ht1 group (p<0.05. This ht1×ht1 group showed significant differences with unsaturated fatty acids and marbling scores that affected beef flavor in Korean cattle. Therefore, it can be inferred that the ht1×ht1 types might be valuable new markers for use in the improvement of Korean cattle.
van den Brink, Floris Teunis Gerardus; Gool, Elmar; Frimat, Jean-Philippe; Bomer, Johan G.; van den Berg, Albert; le Gac, Severine
We report a PDMS microfluidic platform for parallel single-cell analysis (PaSCAl) as a powerful tool to decipher the heterogeneity found in cell populations. Cells are trapped individually in dedicated pockets, and thereafter, a number of invasive or non-invasive analysis schemes are performed.
Full Text Available To study crab immunogenetics of individuals, newly hatched Eriocheir sinensis larvae were stimulated with a mixture of three pathogen strains (Gram-positive bacteria Micrococcus luteus, Gram-negative bacteria Vibrio alginolyticus and fungi Pichia pastoris; 10(8 cfu·mL(-1. A total of 44,767,566 Illumina clean reads corresponding to 4.52 Gb nucleotides were generated and assembled into 100,252 unigenes (average length: 1,042 bp; range: 201-19,357 bp. 17,097 (26.09% of 65,535 non-redundant unigenes were annotated in NCBI non-redundant protein (Nr database. Moreover, 23,188 (35.38% unigenes were assigned to three Gene Ontology (GO categories, 15,071 (23.00% to twenty-six Clusters of orthologous Groups (COG and 8,574 (13.08% to six Kyoto Encyclopedia of Genes and Genomes (KEGG pathways, respectively. Numerous genes were further identified to be associated with multiple immune pathways, including Toll, immune deficiency (IMD, janus kinase (JAK-signal transducers and activators of transcription (STAT and mitogen-activated protein kinase (MAPK pathways. Some of them, such as tumor necrosis factor receptor associated factor 6 (TRAF6, fibroblast growth factor (FGF, protein-tyrosine phosphatase (PTP, JNK-interacting protein 1 (JIP1, were first identified in E. sinensis. TRAF6 was even first discovered in crabs. Additionally, 49,555 single nucleotide polymorphisms (SNPs were developed from over 13,309 unigenes. This is the first transcriptome report of whole bodies of E. sinensis larvae after immune challenge. Data generated here not only provide detail information to identify novel genes in genome reference-free E. sinensis, but also facilitate our understanding on host immunity and defense mechanism of the crab at whole transcriptome level.
Interim Report on SNP analysis and forensic microarray probe design for South American hemorrhagic fever viruses, tick-borne encephalitis virus, henipaviruses, Old World Arenaviruses, filoviruses, Crimean-Congo hemorrhagic fever viruses, Rift Valley fever
Jaing, C; Gardner, S
The goal of this project is to develop forensic genotyping assays for select agent viruses, enhancing the current capabilities for the viral bioforensics and law enforcement community. We used a multipronged approach combining bioinformatics analysis, PCR-enriched samples, microarrays and TaqMan assays to develop high resolution and cost effective genotyping methods for strain level forensic discrimination of viruses. We have leveraged substantial experience and efficiency gained through year 1 on software development, SNP discovery, TaqMan signature design and phylogenetic signature mapping to scale up the development of forensics signatures in year 2. In this report, we have summarized the whole genome wide SNP analysis and microarray probe design for forensics characterization of South American hemorrhagic fever viruses, tick-borne encephalitis viruses and henipaviruses, Old World Arenaviruses, filoviruses, Crimean-Congo hemorrhagic fever virus, Rift Valley fever virus and Japanese encephalitis virus.
Full Text Available Abstract Background Until recently, only a small number of low- and mid-throughput methods have been used for single nucleotide polymorphism (SNP discovery and genotyping in grapevine (Vitis vinifera L.. However, following completion of the sequence of the highly heterozygous genome of Pinot Noir, it has been possible to identify millions of electronic SNPs (eSNPs thus providing a valuable source for high-throughput genotyping methods. Results Herein we report the first application of the SNPlex™ genotyping system in grapevine aiming at the anchoring of an eukaryotic genome. This approach combines robust SNP detection with automated assay readout and data analysis. 813 candidate eSNPs were developed from non-repetitive contigs of the assembled genome of Pinot Noir and tested in 90 progeny of Syrah × Pinot Noir cross. 563 new SNP-based markers were obtained and mapped. The efficiency rate of 69% was enhanced to 80% when multiple displacement amplification (MDA methods were used for preparation of genomic DNA for the SNPlex assay. Conclusion Unlike other SNP genotyping methods used to investigate thousands of SNPs in a few genotypes, or a few SNPs in around a thousand genotypes, the SNPlex genotyping system represents a good compromise to investigate several hundred SNPs in a hundred or more samples simultaneously. Therefore, the use of the SNPlex assay, coupled with whole genome amplification (WGA, is a good solution for future applications in well-equipped laboratories.
Shahid, Saleem Ullah; Shabana; Humphries, Steve
rs10911021 (a single nucleotide polymorphism present upstream of the GLUL gene) affects glutamic acid metabolism, and was shown to be associated with coronary heart disease (CHD) in patients with T2DM but a definite mechanism is unknown. It may affect glutathione cycle, an important effector in the antioxidant defense mechanism, in the cells. We checked the association of this SNP with CHD and oxidative stress biomarkers, malondialdeheyde (MDA), GSH and GSSG in Pakistani patients. A total of 650 subjects (425 CHD cases and 225 controls) were genotyped by TaqMan allelic discrimination technique. The levels of MDA, GSH and GSSG were measured by standard protocols. The risk allele frequency was higher in cases than controls, but the difference was insignificant (p = 0.55). The SNP was not associated with CHD (p = 0.053) but when the analysis was limited to CHD patients having DM, a significant association (p = 0.03) was observed. The blood levels of MDA and GSSG were higher while that of GSH was significantly lower in the cases than the controls (p stress biomarkers MDA and GSH and GSSG levels. As the SNP rs10911021 showed significant association with oxidative stress parameters and these parameters should an increased oxidative stress in the CHD subjects, it can be concluded that the SNP may have contributed to increase the risk of heart diseases in the diabetic subjects by increasing the oxidative stress.
Ortiz-Estevez, Maria; Bengtsson, Henrik; Rubio, Angel
Current algorithms for estimating DNA copy numbers (CNs) borrow concepts from gene expression analysis methods. However, single nucleotide polymorphism (SNP) arrays have special characteristics that, if taken into account, can improve the overall performance. For example, cross hybridization between alleles occurs in SNP probe pairs. In addition, most of the current CN methods are focused on total CNs, while it has been shown that allele-specific CNs are of paramount importance for some studies. Therefore, we have developed a summarization method that estimates high-quality allele-specific CNs. The proposed method estimates the allele-specific DNA CNs for all Affymetrix SNP arrays dealing directly with the cross hybridization between probes within SNP probesets. This algorithm outperforms (or at least it performs as well as) other state-of-the-art algorithms for computing DNA CNs. It better discerns an aberration from a normal state and it also gives more precise allele-specific CNs. The method is available in the open-source R package ACNE, which also includes an add on to the aroma.affymetrix framework (http://www.aroma-project.org/).
Santoro, Stephanie L; Hashimoto, Sayaka; McKinney, Aimee; Mihalic Mosher, Theresa; Pyatt, Robert; Reshmi, Shalini C; Astbury, Caroline; Hickey, Scott E
Maternal uniparental disomy (UPD) 15 is one of the molecular causes of Prader-Willi syndrome (PWS), a multisystem disorder which presents with neonatal hypotonia and feeding difficulty. Current diagnostic algorithms differ regarding the use of SNP microarray to detect PWS. We retrospectively examined the frequency with which SNP microarray could identify regions of homozygosity (ROH) in patients with PWS. We determined that 7/12 (58%) patients with previously confirmed PWS by methylation analysis and microsatellite-positive UPD studies had ROH (>10 Mb) by SNP microarray. Additional assessment of 5,000 clinical microarrays, performed from 2013 to present, determined that only a single case of ROH for chromosome 15 was not caused by an imprinting disorder or identity by descent. We observed that ROH for chromosome 15 is rarely incidental and strongly associated with hypotonic infants having features of PWS. Although UPD microsatellite studies remain essential to definitively establish the presence of UPD, SNP microarray has important utility in the timely diagnostic algorithm for PWS. © 2017 S. Karger AG, Basel.
Muñoz, Irene; Henriques, Dora; Johnston, J. Spencer
Beekeeping activities, especially queen trading, have shaped the distribution of honey bee (Apis mellifera) subspecies in Europe, and have resulted in extensive introductions of two eastern European C-lineage subspecies (A. m. ligustica and A. m. carnica) into the native range of the M-lineage A. m....... mellifera subspecies in Western Europe. As a consequence, replacement and gene flow between native and commercial populations have occurred at varying levels across western European populations. Genetic identification and introgression analysis using molecular markers is an important tool for management...... be exploited to create a reduced panel containing the most ancestry-informative markers (AIMs) for those purposes with very little loss of information. The objective of this study was to design reduced panels of AIMs to verify the origin of A. m. mellifera individuals and to provide accurate estimates...
Poole, Amy; Roessli, Bertrand
This paper presents a new single crystal neutron scattering data analysis program, Mufit. The program can be used to refine spherical neutron polarimetry (SNP) data as well as nuclear and magnetic single crystal diffraction data. The article presents a brief overview of the SNP process and the core mathematics used to calculate the SNP matrices and gives an example of data that has been fitted using the MuFit package, which is available on request from the author Bertrand Roessli
Full Text Available Abstract Background The recent development of new high-throughput technologies for SNP genotyping has opened the possibility of taking a genome-wide linkage approach to the search for new candidate genes involved in heredity diseases. The two major breast cancer susceptibility genes BRCA1 and BRCA2 are involved in 30% of hereditary breast cancer cases, but the discovery of additional breast cancer predisposition genes for the non-BRCA1/2 breast cancer families has so far been unsuccessful. Results In order to evaluate the power improvement provided by using SNP markers in a real situation, we have performed a whole genome screen of 19 non-BRCA1/2 breast cancer families using 4720 genomewide SNPs with Illumina technology (Illumina's Linkage III Panel, with an average distance of 615 Kb/SNP. We identified six regions on chromosomes 2, 3, 4, 7, 11 and 14 as candidates to contain genes involved in breast cancer susceptibility, and additional fine mapping genotyping using microsatellite markers around linkage peaks confirmed five of them, excluding the region on chromosome 3. These results were consistent in analyses that excluded SNPs in high linkage disequilibrium. The results were compared with those obtained previously using a 10 cM microsatellite scan (STR-GWS and we found lower or not significant linkage signals with STR-GWS data compared to SNP data in all cases. Conclusion Our results show the power increase that SNPs can supply in linkage studies.
Houston, Ross D; Taggart, John B; Cézard, Timothé; Bekaert, Michaël; Lowe, Natalie R; Downing, Alison; Talbot, Richard; Bishop, Stephen C; Archibald, Alan L; Bron, James E; Penman, David J; Davassi, Alessandro; Brew, Fiona; Tinch, Alan E; Gharbi, Karim; Hamilton, Alastair
Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture
Miyata, Hirofumi; Itoh, Rika; Sato, Fumio; Takebe, Naoya; Hada, Tetsuro; Tozaki, Teruaki
Variants of the Myostatin gene have been shown to have an influence on muscle hypertrophy phenotypes in a wide range of mammalian species. Recently, a Thoroughbred horse with a C-Allele at the g.66493737C/T single-nucleotide polymorphism (SNP) has been reported to be suited to short-distance racing. In this study, we examined the effect of the Myostatin SNP on muscle fiber properties in young Thoroughbred horses during a training period. To investigate the effect of the Myostatin SNP on muscle fiber before training, several mRNA expressions were relatively quantified in biopsy samples from the middle gluteal muscle of 27 untrained male Thoroughbred horses (1.5 years old) using real-time RT-PCR analysis. The remaining muscle samples were used for immunohistochemical analysis to determine the population and area of each fiber type. All measurements were revaluated in biopsy samples of the same horses after a 5-month period of conventional training. Although the expressions of Myostatin mRNA decreased in all SNP genotypes, a significant decrease was found in only the C/C genotype after training. While, expression of VEGFa, PGC1α, and SDHa mRNAs, which relate to the biogenesis of mitochondria and capillaries, was significantly higher (54-82%) in the T/T than the C/C genotypes after training. It is suggested that hypertrophy of muscle fiber is directly associated with a decrease in Myostatin mRNA expression in the C/C genotype, and that increased expressions of VEGFa, PGC1α, and SDHa in the T/T genotype might be indirectly caused by the Myostatin SNP.
Tea [Camellia sinensis (L.) O Kuntze] is an economically important crop cultivated in more than 50 countries. Production and marketing of premium specialty tea products provides opportunities for tea growers, the tea industry and consumers. Rapid market segmentation in the tea industry has resulted ...
Full Text Available Abstract Background With the availability of large-scale genome-wide association study (GWAS data, choosing an optimal set of SNPs for disease susceptibility prediction is a challenging task. This study aimed to use single nucleotide polymorphisms (SNPs to predict psoriasis from searching GWAS data. Methods Totally we had 2,798 samples and 451,724 SNPs. Process for searching a set of SNPs to predict susceptibility for psoriasis consisted of two steps. The first one was to search top 1,000 SNPs with high accuracy for prediction of psoriasis from GWAS dataset. The second one was to search for an optimal SNP subset for predicting psoriasis. The sequential information bottleneck (sIB method was compared with classical linear discriminant analysis(LDA for classification performance. Results The best test harmonic mean of sensitivity and specificity for predicting psoriasis by sIB was 0.674(95% CI: 0.650-0.698, while only 0.520(95% CI: 0.472-0.524 was reported for predicting disease by LDA. Our results indicate that the new classifier sIB performs better than LDA in the study. Conclusions The fact that a small set of SNPs can predict disease status with average accuracy of 68% makes it possible to use SNP data for psoriasis prediction.
Imen Rekik Hakim; Naziha Grati Kammoun; Emna Makhloufi; Ahmed Rebaï
Single Nucelotide Polymorphisms (SNPs) have become the most widely used markers in many current genetic applications. Here we report the discovery of nine new SNPs in olives by direct partial sequencing of two genes (OEX and OEW) in sixteen Tunisian cultivars. The SNP markers were then used to genotype 24 olive cultivars and assess the level of genetic diversity. Power of discrimination of SNP markers was then compared to that of microsatellites (SSRs). A combination of SSR and SNP markers wa...
Fu, Ou-Yang; Chang, Hsueh-Wei; Lin, Yu-Da; Chuang, Li-Yeh; Hou, Ming-Feng; Yang, Cheng-Hong
In association studies, the combined effects of single nucleotide polymorphism (SNP)-SNP interactions and the problem of imbalanced data between cases and controls are frequently ignored. In the present study, we used an improved multifactor dimensionality reduction (MDR) approach namely MDR-ER to detect the high order SNP‑SNP interaction in an imbalanced breast cancer data set containing seven SNPs of chemokine CXCL12/CXCR4 pathway genes. Most individual SNPs were not significantly associated with breast cancer. After MDR‑ER analysis, six significant SNP‑SNP interaction models with seven genes (highest cross‑validation consistency, 10; classification error rates, 41.3‑21.0; and prediction error rates, 47.4‑55.3) were identified. CD4 and VEGFA genes were associated in a 2‑loci interaction model (classification error rate, 41.3; prediction error rate, 47.5; odds ratio (OR), 2.069; 95% bootstrap CI, 1.40‑2.90; P=1.71E‑04) and it also appeared in all the best 2‑7‑loci models. When the loci number increased, the classification error rates and P‑values decreased. The powers in 2‑7‑loci in all models were >0.9. The minimum classification error rate of the MDR‑ER‑generated model was shown with the 7‑loci interaction model (classification error rate, 21.0; OR=15.282; 95% bootstrap CI, 9.54‑23.87; P=4.03E‑31). In the epistasis network analysis, the overall effect with breast cancer susceptibility was identified and the SNP order of impact on breast cancer was identified as follows: CD4 = VEGFA > KITLG > CXCL12 > CCR7 = MMP2 > CXCR4. In conclusion, the MDR‑ER can effectively and correctly identify the best SNP‑SNP interaction models in an imbalanced data set for breast cancer cases.
Shah, Nameeta; Teplitsky, Michael V.; Pennacchio, Len A.; Hugenholtz, Philip; Hamann, Bernd; Dubchak, Inna L.
Recent advances in sequencing technologies promise better diagnostics for many diseases as well as better understanding of evolution of microbial populations. Single Nucleotide Polymorphisms(SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it is possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease and then screen for causative mutations.In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmental samples makes possible more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at http://genome.lbl.gov/vista/snpvista.
Maslow, Bat-Sheva L; Budinetz, Tara; Sueldo, Carolina; Anspach, Erica; Engmann, Lawrence; Benadiva, Claudio; Nulsen, John C
To compare the analysis of chromosome number from paraffin-embedded products of conception using single-nucleotide polymorphism (SNP) microarray with the recommended screening for the evaluation of couples presenting with recurrent pregnancy loss who do not have previous fetal cytogenetic data. We performed a retrospective cohort study including all women who presented for a new evaluation of recurrent pregnancy loss over a 2-year period (January 1, 2012, to December 31, 2013). All participants had at least two documented first-trimester losses and both the recommended screening tests and SNP microarray performed on at least one paraffin-embedded products of conception sample. Single-nucleotide polymorphism microarray identifies all 24 chromosomes (22 autosomes, X, and Y). Forty-two women with a total of 178 losses were included in the study. Paraffin-embedded products of conception from 62 losses were sent for SNP microarray. Single-nucleotide polymorphism microarray successfully diagnosed fetal chromosome number in 71% (44/62) of samples, of which 43% (19/44) were euploid and 57% (25/44) were noneuploid. Seven of 42 (17%) participants had abnormalities on recurrent pregnancy loss screening. The per-person detection rate for a cause of pregnancy loss was significantly higher in the SNP microarray (0.50; 95% confidence interval [CI] 0.36-0.64) compared with recurrent pregnancy loss evaluation (0.17; 95% CI 0.08-0.31) (P=.002). Participants with one or more euploid loss identified on paraffin-embedded products of conception were significantly more likely to have an abnormality on recurrent pregnancy loss screening than those with only noneuploid results (P=.028). The significance remained when controlling for age, number of losses, number of samples, and total pregnancies. These results suggest that SNP microarray testing of paraffin-embedded products of conception is a valuable tool for the evaluation of recurrent pregnancy loss in patients without prior fetal
Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Bellato, Cláudia M; Motilal, Lambert; Zhang, Dapeng
Cacao (Theobroma cacao L.), the source of cocoa, is an economically important tropical crop. One problem with the premium cacao market is contamination with off-types adulterating raw premium material. Accurate determination of the genetic identity of single cacao beans is essential for ensuring cocoa authentication. Using nanofluidic single nucleotide polymorphism (SNP) genotyping with 48 SNP markers, we generated SNP fingerprints for small quantities of DNA extracted from the seed coat of single cacao beans. On the basis of the SNP profiles, we identified an assumed adulterant variety, which was unambiguously distinguished from the authentic beans by multilocus matching. Assignment tests based on both Bayesian clustering analysis and allele frequency clearly separated all 30 authentic samples from the non-authentic samples. Distance-based principle coordinate analysis further supported these results. The nanofluidic SNP protocol, together with forensic statistical tools, is sufficiently robust to establish authentication and to verify gourmet cacao varieties. This method shows significant potential for practical application.
Kaya, Hilal Betul; Cetin, Oznur; Kaya, Hulya; Sahin, Mustafa; Sefer, Filiz; Kahraman, Abdullah; Tanyolac, Bahattin
Background The olive tree (Olea europaea L.) is a diploid (2n = 2x = 46) outcrossing species mainly grown in the Mediterranean area, where it is the most important oil-producing crop. Because of its economic, cultural and ecological importance, various DNA markers have been used in the olive to characterize and elucidate homonyms, synonyms and unknown accessions. However, a comprehensive characterization and a full sequence of its transcriptome are unavailable, leading to the importance of an efficient large-scale single nucleotide polymorphism (SNP) discovery in olive. The objectives of this study were (1) to discover olive SNPs using next-generation sequencing and to identify SNP primers for cultivar identification and (2) to characterize 96 olive genotypes originating from different regions of Turkey. Methodology/Principal Findings Next-generation sequencing technology was used with five distinct olive genotypes and generated cDNA, producing 126,542,413 reads using an Illumina Genome Analyzer IIx. Following quality and size trimming, the high-quality reads were assembled into 22,052 contigs with an average length of 1,321 bases and 45 singletons. The SNPs were filtered and 2,987 high-quality putative SNP primers were identified. The assembled sequences and singletons were subjected to BLAST similarity searches and annotated with a Gene Ontology identifier. To identify the 96 olive genotypes, these SNP primers were applied to the genotypes in combination with amplified fragment length polymorphism (AFLP) and simple sequence repeats (SSR) markers. Conclusions/Significance This study marks the highest number of SNP markers discovered to date from olive genotypes using transcriptome sequencing. The developed SNP markers will provide a useful source for molecular genetic studies, such as genetic diversity and characterization, high density quantitative trait locus (QTL) analysis, association mapping and map-based gene cloning in the olive. High levels of
Cingolani, Pablo; Platts, Adrian; Wang, Le Lily; Coon, Melissa; Nguyen, Tung; Wang, Luan; Land, Susan J; Lu, Xiangyi; Ruden, Douglas M
We describe a new computer program, SnpEff, for rapidly categorizing the effects of variants in genome sequences. Once a genome is sequenced, SnpEff annotates variants based on their genomic locations and predicts coding effects. Annotated genomic locations include intronic, untranslated region, upstream, downstream, splice site, or intergenic regions. Coding effects such as synonymous or non-synonymous amino acid replacement, start codon gains or losses, stop codon gains or losses, or frame shifts can be predicted. Here the use of SnpEff is illustrated by annotating ~356,660 candidate SNPs in ~117 Mb unique sequences, representing a substitution rate of ~1/305 nucleotides, between the Drosophila melanogaster w(1118); iso-2; iso-3 strain and the reference y(1); cn(1) bw(1) sp(1) strain. We show that ~15,842 SNPs are synonymous and ~4,467 SNPs are non-synonymous (N/S ~0.28). The remaining SNPs are in other categories, such as stop codon gains (38 SNPs), stop codon losses (8 SNPs), and start codon gains (297 SNPs) in the 5'UTR. We found, as expected, that the SNP frequency is proportional to the recombination frequency (i.e., highest in the middle of chromosome arms). We also found that start-gain or stop-lost SNPs in Drosophila melanogaster often result in additions of N-terminal or C-terminal amino acids that are conserved in other Drosophila species. It appears that the 5' and 3' UTRs are reservoirs for genetic variations that changes the termini of proteins during evolution of the Drosophila genus. As genome sequencing is becoming inexpensive and routine, SnpEff enables rapid analyses of whole-genome sequencing data to be performed by an individual laboratory.
Chuang, Li-Yeh; Lin, Yu-Da; Chang, Hsueh-Wei; Yang, Cheng-Hong
Possible single nucleotide polymorphism (SNP) interactions in breast cancer are usually not investigated in genome-wide association studies. Previously, we proposed a particle swarm optimization (PSO) method to compute these kinds of SNP interactions. However, this PSO does not guarantee to find the best result in every implement, especially when high-dimensional data is investigated for SNP-SNP interactions. In this study, we propose IPSO algorithm to improve the reliability of PSO for the identification of the best protective SNP barcodes (SNP combinations and genotypes with maximum difference between cases and controls) associated with breast cancer. SNP barcodes containing different numbers of SNPs were computed. The top five SNP barcode results are retained for computing the next SNP barcode with a one-SNP-increase for each processing step. Based on the simulated data for 23 SNPs of six steroid hormone metabolisms and signalling-related genes, the performance of our proposed IPSO algorithm is evaluated. Among 23 SNPs, 13 SNPs displayed significant odds ratio (OR) values (1.268 to 0.848; pPSO algorithm, two to four SNPs show significantly decreasing OR values (0.84 to 0.77; pPSO. The interquartile ranges of the boxplot, as well as the upper and lower hinges for each n-SNP barcode (n = 3∼10) are more narrow in IPSO than in PSO, suggesting that IPSO is highly reliable for SNP barcode identification. Overall, the proposed IPSO algorithm is robust to provide exact identification of the best protective SNP barcodes for breast cancer.
Full Text Available The molecular genetic analysis of microdissected cells by laser, a method for selecting a starting material of pure DNA or RNA uncontaminated. Our study focuses on technical pre-PCR (polymerase chain reaction for the amplification of DNA from a single cell (leukocyte isolated from human blood after laser microdissection and aims to optimize the yield of DNA extracted of this cell to be amplified without errors and provide reliable genetic analyzes. This study has allowed us to reduce the duration of cell lysis in order to perform the step of expanding genomic PEP (primer extension preamplification directly after lysis the same day and the quality of genomic amplification and eliminate purification step of the product PEP, step with a risk of contamination and risk of loss of genetic material related to manipulation. This approach has shown that the combination of at least 3 STR (short tandem repeat markers for genetic analysis of single cell improves the efficiency and accuracy of PCR and minimizes the loss of allele (allele drop out; ADO. This protocol can be applied to large scale and an effective means suitable for genetic testing for molecular diagnostic from isolated single cell (cancerous - fetal.
Genotyping with large numbers of molecular markers is now an indispensable tool within plant genetics and breeding. Especially through the identification of large numbers of single nucleotide polymorphism (SNP) markers using the novel high-throughput sequencing technologies, it is now possible to reliably identify many ...
Oct 15, 2012 ... Genotyping with large numbers of molecular markers is now an indispensable tool within plant genetics and breeding. Especially through the identification of large numbers of single nucleotide polymorphism (SNP) markers using the novel high-throughput sequencing technologies, it is now possible to ...
Børsting, Claus; Sanchez Sanchez, Juan Jose; Morling, Niels
We describe a single nucleotide polymorphism (SNP) typing protocol developed for the NanoChip electronic microarray. The NanoChip array consists of 100 electrodes covered by a thin hydrogel layer containing streptavidin. An electric currency can be applied to one, several, or all electrodes...
and/or palate phenotypes. Tan et al. (2013) identified a de novo 2.3 Mb microdeletion of 1q32.2 involving the Van der Woude Syndrome locus using a high resolution single nucleotide poly-morphism (SNP) array. In our study, we used the CytoScanTM HD array platform provided by the Affymetrix Corporation to analyze 33 ...
Full Text Available Single-nucleotide polymorphisms (SNPs associated with complex disorders can create, destroy, or modify protein coding sites. Single amino acid substitutions in the insulin receptor (INSR are the most common forms of genetic variations that account for various diseases like Donohue syndrome or Leprechaunism, Rabson-Mendenhall syndrome, and type A insulin resistance. We analyzed the deleterious nonsynonymous SNPs (nsSNPs in INSR gene based on different computational methods. Analysis of INSR was initiated with PROVEAN followed by PolyPhen and I-Mutant servers to investigate the effects of 57 nsSNPs retrieved from database of SNP (dbSNP. A total of 18 mutations that were found to exert damaging effects on the INSR protein structure and function were chosen for further analysis. Among these mutations, our computational analysis suggested that 13 nsSNPs decreased protein stability and might have resulted in loss of function. Therefore, the probability of their involvement in disease predisposition increases. In the lack of adequate prior reports on the possible deleterious effects of nsSNPs, we have systematically analyzed and characterized the functional variants in coding region that can alter the expression and function of INSR gene. In silico characterization of nsSNPs affecting INSR gene function can aid in better understanding of genetic differences in disease susceptibility.
Schneider, Katharina; Kulosa, Dagmar; Soerensen, Thomas Rosleff; Möhring, Silke; Heine, Martin; Durstewitz, Gregor; Polley, Andreas; Weber, Eberhard; Jamsari; Lein, Jens; Hohmann, Uwe; Tahiro, Emma; Weisshaar, Bernd; Schulz, Britta; Koch, Georg; Jung, Christian; Ganal, Martin
A panel of 13 sugar beet lines and one genotype each of the Beta vulgaris cultivars red beet and Swiss chard, and B. vulgaris ssp. maritima were used to identify polymorphisms in alignments of genomic DNA sequences derived from 315 EST- and 43 non-coding RFLP-derived loci. In sugar beet lines, loci of expressed genes showed an average SNP frequency of 1/72 bp, 1 in 58 bp in non-coding sequences, increasing to 1/47 bp upon the addition of the remaining genotypes. Within analysed DNA fragments, alleles at different SNP positions displayed linkage disequilibrium indicative of haplotype structures. On average 2.7 haplotypes were found in sugar beet lines, and haplotype conservation in expressed genes appeared to exceed 500 bp in length. Seven different genotyping techniques including SNP detection by MALDI-TOF mass spectrometry, pyrosequencing and fluorescence scanning of labelled nucleotides were employed to perform 712 segregation analyses for 538 markers in three F(2) populations. Functions were predicted for 492 mapped sequences. Genetic maps comprised 305 loci covering 599.8 cM in population K1, 241 loci distributed over 636.6 cM in population D2, and 166 loci over 507.1 cM in population K2, respectively. Based on 156 markers common to more than one population an integrated map was constructed with 524 loci covering 664.3 cM. For 377 loci the genome positions of the most similar sequences from A. thaliana were identified, but little evidence for previously presented ancestral genome structures was found.
Full Text Available Recent advances in next-generation DNA sequencing technologies have made possible the development of high-throughput SNP genotyping platforms that allow for the simultaneous interrogation of thousands of single-nucleotide polymorphisms (SNPs. Such resources have the potential to facilitate the rapid development of high-density genetic maps, and to enable genome-wide association studies as well as molecular breeding approaches in a variety of taxa. Herein, we describe the development of a SNP genotyping resource for use in sunflower (Helianthus annuus L.. This work involved the development of a reference transcriptome assembly for sunflower, the discovery of thousands of high quality SNPs based on the generation and analysis of ca. 6 Gb of transcriptome re-sequencing data derived from multiple genotypes, the selection of 10,640 SNPs for inclusion in the genotyping array, and the use of the resulting array to screen a diverse panel of sunflower accessions as well as related wild species. The results of this work revealed a high frequency of polymorphic SNPs and relatively high level of cross-species transferability. Indeed, greater than 95% of successful SNP assays revealed polymorphism, and more than 90% of these assays could be successfully transferred to related wild species. Analysis of the polymorphism data revealed patterns of genetic differentiation that were largely congruent with the evolutionary history of sunflower, though the large number of markers allowed for finer resolution than has previously been possible.
Bachlava, Eleni; Taylor, Christopher A.; Tang, Shunxue; Bowers, John E.; Mandel, Jennifer R.; Burke, John M.; Knapp, Steven J.
Recent advances in next-generation DNA sequencing technologies have made possible the development of high-throughput SNP genotyping platforms that allow for the simultaneous interrogation of thousands of single-nucleotide polymorphisms (SNPs). Such resources have the potential to facilitate the rapid development of high-density genetic maps, and to enable genome-wide association studies as well as molecular breeding approaches in a variety of taxa. Herein, we describe the development of a SNP genotyping resource for use in sunflower (Helianthus annuus L.). This work involved the development of a reference transcriptome assembly for sunflower, the discovery of thousands of high quality SNPs based on the generation and analysis of ca. 6 Gb of transcriptome re-sequencing data derived from multiple genotypes, the selection of 10,640 SNPs for inclusion in the genotyping array, and the use of the resulting array to screen a diverse panel of sunflower accessions as well as related wild species. The results of this work revealed a high frequency of polymorphic SNPs and relatively high level of cross-species transferability. Indeed, greater than 95% of successful SNP assays revealed polymorphism, and more than 90% of these assays could be successfully transferred to related wild species. Analysis of the polymorphism data revealed patterns of genetic differentiation that were largely congruent with the evolutionary history of sunflower, though the large number of markers allowed for finer resolution than has previously been possible. PMID:22238659
Genetic characterization of Polish ccRCC patients: somatic mutation analysis of PBRM1, BAP1 and KDMC5, genomic SNP array analysis in tumor biopsy and preliminary results of chromosome aberrations analysis in plasma cell free DNA.
Kluzek, Katarzyna; Srebniak, Malgorzata I; Majer, Weronika; Ida, Agnieszka; Milecki, Tomasz; Huminska, Kinga; van der Helm, Robert M; Silesian, Adrian; Wrzesinski, Tomasz M; Wojciechowicz, Jacek; Beverloo, Berna H; Kwias, Zbigniew; Bluyssen, Hans A R; Wesoly, Joanna
Mutation analysis and cytogenetic testing in clear cell renal cell carcinoma (ccRCC) is not yet implemented in a routine diagnostics of ccRCC. We characterized the chromosomal alterations in 83 ccRCC tumors from Polish patients using whole genome SNP genotyping assay. Moreover, the utility of next generation sequencing of cell free DNA (cfDNA) in patients plasma as a potential tool for non-invasive cytogenetic analysis was tested. Additionally, tumor specific somatic mutations in PBRM1, BAP1 and KDM5C were determined. We confirmed a correlation between deletions at 9p and higher tumor size, and deletion of chromosome 20 and the survival time. In Fuhrman grade 1, only aberrations of 3p and 8p deletion, gain of 5q and 13q and gains of chromosome 7 and 16 were present. The number of aberrations increased with Fuhrman grade, all chromosomes displayed cytogenetic changes in G3 and G4. ccRCC specific chromosome aberrations were observed in cfDNA, although discrepancies were found between cfDNA and tumor samples. In total 12 common and 94 rare variants were detected in PBRM1, BAP1 and KDM5C, with four potentially pathogenic variants. We observed markedly lower mutation load in PBRM1. Cytogenetic analysis of cfDNA may allow more accurate diagnosis of tumor aberrations and therefore the correlation between the chromosome aberrations in cfDNA and clinical outcome should be studied in larger cohorts. The functional studies on in BAP1, KDM5C, PBRM1 mutations in large, independent sample set would be necessary for the assessment of their prognostic and diagnostic potential.
Wu, Peipei; Xie, Jingzhong; Hu, Jinghuang; Qiu, Dan; Liu, Zhiyong; Li, Jingting; Li, Miaomiao; Zhang, Hongjun; Yang, Li; Liu, Hongwei; Zhou, Yang; Zhang, Zhongjun; Li, Hongjie
Powdery mildew resistance gene Pm4b, originating from Triticum persicum, is effective against the prevalent Blumeria graminis f. sp. tritici (Bgt) isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification of Pm4b during the application of molecular marker-assisted selection (MAS). The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq) was applied in an F2:3 mapping population (237 families) derived from a pair of isogenic lines VPM1/7∗Bainong 3217 F4 (carrying Pm4b) and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F2:3 families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs) and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. Four Pm4b-linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking the Pm4b gene. Three SSR markers, Xics13, Xics43, and Xics76, were incorporated in the new genetic linkage map, which located Pm4b in a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship with Brachypodium distachyon chromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with disease
Full Text Available Powdery mildew resistance gene Pm4b, originating from Triticum persicum, is effective against the prevalent Blumeria graminis f. sp. tritici (Bgt isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification of Pm4b during the application of molecular marker-assisted selection (MAS. The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq was applied in an F2:3 mapping population (237 families derived from a pair of isogenic lines VPM1/7∗Bainong 3217 F4 (carrying Pm4b and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F2:3 families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. Four Pm4b-linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking the Pm4b gene. Three SSR markers, Xics13, Xics43, and Xics76, were incorporated in the new genetic linkage map, which located Pm4b in a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship with Brachypodium distachyon chromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with
Safari, Roghaiyeh; Salimi, Reza; Tunca, Zeliha; Ozerdem, Aysegul; Ceylan, Deniz; Sakizli, Meral
Calcium signaling is important for synaptic plasticity, generation of brain rhythms, regulating neuronal excitability, data processing and cognition. Impairment in calcium homeostasis contributed to the development of psychiatric disorders such as bipolar disorder (BP). MCU is the most important calcium transporter in mitochondria inner membrane responsible for influx of Ca[Formula: see text]. MICU1 is linked with MCU and has two canonical EF hands that are vital for its activity and regulates MCU-mediated Ca[Formula: see text] influx. In the current study, we aimed to investigate the role of genetic alteration of EF hand calcium binding motifs of MICU1 on the development of BP. We examined patients with BP, first degree relatives of these patients and healthy volunteers for mutations and polymorphisms in EF hand calcium binding motifs of MICU1. The result showed no SNP/mutation in BP patients, in healthy subjects and in first degree relatives. Additionally, alignment of the EF hand calcium binding regions among species (Gallus-gallus, Canis-lupus-familiaris, Bos-taurus, Mus-musculus, Rattus-norvegicus, Pan-troglodytes, Homosapiens and Danio-rerio) showed exactly the same amino acids (DLNGDGEVDMEE and DCDGNGELSNKE) except in one of the calcium binding domain of Danio-rerio that there was only one difference; leucine instead of Methionine. Our results showed that the SNP on EF-hand Ca[Formula: see text] binding domains of MICU1 gene had no effect in phenotypic characters of BP patients.
Nicholas A. Tinker
Full Text Available Recognizing a need in cultivated hexaploid oat ( L. for a reliable set of reference single nucleotide polymorphisms (SNPs, we have developed a 6000 (6K BeadChip design containing 257 Infinium I and 5486 Infinium II designs corresponding to 5743 SNPs. Of those, 4975 SNPs yielded successful assays after array manufacturing. These SNPs were discovered based on a variety of bioinformatics pipelines in complementary DNA (cDNA and genomic DNA originating from 20 or more diverse oat cultivars. The array was validated in 1100 samples from six recombinant inbred line (RIL mapping populations and sets of diverse oat cultivars and breeding lines, and provided approximately 3500 discernible Mendelian polymorphisms. Here, we present an annotation of these SNPs, including methods of discovery, gene identification and orthology, population-genetic characteristics, and tentative positions on an oat consensus map. We also evaluate a new cluster-based method of calling SNPs. The SNP design sequences are made publicly available, and the full SNP genotyping platform is available for commercial purchase from an independent third party.
Full Text Available Abstract Background SNP (Single Nucleotide Polymorphism markers are rapidly becoming the markers of choice for applications in breeding because of next generation sequencing technology developments. For SNP development by NGS technologies, correct assembly of the huge amounts of sequence data generated is essential. Little is known about assembler's performance, especially when dealing with highly heterogeneous species that show a high genome complexity and what the possible consequences are of differences in assemblies on SNP retrieval. This study tested two assemblers (CAP3 and CLC on 454 data from four lily genotypes and compared results with respect to SNP retrieval. Results CAP3 assembly resulted in higher numbers of contigs, lower numbers of reads per contig, and shorter average read lengths compared to CLC. Blast comparisons showed that CAP3 contigs were highly redundant. Contrastingly, CLC in rare cases combined paralogs in one contig. Redundant and chimeric contigs may lead to erroneous SNPs. Filtering for redundancy can be done by blasting selected SNP markers to the contigs and discarding all the SNP markers that show more than one blast hit. Results on chimeric contigs showed that only four out of 2,421 SNP markers were selected from chimeric contigs. Conclusion In practice, CLC performs better in assembling highly heterogeneous genome sequences compared to CAP3, and consequently SNP retrieval is more efficient. Additionally a simple flow scheme is suggested for SNP marker retrieval that can be valid for all non-model species.
Ha, Bo-Keun; Phillips, Daniel V.; Boerma, H. Roger
Asian soybean rust (ASR), caused by Phakopsora pachyrhizi Syd., has the potential to become a serious threat to soybean, Glycine max L. Merr., production in the USA. A novel rust resistance gene, Rpp?(Hyuuga), from the Japanese soybean cultivar Hyuuga has been identified and mapped to soybean chromosome 6 (Gm06). Our objectives were to fine-map the Rpp?(Hyuuga) gene and develop a high-throughput single nucleotide polymorphism (SNP) assay to detect this ASR resistance gene. The integration of recombination events from two different soybean populations and the ASR reaction data indicates that the Rpp?(Hyuuga) locus is located in a region of approximately 371 kb between STS70887 and STS70923 on chromosome Gm06. A set of 32 ancestral genotypes which is predicted to contain 95% of the alleles present in current elite North American breeding populations and the sources of the previously reported ASR resistance genes (Rpp1, Rpp2, Rpp3, Rpp4, Rpp5, and rpp5) were genotyped with five SNP markers. We developed a SimpleProbe assay based on melting curve analysis for SNP06-44058 which is tighly linked to the Rpp?(Hyuuga) gene. This SNP assay can differentiate plants/lines that are homozygous/homogeneous or heterozygous/heterogeneous for the resistant and susceptible alleles at the Rpp?(Hyuuga) locus. PMID:20532750
Huertas, Pablo; Cruz-García, Andrés
Homologous recombination is initiated by the so-called DNA end resection, the 5'-3' nucleolytic degradation of a single strand of the DNA at each side of the break. The presence of resected DNA is an obligatory step for homologous recombination. Moreover, the amount of resected DNA modulates the prevalence of different recombination pathways. In different model organisms, there are several published ways to visualize and measure with more or less detail the amount of DNA resected. In human cells, however, technical constraints hampered the study of resection at high resolution. Some information might be gathered from the study of endonuclease-created DSBs, in which the resection of breaks at known sites can be followed by PCR or ChIP. In this chapter, we describe in detail a novel assay to study DNA end resection in breaks located on unknown positions. Here, we use ionizing radiation to induce double-strand breaks, but the same approach can be used to monitor resection induced by different DNA damaging agents. By modifying the DNA-combing technique, used for high-resolution replication analyses, we can measure resection progression at the level of individual DNA fibers. Thus, we named the method Single Molecule Analysis of Resection Tracks (SMART). We use human cells in culture as a model system, but in principle the same approach would be feasible to any model organism adjusting accordingly the DNA isolation part of the protocol.
Psifidi, Androniki; Dovas, Chrysostomos; Banos, Georgios
Background Single nucleotide polymorphisms (SNP) have proven to be powerful genetic markers for genetic applications in medicine, life science and agriculture. A variety of methods exist for SNP detection but few can quantify SNP frequencies when the mutated DNA molecules correspond to a small fraction of the wild-type DNA. Furthermore, there is no generally accepted gold standard for SNP quantification, and, in general, currently applied methods give inconsistent results in selected cohorts. In the present study we sought to develop a novel method for accurate detection and quantification of SNP in DNA pooled samples. Methods The development and evaluation of a novel Ligase Chain Reaction (LCR) protocol that uses a DNA-specific fluorescent dye to allow quantitative real-time analysis is described. Different reaction components and thermocycling parameters affecting the efficiency and specificity of LCR were examined. Several protocols, including gap-LCR modifications, were evaluated using plasmid standard and genomic DNA pools. A protocol of choice was identified and applied for the quantification of a polymorphism at codon 136 of the ovine PRNP gene that is associated with susceptibility to a transmissible spongiform encephalopathy in sheep. Conclusions The real-time LCR protocol developed in the present study showed high sensitivity, accuracy, reproducibility and a wide dynamic range of SNP quantification in different DNA pools. The limits of detection and quantification of SNP frequencies were 0.085% and 0.35%, respectively. Significance The proposed real-time LCR protocol is applicable when sensitive detection and accurate quantification of low copy number mutations in DNA pools is needed. Examples include oncogenes and tumour suppressor genes, infectious diseases, pathogenic bacteria, fungal species, viral mutants, drug resistance resulting from point mutations, and genetically modified organisms in food. PMID:21283808
Full Text Available Cancers often involve the synergistic effects of gene–gene interactions, but identifying these interactions remains challenging. Here, we present an odds ratio-based genetic algorithm (OR-GA that is able to solve the problems associated with the simultaneous analysis of multiple independent single nucleotide polymorphisms (SNPs that are associated with oral cancer. The SNP interactions between four SNPs—namely rs1799782, rs2040639, rs861539, rs2075685, and belonging to four genes (XRCC1, XRCC2, XRCC3, and XRCC4—were tested in this study, respectively. The GA decomposes the SNPs sets into different SNP combinations with their corresponding genotypes (called SNP barcodes. The GA can effectively identify a specific SNP barcode that has an optimized fitness value and uses this to calculate the difference between the case and control groups. The SNP barcodes with a low fitness value are naturally removed from the population. Using two to four SNPs, the best SNP barcodes with maximum differences in occurrence between the case and control groups were generated by GA algorithm. Subsequently, the OR provides a quantitative measure of the multiple SNP synergies between the oral cancer and control groups by calculating the risk related to the best SNP barcodes and others. When these were compared to their corresponding non-SNP barcodes, the estimated ORs for oral cancer were found to be great than 1 [approx. 1.72–2.23; confidence intervals (CIs: 0.94–5.30, p < 0.03–0.07] for various specific SNP barcodes with two to four SNPs. In conclusion, the proposed OR-GA method successfully generates SNP barcodes, which allow oral cancer risk to be evaluated and in the process the OR-GA method identifies possible SNP–SNP interactions.
Full Text Available Abstract Background Rhesus macaques (Macaca mulatta are the primate most used for biomedical research, but phenotypic differences between Indian-origin and Chinese rhesus macaques have encouraged genetic methods for identifying genetic differences between these two populations. The completion of the rhesus genome has led to the identification of many single nucleotide polymorphisms (SNPs in this species. These single nucleotide polymorphisms have many advantages over the short tandem repeat (STR loci currently used to assay genetic variation. However, the number of currently identified polymorphisms is too small for whole genome analysis or studies of quantitative trait loci. To that end, we tested a combination of methods to identify large numbers of high-confidence SNPs, and screen those with high minor allele frequencies (MAF. Results By testing our previously reported single nucleotide polymorphisms, we identified a subset of high-confidence, high-MAF polymorphisms. Resequencing revealed a large number of regionally specific SNPs not identified through a single pyrosequencing run. By resequencing a pooled sample of four individuals, we reliably identified loci with a MAF of at least 12.5%. Finally, we found that when applied to a larger, geographically variable sample of rhesus, a large proportion of our loci were variable in both populations, and very few loci were ancestry informative. Despite this fact, the SNP loci were more effective at discriminating Indian and Chinese rhesus than STR loci. Conclusion Pyrosequencing and pooled resequencing are viable methods for the identification of high-MAF SNP loci in rhesus macaques. These SNP loci are appropriate for screening both the inter- and intra-population genetic variation.
Wang, Difei; Song, Lei; Singh, Varun; Rao, Shruti; An, Lin; Madhavan, Subha
One of the long-standing challenges in biology is to understand how non-synonymous single nucleotide polymorphisms (nsSNPs) change protein structure and further affect their function. While it is impractical to solve all the mutated protein structures experimentally, it is quite feasible to model the mutated structures in silico. Toward this goal, we built a publicly available structure database resource (SNP2Structure, https://apps.icbi.georgetown.edu/snp2structure) focusing on missense mutations, msSNP. Compared with web portals with similar aims, SNP2Structure has the following major advantages. First, our portal offers direct comparison of two related 3D structures. Second, the protein models include all interacting molecules in the original PDB structures, so users are able to determine regions of potential interaction changes when a protein mutation occurs. Third, the mutated structures are available to download locally for further structural and functional analysis. Fourth, we used Jsmol package to display the protein structure that has no system compatibility issue. SNP2Structure provides reliable, high quality mapping of nsSNPs to 3D protein structures enabling researchers to explore the likely functional impact of human disease-causing mutations.
Full Text Available One of the long-standing challenges in biology is to understand how non-synonymous single nucleotide polymorphisms (nsSNPs change protein structure and further affect their function. While it is impractical to solve all the mutated protein structures experimentally, it is quite feasible to model the mutated structures in silico. Toward this goal, we built a publicly available structure database resource (SNP2Structure, https://apps.icbi.georgetown.edu/snp2structure focusing on missense mutations, msSNP. Compared with web portals with similar aims, SNP2Structure has the following major advantages. First, our portal offers direct comparison of two related 3D structures. Second, the protein models include all interacting molecules in the original PDB structures, so users are able to determine regions of potential interaction changes when a protein mutation occurs. Third, the mutated structures are available to download locally for further structural and functional analysis. Fourth, we used Jsmol package to display the protein structure that has no system compatibility issue. SNP2Structure provides reliable, high quality mapping of nsSNPs to 3D protein structures enabling researchers to explore the likely functional impact of human disease-causing mutations.
Ponomarenko, Petr; Rasskazov, Dmitry; Suslov, Valentin; Sharypova, Ekaterina; Savinkova, Ludmila; Podkolodnaya, Olga; Podkolodny, Nikolay L; Tverdokhleb, Natalya N; Chadaeva, Irina; Ponomarenko, Mikhail; Kolchanov, Nikolay
Variations in human genome (e.g., single nucleotide polymorphisms, SNPs) may be associated with hereditary diseases, their complications, comorbidities, and drug responses. Using Web service SNP_TATA_Comparator presented in our previous paper, here we analyzed immediate surroundings of known SNP markers of diseases and identified several candidate SNP markers that can significantly change the affinity of TATA-binding protein for human gene promoters, with circadian consequences. For example, rs572527200 may be related to asthma, where symptoms are circadian (worse at night), and rs367732974 may be associated with heart attacks that are characterized by a circadian preference (early morning). By the same method, we analyzed the 90 bp proximal promoter region of each protein-coding transcript of each human gene of the circadian clock core. This analysis yielded 53 candidate SNP markers, such as rs181985043 (susceptibility to acute Q fever in male patients), rs192518038 (higher risk of a heart attack in patients with diabetes), and rs374778785 (emphysema and lung cancer in smokers). If they are properly validated according to clinical standards, these candidate SNP markers may turn out to be useful for physicians (to select optimal treatment for each patient) and for the general population (to choose a lifestyle preventing possible circadian complications of diseases).
Ba, Hengxing; Jia, Boyin; Wang, Guiwu; Yang, Yifeng; Kedem, Gilead; Li, Chunyi
Sika deer are an economically valuable species owing to their use in traditional Chinese medicine, particularly their velvet antlers. Sika deer in northeast China are mostly farmed in enclosure. Therefore, genetic management of farmed sika deer would benefit from detailed knowledge of their genetic diversity. In this study, we generated over 1.45 billion high-quality paired-end reads (288 Gbp) across 42 unrelated individuals using double-digest restriction site-associated DNA sequencing (ddRAD-seq). A total of 96,188 (29.63%) putative biallelic SNP loci were identified with an average sequencing depth of 23×. Based on the analysis, we found that the majority of the loci had a deficit of heterozygotes (F IS >0) and low values of H obs , which could be due to inbreeding and Wahlund effects. We also developed a collection of high-quality SNP probes that will likely be useful in a variety of applications in genotyping for cervid species in the future. Copyright © 2017 Ba et al.
Cabezas José A
Full Text Available Abstract Background Rapid and consistent genotyping is an important requirement for cultivar identification in many crop species. Among them grapevine cultivars have been the subject of multiple studies given the large number of synonyms and homonyms generated during many centuries of vegetative multiplication and exchange. Simple sequence repeat (SSR markers have been preferred until now because of their high level of polymorphism, their codominant nature and their high profile repeatability. However, the rapid application of partial or complete genome sequencing approaches is identifying thousands of single nucleotide polymorphisms (SNP that can be very useful for such purposes. Although SNP markers are bi-allelic, and therefore not as polymorphic as microsatellites, the high number of loci that can be multiplexed and the possibilities of automation as well as their highly repeatable results under any analytical procedure make them the future markers of choice for any type of genetic identification. Results We analyzed over 300 SNP in the genome of grapevine using a re-sequencing strategy in a selection of 11 genotypes. Among the identified polymorphisms, we selected 48 SNP spread across all grapevine chromosomes with allele frequencies balanced enough as to provide sufficient information content for genetic identification in grapevine allowing for good genotyping success rate. Marker stability was tested in repeated analyses of a selected group of cultivars obtained worldwide to demonstrate their usefulness in genetic identification. Conclusions We have selected a set of 48 stable SNP markers with a high discrimination power and a uniform genome distribution (2-3 markers/chromosome, which is proposed as a standard set for grapevine (Vitis vinifera L. genotyping. Any previous problems derived from microsatellite allele confusion between labs or the need to run reference cultivars to identify allele sizes disappear using this type of marker
In this study, we detected four single nucleotide polymorphisms (SNPs) at the PRKAG3 gene (DQ082736) in 267 beef cattle. The SNP marker association analysis indicated that the SNP markers T2885C was significantly associated with tenderness trait. Animals with the TT genotype had lower Warner-Bratzler shear force ...
Full Text Available Genome-wide association studies (GWASs have identified low-penetrance common variants (i.e., single nucleotide polymorphisms, SNPs associated with breast cancer susceptibility. Although GWASs are primarily focused on single-locus effects, gene-gene interactions (i.e., epistasis are also assumed to contribute to the genetic risks for complex diseases including breast cancer. While it has been hypothesized that moderately ranked (P value based weak single-locus effects in GWASs could potentially harbor valuable information for evaluating epistasis, we lack systematic efforts to investigate SNPs showing consistent associations with weak statistical significance across independent discovery and replication stages. The objectives of this study were i to select SNPs showing single-locus effects with weak statistical significance for breast cancer in a GWAS and/or candidate-gene studies; ii to replicate these SNPs in an independent set of breast cancer cases and controls; and iii to explore their potential SNP-SNP interactions contributing to breast cancer susceptibility. A total of 17 SNPs related to DNA repair, modification and metabolism pathway genes were selected since these pathways offer a priori knowledge for potential epistatic interactions and an overall role in breast carcinogenesis. The study design included predominantly Caucasian women (2,795 cases and 4,505 controls from Alberta, Canada. We observed two two-way SNP-SNP interactions (APEX1-rs1130409 and RPAP1-rs2297381; MLH1-rs1799977 and MDM2-rs769412 in logistic regression that conferred elevated risks for breast cancer (P(interaction<7.3 × 10(-3. Logic regression identified an interaction involving four SNPs (MBD2-rs4041245, MLH1-rs1799977, MDM2-rs769412, BRCA2-rs1799943 (P(permutation = 2.4 × 10(-3. SNPs involved in SNP-SNP interactions also showed single-locus effects with weak statistical significance, while BRCA2-rs1799943 showed stronger statistical significance (P
Guo, Liyuan; Wang, Jing
Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element-target gene pairs (E-G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Combined array-comparative genomic hybridization and single-nucleotide polymorphism-loss of heterozygosity analysis reveals complex changes and multiple forms of chromosomal instability in colorectal cancers
Gaasenbeek, Michelle; Howarth, Kimberley; Rowan, Andrew J
(CGH) for copy number changes and single-copy number polymorphism (SNP) microarrays for allelic loss (LOH). Many array-based CGH changes were not found by LOH because they did not cause true reduction-to-homozygosity. Conversely, many regions of SNP-LOH occurred in the absence of copy number change...
Masuyama, Kotoka; Shojo, Hideki; Nakanishi, Hiroaki; Inokuchi, Shota; Adachi, Noboru
Sex determination is important in archeology and anthropology for the study of past societies, cultures, and human activities. Sex determination is also one of the most important components of individual identification in criminal investigations. We developed a new method of sex determination by detecting a single-nucleotide polymorphism in the amelogenin gene using amplified product-length polymorphisms in combination with sex-determining region Y analysis. We particularly focused on the most common types of postmortem DNA damage in ancient and forensic samples: fragmentation and nucleotide modification resulting from deamination. Amplicon size was designed to be less than 60 bp to make the method more useful for analyzing degraded DNA samples. All DNA samples collected from eight Japanese individuals (four male, four female) were evaluated correctly using our method. The detection limit for accurate sex determination was determined to be 20 pg of DNA. We compared our new method with commercial short tandem repeat analysis kits using DNA samples artificially fragmented by ultraviolet irradiation. Our novel method was the most robust for highly fragmented DNA samples. To deal with allelic dropout resulting from deamination, we adopted “bidirectional analysis,” which analyzed samples from both sense and antisense strands. This new method was applied to 14 Jomon individuals (3500-year-old bone samples) whose sex had been identified morphologically. We could correctly identify the sex of 11 out of 14 individuals. These results show that our method is reliable for the sex determination of highly degenerated samples. PMID:28052096
Full Text Available Sex determination is important in archeology and anthropology for the study of past societies, cultures, and human activities. Sex determination is also one of the most important components of individual identification in criminal investigations. We developed a new method of sex determination by detecting a single-nucleotide polymorphism in the amelogenin gene using amplified product-length polymorphisms in combination with sex-determining region Y analysis. We particularly focused on the most common types of postmortem DNA damage in ancient and forensic samples: fragmentation and nucleotide modification resulting from deamination. Amplicon size was designed to be less than 60 bp to make the method more useful for analyzing degraded DNA samples. All DNA samples collected from eight Japanese individuals (four male, four female were evaluated correctly using our method. The detection limit for accurate sex determination was determined to be 20 pg of DNA. We compared our new method with commercial short tandem repeat analysis kits using DNA samples artificially fragmented by ultraviolet irradiation. Our novel method was the most robust for highly fragmented DNA samples. To deal with allelic dropout resulting from deamination, we adopted "bidirectional analysis," which analyzed samples from both sense and antisense strands. This new method was applied to 14 Jomon individuals (3500-year-old bone samples whose sex had been identified morphologically. We could correctly identify the sex of 11 out of 14 individuals. These results show that our method is reliable for the sex determination of highly degenerated samples.
Longfils, M; Schuster, E; Lorén, N; Särkkä, A; Rudemo, M
As a complement to the standard RICS method of analysing Raster Image Correlation Spectroscopy images with estimation of the image correlation function, we introduce the method SPRIA, Single Particle Raster Image Analysis. Here, we start by identifying individual particles and estimate the diffusion coefficient for each particle by a maximum likelihood method. Averaging over the particles gives a diffusion coefficient estimate for the whole image. In examples both with simulated and experimental data, we show that the new method gives accurate estimates. It also gives directly standard error estimates. The method should be possible to extend to study heterogeneous materials and systems of particles with varying diffusion coefficient, as demonstrated in a simple simulation example. A requirement for applying the SPRIA method is that the particle concentration is low enough so that we can identify the individual particles. We also describe a bootstrap method for estimating the standard error of standard RICS. © 2016 The Authors Journal of Microscopy © 2016 Royal Microscopical Society.
The knowledge of the intracellular distribution of biological relevant metals is important to understand their mechanisms of action in cells, either for physiological, toxicological or pathological processes. However, the direct detection of trace metals in single cells is a challenging task that requires sophisticated analytical developments. The combination of micro-PIXE with RBS and STIM (Scanning Transmission Ion Microscopy) allows the quantitative determination of trace metal content within sub-cellular compartments. The application of STIM analysis provides high spatial resolution imaging (< 200 nm) and excellent mass sensitivity (< 0.1 ng). Application of the STIM-PIXE-RBS methodology is absolutely needed when organic mass loss appears during PIXE-RBS irradiation. This combination of STIM-PIXE-RBS provides fully quantitative determination of trace element content, expressed in μg/g, which is a quite unique capability for micro-PIXE compared to other micro-analytical methods such as the electron and synchrotron x-ray fluorescence. Examples of micro-PIXE studies for sub-cellular imaging of trace elements in various fields of interest will be presented: in patho-physiology of trace elements involved in neurodegenerative diseases such as Parkinson's disease, and in toxicology of metals such as cobalt. (author)
Ismail, A. A.; Daud, R.; Amin, N. A. M.; Majid, M. S. Abdul; Basaruddin, K. S.; Junoh, A. K.
Cortical boneis considered as a composite material as it was made up of different phases such as osteon, interstitial bone and cement line. Bone is known for its function to give supports in vertebrae. The bone also has a special feature that was enable it to resist the fracture under certain condition. Many researchers are interested in understanding the fracture mechanic of the bone because of its special micro-structure. The objective of this study was to develop numericalmodelof a single osteon using Finite Element Method to determine stress distribution and predict the location of microcrack initiation.A finite element stress and strain analysis has been carried out for an osteon under radial loading. With radial loading, high values of stresses were predicted at the innermost lamellar of osteon. These findings suggest that stress was accumulated in the lamella near the Haversian canal. The microcrack in osteon may start in the lamella near to the canal, correlates with the data in literature.
Yu, So Young; Ryu, Han Kyoung; Park, Hee Jung; Choi, Young Ju; Huh, Kap Bum; Kim, Wha Young
Single nucleotide polymorphism (SNP) in adiponectin gene has been associated with insulin resistance, diabetes, and cardiovascular disease (CVD). This study was performed to investigate the association of SNP 276G?T at adiponectin gene with CVD risk factors in Korean type 2 diabetes mellitus (DM) patients. The subjects were 351 type 2 DM patients visited a DM clinic in Seoul, and the patients with known CVD were excluded. The adiponectin SNP 276G?T was analyzed and dietary intakes were assess...
Routtu, Jarkko; Hall, Matthew D; Albere, Brian; Beisel, Christian; Bergeron, R Daniel; Chaturvedi, Anurag; Choi, Jeong-Hyeon; Colbourne, John; De Meester, Luc; Stephens, Melissa T; Stelzer, Claus-Peter; Solorzano, Eleanne; Thomas, W Kelley; Pfrender, Michael E; Ebert, Dieter
Although Daphnia is increasingly recognized as a model for ecological genomics and biomedical research, there is, as of yet, no high-resolution genetic map for the genus. Such a map would provide an important tool for mapping phenotypes and assembling the genome. Here we estimate the genome size of Daphnia magna and describe the construction of an SNP array based linkage map. We then test the suitability of the map for life history and behavioural trait mapping. The two parent genotypes used to produce the map derived from D. magna populations with and without fish predation, respectively and are therefore expected to show divergent behaviour and life-histories. Using flow cytometry we estimated the genome size of D. magna to be about 238 mb. We developed an SNP array tailored to type SNPs in a D. magna F2 panel and used it to construct a D. magna linkage map, which included 1,324 informative markers. The map produced ten linkage groups ranging from 108.9 to 203.6 cM, with an average distance between markers of 1.13 cM and a total map length of 1,483.6 cM (Kosambi corrected). The physical length per cM is estimated to be 160 kb. Mapping infertility genes, life history traits and behavioural traits on this map revealed several significant QTL peaks and showed a complex pattern of underlying genetics, with different traits showing strongly different genetic architectures. The new linkage map of D. magna constructed here allowed us to characterize genetic differences among parent genotypes from populations with ecological differences. The QTL effect plots are partially consistent with our expectation of local adaptation under contrasting predation regimes. Furthermore, the new genetic map will be an important tool for the Daphnia research community and will contribute to the physical map of the D. magna genome project and the further mapping of phenotypic traits. The clones used to produce the linkage map are maintained in a stock collection and can be used for
Full Text Available BACKGROUND: Possible single nucleotide polymorphism (SNP interactions in breast cancer are usually not investigated in genome-wide association studies. Previously, we proposed a particle swarm optimization (PSO method to compute these kinds of SNP interactions. However, this PSO does not guarantee to find the best result in every implement, especially when high-dimensional data is investigated for SNP-SNP interactions. METHODOLOGY/PRINCIPAL FINDINGS: In this study, we propose IPSO algorithm to improve the reliability of PSO for the identification of the best protective SNP barcodes (SNP combinations and genotypes with maximum difference between cases and controls associated with breast cancer. SNP barcodes containing different numbers of SNPs were computed. The top five SNP barcode results are retained for computing the next SNP barcode with a one-SNP-increase for each processing step. Based on the simulated data for 23 SNPs of six steroid hormone metabolisms and signalling-related genes, the performance of our proposed IPSO algorithm is evaluated. Among 23 SNPs, 13 SNPs displayed significant odds ratio (OR values (1.268 to 0.848; p<0.05 for breast cancer. Based on IPSO algorithm, the jointed effect in terms of SNP barcodes with two to seven SNPs show significantly decreasing OR values (0.84 to 0.57; p<0.05 to 0.001. Using PSO algorithm, two to four SNPs show significantly decreasing OR values (0.84 to 0.77; p<0.05 to 0.001. Based on the results of 20 simulations, medians of the maximum differences for each SNP barcode generated by IPSO are higher than by PSO. The interquartile ranges of the boxplot, as well as the upper and lower hinges for each n-SNP barcode (n = 3∼10 are more narrow in IPSO than in PSO, suggesting that IPSO is highly reliable for SNP barcode identification. CONCLUSIONS/SIGNIFICANCE: Overall, the proposed IPSO algorithm is robust to provide exact identification of the best protective SNP barcodes for breast cancer.
Full Text Available AbstrakSingle Nucleotide Polymorphism (SNP merupakan variasi genetik yang ditemukan pada lebih dari 1% populasi. Haplotipe, yang merupakan sekelompok SNP atau alel dalam satu kromosom, dapat di turunkan ke generasi selanjutnya dan dapat digunakan untuk menelusuri gen penyebab penyakit (marker genetik. Artikel ini bertujuan menjelaskan aplikasi analisis SNP dalam diagnosis beberapa sindrom yang disebabkan gangguan genetik. Berdasarkan laporan studi terdahulu, sindrom yang disebabkan oleh UPD (uniparental disomy maupun penyakit autosomal resesif yang muncul sebagai akibat perkawinan sedarah dapat dideteksi dengan SNP array melalui analisis block of homozygosity dalam kromosom. Kelebihan lain SNP array adalah kemampuannya dalam mendeteksi mosaicism level rendah yang tidak terdeteksi dengan pemeriksaan sitogenetik konvensional. Bahkan saat ini, SNP array sedang diujicobakan dalam IVF untuk mendapatkan bayi yang sehat. Hal ini dapat dilakukan dengan mendeteksi ada atau tidaknya gen tunggal penyebab penyakit pada embrio hasil bayi tabung sebelum embrio ditanamkan ke uterus. Analisis SNP dengan SNP array mempunyai banyak kelebihan dibanding metode pemeriksaan SNP lainnya dan diharapkan dapat digunakan secara luas dalam bidang diagnostik molekuler genetik di masa mendatang.AbstractSingle Nucleotide Polymorphism (SNP is a genetic variant with a frequency of >1% of a large population. Haplotypes, a combination of a set of SNPs/alleles that appear as “associated blocks” on one chromosome, tend to be inherited together to the next offspring and can be used as genetic markers to trace particular diseases. This article aimed at explaining of SNP analysis application in diagnosis of genetic-disorder related syndrome. Previous studies showed that syndromes caused by UPD or autosomal recessive disorder as a result of consanguineous marriage can be identified by SNP array through analysing block of homozygosity region in a chromosome. Another advantage of SNP
Pareek, Chandra Shekhar; Błaszczyk, Paweł; Dziuba, Piotr
Background RNA-seq is a useful next-generation sequencing (NGS) technology that has been widely used to understand mammalian transcriptome architecture and function. In this study, a breed-specific RNA-seq experiment was utilized to detect putative single nucleotide polymorphisms (SNPs) in liver...... constructed for the Polish Red, Polish HF, and Hereford breeds, respectively. Using a combination of stringent parameters of a minimum depth of ≥10 mapping reads that support the polymorphic nucleotide base and 100% SNP ratio, 4,368, 3,780 and 3,800 SNP records were detected in the Polish Red, Polish HF......, and Hereford breeds, respectively. The SNP detections using RNA-seq data were successfully validated by kompetitive allele-specific PCR (KASPTM) SNP genotyping assay. The comprehensive QTL/CG analysis of 110 QTL/CG with RNA-seq data identified 20 monomorphic SNP hit loci (CARTPT, GAD1, GDF5, GHRH, GHRL, GRB10...
McClure, Matthew C; McCarthy, John; Flynn, Paul; McClure, Jennifer C; Dair, Emma; O'Connell, D K; Kearney, John F
A major use of genetic data is parentage verification and identification as inaccurate pedigrees negatively affect genetic gain. Since 2012 the international standard for single nucleotide polymorphism (SNP) verification in Bos taurus cattle has been the ISAG SNP panels. While these ISAG panels provide an increased level of parentage accuracy over microsatellite markers (MS), they can validate the wrong parent at ≤1% misconcordance rate levels, indicating that more SNP are needed if a more accurate pedigree is required. With rapidly increasing numbers of cattle being genotyped in Ireland that represent 61 B. taurus breeds from a wide range of farm types: beef/dairy, AI/pedigree/commercial, purebred/crossbred, and large to small herd size the Irish Cattle Breeding Federation (ICBF) analyzed different SNP densities to determine that at a minimum ≥500 SNP are needed to consistently predict only one set of parents at a ≤1% misconcordance rate. For parentage validation and prediction ICBF uses 800 SNP (ICBF800) selected based on SNP clustering quality, ISAG200 inclusion, call rate (CR), and minor allele frequency (MAF) in the Irish cattle population. Large datasets require sample and SNP quality control (QC). Most publications only deal with SNP QC via CR, MAF, parent-progeny conflicts, and Hardy-Weinberg deviation, but not sample QC. We report here parentage, SNP QC, and a genomic sample QC pipelines to deal with the unique challenges of >1 million genotypes from a national herd such as SNP genotype errors from mis-tagging of animals, lab errors, farm errors, and multiple other issues that can arise. We divide the pipeline into two parts: a Genotype QC and an Animal QC pipeline. The Genotype QC identifies samples with low call rate, missing or mixed genotype classes (no BB genotype or ABTG alleles present), and low genotype frequencies. The Animal QC handles situations where the genotype might not belong to the listed individual by identifying: >1 non
Kenny, Elaine M
Screening large numbers of target regions in multiple DNA samples for sequence variation is an important application of next-generation sequencing but an efficient method to enrich the samples in parallel has yet to be reported. We describe an advanced method that combines DNA samples using indexes or barcodes prior to target enrichment to facilitate this type of experiment. Sequencing libraries for multiple individual DNA samples, each incorporating a unique 6-bp index, are combined in equal quantities, enriched using a single in-solution target enrichment assay and sequenced in a single reaction. Sequence reads are parsed based on the index, allowing sequence analysis of individual samples. We show that the use of indexed samples does not impact on the efficiency of the enrichment reaction. For three- and nine-indexed HapMap DNA samples, the method was found to be highly accurate for SNP identification. Even with sequence coverage as low as 8x, 99% of sequence SNP calls were concordant with known genotypes. Within a single experiment, this method can sequence the exonic regions of hundreds of genes in tens of samples for sequence and structural variation using as little as 1 μg of input DNA per sample.
Sluga, Davor; Curk, Tomaz; Zupan, Blaz; Lotric, Uros
The extent of data in a typical genome-wide association study (GWAS) poses considerable computational challenges to software tools for gene-gene interaction discovery. Exhaustive evaluation of all interactions among hundreds of thousands to millions of single nucleotide polymorphisms (SNPs) may require weeks or even months of computation. Massively parallel hardware within a modern Graphic Processing Unit (GPU) and Many Integrated Core (MIC) coprocessors can shorten the run time considerably. While the utility of GPU-based implementations in bioinformatics has been well studied, MIC architecture has been introduced only recently and may provide a number of comparative advantages that have yet to be explored and tested. We have developed a heterogeneous, GPU and Intel MIC-accelerated software module for SNP-SNP interaction discovery to replace the previously single-threaded computational core in the interactive web-based data exploration program SNPsyn. We report on differences between these two modern massively parallel architectures and their software environments. Their utility resulted in an order of magnitude shorter execution times when compared to the single-threaded CPU implementation. GPU implementation on a single Nvidia Tesla K20 runs twice as fast as that for the MIC architecture-based Xeon Phi P5110 coprocessor, but also requires considerably more programming effort. General purpose GPUs are a mature platform with large amounts of computing power capable of tackling inherently parallel problems, but can prove demanding for the programmer. On the other hand the new MIC architecture, albeit lacking in performance reduces the programming effort and makes it up with a more general architecture suitable for a wider range of problems.
Pootakham, Wirulda; Shearman, Jeremy R; Ruang-Areerate, Panthita; Sonthirod, Chutima; Sangsrakru, Duangjai; Jomchai, Nukoon; Yoocha, Thippawan; Triwitayakorn, Kanokporn; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke
Cassava (Manihot esculenta Crantz) is one of the most important crop species being the main source of dietary energy in several countries. Marker-assisted selection has become an essential tool in plant breeding. Single nucleotide polymorphism (SNP) discovery via transcriptome sequencing is an attractive strategy for genome complexity reduction in organisms with large genomes. We sequenced the transcriptome of 16 cassava accessions using the Illumina HiSeq platform and identified 675,559 EST-derived SNP markers. A subset of those markers was subsequently genotyped by capture-based targeted enrichment sequencing in 100 F1 progeny segregating for starch viscosity phenotypes. A total of 2,110 non-redundant SNP markers were used to construct a genetic map. This map encompasses 1,785 cM and consists of 19 linkage groups. A major quantitative trait locus (QTL) controlling starch pasting properties was identified and shown to coincide with the QTL previously reported for this trait. With a high-density SNP-based linkage map presented here, we also uncovered a novel QTL associated with starch pasting time on LG 10.
Johnston, Susan E; Orell, Panu; Pritchard, Victoria L; Kent, Matthew P; Lien, Sigbjørn; Niemelä, Eero; Erkinaro, Jaakko; Primmer, Craig R
Delaying sexual maturation can lead to larger body size and higher reproductive success, but carries an increased risk of death before reproducing. Classical life history theory predicts that trade-offs between reproductive success and survival should lead to the evolution of an optimal strategy in a given population. However, variation in mating strategies generally persists, and in general, there remains a poor understanding of genetic and physiological mechanisms underlying this variation. One extreme case of this is in the Atlantic salmon (Salmo salar), which can show variation in the age at which they return from their marine migration to spawn (i.e. their 'sea age'). This results in large size differences between strategies, with direct implications for individual fitness. Here, we used an Illumina Infinium SNP array to identify regions of the genome associated with variation in sea age in a large population of Atlantic salmon in Northern Europe, implementing individual-based genome-wide association studies (GWAS) and population-based FST outlier analyses. We identified several regions of the genome which vary in association with phenotype and/or selection between sea ages, with nearby genes having functions related to muscle development, metabolism, immune response and mate choice. In addition, we found that individuals of different sea ages belong to different, yet sympatric populations in this system, indicating that reproductive isolation may be driven by divergence between stable strategies. Overall, this study demonstrates how genome-wide methodologies can be integrated with samples collected from wild, structured populations to understand their ecology and evolution in a natural context. © 2014 John Wiley & Sons Ltd.
Imen Rekik Hakim
Full Text Available Single Nucelotide Polymorphisms (SNPs have become the most widely used markers in many current genetic applications. Here we report the discovery of nine new SNPs in olives by direct partial sequencing of two genes (OEX and OEW in sixteen Tunisian cultivars. The SNP markers were then used to genotype 24 olive cultivars and assess the level of genetic diversity. Power of discrimination of SNP markers was then compared to that of microsatellites (SSRs. A combination of SSR and SNP markers was finally proposed that can be used for cultivars identification in juvenile step or for oil traceability.
Groeneveld, Eildert; Lichtenberg, Helmut
The fast development of high throughput genotyping has opened up new possibilities in genetics while at the same time producing considerable data handling issues. TheSNPpit is a database system for managing large amounts of multi panel SNP genotype data from any genotyping platform. With an increasing rate of genotyping in areas like animal and plant breeding as well as human genetics, already now hundreds of thousand of individuals need to be managed. While the common database design with one row per SNP can manage hundreds of samples this approach becomes progressively slower as the size of the data sets increase until it finally fails completely once tens or even hundreds of thousands of individuals need to be managed. TheSNPpit has implemented three ideas to also accomodate such large scale experiments: highly compressed vector storage in a relational database, set based data manipulation, and a very fast export written in C with Perl as the base for the framework and PostgreSQL as the database backend. Its novel subset system allows the creation of named subsets based on the filtering of SNP (based on major allele frequency, no-calls, and chromosomes) and manually applied sample and SNP lists at negligible storage costs, thus avoiding the issue of proliferating file copies. The named subsets are exported for down stream analysis. PLINK ped and map files are processed as in- and outputs. TheSNPpit allows management of different panel sizes in the same population of individuals when higher density panels replace previous lower density versions as it occurs in animal and plant breeding programs. A completely generalized procedure allows storage of phenotypes. TheSNPpit only occupies 2 bits for storing a single SNP implying a capacity of 4 mio SNPs per 1MB of disk storage. To investigate performance scaling, a database with more than 18.5 mio samples has been created with 3.4 trillion SNPs from 12 panels ranging from 1000 through 20 mio SNPs resulting in a
Groeneveld, Eildert; Lichtenberg, Helmut
The fast development of high throughput genotyping has opened up new possibilities in genetics while at the same time producing considerable data handling issues. TheSNPpit is a database system for managing large amounts of multi panel SNP genotype data from any genotyping platform. With an increasing rate of genotyping in areas like animal and plant breeding as well as human genetics, already now hundreds of thousand of individuals need to be managed. While the common database design with one row per SNP can manage hundreds of samples this approach becomes progressively slower as the size of the data sets increase until it finally fails completely once tens or even hundreds of thousands of individuals need to be managed. TheSNPpit has implemented three ideas to also accomodate such large scale experiments: highly compressed vector storage in a relational database, set based data manipulation, and a very fast export written in C with Perl as the base for the framework and PostgreSQL as the database backend. Its novel subset system allows the creation of named subsets based on the filtering of SNP (based on major allele frequency, no-calls, and chromosomes) and manually applied sample and SNP lists at negligible storage costs, thus avoiding the issue of proliferating file copies. The named subsets are exported for down stream analysis. PLINK ped and map files are processed as in- and outputs. TheSNPpit allows management of different panel sizes in the same population of individuals when higher density panels replace previous lower density versions as it occurs in animal and plant breeding programs. A completely generalized procedure allows storage of phenotypes. TheSNPpit only occupies 2 bits for storing a single SNP implying a capacity of 4 mio SNPs per 1MB of disk storage. To investigate performance scaling, a database with more than 18.5 mio samples has been created with 3.4 trillion SNPs from 12 panels ranging from 1000 through 20 mio SNPs resulting in a
Full Text Available The fast development of high throughput genotyping has opened up new possibilities in genetics while at the same time producing considerable data handling issues. TheSNPpit is a database system for managing large amounts of multi panel SNP genotype data from any genotyping platform. With an increasing rate of genotyping in areas like animal and plant breeding as well as human genetics, already now hundreds of thousand of individuals need to be managed. While the common database design with one row per SNP can manage hundreds of samples this approach becomes progressively slower as the size of the data sets increase until it finally fails completely once tens or even hundreds of thousands of individuals need to be managed. TheSNPpit has implemented three ideas to also accomodate such large scale experiments: highly compressed vector storage in a relational database, set based data manipulation, and a very fast export written in C with Perl as the base for the framework and PostgreSQL as the database backend. Its novel subset system allows the creation of named subsets based on the filtering of SNP (based on major allele frequency, no-calls, and chromosomes and manually applied sample and SNP lists at negligible storage costs, thus avoiding the issue of proliferating file copies. The named subsets are exported for down stream analysis. PLINK ped and map files are processed as in- and outputs. TheSNPpit allows management of different panel sizes in the same population of individuals when higher density panels replace previous lower density versions as it occurs in animal and plant breeding programs. A completely generalized procedure allows storage of phenotypes. TheSNPpit only occupies 2 bits for storing a single SNP implying a capacity of 4 mio SNPs per 1MB of disk storage. To investigate performance scaling, a database with more than 18.5 mio samples has been created with 3.4 trillion SNPs from 12 panels ranging from 1000 through 20 mio SNPs
Full Text Available Background: Single nucleotide polymorphism (SNPs are considered as one of the underlyingcauses of male infertility. Proper sperm chromatin packaging which involves replacement ofhistones with protamines has profound effect on male fertility. Over 20 SNPs have been reportedfor the protamine 1 and 2.Materials and Methods: The aim of this study was to evaluate the frequency of two previouslyreported SNPs using polymerase chain reaction (PCR-restriction fragment length polymorphism(RFLP approach in 35, 96 and 177 normal, oligozoospermic and azoospermic individuals. TheseSNPs are: 1. A base pair substitution (G at position 197 instead of T in protamine type 1 Openreading frame (ORF including untranslated region, which causes an Arg residue change to Serresidue in a highly conserved region. 2. cytidine nucleotide change to thymidine in position of 248of protamine type 2 ORF which caused a nonsense point mutation.Results: The two mentioned SNPs were not present in the studied population, thus concluding thatthese SNPs can not serves as molecular markers for male infertility diagnosis.Conclusion: The results of our study reveal that in a selected Iranian population, the SNP G197Tand C248T are completely absent and are not associated with male infertility and therefore theseSNPs may not represent a molecular marker for genetic diagnosis of male infertility.
This document provides the mission analysis for the Initial Single-Shell Tank Retrieval System task, which supports the Single-Shell Tank Waste Retrieval Program in its commitment to remove waste from single-shell tanks for treatment and final closure
Full Text Available Between 20 and 30 bacteriologically confirmed cases of leprosy are diagnosed each year at the French National Reference Center for mycobacteria. Patients are mainly immigrants from various endemic countries or living in French overseas territories. We aimed at expanding data regarding the geographical distribution of the SNP genotypes of the M. leprae isolates from these patients.Skin biopsies were obtained from 71 leprosy patients diagnosed between January 2009 and December 2013. Data regarding age, sex and place of birth and residence were also collected. Diagnosis of leprosy was confirmed by microscopic detection of acid-fast bacilli and/or amplification by PCR of the M. leprae-specific RLEP region. Single nucleotide polymorphisms (SNP, present in the M. leprae genome at positions 14 676, 1 642 875 and 2 935 685, were determined with an efficiency of 94% (67/71. Almost all patients were from countries other than France where leprosy is still prevalent (n = 31 or from French overseas territories (n = 36 where leprosy is not totally eradicated, while only a minority (n = 4 was born in metropolitan France but have lived in other countries. SNP type 1 was predominant (n = 33, followed by type 3 (n = 17, type 4 (n = 11 and type 2 (n = 6. SNP types were concordant with those previously reported as prevalent in the patients' countries of birth. SNP types found in patients born in countries other than France (Comoros, Haiti, Benin, Congo, Sri Lanka and French overseas territories (French Polynesia, Mayotte and La Réunion not covered by previous work correlated well with geographical location and history of human settlements.The phylogenic analysis of M. leprae strains isolated in France strongly suggests that French leprosy cases are caused by SNP types that are (a concordant with the geographic origin or residence of the patients (non-French countries, French overseas territories, metropolitan France or (b more likely random in regions where
Reibel, Florence; Chauffour, Aurélie; Brossier, Florence; Jarlier, Vincent; Cambau, Emmanuelle; Aubry, Alexandra
Between 20 and 30 bacteriologically confirmed cases of leprosy are diagnosed each year at the French National Reference Center for mycobacteria. Patients are mainly immigrants from various endemic countries or living in French overseas territories. We aimed at expanding data regarding the geographical distribution of the SNP genotypes of the M. leprae isolates from these patients. Skin biopsies were obtained from 71 leprosy patients diagnosed between January 2009 and December 2013. Data regarding age, sex and place of birth and residence were also collected. Diagnosis of leprosy was confirmed by microscopic detection of acid-fast bacilli and/or amplification by PCR of the M. leprae-specific RLEP region. Single nucleotide polymorphisms (SNP), present in the M. leprae genome at positions 14 676, 1 642 875 and 2 935 685, were determined with an efficiency of 94% (67/71). Almost all patients were from countries other than France where leprosy is still prevalent (n = 31) or from French overseas territories (n = 36) where leprosy is not totally eradicated, while only a minority (n = 4) was born in metropolitan France but have lived in other countries. SNP type 1 was predominant (n = 33), followed by type 3 (n = 17), type 4 (n = 11) and type 2 (n = 6). SNP types were concordant with those previously reported as prevalent in the patients' countries of birth. SNP types found in patients born in countries other than France (Comoros, Haiti, Benin, Congo, Sri Lanka) and French overseas territories (French Polynesia, Mayotte and La Réunion) not covered by previous work correlated well with geographical location and history of human settlements. The phylogenic analysis of M. leprae strains isolated in France strongly suggests that French leprosy cases are caused by SNP types that are (a) concordant with the geographic origin or residence of the patients (non-French countries, French overseas territories, metropolitan France) or (b) more likely random in regions where diverse
Brown, Allan F; Yousef, Gad G; Reid, Robert W; Chebrolu, Kranthi K; Thomas, Aswathy; Krueger, Christopher; Jeffery, Elizabeth; Jackson, Eric; Juvik, John A
The identification of genetic factors influencing the accumulation of individual glucosinolates in broccoli florets provides novel insight into the regulation of glucosinolate levels in Brassica vegetables and will accelerate the development of vegetables with glucosinolate profiles tailored to promote human health. Quantitative trait loci analysis of glucosinolate (GSL) variability was conducted with a B. oleracea (broccoli) mapping population, saturated with single nucleotide polymorphism markers from a high-density array designed for rapeseed (Brassica napus). In 4 years of analysis, 14 QTLs were associated with the accumulation of aliphatic, indolic, or aromatic GSLs in floret tissue. The accumulation of 3-carbon aliphatic GSLs (2-propenyl and 3-methylsulfinylpropyl) was primarily associated with a single QTL on C05, but common regulation of 4-carbon aliphatic GSLs was not observed. A single locus on C09, associated with up to 40 % of the phenotypic variability of 2-hydroxy-3-butenyl GSL over multiple years, was not associated with the variability of precursor compounds. Similarly, QTLs on C02, C04, and C09 were associated with 4-methylsulfinylbutyl GSL concentration over multiple years but were not significantly associated with downstream compounds. Genome-specific SNP markers were used to identify candidate genes that co-localized to marker intervals and previously sequenced Brassica oleracea BAC clones containing known GSL genes (GSL-ALK, GSL-PRO, and GSL-ELONG) were aligned to the genomic sequence, providing support that at least three of our 14 QTLs likely correspond to previously identified GSL loci. The results demonstrate that previously identified loci do not fully explain GSL variation in broccoli. The identification of additional genetic factors influencing the accumulation of GSL in broccoli florets provides novel insight into the regulation of GSL levels in Brassicaceae and will accelerate development of vegetables with modified or enhanced GSL
Núñez-Acuña, Gustavo; Aguilar-Espinoza, Andrea; Chávez-Mardones, Jacqueline; Gallardo-Escárate, Cristian
Ubiquitin-conjugated E2 enzyme (UBE2) is one of the main components of the proteasome degradation cascade. Previous studies have shown an increase of expression levels in individuals challenged to some pathogen organism such as virus and bacteria. The study was to characterize the immune response of UBE2 gene in the gastropod Concholepas concholepas through expression analysis and single nucleotide polymorphisms (SNP) discovery. Hence, UBE2 was identified from a cDNA library by 454 pyrosequencing, while SNP identification and validation were performed using De novo assembly and high resolution melting analysis. Challenge trials with Vibrio anguillarum was carried out to evaluate the relative transcript abundance of UBE2 gene from two to thirty-three hours post-treatment. The results showed a partial UBE2 sequence of 889 base pair (bp) with a partial coding region of 291 bp. SNP variation (A/C) was observed at the 546th position. Individuals challenged by V. anguillarum showed an overexpression of the UBE2 gene, the expression being significantly higher in homozygous individuals (AA) than (CC) or heterozygous individuals (A/C). This study contributes useful information relating to the UBE2 gene and its association with innate immune response in marine invertebrates. Copyright © 2012 Elsevier Ltd. All rights reserved.
Stone, Benjamin; Dennis, Simon; Kwantes, Peter J
The focus of this paper is two-fold. First, similarities generated from six semantic models were compared to human ratings of paragraph similarity on two datasets-23 World Entertainment News Network paragraphs and 50 ABC newswire paragraphs. Contrary to findings on smaller textual units such as word associations (Griffiths, Tenenbaum, & Steyvers, 2007), our results suggest that when single paragraphs are compared, simple nonreductive models (word overlap and vector space) can provide better similarity estimates than more complex models (LSA, Topic Model, SpNMF, and CSM). Second, various methods of corpus creation were explored to facilitate the semantic models' similarity estimates. Removing numeric and single characters, and also truncating document length improved performance. Automated construction of smaller Wikipedia-based corpora proved to be very effective, even improving upon the performance of corpora that had been chosen for the domain. Model performance was further improved by augmenting corpora with dataset paragraphs. Copyright © 2010 Cognitive Science Society, Inc.
Arkova, Olga V; Ponomarenko, Mikhail P; Rasskazov, Dmitry A; Drachkova, Irina A; Arshinova, Tatjana V; Ponomarenko, Petr M; Savinkova, Ludmila K; Kolchanov, Nikolay A
Obesity affects quality of life and life expectancy and is associated with cardiovascular disorders, cancer, diabetes, reproductive disorders in women, prostate diseases in men, and congenital anomalies in children. The use of single nucleotide polymorphism (SNP) markers of diseases and drug responses (i.e., significant differences of personal genomes of patients from the reference human genome) can help physicians to improve treatment. Clinical research can validate SNP markers via genotyping of patients and demonstration that SNP alleles are significantly more frequent in patients than in healthy people. The search for biomedical SNP markers of interest can be accelerated by computer-based analysis of hundreds of millions of SNPs in the 1000 Genomes project because of selection of the most meaningful candidate SNP markers and elimination of neutral SNPs. We cross-validated the output of two computer-based methods: DNA sequence analysis using Web service SNP_TATA_Comparator and keyword search for articles on comorbidities of obesity. Near the sites binding to TATA-binding protein (TBP) in human gene promoters, we found 22 obesity-related candidate SNP markers, including rs10895068 (male breast cancer in obesity); rs35036378 (reduced risk of obesity after ovariectomy); rs201739205 (reduced risk of obesity-related cancers due to weight loss by diet/exercise in obese postmenopausal women); rs183433761 (obesity resistance during a high-fat diet); rs367732974 and rs549591993 (both: cardiovascular complications in obese patients with type 2 diabetes mellitus); rs200487063 and rs34104384 (both: obesity-caused hypertension); rs35518301, rs72661131, and rs562962093 (all: obesity); and rs397509430, rs33980857, rs34598529, rs33931746, rs33981098, rs34500389, rs63750953, rs281864525, rs35518301, and rs34166473 (all: chronic inflammation in comorbidities of obesity). Using an electrophoretic mobility shift assay under nonequilibrium conditions, we empirically validated the
Tongtawee, Taweesak; Dechsukhum, Chavaboon; Leeanansaksiri, Wilairat; Kaewpitoon, Soraya; Kaewpitoon, Natthawut; Loyd, Ryan A; Matrakool, Likit; Panpimanmas, Sukij
Helicobacter pylori plays an important role in gastric cancer, which has a relatively low inciduence in Thailand. MDM2 is a major negative regulator of p53, the key tumor suppressor involved in tumorigenesis of the majority of human cancers. Whether its expression might explain the relative lack of gastric cancer in Thailand was assessed here. This single-center study was conducted in the northeast region of Thailand. Gastric mucosa from 100 patients with Helicobacter pylori associated gastritis was analyzed for MDM2 SNP309 using real-time PCR hybridization (light-cycler) probes. In the total 100 Helicobacter pylori associated gastritis cases the incidence of SNP 309 T/T homozygous was 78 % with SNP309 G/T heterozygous found in 19% and SNP309 G/G homozygous in 3%. The result show SNP 309 T/T and SNP 309 G/T to be rather common in the Thai population. Our study indicates that the MDM2 SNP309 G/G homozygous genotype might be a risk factor for gastric cancer in Thailand and the fact that it is infrequent could explain to some extent the low incidence of gastric cancer in the Thai population.
Chen, Xin; Wu, Qiong; Sun, Ruimin; Zhang, Louxin
The discovery of single-nucleotide polymorphisms (SNPs) has important implications in a variety of genetic studies on human diseases and biological functions. One valuable approach proposed for SNP discovery is based on base-specific cleavage and mass spectrometry. However, it is still very challenging to achieve the full potential of this SNP discovery approach. In this study, we formulate two new combinatorial optimization problems. While both problems are aimed at reconstructing the sample sequence that would attain the minimum number of SNPs, they search over different candidate sequence spaces. The first problem, denoted as SNP - MSP, limits its search to sequences whose in silico predicted mass spectra have all their signals contained in the measured mass spectra. In contrast, the second problem, denoted as SNP - MSQ, limits its search to sequences whose in silico predicted mass spectra instead contain all the signals of the measured mass spectra. We present an exact dynamic programming algorithm for solving the SNP - MSP problem and also show that the SNP - MSQ problem is NP-hard by a reduction from a restricted variation of the 3-partition problem. We believe that an efficient solution to either problem above could offer a seamless integration of information in four complementary base-specific cleavage reactions, thereby improving the capability of the underlying biotechnology for sensitive and accurate SNP discovery.
Bekkevold, Dorte; Limborg, Morten; Helyar, Sarah
polymorphism (SNP) markers in Northeast Atlantic herring population samples. Marker panels were targeted to include gene‐associated loci to maximize statistical resolution. Application of 281 SNP markers to samples representing different levels of stock complexity showed that the regional origin of individual......Atlantic herring (Clupea harengus) exhibit biocomplexity, with widespread, geographically explicit populations that perform long‐range migration to common feeding and wintering areas, where they are exploited by fisheries. This means that exploited stocks do not describe discrete units, thereby...... complicating stock assessment and management. It is therefore of management interest to trace individual population migration patterns and contributions to fisheries. To underpin management and to develop a validated tool for traceability of individuals from mixed‐stock samples we applied single nucleotide...
stimuli is available in the literature, and since this study deals with single responses, an assumption had to be made before the information in the...Figue 4 Repon e mocekroadpten.trp atrsadt ln 0m10 4 4 22 A S10 2D 3 40 5 64 10 i 13 10 II0 Visul wo (miautes) of side of dck Figure 5. The effect of unit...differences in their average evoked responses. These responses will also lend themselves to detection based on information from multiple electrodes because
Watson-Haigh Nathan S
Full Text Available Abstract Background Whole genome association studies using highly dense single nucleotide polymorphisms (SNPs are a set of methods to identify DNA markers associated with variation in a particular complex trait of interest. One of the main outcomes from these studies is a subset of statistically significant SNPs. Finding the potential biological functions of such SNPs can be an important step towards further use in human and agricultural populations (e.g., for identifying genes related to susceptibility to complex diseases or genes playing key roles in development or performance. The current challenge is that the information holding the clues to SNP functions is distributed across many different databases. Efficient bioinformatics tools are therefore needed to seamlessly integrate up-to-date functional information on SNPs. Many web services have arisen to meet the challenge but most work only within the framework of human medical research. Although we acknowledge the importance of human research, we identify there is a need for SNP annotation tools for other organisms. Description We introduce an R package called FunctSNP, which is the user interface to custom built species-specific databases. The local relational databases contain SNP data together with functional annotations extracted from online resources. FunctSNP provides a unified bioinformatics resource to link SNPs with functional knowledge (e.g., genes, pathways, ontologies. We also introduce dbAutoMaker, a suite of Perl scripts, which can be scheduled to run periodically to automatically create/update the customised SNP databases. We illustrate the use of FunctSNP with a livestock example, but the approach and software tools presented here can be applied also to human and other organisms. Conclusions Finding the potential functional significance of SNPs is important when further using the outcomes from whole genome association studies. FunctSNP is unique in that it is the only R
Doran Anthony G
Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the most abundant genetic variant found in vertebrates and invertebrates. SNP discovery has become a highly automated, robust and relatively inexpensive process allowing the identification of many thousands of mutations for model and non-model organisms. Annotating large numbers of SNPs can be a difficult and complex process. Many tools available are optimised for use with organisms densely sampled for SNPs, such as humans. There are currently few tools available that are species non-specific or support non-model organism data. Results Here we present SNPdat, a high throughput analysis tool that can provide a comprehensive annotation of both novel and known SNPs for any organism with a draft sequence and annotation. Using a dataset of 4,566 SNPs identified in cattle using high-throughput DNA sequencing we demonstrate the annotations performed and the statistics that can be generated by SNPdat. Conclusions SNPdat provides users with a simple tool for annotation of genomes that are either not supported by other tools or have a small number of annotated SNPs available. SNPdat can also be used to analyse datasets from organisms which are densely sampled for SNPs. As a command line tool it can easily be incorporated into existing SNP discovery pipelines and fills a niche for analyses involving non-model organisms that are not supported by many available SNP annotation tools. SNPdat will be of great interest to scientists involved in SNP discovery and analysis projects, particularly those with limited bioinformatics experience.
Dannhauser, D; Romeo, G; Causa, F; De Santo, I; Netti, P A
A straightforward way to measure separated micrometric sized particles in microfluidic flow is reported. The light scattering profile (LSP) of each single particle is fully characterized by using a CMOS-camera based small angle light scattering (SALS) apparatus, ranging from 2° up to 30°. To ensure controlled particle passage through the incident laser, a viscoelastic 3D alignment effect by viscoelastic induced particle migration has been implemented in a simple and cost-effective microfluidic device. Different polystyrene particle sizes are measured in microfluidic flows and the obtained scattering signatures are matched with the Lorenz-Mie based scattering theory. The results confirm the possibility of using this apparatus for real multiplex particle analyses in microfluidic particle flows.
Kelemen, Arpad; Vasilakos, Athanasios V; Liang, Yulan
Comprehensive evaluation of common genetic variations through association of single-nucleotide polymorphism (SNP) structure with common complex disease in the genome-wide scale is currently a hot area in human genome research due to the recent development of the Human Genome Project and HapMap Project. Computational science, which includes computational intelligence (CI), has recently become the third method of scientific enquiry besides theory and experimentation. There have been fast growing interests in developing and applying CI in disease mapping using SNP and haplotype data. Some of the recent studies have demonstrated the promise and importance of CI for common complex diseases in genomic association study using SNP/haplotype data, especially for tackling challenges, such as gene-gene and gene-environment interactions, and the notorious "curse of dimensionality" problem. This review provides coverage of recent developments of CI approaches for complex diseases in genetic association study with SNP/haplotype data.
Full Text Available In this study the benefit of metabolome level analysis for the prediction of genetic value of three traditional milk traits was investigated. Our proposed approach consists of three steps: First, milk metabolite profiles are used to predict three traditional milk traits of 1,305 Holstein cows. Two regression methods, both enabling variable selection, are applied to identify important milk metabolites in this step. Second, the prediction of these important milk metabolite from single nucleotide polymorphisms (SNPs enables the detection of SNPs with significant genetic effects. Finally, these SNPs are used to predict milk traits. The observed precision of predicted genetic values was compared to the results observed for the classical genotype-phenotype prediction using all SNPs or a reduced SNP subset (reduced classical approach. To enable a comparison between SNP subsets, a special invariable evaluation design was implemented. SNPs close to or within known quantitative trait loci (QTL were determined. This enabled us to determine if detected important SNP subsets were enriched in these regions. The results show that our approach can lead to genetic value prediction, but requires less than 1% of the total amount of (40,317 SNPs., significantly more important SNPs in known QTL regions were detected using our approach compared to the reduced classical approach. Concluding, our approach allows a deeper insight into the associations between the different levels of the genotype-phenotype map (genotype-metabolome, metabolome-phenotype, genotype-phenotype.
Gonçalves da Silva, Anders; Barendse, William; Kijas, James W; Barris, Wes C; McWilliam, Sean; Bunch, Rowan J; McCullough, Russell; Harrison, Blair; Hoelzel, A Rus; England, Phillip R
Single nucleotide polymorphisms (SNPs) have become the marker of choice for genetic studies in organisms of conservation, commercial or biological interest. Most SNP discovery projects in nonmodel organisms apply a strategy for identifying putative SNPs based on filtering rules that account for random sequencing errors. Here, we analyse data used to develop 4723 novel SNPs for the commercially important deep-sea fish, orange roughy (Hoplostethus atlanticus), to assess the impact of not accounting for systematic sequencing errors when filtering identified polymorphisms when discovering SNPs. We used SAMtools to identify polymorphisms in a velvet assembly of genomic DNA sequence data from seven individuals. The resulting set of polymorphisms were filtered to minimize 'bycatch'-polymorphisms caused by sequencing or assembly error. An Illumina Infinium SNP chip was used to genotype a final set of 7714 polymorphisms across 1734 individuals. Five predictors were examined for their effect on the probability of obtaining an assayable SNP: depth of coverage, number of reads that support a variant, polymorphism type (e.g. A/C), strand-bias and Illumina SNP probe design score. Our results indicate that filtering out systematic sequencing errors could substantially improve the efficiency of SNP discovery. We show that BLASTX can be used as an efficient tool to identify single-copy genomic regions in the absence of a reference genome. The results have implications for research aiming to identify assayable SNPs and build SNP genotyping assays for nonmodel organisms. © 2014 John Wiley & Sons Ltd.
Full Text Available Although a large number of single nucleotide polymorphisms (SNPs have been identified from the bovine genome-sequencing project, few of these have been validated at large in Bos indicus breeds. We have genotyped 192 animals, representing 5 cattle populations of Ethiopia, with the Illumina Bovine 8K SNP BeadChip. These include 1 Sanga (Danakil, 3 zebu (Borana, Arsi and Ambo, and 1 zebu × Sanga intermediate (Horro breeds. The Hanwoo (Bos taurus was included for comparison purposes. Analysis of 7,045 SNP markers revealed that the mean minor allele frequency (MAF was 0.23, 0.22, 0.21, 0.21, 0.23, and 0.29 for Ambo, Arsi, Borana, Danakil, Horro, and Hanwoo, respectively. Significant differences of MAF were observed between the indigenous Ethiopian cattle populations and Hanwoo breed (p < 0.001. Across the Ethiopian cattle populations, a common variant MAF (≥0.10 and ≤0.5 accounted for an overall estimated 73.79% of the 7,045 SNPs. The Hanwoo displayed a higher proportion of common variant SNPs (90%. Investigation within Ethiopian cattle populations showed that on average, 16.64% of the markers were monomorphic, but in the Hanwoo breed, only 6% of the markers were monomorphic. Across the sampled Ethiopian cattle populations, the mean observed and expected heterozygosities were 0.314 and 0.313, respectively. The level of SNP variation identified in this particular study highlights that these markers can be potentially used for genetic studies in African cattle breeds.
Altinagac, Emre; Taskin, Selen; Kizil, Huseyin
Impedance analysis of single cells is presented in this paper. Following the separation of a target cell type by dielectrophoresis in our previous work, this paper focuses on capturing the cells as a single array and performing impedance analysis to point out the signature difference between each cell type. Lab-on-a-chip devices having a titanium interdigitated electrode layer on a glass substrate and a PDMS microchannel are fabricated to capture each cell in a single form and perform impedance analysis. HCT116 (homosapiens colon colorectal carcin) and HEK293 (human embryonic kidney) cells are used in our experiments.
Using relational databases to manage SNP datasets is a very useful technique that has significant advantages over alternative methods, including the ability to leverage the power of relational databases to perform data validation, and the use of the powerful SQL query language to export data. SNPpy is a Python program which uses the PostgreSQL database and the SQLAlchemy Python library to automate SNP data management. This chapter shows how to use SNPpy to store and manage large datasets.
Background Allergic rhinitis is a common disease whose genetic basis is incompletely explained. We report an integrated genomic analysis of allergic rhinitis. Methods We performed genome wide association studies (GWAS) of allergic rhinitis in 5633 ethnically diverse North American subjects. Next, we profiled gene expression in disease-relevant tissue (peripheral blood CD4+ lymphocytes) collected from subjects who had been genotyped. We then integrated the GWAS and gene expression data using expression single nucleotide (eSNP), coexpression network, and pathway approaches to identify the biologic relevance of our GWAS. Results GWAS revealed ethnicity-specific findings, with 4 genome-wide significant loci among Latinos and 1 genome-wide significant locus in the GWAS meta-analysis across ethnic groups. To identify biologic context for these results, we constructed a coexpression network to define modules of genes with similar patterns of CD4+ gene expression (coexpression modules) that could serve as constructs of broader gene expression. 6 of the 22 GWAS loci with P-value ≤ 1x10−6 tagged one particular coexpression module (4.0-fold enrichment, P-value 0.0029), and this module also had the greatest enrichment (3.4-fold enrichment, P-value 2.6 × 10−24) for allergic rhinitis-associated eSNPs (genetic variants associated with both gene expression and allergic rhinitis). The integrated GWAS, coexpression network, and eSNP results therefore supported this coexpression module as an allergic rhinitis module. Pathway analysis revealed that the module was enriched for mitochondrial pathways (8.6-fold enrichment, P-value 4.5 × 10−72). Conclusions Our results highlight mitochondrial pathways as a target for further investigation of allergic rhinitis mechanism and treatment. Our integrated approach can be applied to provide biologic context for GWAS of other diseases. PMID:25085501
Nathan A Baird
Full Text Available Single nucleotide polymorphism (SNP discovery and genotyping are essential to genetic mapping. There remains a need for a simple, inexpensive platform that allows high-density SNP discovery and genotyping in large populations. Here we describe the sequencing of restriction-site associated DNA (RAD tags, which identified more than 13,000 SNPs, and mapped three traits in two model organisms, using less than half the capacity of one Illumina sequencing run. We demonstrated that different marker densities can be attained by choice of restriction enzyme. Furthermore, we developed a barcoding system for sample multiplexing and fine mapped the genetic basis of lateral plate armor loss in threespine stickleback by identifying recombinant breakpoints in F(2 individuals. Barcoding also facilitated mapping of a second trait, a reduction of pelvic structure, by in silico re-sorting of individuals. To further demonstrate the ease of the RAD sequencing approach we identified polymorphic markers and mapped an induced mutation in Neurospora crassa. Sequencing of RAD markers is an integrated platform for SNP discovery and genotyping. This approach should be widely applicable to genetic mapping in a variety of organisms.
Full Text Available Abstract Background Expression quantitative trait loci (eQTL databases represent a valuable resource to link disease-associated SNPs to specific candidate genes whose gene expression is significantly modulated by the SNP under investigation. We previously identified signal inhibitory receptor on leukocytes-1 (SIRL-1 as a powerful regulator of human innate immune cell function. While it is constitutively high expressed on neutrophils, on monocytes the SIRL-1 surface expression varies strongly between individuals. The underlying mechanism of regulation, its genetic control as well as potential clinical implications had not been explored yet. Methods Whole blood eQTL data of a Chinese cohort was used to identify SNPs regulating the expression of VSTM1, the gene encoding SIRL-1. The genotype effect was validated by flow cytometry (cell surface expression, correlated with electrophoretic mobility shift assay (EMSA, chromatin immunoprecipitation (ChIP and bisulfite sequencing (C-methylation and its functional impact studied the inhibition of reactive oxygen species (ROS. Results We found a significant association of a single CpG-SNP, rs612529T/C, located in the promoter of VSTM1. Through flow cytometry analysis we confirmed that primarily in the monocytes the protein level of SIRL-1 is strongly associated with genotype of this SNP. In monocytes, the T allele of this SNP facilitates binding of the transcription factors YY1 and PU.1, of which the latter has been recently shown to act as docking site for modifiers of DNA methylation. In line with this notion rs612529T associates with a complete demethylation of the VSTM1 promoter correlating with the allele-specific upregulation of SIRL-1 expression. In monocytes, this upregulation strongly impacts the IgA-induced production of ROS by these cells. Through targeted association analysis we found a significant Meta P value of 1.14 × 10–6 for rs612529 for association to atopic dermatitis (AD
Hwang, Michael T; Landon, Preston B; Lee, Joon; Choi, Duyoung; Mo, Alexander H; Glinsky, Gennadi; Lal, Ratnesh
Single-nucleotide polymorphisms (SNPs) in a gene sequence are markers for a variety of human diseases. Detection of SNPs with high specificity and sensitivity is essential for effective practical implementation of personalized medicine. Current DNA sequencing, including SNP detection, primarily uses enzyme-based methods or fluorophore-labeled assays that are time-consuming, need laboratory-scale settings, and are expensive. Previously reported electrical charge-based SNP detectors have insufficient specificity and accuracy, limiting their effectiveness. Here, we demonstrate the use of a DNA strand displacement-based probe on a graphene field effect transistor (FET) for high-specificity, single-nucleotide mismatch detection. The single mismatch was detected by measuring strand displacement-induced resistance (and hence current) change and Dirac point shift in a graphene FET. SNP detection in large double-helix DNA strands (e.g., 47 nt) minimize false-positive results. Our electrical sensor-based SNP detection technology, without labeling and without apparent cross-hybridization artifacts, would allow fast, sensitive, and portable SNP detection with single-nucleotide resolution. The technology will have a wide range of applications in digital and implantable biosensors and high-throughput DNA genotyping, with transformative implications for personalized medicine.
Sindhu, Anoop; Ramsay, Larissa; Sanderson, Lacey-Anne; Stonehouse, Robert; Li, Rong; Condie, Janet; Shunmugam, Arun S K; Liu, Yong; Jha, Ambuj B; Diapari, Marwan; Burstin, Judith; Aubert, Gregoire; Tar'an, Bunyamin; Bett, Kirstin E; Warkentin, Thomas D; Sharpe, Andrew G
Gene-based SNPs were identified and mapped in pea using five recombinant inbred line populations segregating for traits of agronomic importance. Pea (Pisum sativum L.) is one of the world's oldest domesticated crops and has been a model system in plant biology and genetics since the work of Gregor Mendel. Pea is the second most widely grown pulse crop in the world following common bean. The importance of pea as a food crop is growing due to its combination of moderate protein concentration, slowly digestible starch, high dietary fiber concentration, and its richness in micronutrients; however, pea has lagged behind other major crops in harnessing recent advances in molecular biology, genomics and bioinformatics, partly due to its large genome size with a large proportion of repetitive sequence, and to the relatively limited investment in research in this crop globally. The objective of this research was the development of a genome-wide transcriptome-based pea single-nucleotide polymorphism (SNP) marker platform using next-generation sequencing technology. A total of 1,536 polymorphic SNP loci selected from over 20,000 non-redundant SNPs identified using deep transcriptome sequencing of eight diverse Pisum accessions were used for genotyping in five RIL populations using an Illumina GoldenGate assay. The first high-density pea SNP map defining all seven linkage groups was generated by integrating with previously published anchor markers. Syntenic relationships of this map with the model legume Medicago truncatula and lentil (Lens culinaris Medik.) maps were established. The genic SNP map establishes a foundation for future molecular breeding efforts by enabling both the identification and tracking of introgression of genomic regions harbouring QTLs related to agronomic and seed quality traits.
Katanforoush, Ali; Sadeghi, Mehdi; Pezeshk, Hamid; Elahi, Elahe
Global partitioning based on pairwise associations of SNPs has not previously been used to define haplotype blocks within genomes. Here, we define an association index based on LD between SNP pairs. We use the Fisher's exact test to assess the statistical significance of the LD estimator. By this test, each SNP pair is characterized as associated, independent, or not-statistically-significant. We set limits on the maximum acceptable proportion of independent pairs within all blocks and search for the partitioning with maximal proportion of associated SNP pairs. Essentially, this model is reduced to a constrained optimization problem, the solution of which is obtained by iterating a dynamic programming algorithm. In comparison with other methods, our algorithm reports blocks of larger average size. Nevertheless, the haplotype diversity within the blocks is captured by a small number of tagSNPs. Resampling HapMap haplotypes under a block-based model of recombination showed that our algorithm is robust in reproducing the same partitioning for recombinant samples. Our algorithm performed better than previously reported models in a case-control association study aimed at mapping a single locus trait, based on simulation results that were evaluated by a block-based statistical test. Compared to methods of haplotype block partitioning, we performed best on detection of recombination hotspots. Our proposed method divides chromosomes into the regions within which allelic associations of SNP pairs are maximized. This approach presents a native design for dimension reduction in genome-wide association studies. Our results show that the pairwise allelic association of SNPs can describe various features of genomic variation, in particular recombination hotspots.
Full Text Available Abstract Background Global partitioning based on pairwise associations of SNPs has not previously been used to define haplotype blocks within genomes. Here, we define an association index based on LD between SNP pairs. We use the Fisher's exact test to assess the statistical significance of the LD estimator. By this test, each SNP pair is characterized as associated, independent, or not-statistically-significant. We set limits on the maximum acceptable proportion of independent pairs within all blocks and search for the partitioning with maximal proportion of associated SNP pairs. Essentially, this model is reduced to a constrained optimization problem, the solution of which is obtained by iterating a dynamic programming algorithm. Results In comparison with other methods, our algorithm reports blocks of larger average size. Nevertheless, the haplotype diversity within the blocks is captured by a small number of tagSNPs. Resampling HapMap haplotypes under a block-based model of recombination showed that our algorithm is robust in reproducing the same partitioning for recombinant samples. Our algorithm performed better than previously reported models in a case-control association study aimed at mapping a single locus trait, based on simulation results that were evaluated by a block-based statistical test. Compared to methods of haplotype block partitioning, we performed best on detection of recombination hotspots. Conclusion Our proposed method divides chromosomes into the regions within which allelic associations of SNP pairs are maximized. This approach presents a native design for dimension reduction in genome-wide association studies. Our results show that the pairwise allelic association of SNPs can describe various features of genomic variation, in particular recombination hotspots.
Excoffier, Laurent; Dupanloup, Isabelle; Huerta-Sánchez, Emilia; Sousa, Vitor C.; Foll, Matthieu
We introduce a flexible and robust simulation-based framework to infer demographic parameters from the site frequency spectrum (SFS) computed on large genomic datasets. We show that our composite-likelihood approach allows one to study evolutionary models of arbitrary complexity, which cannot be tackled by other current likelihood-based methods. For simple scenarios, our approach compares favorably in terms of accuracy and speed with , the current reference in the field, while showing better convergence properties for complex models. We first apply our methodology to non-coding genomic SNP data from four human populations. To infer their demographic history, we compare neutral evolutionary models of increasing complexity, including unsampled populations. We further show the versatility of our framework by extending it to the inference of demographic parameters from SNP chips with known ascertainment, such as that recently released by Affymetrix to study human origins. Whereas previous ways of handling ascertained SNPs were either restricted to a single population or only allowed the inference of divergence time between a pair of populations, our framework can correctly infer parameters of more complex models including the divergence of several populations, bottlenecks and migration. We apply this approach to the reconstruction of African demography using two distinct ascertained human SNP panels studied under two evolutionary models. The two SNP panels lead to globally very similar estimates and confidence intervals, and suggest an ancient divergence (>110 Ky) between Yoruba and San populations. Our methodology appears well suited to the study of complex scenarios from large genomic data sets. PMID:24204310
Loya Méndez, Yolanda; Reyes Leal, Gilberto; Sánchez González, Adriana; Portillo Reyes, Verónica; Reyes Ruvalcaba, David; Bojórquez Rangel, Guillermo
Diabetes Mellitus (DM) type 2 is a common pathology with multifactorial etiology, which exact genetic bases remain unknown. Some studies suggest that single nucleotides polymorphisms (SNPs) in the CAPN10 gene (Locus 2q37.3) could be associated with the development of this disease, including the insertion/deletion polymorphism SNP-19 (2R→3R). The present study determined the association between the SNP-19 and the risk of developing DM type 2 in Ciudad Juarez population. For this study 107 participants were selected: 43 diabetics type 2 (cases) and 64 non diabetics with no family history of DM type 2 in first grade (control). Anthropometric studies were realized as well as lipids, lipoproteins and serum glucose biochemical profiles. The genotypification of SNP-19 was performed using peripheral blood lymphocytes DNA, polymerase chain reactions (PCR), and electrophoretic analysis in agarose gels. Once obtained the genotypic and allelic frequencies, the Hardy-Weinberg equilibrium test (GenAlEx 6.4) was also performed. Using the X² analysis it was identified the genotypic differences between cases and control with higher frequency of the homozygous genotype 3R of SNP- 19 in the cases group (0.418) compared to control group (0.265). Also, it was observed an association between genotype 2R/3R with elevated weight, body mass index, and waist and hip circumferences, but only in the diabetic group (P=< 0.05). The findings in this study suggest that SNP-19 in CAPN10 may participate in the development of DM type 2 in the studied population. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.
Hasegawa, Takako; Futagami, Michiko; Kim, Hey-Sook; Matsuda, Akira; Wataya, Yusuke
We investigated single nucleotide polymorphisms (SNPs) in uck2 gene encoding metabolic enzyme of 3'-ethynylcytidine (ECyd) which were associated with drug response of ECyd, and the newly synthesized antitumor ribonucleoside analog. We analized that on exon-intron junction and exon region to affect the qualitative alteration of gene product directly in ECyd sensitive and resistant human cancer cell lines. As the results, cSNP and sSNP were detected in exon 4. In the promoter region, 3 SNPs were detected. Our data seem to be able to give an important knowledge, when ECyd is applied clinically.
Nijveen, Harm; van Kaauwen, Martijn; Esselink, Danny G; Hoegen, Brechtje; Vosman, Ben
QualitySNPng is a new software tool for the detection and interactive visualization of single-nucleotide polymorphisms (SNPs). It uses a haplotype-based strategy to identify reliable SNPs; it is optimized for the analysis of current RNA-seq data; but it can also be used on genomic DNA sequences derived from next-generation sequencing experiments. QualitySNPng does not require a sequenced reference genome and delivers reliable SNPs for di- as well as polyploid species. The tool features a user-friendly interface, multiple filtering options to handle typical sequencing errors, support for SAM and ACE files and interactive visualization. QualitySNPng produces high-quality SNP information that can be used directly in genotyping by sequencing approaches for application in QTL and genome-wide association mapping as well as to populate SNP arrays. The software can be used as a stand-alone application with a graphical user interface or as part of a pipeline system like Galaxy. Versions for Windows, Mac OS X and Linux, as well as the source code, are available from http://www.bioinformatics.nl/QualitySNPng.
The purpose of the Single-Shell Tank (SST) Interim Stabilization Risk Analysis is to provide a cost and schedule risk analysis of HNF-2358, Rev. 1, Single-Shell Tank Interim Stabilization Project Plan (Project Plan) (Ross et al. 1998). The analysis compares the required cost profile by fiscal year (Section 4.2) and revised schedule completion date (Section 4.5) to the Project Plan. The analysis also evaluates the executability of the Project Plan and recommends a path forward for risk mitigation
Torskarska, M; Marshall, T; Kowalczyk, R
The European bison (Bison bonasus) has recovered successfully after a severe bottleneck about 90 years ago. Pedigree analysis indicates that over 80% of the genes in the contemporary population descend from just 2 founder individuals and the pedigree-based inbreeding coefficient averages almost 0...
Chagné, D.; Crowhurst, R.N.; Troggio, M.; Davey, M.W.; Gilmore, B.; Lawley, C.; Vanderzande, S.; Hellens, R.P.; Kumar, S.; Cestaro, A.; Velasco, R.; Main, D.; Rees, J.D.; Iezzoni, A.F.; Mockler, T.; Wilhelm, L.; Weg, van de W.E.; Gardiner, S.E.; Bassil, N.; Peace, C.
As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide
Didion, John P; Buus, Ryan J; Naghashfar, Zohreh; Threadgill, David W; Morse, Herbert C; de Villena, Fernando Pardo-Manuel
The crisis of Misidentified and contaminated cell lines have plagued the biological research community for decades. Some repositories and journals have heeded calls for mandatory authentication of human cell lines, yet misidentification of mouse cell lines has received little publicity despite their importance in sponsored research. Short tandem repeat (STR) profiling is the standard authentication method, but it may fail to distinguish cell lines derived from the same inbred strain of mice. Additionally, STR profiling does not reveal karyotypic changes that occur in some high-passage lines and may have functional consequences. Single nucleotide polymorphism (SNP) profiling has been suggested as a more accurate and versatile alternative to STR profiling; however, a high-throughput method for SNP-based authentication of mouse cell lines has not been described. We have developed computational methods (Cell Line Authentication by SNP Profiling, CLASP) for cell line authentication and copy number analysis based on a cost-efficient SNP array, and we provide a reference database of commonly used mouse strains and cell lines. We show that CLASP readily discriminates among cell lines of diverse taxonomic origins, including multiple cell lines derived from a single inbred strain, intercross or wild caught mouse. CLASP is also capable of detecting contaminants present at concentrations as low as 5%. Of the 99 cell lines we tested, 15 exhibited substantial divergence from the reported genetic background. In all cases, we were able to distinguish whether the authentication failure was due to misidentification (one cell line, Ba/F3), the presence of multiple strain backgrounds (five cell lines), contamination by other cells and/or the presence of aneuploid chromosomes (nine cell lines). Misidentification and contamination of mouse cell lines is potentially as widespread as it is in human cell culture. This may have substantial implications for studies that are dependent on the
Lee, Phil Hyoun; Jung, Jae-Yoon; Shatkay, Hagit
Selecting a representative set of single nucleotide polymorphism (SNP) markers for facilitating association studies is an important step to uncover the genetic basis of human disease. Tag SNP selection and functional SNP selection are the two main approaches for addressing the SNP selection problem. However, little was done so far to effectively combine these distinct and possibly competing approaches. Here, we present a new multiobjective optimization framework for identifying SNPs that are both informative tagging and have functional significance (FS). Our selection algorithm is based on the notion of Pareto optimality, which has been extensively used for addressing multiobjective optimization problems in game theory, economics, and engineering. We applied our method to 34 disease-susceptibility genes for lung cancer and compared the performance with that of other systems which support both tag SNP selection and functional SNP selection methods. The comparison shows that our algorithm always finds a subset of SNPs that improves upon the subset selected by other state-of-the-art systems with respect to both selection objectives.
Kurokawa, Yusuke; Noda, Tomonori; Yamagata, Yoshiyuki; Angeles-Shim, Rosalyn; Sunohara, Hidehiko; Uehara, Kanako; Furuta, Tomoyuki; Nagai, Keisuke; Jena, Kshirod Kumar; Yasui, Hideshi; Yoshimura, Atsushi; Ashikari, Motoyuki; Doi, Kazuyuki
DNA marker-assisted selection (MAS) has become an indispensable component of breeding. Single nucleotide polymorphisms (SNP) are the most frequent polymorphism in the rice genome. However, SNP markers are not readily employed in MAS because of limitations in genotyping platforms. Here the authors report a Golden Gate SNP array that targets specific genes controlling yield-related traits and biotic stress resistance in rice. As a first step, the SNP genotypes were surveyed in 31 parental varieties using the Affymetrix Rice 44K SNP microarray. The haplotype information for 16 target genes was then converted to the Golden Gate platform with 143-plex markers. Haplotypes for the 14 useful allele are unique and can discriminate among all other varieties. The genotyping consistency between the Affymetrix microarray and the Golden Gate array was 92.8%, and the accuracy of the Golden Gate array was confirmed in 3 F2 segregating populations. The concept of the haplotype-based selection by using the constructed SNP array was proofed. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Full Text Available Ai-Min Gong,1,2,* Xin-Yuan Li,2,* Yi-Qiang Xie,1 Zhan-Dong Jia,3 Yuan-Xin Li,4 Yong-Yan Zou,5 Chang-Qing Xu,2,* Zhen-Yu Wang2,* 1Department of Internal Medicine of Traditional Chinese Medicine, Hainan Medical University, Hainan, 2Department of Pathophysiology, Harbin Medical University, Harbin, 3Department of Nephrology, Ningbo Tradition Chinese Medicine Hospital affiliated to Zhejiang Chinese Medical University, Ningbo, 4The Fifth Department of Acupuncture, 5Department of Nephrology, Jining Tradition Chinese Medicine Hospital, Jining, People’s Republic of China *These authors contributed equally to this work Purpose: The association between CD14 -159C/T polymorphism and the susceptibility to gastric cancer (GC has been reported. However, the results were inconclusive. In the present study, a case–control study and a meta-analysis were performed to assess the possible association between -159C/T in the CD14 gene and GC risk. Patients and methods: Relevant studies were searched in several databases including PubMed, Web of Science, EMBASE, Chinese National Knowledge Infrastructure database, and Wanfang database (last search was performed on December 30, 2015. In addition, a case–control study involving 164 GC cases and 169 controls was also performed in the analysis. Statistical analysis was performed by the software Revman5.3.Results: A total of ten published studies and the present case–control study involving 2,844 GC and 3,983 controls were included for the meta-analysis. The analysis result indicated that the T allele of CD14 -159C/T polymorphism did not confer risk for GC (in our study: [P=0.93]; in the meta-analysis: T vs 2N odds ratio =1.28 and 95% confidence interval (CI =0.95–1.24, [P=0.24]. However, we found a significant association in the recessive model (in our study: TT vs TC+CC [P=0.04]; in the meta-analysis: TT vs TC+CC odds ratio =1.12 and 95% CI =1.01–1.26, [P=0.04]. Furthermore, a subgroup analysis by ethnicity
Full Text Available Single nucleotide polymorphisms (SNPs have been increasingly utilized to investigate somatic genetic abnormalities in premalignancy and cancer. LOH is a common alteration observed during cancer development, and SNP assays have been used to identify LOH at specific chromosomal regions. The design of such studies requires consideration of the resolution for detecting LOH throughout the genome and identification of the number and location of SNPs required to detect genetic alterations in specific genomic regions. Our study evaluated SNP distribution patterns and used probability models, Monte Carlo simulation, and real human subject genotype data to investigate the relationships between the number of SNPs, SNP HET rates, and the sensitivity (resolution for detecting LOH. We report that variances of SNP heterozygosity rate in dbSNP are high for a large proportion of SNPs. Two statistical methods proposed for directly inferring SNP heterozygosity rates require much smaller sample sizes (intermediate sizes and are feasible for practical use in SNP selection or verification. Using HapMap data, we showed that a region of LOH greater than 200 kb can be reliably detected, with losses smaller than 50 kb having a substantially lower detection probability when using all SNPs currently in the HapMap database. Higher densities of SNPs may exist in certain local chromosomal regions that provide some opportunities for reliably detecting LOH of segment sizes smaller than 50 kb. These results suggest that the interpretation of the results from genome-wide scans for LOH using commercial arrays need to consider the relationships among inter-SNP distance, detection probability, and sample size for a specific study. New experimental designs for LOH studies would also benefit from considering the power of detection and sample sizes required to accomplish the proposed aims.
Prokhorenko, Igor A.; Astakhova, Irina V.; Momynaliev, Kuvat T.
Excimer formation is a unique feature of some fluorescent dyes (e.g., pyrene) which can be used for probing the proximity of biomolecules. Pyrene excimer fluorescence has previously been used for homogeneous detection of single nucleotide polymorphism (SNP) on DNA. 1-Phenylethynylpyrene (1-1-PEPy......), a photostable pyrene derivative with redshifted fluorescence, is able to form excimers (emission maximum about 500-510 nm) and is well suitable for nucleic acid labeling. We have shown the utility of 1-1-PEPy in the excimer-forming DNA probes for detection of 2144A/G and 2143A/G transitions, and 2143A...
J. Genet. 89, 247–251] hypothesized to act as a superoxide dismutase (Wong et al. 2000a). About 85% cases of human prion diseases have been found to ... 2003). The putative functionality of the SNP is yet to be unearthed. Although the copper-binding ability and the level of superoxide dismutase activity do not seem.
Yalcin, Dicle; Hakguder, Zeynep M; Otu, Hasan H
Individual cells within the same population show various degrees of heterogeneity, which may be better handled with single-cell analysis to address biological and clinical questions. Single-cell analysis is especially important in developmental biology as subtle spatial and temporal differences in cells have significant associations with cell fate decisions during differentiation and with the description of a particular state of a cell exhibiting an aberrant phenotype. Biotechnological advances, especially in the area of microfluidics, have led to a robust, massively parallel and multi-dimensional capturing, sorting, and lysis of single-cells and amplification of related macromolecules, which have enabled the use of imaging and omics techniques on single cells. There have been improvements in computational single-cell image analysis in developmental biology regarding feature extraction, segmentation, image enhancement and machine learning, handling limitations of optical resolution to gain new perspectives from the raw microscopy images. Omics approaches, such as transcriptomics, genomics and epigenomics, targeting gene and small RNA expression, single nucleotide and structural variations and methylation and histone modifications, rely heavily on high-throughput sequencing technologies. Although there are well-established bioinformatics methods for analysis of sequence data, there are limited bioinformatics approaches which address experimental design, sample size considerations, amplification bias, normalization, differential expression, coverage, clustering and classification issues, specifically applied at the single-cell level. In this review, we summarize biological and technological advancements, discuss challenges faced in the aforementioned data acquisition and analysis issues and present future prospects for application of single-cell analyses to developmental biology. © The Author 2015. Published by Oxford University Press on behalf of the European
Shavrukov, Yuri; Zhumalin, Aibek; Serikbay, Dauren; Botayeva, Makpal; Otemisova, Ainur; Absattarova, Aiman; Sereda, Grigoriy; Sereda, Sergey; Shvidchenko, Vladimir; Turbekova, Arysgul; Jatayev, Satyvaldy; Lopato, Sergiy; Soole, Kathleen; Langridge, Peter
A panel of 89 local commercial cultivars of bread wheat was tested in field trials in the dry conditions of Northern Kazakhstan. Two distinct groups of cultivars (six cultivars in each group), which had the highest and the lowest grain yield under drought were selected for further experiments. A dehydration test conducted on detached leaves indicated a strong association between rates of water loss in plants from the first group with highest grain yield production in the dry environment relative to the second group. Modern high-throughput Amplifluor Single Nucleotide Polymorphism (SNP) technology was applied to study allelic variations in a series of drought-responsive genes using 19 SNP markers. Genotyping of an SNP in the TaDREB5 (DREB2-type) gene using the Amplifluor SNP marker KATU48 revealed clear allele distribution across the entire panel of wheat accessions, and distinguished between the two groups of cultivars with high and low yield under drought. Significant differences in expression levels of TaDREB5 were revealed by qRT-PCR. Most wheat plants from the first group of cultivars with high grain yield showed slight up-regulation in the TaDREB5 transcript in dehydrated leaves. In contrast, expression of TaDREB5 in plants from the second group of cultivars with low grain yield was significantly down-regulated. It was found that SNPs did not alter the amino acid sequence of TaDREB5 protein. Thus, a possible explanation is that alternative splicing and up-stream regulation of TaDREB5 may be affected by SNP, but these hypotheses require additional analysis (and will be the focus of future studies). PMID:27917186
This Mission Analysis Report was prepared to provide the foundation for the Single-Shell Tank (SST) Retrieval Program, a new program responsible for waste removal for the SSTS. The SST Retrieval Program is integrated with other Tank Waste Remediation System activities that provide the management, technical, and operations elements associated with planning and execution of SST and SST Farm retrieval and closure. This Mission Analysis Report provides the basis and strategy for developing a program plan for SST retrieval. This Mission Analysis Report responds to a US Department of Energy request for an alternative single-shell tank retrieval approach (Taylor 1997).
This Mission Analysis Report was prepared to provide the foundation for the Single-Shell Tank (SST) Retrieval Program, a new program responsible for waste removal for the SSTS. The SST Retrieval Program is integrated with other Tank Waste Remediation System activities that provide the management, technical, and operations elements associated with planning and execution of SST and SST Farm retrieval and closure. This Mission Analysis Report provides the basis and strategy for developing a program plan for SST retrieval. This Mission Analysis Report responds to a US Department of Energy request for an alternative single-shell tank retrieval approach (Taylor 1997)
Zhang Yongming; Wu Ruisheng; Li Bin
The single-chip computer analysis system for liquid fluorescence is an intelligent analytic instrument, which is based on the principle that the liquid containing hydrocarbons can give out several characteristic fluorescences when irradiated by strong light. Besides a single-chip computer, the system makes use of the keyboard and the calculation and printing functions of a CASIO printing calculator. It combines optics, mechanism and electronics into one, and is small, light and practical, so it can be used for surface water sample analysis in oil field and impurity analysis of other materials
Abad, E.; Juarros, A.; Retolaza, A.
Stretching single DNA molecules by confinement in nanofluidic channels has attracted a great interest during the last few years as a DNA analysis tool. We have designed and fabricated a sealed micro/nanofluidic device for DNA stretching applications, based on the use of the high throughput Nano......Imprint Lithography (NIL) technology combined with a conventional anodic bonding of the silicon base and Pyrex cover. Using this chip, we have performed single molecule imaging on a bench-top fluorescent microscope system. Lambda phage DNA was used as a model sample to characterize the chip. Single molecules of λ...... a method to determining DNA size. The results of this work prove that the developed fabrication process is a good alternative for the fabrication of single molecule DNA biochips and it allows developing a variety of innovative bio/chemical sensors based on single-molecule DNA sequencing devices....
Full Text Available Mycobacterium ulcerans (M. ulcerans, the causative agent of the devastating skin disease Buruli ulcer (BU, is characterized by an extremely low level of genetic diversity. Recently, we have reported the first discrimination of closely related M. ulcerans variants in the BU endemic Densu River Valley of Ghana. In the study real-time PCR-based single nucleotide polymorphism (SNP typing at 89 predefined loci revealed the presence of ten M. ulcerans haplotypes circulating in the BU endemic region. Here we describe the development of temperature-switch PCR (TSP assays that allow distinguishing these haplotypes by conventional agarose gel-based analysis of the PCR products. After validation of the accuracy of typing results, the TSP assays were successfully established in a reference laboratory in Ghana. Development of the cost-effective and rapid TSP-based genetic fingerprinting method will thus allow investigating the spread of M. ulcerans clones by regular genetic monitoring in BU endemic countries.
Full Text Available During cancer progression, specific genomic aberrations arise that can determine the scope of the disease and can be used as predictive or prognostic markers. The detection of specific gene amplifications or deletions in single blood-borne or disseminated tumour cells that may give rise to the development of metastases is of great clinical interest but technically challenging. In this study, we present a method for quantitative high-resolution genomic analysis of single cells. Cells were isolated under permanent microscopic control followed by high-fidelity whole genome amplification and subsequent analyses by fine tiling array-CGH and qPCR. The assay was applied to single breast cancer cells to analyze the chromosomal region centred by the therapeutical relevant EGFR gene. This method allows precise quantitative analysis of copy number variations in single cell diagnostics.
Aung H. Naing
Full Text Available Longevity of cut flowers of many gerbera cultivars (Gerbera jamesonii is typically short because of stem bending; hence, stem bending that occurs during the early vase life period is a major problem in gerbera. Here, we investigated the effects of sodium nitroprusside (SNP on the delay of stem bending in the gerbera cultivars, Alliance, Rosalin, and Bintang, by examining relative fresh weight, bacterial density in the vase solution, transcriptional analysis of a lignin biosynthesis gene, antioxidant activity, and xylem blockage. All three gerbera cultivars responded to SNP by delaying stem bending, compared to the controls; however, the responses were dose- and cultivar-dependent. Among the treatments, SNP at 20 mg L-1 was the best to delay stem bending in Alliance, while dosages of 10 and 5 mg L-1 were the best for Rosalin and Bintang, respectively. However, stem bending in Alliance and Rosalin was faster than in Bintang, indicating a discrepancy influenced by genotype. According to our analysis of the role of SNP in the delay of stem bending, the results revealed that SNP treatment inhibited bacterial growth and xylem blockage, enhanced expression levels of a lignin biosynthesis gene, and maintained antioxidant activities. Therefore, it is suggested that the cause of stem bending is associated with the above-mentioned parameters and SNP is involved in the mechanism that delays stem bending in the different gerbera cultivars.
Full Text Available Single-cell analysis has become an established method to study cell heterogeneity and for rare cell characterization. Despite the high cost and technical constraints, applications are increasing every year in all fields of biology. Following the trend, there is a tremendous development of tools for single-cell analysis, especially in the RNA sequencing field. Every improvement increases sensitivity and throughput. Collecting a large amount of data also stimulates the development of new approaches for bioinformatic analysis and interpretation. However, the essential requirement for any analysis is the collection of single cells of high quality. The single-cell isolation must be fast, effective, and gentle to maintain the native expression profiles. Classical methods for single-cell isolation are micromanipulation, microdissection, and fluorescence-activated cell sorting (FACS. In the last decade several new and highly efficient approaches have been developed, which not just supplement but may fully replace the traditional ones. These new techniques are based on microfluidic chips, droplets, micro-well plates, and automatic collection of cells using capillaries, magnets, an electric field, or a punching probe. In this review we summarize the current methods and developments in this field. We discuss the advantages of the different commercially available platforms and their applicability, and also provide remarks on future developments.
Yu, Jianxin A.; Castranova, Daniel; Pham, Van N.; Weinstein, Brant M.
Vessel formation has been extensively studied at the tissue level, but the difficulty in imaging the endothelium with cellular resolution has hampered study of the morphogenesis and behavior of endothelial cells (ECs) in vivo. We are using endothelial-specific transgenes and high-resolution imaging to examine single ECs in zebrafish. By generating mosaics with transgenes that simultaneously mark endothelial nuclei and membranes we are able to definitively identify and study the morphology and behavior of individual ECs during vessel sprouting and lumen formation. Using these methods, we show that developing trunk vessels are composed of ECs of varying morphology, and that single-cell analysis can be used to quantitate alterations in morphology and dynamics in ECs that are defective in proper guidance and patterning. Finally, we use single-cell analysis of intersegmental vessels undergoing lumen formation to demonstrate the coexistence of seamless transcellular lumens and single or multicellular enclosed lumens with autocellular or intercellular junctions, suggesting that heterogeneous mechanisms contribute to vascular lumen formation in vivo. The tools that we have developed for single EC analysis should facilitate further rigorous qualitative and quantitative analysis of EC morphology and behavior in vivo. PMID:26253401
Nab Raj Roshyara
Full Text Available Modern analysis of high-dimensional SNP data requires a number of biometrical and statistical methods such as pre-processing, analysis of population structure, association analysis and genotype imputation. Software used for these purposes often rely on specific and incompatible input and output data formats. Therefore extensive data management including multiple format conversions is necessary during analyses.In order to support fast and efficient management and bio-statistical quality control of high-dimensional SNP data, we developed the publically available software fcGENE using C++ object-oriented programming language. This software simplifies and automates the use of different existing analysis packages, especially during the workflow of genotype imputations and corresponding analyses.fcGENE transforms SNP data and imputation results into different formats required for a large variety of analysis packages such as PLINK, SNPTEST, HAPLOVIEW, EIGENSOFT, GenABEL and tools used for genotype imputation such as MaCH, IMPUTE, BEAGLE and others. Data Management tasks like merging, splitting, extracting SNP and pedigree information can be performed. fcGENE also supports a number of bio-statistical quality control processes and quality based filtering processes at SNP- and sample-wise level. The tool also generates templates of commands required to run specific software packages, especially those required for genotype imputation. We demonstrate the functionality of fcGENE by example workflows of SNP data analyses and provide a comprehensive manual of commands, options and applications.We have developed a user-friendly open-source software fcGENE, which comprehensively supports SNP data management, quality control and analysis workflows. Download statistics and corresponding feedbacks indicate that software is highly recognised and extensively applied by the scientific community.
Majolo, Fernanda; Oliveira Paludo, Francis Jackson de; Ponzoni, Aline; Graebin, Pietra; Dias, Fernando Suparregui; Alho, Clarice Sampaio
During critical illness and sepsis there is severe antioxidant depletion, and this scenario raises the critical ill patient's mortality risk. Glutathione peroxidase (GPx) is one of the first endogenous antioxidant defense enzymes, and it works cooperatively with superoxide dismutase (SOD) and catalase (CAT) to detoxify free radicals from the cellular environment. Genetic studies are important to understand the complexity of human oxidative stress and how the organism responds to an extreme situation such as critically care conditions. Previous studies with a GPx1 single nucleotide polymorphism (593C>T SNP; rs1050450; protein variant in GPx1: Pro198Leu) showed 593T carriers and 593TT homozygotes present higher risk to develop different diseases. We assessed the relationship of the genotype distribution of GPx1 SNP in critically ill patients with their conditions (organ dysfunction, sepsis, and septic shock) and their outcome. We monitored 626 critically ill patients daily from the ICU (intensive care unit) admission to their discharge from hospital, or death. Our study revealed a significant association between 593TT GPx1 genotype and mortality; the mortality rate was higher in homozygous 593TT GPx1 (N=94) when compared with the group of subjects with genotypes 593CT or 593CC GPx1 (N=532) (52% vs. 38%, P=0.009; OR=1.79; 95% CI=1.13-2.85). Evaluating the subgroup of 293 ICU patients with sepsis, a pooled analysis including two genetic variants GPx1 and SOD2 (47C>T SNP, rs4880; protein variant in MnSOD: Ala-9Val) showed a significant difference in relation to progression to septic shock. The frequency of septic shock among septic patients with 593T GPx1 and 47C SOD2 alleles (N=122) was higher when compared with septic patients carrying other settings of genotypes (N=174) (78% vs. 66%; P=0.028; OR=1.81; 95% CI=1.03-3.18). Accepting the previously reported functional effects of these two SNPs on GPx1 and SOD2 gene expressions and, consequently, on GPx1 and MnSOD enzyme
Safa, Ahmad Hosseini; Harandi, Majid Fasihi; Tajaddini, Mohammadhasan; Rostami-Nejad, Mohammad; Mohtashami-Pour, Mehdi; Pestehchian, Nader
High-resolution melting (HRM) is a reliable and sensitive scanning method to detect variation in DNA sequences. We used this method to better understand the epidemiology and transmission of Echinococcus granulosus. We tested the use of HRM to discriminate the genotypes of E. granulosus and E. canadensis. One hundred forty-one hydatid cysts were collected from slaughtered animals in different parts of Isfahan-Iran in 2013. After DNA extraction, the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene was amplified using PCR coupled with the HRM curve. The result of HRM analysis using partial the sequences of cox1 gene revealed that 93, 35, and 2 isolates were identified as G1, G3, and G6 genotypes, respectively. A single nucleotide polymorphism (SNP) was found in locus 9867 of the cox1 gene. This is a critical locus for the differentiation between the G6 and G7 genotypes. In the phylogenic tree, the sample with a SNP was located between the G6 and G7 genotypes, which suggest that this isolate has a G6/G7 genotype. The HRM analysis developed in the present study provides a powerful technique for molecular and epidemiological studies on echinococcosis in humans and animals.
Restriction-site associated DNA sequencing (RAD-seq) and related methods are revolutionizing the field of population genomics in non-model organisms as they allow generating an unprecedented number of single nucleotide polymorphisms (SNPs) even when no genomic information is available. Yet, RAD-seq data analyses rely on assumptions on nature and number of nucleotide variants present in a single locus, the choice of which may lead to an under- or overestimated number of SNPs and/or to incorrectly called genotypes. Using the Atlantic mackerel (Scomber scombrus L.) and a close relative, the Atlantic chub mackerel (Scomber colias), as case study, here we explore the sensitivity of population structure inferences to two crucial aspects in RAD-seq data analysis: the maximum number of mismatches allowed to merge reads into a locus and the relatedness of the individuals used for genotype calling and SNP selection. Our study resolves the population structure of the Atlantic mackerel, but, most importantly, provides insights into the effects of alternative RAD-seq data analysis strategies on population structure inferences that are directly applicable to other species.
Linda E. Kippner
Full Text Available Single-cell analysis has the potential to provide us with a host of new knowledge about biological systems, but it comes with the challenge of correctly interpreting the biological information. While emerging techniques have made it possible to measure inter-cellular variability at the transcriptome level, no consensus yet exists on the most appropriate method of data analysis of such single cell data. Methods for analysis of transcriptional data at the population level are well established but are not well suited to single cell analysis due to their dependence on population averages. In order to address this question, we have systematically tested combinations of methods for primary data analysis on single cell transcription data generated from two types of primary immune cells, neutrophils and T lymphocytes. Cells were obtained from healthy individuals, and single cell transcript expression data was obtained by a combination of single cell sorting and nanoscale quantitative real time PCR (qRT-PCR for markers of cell type, intracellular signaling, and immune functionality. Gene expression analysis was focused on hierarchical clustering to determine the existence of cellular subgroups within the populations. Nine combinations of criteria for data exclusion and normalization were tested and evaluated. Bimodality in gene expression indicated the presence of cellular subgroups which were also revealed by data clustering. We observed evidence for two clearly defined cellular subtypes in the neutrophil populations and at least two in the T lymphocyte populations. When normalizing the data by different methods, we observed varying outcomes with corresponding interpretations of the biological characteristics of the cell populations. Normalization of the data by linear standardization taking into account technical effects such as plate effects, resulted in interpretations that most closely matched biological expectations. Single cell transcription
critical. 5. REFERENCES Almasy, L, Blangero, J. (2009) “Human QTL linkage mapping.” Genetica 136:333-340. Amos, CI. (2007) “Successful...quantitative trait loci.” Genetica 136:237-243. Ward, JH, Hook, ME. “A Hierarchical Grouping Procedure Applied to a Problem of Grouping Profiles
Cowpea is a legume widely grown in Africa, North, Central and South America, and Asia. The Cowpea plant growth habits consist of erect, semi-prostrate, and prostrate types. Developing a cultivar while considering plant growth habit is essential within a breeding program since the need for a particul...
Jain, Manish; Kalsi, Amanpreet Kaur
The present study evaluated the role of SNP microarray in 101 cases of clinically suspected FISH negative (noninformative/normal) 22q11.2 microdeletion syndrome. SNP microarray was carried out using 300 K HumanCytoSNP-12 BeadChip array or CytoScan 750 K array. SNP microarray identified 8 cases of 22q11.2 microdeletions and/or microduplications in addition to cases of chromosomal abnormalities and other pathogenic/likely pathogenic CNVs. Clinically suspected specific deletions (22q11.2) were detectable in approximately 8% of cases by SNP microarray, mostly from FISH noninformative cases. This study also identified several LOH/AOH loci with known and well-defined UPD (uniparental disomy) disorders. In conclusion, this study suggests more strict clinical criteria for FISH analysis. However, if clinical criteria are few or doubtful, in particular newborn/neonate in intensive care, SNP microarray should be the first screening test to be ordered. FISH is ideal test for detecting mosaicism, screening family members, and prenatal diagnosis in proven families. PMID:27051557
Halder, Ashutosh; Jain, Manish; Kalsi, Amanpreet Kaur
The present study evaluated the role of SNP microarray in 101 cases of clinically suspected FISH negative (noninformative/normal) 22q11.2 microdeletion syndrome. SNP microarray was carried out using 300 K HumanCytoSNP-12 BeadChip array or CytoScan 750 K array. SNP microarray identified 8 cases of 22q11.2 microdeletions and/or microduplications in addition to cases of chromosomal abnormalities and other pathogenic/likely pathogenic CNVs. Clinically suspected specific deletions (22q11.2) were detectable in approximately 8% of cases by SNP microarray, mostly from FISH noninformative cases. This study also identified several LOH/AOH loci with known and well-defined UPD (uniparental disomy) disorders. In conclusion, this study suggests more strict clinical criteria for FISH analysis. However, if clinical criteria are few or doubtful, in particular newborn/neonate in intensive care, SNP microarray should be the first screening test to be ordered. FISH is ideal test for detecting mosaicism, screening family members, and prenatal diagnosis in proven families.
Full Text Available The present study evaluated the role of SNP microarray in 101 cases of clinically suspected FISH negative (noninformative/normal 22q11.2 microdeletion syndrome. SNP microarray was carried out using 300 K HumanCytoSNP-12 BeadChip array or CytoScan 750 K array. SNP microarray identified 8 cases of 22q11.2 microdeletions and/or microduplications in addition to cases of chromosomal abnormalities and other pathogenic/likely pathogenic CNVs. Clinically suspected specific deletions (22q11.2 were detectable in approximately 8% of cases by SNP microarray, mostly from FISH noninformative cases. This study also identified several LOH/AOH loci with known and well-defined UPD (uniparental disomy disorders. In conclusion, this study suggests more strict clinical criteria for FISH analysis. However, if clinical criteria are few or doubtful, in particular newborn/neonate in intensive care, SNP microarray should be the first screening test to be ordered. FISH is ideal test for detecting mosaicism, screening family members, and prenatal diagnosis in proven families.
Bers, van N.E.M.; Crooijmans, R.P.M.A.; Groenen, M.A.M.; Dibbits, B.W.; Komen, J.
We have generated a unique resource consisting of nearly 175 000 short contig sequences and 3569 SNP markers from the widely cultured GIFT (Genetically Improved Farmed Tilapia) strain of Nile tilapia (Oreochromis niloticus). In total, 384 SNPs were selected to monitor the wider applicability of the
Relich, Peter Kristopher, II
Single molecule experiments are a set of experiments designed specifically to study the properties of individual molecules. It has only been in the last three decades where single molecule experiments have been applied to the life sciences; where they have been successfully implemented in systems biology for probing the behaviors of sub-cellular mechanisms. The advent and growth of super-resolution techniques in single molecule experiments has made the fundamental behaviors of light and the associated nano-probes a necessary concern amongst life scientists wishing to advance the state of human knowledge in biology. This dissertation disseminates some of the practices learned in experimental live cell microscopy. The topic of single particle tracking is addressed here in a format that is designed for the physicist who embarks upon single molecule studies. Specifically, the focus is on the necessary procedures to generate single particle tracking analysis techniques that can be implemented to answer biological questions. These analysis techniques range from designing and testing a particle tracking algorithm to inferring model parameters once an image has been processed. The intellectual contributions of the author include the techniques in diffusion estimation, localization filtering, and trajectory associations for tracking which will all be discussed in detail in later chapters. The author of this thesis has also contributed to the software development of automated gain calibration, live cell particle simulations, and various single particle tracking packages. Future work includes further evaluation of this laboratory's single particle tracking software, entropy based approaches towards hypothesis validations, and the uncertainty quantification of gain calibration.
This book highlights the current state of the art in single cell analysis, an area that involves many fields of science – from clinical hematology, functional analysis and drug screening, to platelet and microparticle analysis, marine biology and fundamental cancer research. This book brings together an eclectic group of current applications, all of which have a significant impact on our current state of knowledge. The authors of these chapters are all pioneering researchers in the field of single cell analysis. The book will not only appeal to those readers more focused on clinical applications, but also those interested in highly technical aspects of the technologies. All of the technologies identified utilize unique applications of photon detection systems.
Full Text Available Low-density (LD single nucleotide polymorphism (SNP arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD or high-density (HD SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE or haplotype-averaged Shannon entropy (HASE and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus
Ørum, H.; Nielsen, P.E.; Egholm, M.
A novel method that allows direct analysis of single base mutation by the polymerase chain reaction (PCR) is described. The method utilizes the finding that PNAs (peptide nucleic acids) recognize and bind to their complementary nucleic acid sequences with higher thermal stability and specificity...
Jul 9, 2014 ... Amino acids analysis during lactic acid fermentation by single strain cultures of lactobacilli and mixed culture starter made from them. KiBeom Lee1*, Ho-Jin Kim1 and Sang-Kyu Park2. 1Bio Center Technopark, 7-50 Songdo, Yeonsu-Gu, Incheon 406-840, Republic of Korea. 2Nambu University, Chumdan ...
Hofland, B.; van gent, Marcel
A method to quantify, analyse, and present the settlement of single-layer concrete armour layers of coastal structures is presented. The use of the image processing technique for settlement analysis is discussed based on various modelling
studies performed over the years. The accuracy of the
CrazyEgg provides an in depth look at visitor behavior on one page. While you can use GA to do trend analysis of your web area, CrazyEgg helps diagnose the design of a single Web page by visually displaying all visitor clicks during a specified time.
Zhang, Jianhui; Jin, Xia; Fang, Shumei; Li, Yan; Wang, Rui; Guo, Wei; Wang, Na; Wang, Yimin; Wen, Denggui; Wei, Lizhen; Kuang, Gang; Dong, Zhiming
The matrix metalloproteinases (MMPs), a family of proteolytic enzymes that degrade different components of the extracellular matrix, play important roles in tumor development and invasion. A single adenine insertion/deletion polymorphism (6A/5A) in the MMP3 promoter region causes transcriptional elevation. The aim of this study was to assess the effects of this single nucleotide polymorphism (SNP) on the development and clinical staging of esophageal squamous cell carcinoma (ESCC) and gastric cardiac adenocarcinoma (GCA). The MMP3 SNP was genotyped by polymerase chain reaction-restriction fragment length polymorphism analysis in 417 cancer patients (234 ESCC and 183 GCA) and 350 controls in north China. The overall distribution of the MMP3 SNP in ESCC and GCA patients was not significantly different from that in healthy controls. However, smoking individuals with the 5A/5A or 5A/6A genotype were significantly more common in ESCC patients than in controls (37.5 versus 23.3%, xi(2) = 5.13, P = 0.02). Thus, smokers with at least one 5A allele had a significantly increased risk of ESCC, compared with 6A homozygotes (age and sex adjusted OR = 1.95, 95% CI = 1.08-3.53). The significant difference in the SNP distribution between ESCC patients, GCA patients and controls was not observed when stratified by family history of upper gastrointestinal cancer. In addition, the frequency of the 5A/5A + 5A/6A genotypes in ESCC patients with and without lymphatic metastasis was significantly different (45.8 versus 27.8%, xi(2) = 4.56, P = 0.03). Therefore, patients with at least one 5A allele were significantly more prone to lymphatic metastasis of ESCC. In contrast, no significant difference in the SNP distribution between patients with and without lymphatic metastasis was observed in GCA. The present study suggests that the MMP3 promoter SNP might be associated with a risk of development and lymphatic metastasis in ESCC but not in GCA.
Erben, Philipp; Nowak, Daniel; Sauer, Christian; Ströbel, Philipp; Hofmann, Wolf-Karsten; Hofheinz, Ralf-Dieter; Hohenberger, Peter; Kasper, Bernd
Desmoid tumors are neoplastic proliferations of connective tissues. The mutation status of the gene coding for catenin (cadherin-associated protein) beta 1 (CTNNB1) and trisomy 8 on the chromosomal level have been described to have prognostic relevance. In order to elucidate new molecular mechanisms underlying these tumors, we carried out a molecular analysis with a genome-wide human high-density single-nucleotide polymorphism (SNP) array, in 9 patients. Single samples showed numerical aberrations on chromosomes (Chrs) 20 and 6 with either trisomy 20 or monosomy 6. No trisomy 8 could be detected. Recurrent heterozygous deletions were found in Chr 5q (including the APC gene locus, n = 3) and Chr 8p23 (n = 4, containing coding regions for the potential tumor suppressor gene CSMD1). This novel deletion in 8p23 showed an association with local recurrence. In addition, structural chromosomal changes (gain of Chrs 8 and 20) were found in a minority of cases. The genomic alteration affecting the candidate gene CSMD1 could be important in the development of desmoid tumors.
Santos, Carla; Phillips, Christopher; Fondevila, Manuel; Daniel, Runa; van Oorschot, Roland A H; Burchard, Esteban G; Schanfield, Moses S; Souto, Luis; Uacyisrael, Jolame; Via, Marc; Carracedo, Ángel; Lareu, Maria V
The analysis of human population variation is an area of considerable interest in the forensic, medical genetics and anthropological fields. Several forensic single nucleotide polymorphism (SNP) assays provide ancestry-informative genotypes in sensitive tests designed to work with limited DNA samples, including a 34-SNP multiplex differentiating African, European and East Asian ancestries. Although assays capable of differentiating Oceanian ancestry at a global scale have become available, this study describes markers compiled specifically for differentiation of Oceanian populations. A sensitive multiplex assay, termed Pacifiplex, was developed and optimized in a small-scale test applicable to forensic analyses. The Pacifiplex assay comprises 29 ancestry-informative marker SNPs (AIM-SNPs) selected to complement the 34-plex test, that in a combined set distinguish Africans, Europeans, East Asians and Oceanians. Nine Pacific region study populations were genotyped with both SNP assays, then compared to four reference population groups from the HGDP-CEPH human diversity panel. STRUCTURE analyses estimated population cluster membership proportions that aligned with the patterns of variation suggested for each study population's currently inferred demographic histories. Aboriginal Taiwanese and Philippine samples indicated high East Asian ancestry components, Papua New Guinean and Aboriginal Australians samples were predominantly Oceanian, while other populations displayed cluster patterns explained by the distribution of divergence amongst Melanesians, Polynesians and Micronesians. Genotype data from Pacifiplex and 34-plex tests is particularly well suited to analysis of Australian Aboriginal populations and when combined with Y and mitochondrial DNA variation will provide a powerful set of markers for ancestry inference applied to modern Australian demographic profiles. On a broader geographic scale, Pacifiplex adds highly informative data for inferring the ancestry
This paper examines whether an abbreviated panel of 37 single nucleotide polymorphisms (SNPs) has the same power as a larger and more expensive panel of 95 SNPs to assign ancestry of honeybees (Apis mellifera) to three ancestral lineages. We selected 37 SNPs from the original 95 SNP panel using alle...
Nielsen, Rasmus; Korneliussen, Thorfinn Sand; Albrechtsen, Anders
is calculated using a dynamic programming algorithm and numerically optimized using analytical derivatives. We then use a bayesian method for estimating the sample allele frequency in a single site, and show how the method can be used for genotype calling and SNP calling. We also show how the method can...
Array- or chip-based single nucleotide polymorphism (SNP) markers are widely used in genomic studies because of their abundance in a genome and cost less per data point compared to older marker technologies. Genotyping by sequencing (GBS), a relatively newer approach of genotyping, suggests equal or...
Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ~4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification pr...
Balomenos, Athanasios D; Tsakanikas, Panagiotis; Aspridou, Zafiro; Tampakaki, Anastasia P; Koutsoumanis, Konstantinos P; Manolakos, Elias S
Time-lapse microscopy is an essential tool for capturing and correlating bacterial morphology and gene expression dynamics at single-cell resolution. However state-of-the-art computational methods are limited in terms of the complexity of cell movies that they can analyze and lack of automation. The proposed Bacterial image analysis driven Single Cell Analytics (BaSCA) computational pipeline addresses these limitations thus enabling high throughput systems microbiology. BaSCA can segment and track multiple bacterial colonies and single-cells, as they grow and divide over time (cell segmentation and lineage tree construction) to give rise to dense communities with thousands of interacting cells in the field of view. It combines advanced image processing and machine learning methods to deliver very accurate bacterial cell segmentation and tracking (F-measure over 95%) even when processing images of imperfect quality with several overcrowded colonies in the field of view. In addition, BaSCA extracts on the fly a plethora of single-cell properties, which get organized into a database summarizing the analysis of the cell movie. We present alternative ways to analyze and visually explore the spatiotemporal evolution of single-cell properties in order to understand trends and epigenetic effects across cell generations. The robustness of BaSCA is demonstrated across different imaging modalities and microscopy types. BaSCA can be used to analyze accurately and efficiently cell movies both at a high resolution (single-cell level) and at a large scale (communities with many dense colonies) as needed to shed light on e.g. how bacterial community effects and epigenetic information transfer play a role on important phenomena for human health, such as biofilm formation, persisters' emergence etc. Moreover, it enables studying the role of single-cell stochasticity without losing sight of community effects that may drive it.
Huang, Hailiang; Tata, Sandeep; Prill, Robert J
Computational workloads for genome-wide association studies (GWAS) are growing in scale and complexity outpacing the capabilities of single-threaded software designed for personal computers. The BlueSNP R package implements GWAS statistical tests in the R programming language and executes the calculations across computer clusters configured with Apache Hadoop, a de facto standard framework for distributed data processing using the MapReduce formalism. BlueSNP makes computationally intensive analyses, such as estimating empirical p-values via data permutation, and searching for expression quantitative trait loci over thousands of genes, feasible for large genotype-phenotype datasets. http://github.com/ibm-bioinformatics/bluesnp
The Markov processes defined by random and loop-based schemes for single spin flip attempts in Monte Carlo simulations of the 2D Ising model are investigated, by explicitly constructing their transition matrices. Their analysis reveals that loops over all lattice sites using a Metropolis-type single spin flip probability often do not define ergodic Markov chains, and have distorted dynamical properties even if they are ergodic. The transition matrices also enable a comparison of the dynamics of random versus loop spin selection and Glauber versus Metropolis probabilities
Yousefi, Soheil; Abbassi-Daloii, Tooba; Kraaijenbrink, Thirsa; Vermaat, Martijn; Mei, Hailiang; van 't Hof, Peter; van Iterson, Maarten; Zhernakova, Daria V; Claringbould, Annique; Franke, Lude; 't Hart, Leen M; Slieker, Roderick C; van der Heijden, Amber; de Knijff, Peter; 't Hoen, Peter A C
BACKGROUND: SNP panels that uniquely identify an individual are useful for genetic and forensic research. Previously recommended SNP panels are based on DNA profiles and mostly contain intragenic SNPs. With the increasing interest in RNA expression profiles, we aimed for establishing a SNP panel for
Martinez, Pierre; Kimberley, Christopher; Birkbak, Nicolai Juul
Intra-tumour genetic heterogeneity (ITH) fosters drug resistance and is a critical hurdle to clinical treatment. ITH can be well-measured using multi-region sampling but this is costly and challenging to implement. There is therefore a need for tools to estimate ITH in individual samples, using...... standard genomic data such as SNP-arrays, that could be implemented routinely. We designed two novel scores S and R, respectively based on the Shannon diversity index and Ripley's L statistic of spatial homogeneity, to quantify ITH in single SNP-array samples. We created in-silico and in-vitro mixtures...... sequencing data but heterogeneity in the fraction of tumour cells present across samples hampered accurate quantification. The prognostic potential of both scores was moderate but significantly predictive of survival in several tumour types (corrected p = 0.03). Our work thus shows how individual SNP...
Welter, Danielle; MacArthur, Jacqueline; Morales, Joannella; Burdett, Tony; Hall, Peggy; Junkins, Heather; Klemm, Alan; Flicek, Paul; Manolio, Teri; Hindorff, Lucia; Parkinson, Helen
The National Human Genome Research Institute (NHGRI) Catalog of Published Genome-Wide Association Studies (GWAS) Catalog provides a publicly available manually curated collection of published GWAS assaying at least 100,000 single-nucleotide polymorphisms (SNPs) and all SNP-trait associations with P Catalog includes 1751 curated publications of 11 912 SNPs. In addition to the SNP-trait association data, the Catalog also publishes a quarterly diagram of all SNP-trait associations mapped to the SNPs' chromosomal locations. The Catalog can be accessed via a tabular web interface, via a dynamic visualization on the human karyotype, as a downloadable tab-delimited file and as an OWL knowledge base. This article presents a number of recent improvements to the Catalog, including novel ways for users to interact with the Catalog and changes to the curation infrastructure.
Motivation: A review of the available single nucleotide polymorphism (SNP) calling procedures for Illumina high-throughput sequencing (HTS) platform data reveals that most rely mainly on base-calling and mapping qualities as sources of error when calling SNPs. Thus, errors not involved in base-calling or alignment, such as those in genomic sample preparation, are not accounted for.Results: A novel method of consensus and SNP calling, Genotype Model Selection (GeMS), is given which accounts for the errors that occur during the preparation of the genomic sample. Simulations and real data analyses indicate that GeMS has the best performance balance of sensitivity and positive predictive value among the tested SNP callers. © The Author 2012. Published by Oxford University Press. All rights reserved.
Sandesh K Shrestha
Full Text Available Corynespora cassiicola attackes diverse agriculturally important plants, including soybean and cotton, in the US. It is a reemerge pathogen on cotton in southeastern US. Whole genome sequences of four cotton and one soybean isolate from Tennessee were used to develop single nucleotide polymorphism markers for cotton isolates. Cotton isolates had little diversity at the genome level and very little differentiation from the soybean isolate. Analysis of 75 isolates from cotton and soybean, using targeted-sequencing of 22 polymorphic SNP sites, revealed eight multi-locus genotypes and it appears a single clonal lineage predominates across the southeastern region. The cotton and soybean genome sequences were significantly different from the public reference genome derived from a rubber isolate and the utility of these novel resources will be discussed.
Erica L Carpenter
Full Text Available Our understanding of the diversity of cells that escape the primary tumor and seed micrometastases remains rudimentary, and approaches for studying circulating and disseminated tumor cells have been limited by low throughput and sensitivity, reliance on single parameter sorting, and a focus on enumeration rather than phenotypic and genetic characterization. Here we utilize a highly sensitive microfluidic and dielectrophoretic approach for the isolation and genetic analysis of individual tumor cells. We employed fluorescence labeling to isolate 208 single cells from spiking experiments conducted with 11 cell lines, including 8 neuroblastoma cell lines, and achieved a capture sensitivity of 1 tumor cell per 106 white blood cells. Sample fixation or freezing had no detectable effect on cell capture. Point mutations were accurately detected in the whole genome amplification product of captured single tumor cells but not in negative control white blood cells. We applied this approach to capture 144 single tumor cells from 10 bone marrow samples from patients suffering from neuroblastoma. In this pediatric malignancy, high-risk patients often exhibit wide-spread hematogenous metastasis, but access to primary tumor can be difficult or impossible. Here we used flow-based sorting to pre-enrich samples with tumor involvement below 0.02%. For all patients for whom a mutation in the Anaplastic Lymphoma Kinase gene had already been detected in their primary tumor, the same mutation was detected in single cells from their marrow. These findings demonstrate a novel, non-invasive, and adaptable method for the capture and genetic analysis of single tumor cells from cancer patients.
Liu, Er-Ping; Wang, Yan; He, Xiao-Hui; Guan, Jun-Jie; Wang, Jin; Qin, Zheng-Hong; Sun, Wan-Ping
Point-of-care genetic analysis may require polymerase chain reaction (PCR) to be carried out on whole blood. However, human blood contains natural inhibitors of PCR such as hemoglobin, immunoglobulin G, lactoferrin, and proteases, as well as anticoagulant agents, including EDTA and heparin that can reduce whole blood PCR efficiency. Our purpose was to develop a highly specific, direct whole blood single-nucleotide polymorphism (SNP) analysis method based on allele-specific (AS) PCR that is mediated by Pfu DNA polymerase and phosphorothioate-modified AS primers. At high Mg(2+) concentrations, Pfu DNA polymerase efficiently amplified genomic DNA in a reaction solution containing up to 14% whole blood. Among the three anticoagulants tested, Pfu DNA polymerase showed the highest activity with sodium citrate. Meanwhile, Triton X-100 and betaine inhibited Pfu DNA polymerase activity in whole blood PCR, whereas trehalose had virtually no effect. These findings provided for the development of a low-cost, simple, and fast direct whole blood genotyping method that uses Pfu DNA polymerase combined with phosphorothioate AS primers for CYP2C9*3 and VKORC1(-1639) loci. With its high DNA amplification efficiency and tolerance of various blood conditions, Pfu DNA polymerase can be used in clinical laboratories to analyze SNPs in whole blood samples.
Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron
As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718
Full Text Available As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of 'Golden Delicious', SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional, and genomic selection in apple.
Hu, Jian-cheng; Tan, Ke; Cheng, De-hua; Li, Lu-yun; Lu, Guang-xiu; Tan, Yue-qiu
To determine the karyotype of a boy suspected to have Cri du Chat syndrome with severe clinical manifestations, and to assess the recurrence risk for his family. High-resolution GTG banding was performed to analyze the patient and his parents. Fluorescence in situ hybridization (FISH) with Cri du Chat syndrome region probe as well as subregional probes mapped to 5pter, 5qter, 18pter, 18qter, and whole chromosome painting probe 18 was performed to analyze the patient and his parents. In addition, single nucleotide polymorphism-based arrays (SNP-Array) analysis with Affymetrix GeneChip Genome-wide Human SNP Nsp/Sty 6.0 were also performed to analyze the patient. Karyotype analysis indicated that the patient has carried a terminal deletion in 5p. FISH with Cri du Chat syndrome region probe confirmed that D5S23 and D5S721 loci are deleted. SNP-Array has detected a 15 Mb deletion at 5p and a 2 Mb duplication at 18p. FISH with 5p subtelomeric probes and 18p subtelomeric probe further confirmed that the derivative chromosome 5 has derived from a translocation between 5p and 18p, which has given rise to a 46,XY,der(5)t(5;18)(p15.1;p11.31)dn karyotype. A de novo 5p partial deletion in conjunction with a cryptic 18p duplication has been detected in a boy featuring Cri-du-Chat syndrome. His parents, both with negative findings, have a low recurrence risk. For its ability to detect chromosomal imbalance, SNP-Array has a great value for counseling of similar patients and assessment of recurrence risks.
Sasaki, Yoshi K.; Allen, Steve; Mizuno, Koki; Whitehead, Victor; Wilk, Kenneth E.
A computer scheme for extracting singularities has been developed and applied to single Doppler radar wind data. The scheme is planned for use in real-time wind and singularity analysis and forecasting. The method, known as Doppler Operational Variational Extraction of Singularities is outlined, focusing on the principle of local symmetry. Results are presented from the application of the scheme to a storm-generated gust front in Oklahoma on May 28, 1987.
Arepalli, Sivram; Nikolaev, Pavel; Gorelik, Olga
An improved protocol for thermogravimetric analysis (TGA) of samples of single-wall carbon nanotube (SWCNT) material has been developed to increase the degree of consistency among results so that meaningful comparisons can be made among different samples. This improved TGA protocol is suitable for incorporation into the protocol for characterization of carbon nanotube material. In most cases, TGA of carbon nanotube materials is performed in gas mixtures that contain oxygen at various concentrations. The improved protocol is summarized.
Full Text Available Abstract Background Urinary bladder carcinoma stage T1 is an unpredictable disease that in some cases has a good prognosis with only local or no recurrence, but in others can appear as a more aggressive tumor with progression to more advanced stages. The aim here was to investigate stage T1 tumors regarding MDM2 promoter SNP309 polymorphism, mutations in the p53 gene, and expression of p53 and p16 measured by immunohistochemistry, and subsequently relate these changes to tumor recurrence and progression. We examined a cohort of patients with primary stage T1 urothelial carcinoma of the bladder and their tumors. Methods After re-evaluation of the original slides and exclusions, the study population comprised 141 patients, all with primary stage T1 urothelial carcinoma of the bladder. The hospital records were screened for clinical parameters and information concerning presence of histologically proven recurrence and progression. The paraffin-embedded tumor material was evaluated by immunohistochemistry. Any mutations found in the p53 gene were studied by single-strand conformation analysis and Sanger sequencing. The MDM2 SNP309 polymorphism was investigated by pyrosequencing. Multivariate analyses concerning association with prognosis were performed, and Kaplan-Meier analysis was conducted for a combination of changes and time to progression. Results Of the 141 patients, 82 had at least one MDM2 SNP309 G allele, and 53 had a mutation in the p53 gene, but neither of those anomalies was associated with a worse prognosis. A mutation in the p53 gene was associated with immunohistochemically visualized p53 protein expression at a cut-off value of 50%. In the group with p53 mutation Kaplan-Meier analysis showed higher rate of progression and shorter time to progression in patients with immunohistochemically abnormal p16 expression compared to them with normal p16 expression (p = 0.038. Conclusions MDM2 SNP309 promoter polymorphism and mutations in
Chirikov-Zorin, I.E.; Fedorko, I.; Sykora, I.; Tokar, S.; Menzione, A.
A deconvolution method based on a sophisticated photomultiplier response function was used to analyse the compact metal package photomultiplier spectra taken in single photoelectron mode. The spectra taken by Hamamtsu R5600 and R5900 photomultipliers have been analysed. The detailed analysis shows that the method appropriately describes the process of charge multiplication in these photomultipliers in a wide range of working regimes and the deconvoluted parameters are established with about 1% accuracy. The method can be used for a detailed analysis of photomultiplier noise and for calibration purposes
Karisa, B K; Thomson, J; Wang, Z; Stothard, P; Moore, S S; Plastow, G S
The candidate gene approach was used to identify genes associated with residual feed intake (RFI) in beef steers. The approach uses prior knowledge of gene functions to predict their biological role in the variation observed in a trait. It is suited to identify genes associated with complex traits where each gene has a relatively small effect. First, positional candidate genes were identified within the genomic positions of previously reported QTL associated with component traits related to RFI such as dry matter intake (DMI), growth, feed conversion ratio (FCR), average daily gain (ADG), and energy balance. Secondly, the positional candidate genes were prioritized into functional candidate genes according to their biological functions and their relationship with the biological processes associated with RFI including carbohydrate, fat and protein metabolism, thermoregulation, immunity and muscle activity. Single nucleotide polymorphisms (SNPs) located within the functional candidate genes were identified using mRNA sequences and prioritized into functional classes such as non-synonymous (nsSNP), synonymous (sSNP) or intronic SNP. A total of 117 nsSNP were considered as functional SNP and genotyped in steers at the University of Alberta ranch in Kinsella. Multiple marker association analysis in ASReml was performed using RFI data obtained from 531 beef steers. Twenty-five SNP were significantly associated with RFI (P < 0.05) accounting for 19.7% of the phenotypic variation. Using SIFT program to predict the effect of the SNP on the function of the corresponding protein, 3 of the 25 SNP were predicted to cause a significant effect on protein function (P < 0.05). One of the 3 SNP was located in the GHR gene and was also associated with a significant effect on the tertiary structure of the GHR protein (P < 0.05) as modeled using SWISSModel software. Least square means for each genotype were estimated and an over-dominance effect was observed for the SNP located in the
Cronin, Matthew A; Rincon, Gonzalo; Meredith, Robert W; MacNeil, Michael D; Islas-Trejo, Alma; Cánovas, Angela; Medrano, Juan F
We assessed the relationships of polar bears (Ursus maritimus), brown bears (U. arctos), and black bears (U. americanus) with high throughput genomic sequencing data with an average coverage of 25× for each species. A total of 1.4 billion 100-bp paired-end reads were assembled using the polar bear and annotated giant panda (Ailuropoda melanoleuca) genome sequences as references. We identified 13.8 million single nucleotide polymorphisms (SNP) in the 3 species aligned to the polar bear genome. These data indicate that polar bears and brown bears share more SNP with each other than either does with black bears. Concatenation and coalescence-based analysis of consensus sequences of approximately 1 million base pairs of ultraconserved elements in the nuclear genome resulted in a phylogeny with black bears as the sister group to brown and polar bears, and all brown bears are in a separate clade from polar bears. Genotypes for 162 SNP loci of 336 bears from Alaska and Montana showed that the species are genetically differentiated and there is geographic population structure of brown and black bears but not polar bears.
Chadaeva, Irina V; Ponomarenko, Petr M; Rasskazov, Dmitry A; Sharypova, Ekaterina B; Kashina, Elena V; Zhechev, Dmitry A; Drachkova, Irina A; Arkova, Olga V; Savinkova, Ludmila K; Ponomarenko, Mikhail P; Kolchanov, Nikolay A; Osadchuk, Ludmila V; Osadchuk, Alexandr V
The progress of medicine, science, technology, education, and culture improves, year by year, quality of life and life expectancy of the populace. The modern human has a chance to further improve the quality and duration of his/her life and the lives of his/her loved ones by bringing their lifestyle in line with their sequenced individual genomes. With this in mind, one of genome-based developments at the junction of personalized medicine and bioinformatics will be considered in this work, where we used two Web services: (i) SNP_TATA_Comparator to search for alleles with a single nucleotide polymorphism (SNP) that alters the affinity of TATA-binding protein (TBP) for the TATA boxes of human gene promoters and (ii) PubMed to look for retrospective clinical reviews on changes in physiological indicators of reproductive potential in carriers of these alleles. A total of 126 SNP markers of female reproductive potential, capable of altering the affinity of TBP for gene promoters, were found using the two above-mentioned Web services. For example, 10 candidate SNP markers of thrombosis (e.g., rs563763767) can cause overproduction of coagulation inducers. In pregnant women, Hughes syndrome provokes thrombosis with a fatal outcome although this syndrome can be diagnosed and eliminated even at the earliest stages of its development. Thus, in women carrying any of the above SNPs, preventive treatment of this syndrome before a planned pregnancy can reduce the risk of death. Similarly, seven SNP markers predicted here (e.g., rs774688955) can elevate the risk of myocardial infarction. In line with Bowles' lifespan theory, women carrying any of these SNPs may modify their lifestyle to improve their longevity if they can take under advisement that risks of myocardial infarction increase with age of the mother, total number of pregnancies, in multiple pregnancies, pregnancies under the age of 20, hypertension, preeclampsia, menstrual cycle irregularity, and in women smokers
Liu, Zongzhi; Li, Ao; Schulz, Vincent; Chen, Min; Tuck, David
Genotyping platforms such as single nucleotide polymorphism (SNP) arrays are powerful tools to study genomic aberrations in cancer samples. Allele specific information from SNP arrays provides valuable information for interpreting copy number variation (CNV) and allelic imbalance including loss-of-heterozygosity (LOH) beyond that obtained from the total DNA signal available from array comparative genomic hybridization (aCGH) platforms. Several algorithms based on hidden Markov models (HMMs) have been designed to detect copy number changes and copy-neutral LOH making use of the allele information on SNP arrays. However heterogeneity in clinical samples, due to stromal contamination and somatic alterations, complicates analysis and interpretation of these data. We have developed MixHMM, a novel hidden Markov model using hidden states based on chromosomal structural aberrations. MixHMM allows CNV detection for copy numbers up to 7 and allows more complete and accurate description of other forms of allelic imbalance, such as increased copy number LOH or imbalanced amplifications. MixHMM also incorporates a novel sample mixing model that allows detection of tumor CNV events in heterogeneous tumor samples, where cancer cells are mixed with a proportion of stromal cells. We validate MixHMM and demonstrate its advantages with simulated samples, clinical tumor samples and a dilution series of mixed samples. We have shown that the CNVs of cancer cells in a tumor sample contaminated with up to 80% of stromal cells can be detected accurately using Illumina BeadChip and MixHMM. The MixHMM is available as a Python package provided with some other useful tools at http://genecube.med.yale.edu:8080/MixHMM.
Full Text Available BACKGROUND: Genotyping platforms such as single nucleotide polymorphism (SNP arrays are powerful tools to study genomic aberrations in cancer samples. Allele specific information from SNP arrays provides valuable information for interpreting copy number variation (CNV and allelic imbalance including loss-of-heterozygosity (LOH beyond that obtained from the total DNA signal available from array comparative genomic hybridization (aCGH platforms. Several algorithms based on hidden Markov models (HMMs have been designed to detect copy number changes and copy-neutral LOH making use of the allele information on SNP arrays. However heterogeneity in clinical samples, due to stromal contamination and somatic alterations, complicates analysis and interpretation of these data. METHODS: We have developed MixHMM, a novel hidden Markov model using hidden states based on chromosomal structural aberrations. MixHMM allows CNV detection for copy numbers up to 7 and allows more complete and accurate description of other forms of allelic imbalance, such as increased copy number LOH or imbalanced amplifications. MixHMM also incorporates a novel sample mixing model that allows detection of tumor CNV events in heterogeneous tumor samples, where cancer cells are mixed with a proportion of stromal cells. CONCLUSIONS: We validate MixHMM and demonstrate its advantages with simulated samples, clinical tumor samples and a dilution series of mixed samples. We have shown that the CNVs of cancer cells in a tumor sample contaminated with up to 80% of stromal cells can be detected accurately using Illumina BeadChip and MixHMM. AVAILABILITY: The MixHMM is available as a Python package provided with some other useful tools at http://genecube.med.yale.edu:8080/MixHMM.
Majumder, M.K.; Pandya, N.D.; Kaushik, B.K.; Manhas, S.K.
Carbon nanotube (CN T) can be considered as an emerging interconnect material in current nano scale regime. They are more promising than other interconnect materials such as Al or Cu because of their robustness to electromigration. This research paper aims to address the crosstalk-related issues (signal integrity) in interconnect lines. Different analytical models of single- (SWCNT), double- (DWCNT), and multiwalled CNTs (MWCNT) are studied to analyze the crosstalk delay at global interconnect lengths. A capacitively coupled three-line bus architecture employing CMOS driver is used for accurate estimation of crosstalk delay. Each line in bus architecture is represented with the equivalent RLC models of single and bundled SWCNT, DWCNT, and MWCNT interconnects. Crosstalk delay is observed at middle line (victim) when it switches in opposite direction with respect to the other two lines (aggressors). Using the data predicted by ITRS 2012, a comparative analysis on the basis of crosstalk delay is performed for bundled SWCNT/DWCNT and single MWCNT interconnects. It is observed that the overall crosstalk delay is improved by 40.92% and 21.37% for single MWCNT in comparison to bundled SWCNT and bundled DWCNT interconnects, respectively.
Full Text Available Fast growth is an important and highly desired trait, which affects the profitability of food animal production, with feed costs accounting for the largest proportion of production costs. Traditional phenotype-based selection is typically used to select for growth traits; however, genetic improvement is slow over generations. Single nucleotide polymorphisms (SNPs explain 90% of the genetic differences between individuals; therefore, they are most suitable for genetic evaluation and strategies that employ molecular genetics for selective breeding. SNPs found within or near a coding sequence are of particular interest because they are more likely to alter the biological function of a protein. We aimed to use SNPs to identify markers and genes associated with genetic variation in growth. RNA-Seq whole-transcriptome analysis of pooled cDNA samples from a population of rainbow trout selected for improved growth versus unselected genetic cohorts (10 fish from 1 full-sib family each identified SNP markers associated with growth-rate. The allelic imbalances (the ratio between the allele frequencies of the fast growing sample and that of the slow growing sample were considered at scores >5.0 as an amplification and <0.2 as loss of heterozygosity. A subset of SNPs (n = 54 were validated and evaluated for association with growth traits in 778 individuals of a three-generation parent/offspring panel representing 40 families. Twenty-two SNP markers and one mitochondrial haplotype were significantly associated with growth traits. Polymorphism of 48 of the markers was confirmed in other commercially important aquaculture stocks. Many markers were clustered into genes of metabolic energy production pathways and are suitable candidates for genetic selection. The study demonstrates that RNA-Seq at low sequence coverage of divergent populations is a fast and effective means of identifying SNPs, with allelic imbalances between phenotypes. This technique is
Long, Nanye; Gianola, Daniel; Rosa, Guilherme J M; Weigel, Kent A; Kranis, Andreas; González-Recio, Oscar
A challenge when predicting total genetic values for complex quantitative traits is that an unknown number of quantitative trait loci may affect phenotypes via cryptic interactions. If markers are available, assuming that their effects on phenotypes are additive may lead to poor predictive ability. Non-parametric radial basis function (RBF) regression, which does not assume a particular form of the genotype-phenotype relationship, was investigated here by simulation and analysis of body weight and food conversion rate data in broilers. The simulation included a toy example in which an arbitrary non-linear genotype-phenotype relationship was assumed, and five different scenarios representing different broad sense heritability levels (0.1, 0.25, 0.5, 0.75 and 0.9) were created. In addition, a whole genome simulation was carried out, in which three different gene action modes (pure additive, additive+dominance and pure epistasis) were considered. In all analyses, a training set was used to fit the model and a testing set was used to evaluate predictive performance. The latter was measured by correlation and predictive mean-squared error (PMSE) on the testing data. For comparison, a linear additive model known as Bayes A was used as benchmark. Two RBF models with single nucleotide polymorphism (SNP)-specific (RBF I) and common (RBF II) weights were examined. Results indicated that, in the presence of complex genotype-phenotype relationships (i.e. non-linearity and non-additivity), RBF outperformed Bayes A in predicting total genetic values using SNP markers. Extension of Bayes A to include all additive, dominance and epistatic effects could improve its prediction accuracy. RBF I was generally better than RBF II, and was able to identify relevant SNPs in the toy example.
Sanchez, J J; Børsting, C; Balogh, K; Berger, B; Bogus, M; Butler, J M; Carracedo, A; Court, D Syndercombe; Dixon, L A; Filipović, B; Fondevila, M; Gill, P; Harrison, C D; Hohoff, C; Huel, R; Ludes, B; Parson, W; Parsons, T J; Petkovski, E; Phillips, C; Schmitter, H; Schneider, P M; Vallone, P M; Morling, N
We report the results of an inter-laboratory exercise on typing of autosomal single nucleotide polymorphisms (SNP) for forensic genetic investigations in crime cases. The European DNA Profiling Group (EDNAP), a working group under the International Society for Forensic Genetics (ISFG), organised the exercise. A total of 11 European and one US forensic genetic laboratories tested a subset of a 52 SNP-multiplex PCR kit developed by the SNPforID consortium. The 52 SNP-multiplex kit amplifies 52 DNA fragments with 52 autosomal SNP loci in one multiplex PCR. The 52 SNPs are detected in two separate single base extension (SBE) multiplex reactions with 29 and 23 SNPs, respectively, using SNaPshot kit, capillary electrophoresis and multicolour fluorescence detection. For practical reasons, only the 29 SBE multiplex reaction was carried out by the participating laboratories. A total of 11 bloodstains on FTA cards including a sample of poor quality and a negative control were sent to the laboratories together with the essential reagents for the initial multiplex PCR and the multiplex SBE reaction. The total SNP locus dropout rate was 2.8% and more than 50% of the dropouts were observed with the poor quality sample. The overall rate of discrepant SNP allele assignments was 2.0%. Two laboratories reported 60% of all the discrepancies. Two laboratories reported all 29 SNP alleles in all 10 positive samples correctly. The results of the collaborative exercise were surprisingly good and demonstrate that SNP typing with SBE, capillary electrophoresis and multicolour detection methods can be developed for forensic genetics.
Marghitoiu, Liliana; Liu, Jian; Lee, Hans; Perez, Lourdes; Fujimori, Kiyoshi; Ronk, Michael; Hammond, Matthew R; Nunn, Heather; Lower, Asher; Rogers, Gary; Nashed-Samuel, Yasser
Studies of the extractable profiles of bioprocessing components have become an integral part of drug development efforts to minimize possible compromise in process performance, decrease in drug product quality, and potential safety risk to patients due to the possibility of small molecules leaching out from the components. In this study, an effective extraction solvent system was developed to evaluate the organic extractable profiles of single-use bioprocess equipment, which has been gaining increasing popularity in the biopharmaceutical industry because of the many advantages over the traditional stainless steel-based bioreactors and other fluid mixing and storage vessels. The chosen extraction conditions were intended to represent aggressive conditions relative to the application of single-use bags in biopharmaceutical manufacture, in which aqueous based systems are largely utilized. Those extraction conditions, along with a non-targeted analytical strategy, allowed for the generation and identification of an array of extractable compounds; a total of 53 organic compounds were identified from four types of commercially available single-use bags, the majority of which are degradation products of polymer additives. The success of this overall extractables analysis strategy was reflected partially by the effectiveness in the extraction and identification of a compound that was later found to be highly detrimental to mammalian cell growth. The usage of single-use bioreactors has been increasing in biopharmaceutical industry because of the appealing advantages that it promises regarding to the cleaning, sterilization, operational flexibility, and so on, during manufacturing of biologics. However, compared to its conventional counterparts based mainly on stainless steel, single-use bioreactors are more susceptible to potential problems associated with compound leaching into the bioprocessing fluid. As a result, extractable profiling of the single-use system has become
Dong, Hee-Jin; Cho, Seongbeom; Boxrud, David; Rankin, Shelly; Downe, Francis; Lovchik, Judith; Gibson, Jim; Erdman, Matt; Saeed, A Mahdi
In 2007, a nationwide Salmonella Tennessee outbreak occurred via contaminated peanut butter. Here, we developed a single-nucleotide polymorphism (SNP)-typing method for S . Tennessee to determine the clonal subtypes of S . Tennessee that were associated with the peanut butter outbreak. One seventy-six S . Tennessee isolates from various sources, including humans, animals, food, and the environment, were analyzed by using the SNP technique. Eighty-four representative SNP markers were selected by comparing the sequences of three representative S . Tennessee strains with different multi-locus sequence typing and variable number tandem repeats from our collection. The set of eighty-four SNP markers showed 100% typeability for the 176 strains, with the nucleotide diversity ranging from 0.011 to 0.107 (mean = 0.049 ± 0.018, median = 0.044) for each marker. Among the four clades and nine subtypes generated by the SNP typing, subtype 1, which comprised 142 S . Tennessee strains, was the most predominant. The dominance of single-strain clones in subtype 1 revealed that S . Tennessee is highly clonal regardless of outbreak-association, source, or period of isolation, suggesting the presence of an S . Tennessee strain prototype. Notably, a minimum 18 SNP set was able to determine clonal S . Tennessee strains with similar discrimination power, potentially allowing more rapid and economic strain genotyping for both outbreaks and sporadic cases. The SNP-typing method described here might aid the investigation of the epidemiology and microevolution of pathogenic bacteria by discriminating between outbreak-related and sporadic clinical cases. In addition, this approach enables us to understand the population structure of the bacterial subtypes involved in the outbreak.
Wagner Mark C
Full Text Available Abstract Background Microbial forensics is important in tracking the source of a pathogen, whether the disease is a naturally occurring outbreak or part of a criminal investigation. Results A method and SPR Opt (SNP and PCR-RFLP Optimization software to perform a comprehensive, whole-genome analysis to forensically discriminate multiple sequences is presented. Tools for the optimization of forensic typing using Single Nucleotide Polymorphism (SNP and PCR-Restriction Fragment Length Polymorphism (PCR-RFLP analyses across multiple isolate sequences of a species are described. The PCR-RFLP analysis includes prediction and selection of optimal primers and restriction enzymes to enable maximum isolate discrimination based on sequence information. SPR Opt calculates all SNP or PCR-RFLP variations present in the sequences, groups them into haplotypes according to their co-segregation across those sequences, and performs combinatoric analyses to determine which sets of haplotypes provide maximal discrimination among all the input sequences. Those set combinations requiring that membership in the fewest haplotypes be queried (i.e. the fewest assays be performed are found. These analyses highlight variable regions based on existing sequence data. These markers may be heterogeneous among unsequenced isolates as well, and thus may be useful for characterizing the relationships among unsequenced as well as sequenced isolates. The predictions are multi-locus. Analyses of mumps and SARS viruses are summarized. Phylogenetic trees created based on SNPs, PCR-RFLPs, and full genomes are compared for SARS virus, illustrating that purported phylogenies based only on SNP or PCR-RFLP variations do not match those based on multiple sequence alignment of the full genomes. Conclusion This is the first software to optimize the selection of forensic markers to maximize information gained from the fewest assays, accepting whole or partial genome sequence data as input. As
Shindo, Hiroyuki; Chigira, Hiroshi; Nagaoka, Tomoyo; Inoue, Masato; Kamatani, Naoyuki
The method of statistical haplotype inference is an indispensable technique in the field of medical science. The authors previously reported Hardy-Weinberg equilibrium-based haplotype inference that could manage single nucleotide polymorphism (SNP) data. We recently extended the method to cover copy number variation (CNV) data. Haplotype inference from mixed data is important because SNPs and CNVs are occasionally in linkage disequilibrium. The idea underlying the proposed method is simple, but the algorithm for it needs to be quite elaborate to reduce the calculation cost. Consequently, we have focused on the details on the algorithm in this study. Although the main advantage of the method is accuracy, in that it does not use any approximation, its main disadvantage is still the calculation cost, which is sometimes intractable for large data sets with missing values.
Full Text Available Abstract Background Single Nucleotide Polymorphisms (SNPs can be used as genetic markers for applications such as genetic diversity studies or genetic mapping. New technologies now allow genotyping hundreds to thousands of SNPs in a single reaction. In order to evaluate the potential of these technologies in pea, we selected a custom 384-SNP set using SNPs discovered in Pisum through the resequencing of gene fragments in different genotypes and by compiling genomic sequence data present in databases. We then designed an Illumina GoldenGate assay to genotype both a Pisum germplasm collection and a genetic mapping population with the SNP set. Results We obtained clear allelic data for more than 92% of the SNPs (356 out of 384. Interestingly, the technique was successful for all the genotypes present in the germplasm collection, including those from species or subspecies different from the P. sativum ssp sativum used to generate sequences. By genotyping the mapping population with the SNP set, we obtained a genetic map and map positions for 37 new gene markers. Conclusion Our results show that the Illumina GoldenGate assay can be used successfully for high-throughput SNP genotyping of diverse germplasm in pea. This genotyping approach will simplify genotyping procedures for association mapping or diversity studies purposes and open new perspectives in legume genomics.
Shavrukov, Yuri; Suchecki, Radoslaw; Eliby, Serik; Abugalieva, Aigul; Kenebayev, Serik; Langridge, Peter
New SNP marker platforms offer the opportunity to investigate the relationships between wheat cultivars from different regions and assess the mechanism and processes that have led to adaptation to particular production environments. Wheat breeding has a long history in Kazakhstan and the aim of this study was to explore the relationship between key varieties from Kazakhstan and germplasm from breeding programs for other regions. The study revealed 5,898 polymorphic markers amongst ten cultivars, of which 2,730 were mapped in the consensus genetic map. Mapped SNP markers were distributed almost equally across the A and B genomes, with between 279 and 484 markers assigned to each chromosome. Marker coverage was approximately 10-fold lower in the D genome. There were 863 SNP markers identified as unique to specific cultivars, and clusters of these markers (regions containing more than three closely mapped unique SNPs) showed specific patterns on the consensus genetic map for each cultivar. Significant intra-varietal genetic polymorphism was identified in three cultivars (Tzelinnaya 3C, Kazakhstanskaya rannespelaya and Kazakhstanskaya 15). Phylogenetic analysis based on inter-varietal polymorphism showed that the very old cultivar Erythrospermum 841 was the most genetically distinct from the other nine cultivars from Kazakhstan, falling in a clade together with the American cultivar Sonora and genotypes from Central and South Asia. The modern cultivar Kazakhstanskaya 19 also fell into a separate clade, together with the American cultivar Thatcher. The remaining eight cultivars shared a single sub-clade but were categorised into four clusters. The accumulated data for SNP marker polymorphisms amongst bread wheat genotypes from Kazakhstan may be used for studying genetic diversity in bread wheat, with potential application for marker-assisted selection and the preparation of a set of genotype-specific markers.
Full Text Available Single nucleotide polymorphisms (SNPs play important roles as molecular markers in plant genomics and breeding studies. Although onion (Allium cepa L. is an important crop globally, relatively few molecular marker resources have been reported due to its large genome and high heterozygosity. Genotyping-by-sequencing (GBS offers a greater degree of complexity reduction followed by concurrent SNP discovery and genotyping for species with complex genomes. In this study, GBS was employed for SNP mining in onion, which currently lacks a reference genome. A segregating F2 population, derived from a cross between ‘NW-001’ and ‘NW-002,’ as well as multiple parental lines were used for GBS analysis. A total of 56.15 Gbp of raw sequence data were generated and 1,851,428 SNPs were identified from the de novo assembled contigs. Stringent filtering resulted in 10,091 high-fidelity SNP markers. Robust SNPs that satisfied the segregation ratio criteria and with even distribution in the mapping population were used to construct an onion genetic map. The final map contained eight linkage groups and spanned a genetic length of 1,383 centiMorgans (cM, with an average marker interval of 8.08 cM. These robust SNPs were further analyzed using the high-throughput Fluidigm platform for marker validation. This is the first study in onion to develop genome-wide SNPs using GBS. The resulting SNP markers and developed linkage map will be valuable tools for genetic mapping of important agronomic traits and marker-assisted selection in onion breeding programs.
Azam, Sarwar; Rathore, Abhishek; Shah, Trushar M; Telluri, Mohan; Amindala, BhanuPrakash; Ruperao, Pradeep; Katta, Mohan A V S K; Varshney, Rajeev K
Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone
Full Text Available Open source single nucleotide polymorphism (SNP discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2, SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a
Zouk, Hana; Marchand, Luc; Li, Quan; Polychronakos, Constantin
The Thr allele at the Thr946Ala non-synonymous single-nucleotide polymorphism (nsSNP) in the IFIH1 gene confers risk for type 1 diabetes (T1D). IFIH1 binds viral double-stranded RNA (dsRNA), inducing a type I interferon (IFN) response. Reports of this nsSNP's role in IFIH1 expression regulation have produced conflicting results and a study evaluating transfected Thr946Ala protein alleles in an artificial system overexpressing IFIH1 shows that the SNP does not affect IFH1 function. In this study, we examine the effects of the Thr946Ala polymorphism on IFN-α response in a cell line that endogenously expresses physiological levels of IFIH1. Eleven lymphoblastoid cell lines (LCLs) homozygous for the major predisposing allele (Thr/Thr) and 6 LCLs homozygous for the minor protective allele (Ala/Ala) were electroporated with the viral dsRNA mimic, poly I:C, in three independent experiments. Media were collected 24 hours later and measured for IFN-α production by ELISA. Basal IFN response is minimal in mock-transfected cells from both genotypes and increases by about 8-fold in cells treated with poly I:C. LCLs with the Ala/Ala genotype have slightly higher IFN-α levels than their Thr/Thr counterparts but this did not reach statistical significance because of the large variability of the IFN response, due mostly to two high outliers (biological, not technical). A larger sample size would be needed to determine whether the Thr946Ala SNP affects the poly I:C-driven IFN-α response. Additionally, the possibility that this nsSNP recognizes viral dsRNA specificities cannot be ruled out. Thus, the mechanism of the observed association of this SNP with T1D remains to be determined.
Hakovirta, Janetta R; Prezioso, Samantha; Hodge, David; Pillai, Segaran P; Weigel, Linda M
Analysis of 16S rRNA genes is important for phylogenetic classification of known and novel bacterial genera and species and for detection of uncultivable bacteria. PCR amplification of 16S rRNA genes with universal primers produces a mixture of amplicons from all rRNA operons in the genome, and the sequence data generally yield a consensus sequence. Here we describe valuable data that are missing from consensus sequences, variable effects on sequence data generated from nonidentical 16S rRNA amplicons, and the appearance of data displayed by different software programs. These effects are illustrated by analysis of 16S rRNA genes from 50 strains of the Bacillus cereus group, i.e., Bacillus anthracis, Bacillus cereus, Bacillus mycoides, and Bacillus thuringiensis These species have 11 to 14 rRNA operons, and sequence variability occurs among the multiple 16S rRNA genes. A single nucleotide polymorphism (SNP) previously reported to be specific to B. anthracis was detected in some B. cereus strains. However, a different SNP, at position 1139, was identified as being specific to B. anthracis, which is a biothreat agent with high mortality rates. Compared with visual analysis of the electropherograms, basecaller software frequently missed gene sequence variations or could not identify variant bases due to overlapping basecalls. Accurate detection of 16S rRNA gene sequences that include intragenomic variations can improve discrimination among closely related species, improve the utility of 16S rRNA databases, and facilitate rapid bacterial identification by targeted DNA sequence analysis or by whole-genome sequencing performed by clinical or reference laboratories. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Cao, Chan; Long, Yi-Tao
Nanopore sensing is developing into a powerful single-molecule approach to investigate the features of biomolecules that are not accessible by studying ensemble systems. When a target molecule is transported through a nanopore, the ions occupying the pore are excluded, resulting in an electrical signal from the intermittent ionic blockade event. By statistical analysis of the amplitudes, duration, frequencies, and shapes of the blockade events, many properties of the target molecule can be obtained in real time at the single-molecule level, including its size, conformation, structure, charge, geometry, and interactions with other molecules. With the development of the use of α-hemolysin to characterize individual polynucleotides, nanopore technology has attracted a wide range of research interest in the fields of biology, physics, chemistry, and nanoscience. As a powerful single-molecule analytical method, nanopore technology has been applied for the detection of various biomolecules, including oligonucleotides, peptides, oligosaccharides, organic molecules, and disease-related proteins. In this Account, we highlight recent developments of biological nanopores in DNA-based sensing and in studying the conformational structures of DNA and RNA. Furthermore, we introduce the application of biological nanopores to investigate the conformations of peptides affected by charge, length, and dipole moment and to study disease-related proteins' structures and aggregation transitions influenced by an inhibitor, a promoter, or an applied voltage. To improve the sensing ability of biological nanopores and further extend their application to a wider range of molecular sensing, we focus on exploring novel biological nanopores, such as aerolysin and Stable Protein 1. Aerolysin exhibits an especially high sensitivity for the detection of single oligonucleotides both in current separation and duration. Finally, to facilitate the use of nanopore measurements and statistical analysis
Recent availability of large-scale genomic resources enables us to conduct so called genome-wide association studies (GWAS) and genomic prediction (GP) studies, particularly with next-generation sequencing (NGS) data. The effectiveness of GWAS and GP depends on not only their mathematical models, but the quality and quantity of variants employed in the analysis. In NGS single nucleotide polymorphism (SNP) calling, conventional tools ideally require more reads for higher SNP sensitivity and accuracy. In this study, we aimed to develop a tool, Heap, that enables robustly sensitive and accurate calling of SNPs, particularly with a low coverage NGS data, which must be aligned to the reference genome sequences in advance. To reduce false positive SNPs, Heap determines genotypes and calls SNPs at each site except for sites at the both ends of reads or containing a minor allele supported by only one read. Performance comparison with existing tools showed that Heap achieved the highest F-scores with low coverage (7X) restriction-site associated DNA sequencing reads of sorghum and rice individuals. This will facilitate cost-effective GWAS and GP studies in this NGS era. Code and documentation of Heap are freely available from https://github.com/meiji-bioinf/heap (29 March 2017, date last accessed) and our web site (http://bioinf.mind.meiji.ac.jp/lab/en/tools.html (29 March 2017, date last accessed)).
In this paper, we use replica analysis to investigate the influence of correlation among the return rates of assets on the solution of the portfolio optimization problem. We consider the behavior of an optimal solution for the case where the return rate is described with a single-factor model and compare the findings obtained from our proposed methods with correlated return rates with those obtained with independent return rates. We then analytically assess the increase in the investment risk when correlation is included. Furthermore, we also compare our approach with analytical procedures for minimizing the investment risk from operations research.
Optofluidics is a field with important applications in areas such as biotechnology, chemical synthesis and analytical chemistry. Optofluidic devices combine optical elements into microfluidic devices in ways that increase portability and sensitivity of analysis for diagnostic or screening purposes .In fact in these devices fluids give fine adaptability, mobility and accessibility to nanoscale photonic devices which otherwise could not be realized using conventional devices. This review describes several cases inwhich optical or microfluidic approaches are used to trap single cells in proximity of integrated optical sensor for being analysed.
Santoso, Yusdi; Torella, Joseph P; Kapanidis, Achillefs N
Probability distribution analysis (PDA) is a recently developed statistical tool for predicting the shapes of single-molecule fluorescence resonance energy transfer (smFRET) histograms, which allows the identification of single or multiple static molecular species within a single histogram. We used a generalized PDA method to predict the shapes of FRET histograms for molecules interconverting dynamically between multiple states. This method is tested on a series of model systems, including both static DNA fragments and dynamic DNA hairpins. By fitting the shape of this expected distribution to experimental data, the timescale of hairpin conformational fluctuations can be recovered, in good agreement with earlier published results obtained using different techniques. This method is also applied to studying the conformational fluctuations in the unliganded Klenow fragment (KF) of Escherichia coli DNA polymerase I, which allows both confirmation of the consistency of a simple, two-state kinetic model with the observed smFRET distribution of unliganded KF and extraction of a millisecond fluctuation timescale, in good agreement with rates reported elsewhere. We expect this method to be useful in extracting rates from processes exhibiting dynamic FRET, and in hypothesis-testing models of conformational dynamics against experimental data.
Wang, Ying; Yao, Fujun; Kang, Xiao-feng
Nanopore technology, as the simplest and most inexpensive single-molecule tool, is being intensively developed. In nanopore stochastic sensing, KCl and NaCl have traditionally been employed as pore-filled electrolytes for recording the change of ion conductance in nanopores triggered by analyte translocation through the pore. However, some challenges limit its further advance. Here we used tetramethylammonium (TMA) chloride, instead of KCl, as a novel analysis system for nanopores. Some unique nanopore characteristics were observed: (1) The stability of the planar lipid bilayer for embedding the protein pores was elevated at least 6 times. (2) The TMA-Cl system could effectively reduce the noise of single-channel recording. (3) It was easy to control the insertion of protein pores into the lipid bilayer, and the formed single nanopore could last for a sufficiently long time. (4) TMA-Cl could be used as a DNA speed bump in the nanopore to slow DNA translocation speed. (5) The capacity of the nanopore capture of DNA (capture rate) increased significantly and simultaneously increased the translocation time of DNA in the pore. (6) The TMA-filled nanopore could discriminate between various polynucleotides.
Background In recent years, high throughput and non-invasive Raman spectrometry technique has matured as an effective approach to identification of individual cells by species, even in complex, mixed populations. Raman profiling is an appealing optical microscopic method to achieve this. To fully utilize Raman proling for single-cell analysis, an extensive understanding of Raman spectra is necessary to answer questions such as which filtering methodologies are effective for pre-processing of Raman spectra, what strains can be distinguished by Raman spectra, and what features serve best as Raman-based biomarkers for single-cells, etc. Results In this work, we have proposed an approach called rDisc to discretize the original Raman spectrum into only a few (usually less than 20) representative peaks (Raman shifts). The approach has advantages in removing noises, and condensing the original spectrum. In particular, effective signal processing procedures were designed to eliminate noise, utilising wavelet transform denoising, baseline correction, and signal normalization. In the discretizing process, representative peaks were selected to signicantly decrease the Raman data size. More importantly, the selected peaks are chosen as suitable to serve as key biological markers to differentiate species and other cellular features. Additionally, the classication performance of discretized spectra was found to be comparable to full spectrum having more than 1000 Raman shifts. Overall, the discretized spectrum needs about 5storage space of a full spectrum and the processing speed is considerably faster. This makes rDisc clearly superior to other methods for single-cell classication.
Milman, Nils; Nielsen, Ole Haagen; Hviid, Thomas Vauvert F
and SNP13, respectively, were performed by capillary electrophoresis single-strand confirmation polymorphism in 53 patients with histologically verified sarcoidosis and in 103 healthy controls. RESULTS: The frequencies of CARD15 mutations in sarcoidosis patients were: SNP8, 4/106 chromosomes (3.8%); SNP12...... with Crohn's disease. OBJECTIVES: To evaluate whether ethnic Danes with sarcoidosis have an increased frequency of CARD15 mutations compared to healthy control subjects. METHODS: Genotyping for CARD15 mutations R702W, G908R, and L1007fsinsC, also designated single nucleotide polymorphism (SNP) SNP8, SNP12......, 2/106 chromosomes (1.9%); SNP13, 2/106 chromosomes (1.9%); SNP8+SNP12+SNP13, 8/106 chromosomes (7.6%). All 8 patients were heterozygous. The frequencies in controls were: SNP8, 9/206 chromosomes (4.4%); SNP12, 2/206 chromosomes (1.0%); SNP13, 4/206 chromosomes (1.9%); SNP8+SNP12+SNP13, 15...
Daniilidis, N.; Gerber, S.; Bolloten, G.; Ramm, M.; Ransford, A.; Ulin-Avila, E.; Talukdar, I.; Häffner, H.
We use a single-ion electric-field noise sensor in combination with in situ surface treatment and analysis tools, to investigate the relationship between electric-field noise from metal surfaces in vacuum and the composition of the surface. These experiments are performed in a setup that integrates ion trapping capabilities with surface analysis tools. We find that treatment of an aluminum-copper surface with energetic argon ions significantly reduces the level of room-temperature electric-field noise, but the surface does not need to be atomically clean to show noise levels comparable to those of the best cryogenic traps. The noise levels after treatment are low enough to allow fault-tolerant trapped-ion quantum information processing on a microfabricated surface trap at room temperature.
Muhammad Ali Shah
Full Text Available This paper reports on designing a single-structure triaxes MEMS capacitive gyroscope which is capable of measuring the three angular velocities on a single drive. A Z-shaped beam for the support of folded coupling spring has been applied to suppress the unwanted mode and decrease the stress effect at the spring ends. The unique coupling spring has changed the driving motion, due to which slide film damping in the driving mode has been reduced. This reduction can lead to higher performance of the sensor with less requirements on vacuum level which decreases the cost of fabrication. Simulation analysis has been performed in COMSOL Multiphysics and Matlab Simulink to finalize the design for fabrication. After finite element analysis, the driving, x-sensing, z-sensing, and y-sensing are, respectively, found to be 13.30 KHz, 13.40 KHz, 13.47 KHz, and 13.51 KHz.
Full Text Available Abstract Background The local environment of single nucleotide polymorphisms (SNPs contains abundant genetic information for the study of mechanisms of mutation, genome evolution, and causes of diseases. Recent studies revealed that neighboring-nucleotide biases on SNPs were strong and the genome-wide bias patterns could be represented by a small subset of the total SNPs. It remains unsolved for the estimation of the effective SNP size, the number of SNPs that are sufficient to represent the bias patterns observed from the whole SNP data. Results To estimate the effective SNP size, we developed a novel statistical method, SNPKS, which considers both the statistical and biological significances. SNPKS consists of two major steps: to obtain an initial effective size by the Kolmogorov-Smirnov test (KS test and to find an intermediate effective size by interval evaluation. The SNPKS algorithm was implemented in computer programs and applied to the real SNP data. The effective SNP size was estimated to be 38,200, 39,300, 38,000, and 38,700 in the human, chimpanzee, dog, and mouse genomes, respectively, and 39,100, 39,600, 39,200, and 42,200 in human intergenic, genic, intronic, and CpG island regions, respectively. Conclusion SNPKS is the first statistical method to estimate the effective SNP size. It runs efficiently and greatly outperforms the algorithm implemented in SNPNB. The application of SNPKS to the real SNP data revealed the similar small effective SNP size (38,000 – 42,200 in the human, chimpanzee, dog, and mouse genomes as well as in human genomic regions. The findings suggest strong influence of genetic factors across vertebrate genomes.
Full Text Available Ganoderma lucidum is a typical polypore fungus used for traditional Chinese medical purposes. The taxonomic delimitation of Ganoderma lucidum is still debated. In this study, we sequenced seven internal transcribed spacer (ITS sequences of Ganoderma lucidum strains and annotated the ITS1 and ITS2 regions. Phylogenetic analysis of ITS1 differentiated the strains into three geographic groups. Groups 1-3 were originated from Europe, tropical Asia, and eastern Asia, respectively. While ITS2 could only differentiate the strains into two groups in which Group 2 originated from tropical Asia gathered with Groups 1 and 3 originated from Europe and eastern Asia. By determining the secondary structures of the ITS1 sequences, these three groups exhibited similar structures with a conserved central core and differed helices. While compared to Group 2, Groups 1 and 3 of ITS2 sequences shared similar structures with the difference in helix 4. Large-scale evaluation of ITS1 and ITS2 both exhibited that the majority of subgroups in the same group shared the similar structures. Further Weblogo analysis of ITS1 sequences revealed two main variable regions located in helix 2 in which C/T or A/G substitutions frequently occurred and ITS1 exhibited more nucleotide variances compared to ITS2. ITS1 multi-alignment of seven spawn strains and culture tests indicated that a single-nucleotide polymorphism (SNP site at position 180 correlated with strain antagonism. The HZ, TK and 203 fusion strains of Ganoderma lucidum had a T at position 180, whereas other strains exhibiting antagonism, including DB, RB, JQ, and YS, had a C. Taken together, compared to ITS2 region, ITS1 region could differentiated Ganoderma lucidum into three geographic originations based on phylogenetic analysis and secondary structure prediction. Besides, a SNP in ITS 1 could delineate Ganoderma lucidum strains at the intraspecific level. These findings will be implemented to improve species quality
Pei, Haisheng; Chen, Zhou; Tan, Xiaoyan; Hu, Jing; Yang, Bin; Sun, Junshe
Ganoderma lucidum is a typical polypore fungus used for traditional Chinese medical purposes. The taxonomic delimitation of Ganoderma lucidum is still debated. In this study, we sequenced seven internal transcribed spacer (ITS) sequences of Ganoderma lucidum strains and annotated the ITS1 and ITS2 regions. Phylogenetic analysis of ITS1 differentiated the strains into three geographic groups. Groups 1–3 were originated from Europe, tropical Asia, and eastern Asia, respectively. While ITS2 could only differentiate the strains into two groups in which Group 2 originated from tropical Asia gathered with Groups 1 and 3 originated from Europe and eastern Asia. By determining the secondary structures of the ITS1 sequences, these three groups exhibited similar structures with a conserved central core and differed helices. While compared to Group 2, Groups 1 and 3 of ITS2 sequences shared similar structures with the difference in helix 4. Large-scale evaluation of ITS1 and ITS2 both exhibited that the majority of subgroups in the same group shared the similar structures. Further Weblogo analysis of ITS1 sequences revealed two main variable regions located in helix 2 in which C/T or A/G substitutions frequently occurred and ITS1 exhibited more nucleotide variances compared to ITS2. ITS1 multi-alignment of seven spawn strains and culture tests indicated that a single-nucleotide polymorphism (SNP) site at position 180 correlated with strain antagonism. The HZ, TK and 203 fusion strains of Ganoderma lucidum had a T at position 180, whereas other strains exhibiting antagonism, including DB, RB, JQ, and YS, had a C. Taken together, compared to ITS2 region, ITS1 region could differentiated Ganoderma lucidum into three geographic originations based on phylogenetic analysis and secondary structure prediction. Besides, a SNP in ITS 1 could delineate Ganoderma lucidum strains at the intraspecific level. These findings will be implemented to improve species quality control in the
Zhang, Xiuqing; Xu, Zhangyang; Pei, Haisheng; Chen, Zhou; Tan, Xiaoyan; Hu, Jing; Yang, Bin; Sun, Junshe
Ganoderma lucidum is a typical polypore fungus used for traditional Chinese medical purposes. The taxonomic delimitation of Ganoderma lucidum is still debated. In this study, we sequenced seven internal transcribed spacer (ITS) sequences of Ganoderma lucidum strains and annotated the ITS1 and ITS2 regions. Phylogenetic analysis of ITS1 differentiated the strains into three geographic groups. Groups 1-3 were originated from Europe, tropical Asia, and eastern Asia, respectively. While ITS2 could only differentiate the strains into two groups in which Group 2 originated from tropical Asia gathered with Groups 1 and 3 originated from Europe and eastern Asia. By determining the secondary structures of the ITS1 sequences, these three groups exhibited similar structures with a conserved central core and differed helices. While compared to Group 2, Groups 1 and 3 of ITS2 sequences shared similar structures with the difference in helix 4. Large-scale evaluation of ITS1 and ITS2 both exhibited that the majority of subgroups in the same group shared the similar structures. Further Weblogo analysis of ITS1 sequences revealed two main variable regions located in helix 2 in which C/T or A/G substitutions frequently occurred and ITS1 exhibited more nucleotide variances compared to ITS2. ITS1 multi-alignment of seven spawn strains and culture tests indicated that a single-nucleotide polymorphism (SNP) site at position 180 correlated with strain antagonism. The HZ, TK and 203 fusion strains of Ganoderma lucidum had a T at position 180, whereas other strains exhibiting antagonism, including DB, RB, JQ, and YS, had a C. Taken together, compared to ITS2 region, ITS1 region could differentiated Ganoderma lucidum into three geographic originations based on phylogenetic analysis and secondary structure prediction. Besides, a SNP in ITS 1 could delineate Ganoderma lucidum strains at the intraspecific level. These findings will be implemented to improve species quality control in the
Mikkelsen, Martin; Rockenbauer, Eszter; Sørensen, Erik
Mitochondrial DNA (mtDNA) is maternally inherited without recombination events and has a high copy number, which makes mtDNA analysis feasible even when genomic DNA is sparse or degraded. Here, we present a SNP typing assay with 33 previously described mtDNA coding region SNPs for haplogroup...
Mikkelsen, Martin; Rockenbauer, Eszter; Sørensen, Erik
Mitochondrial DNA (mtDNA) is maternally inherited without recombination events and has a high copy number, which makes mtDNA analysis feasible even when genomic DNA is sparse or degraded. Here, we present a SNP typing assay with 33 previously described mtDNA coding region SNPs for haplogroup assi...
Prakash, Siddharth; Guo, Dongchuan; Maslen, Cheryl L; Silberbach, Michael; Milewicz, Dianna; Bondy, Carolyn A
Turner syndrome is a developmental disorder caused by partial or complete monosomy for the X chromosome in 1 in 2,500 females. We hypothesized that single-nucleotide polymorphism (SNP) array genotyping could provide superior resolution in comparison to metaphase karyotype analysis to facilitate genotype-phenotype correlations. We genotyped 187 Turner syndrome patients with 733,000 SNP marker arrays. All cases met diagnostic criteria for Turner syndrome based on karyotypes (60%) or characteristic physical features. The SNP array results confirmed the diagnosis of Turner syndrome in 100% of cases. We identified a single X chromosome (45,X) in 113 cases. In 58 additional cases (31%), other mosaic cell lines were present, including isochromosomes (16%), rings (5%), and Xp deletions (8%). The remaining cases were mosaic for monosomy X and normal male or female cell lines. Array-based models of X chromosome structure were compatible with karyotypes in 104 of 116 comparable cases (90%). We found that the SNP array data did not detect X-autosome translocations (three cases) but did identify two derivative Y chromosomes and 13 large copy-number variants that were not detected by karyotyping. Our study is the first systematic comparison between the two methods and supports the utility of SNP array genotyping to address clinical and research questions in Turner syndrome.
Bourret, Vincent; Kent, Matthew P; Primmer, Craig R; Vasemägi, Anti; Karlsson, Sten; Hindar, Kjetil; McGinnity, Philip; Verspoor, Eric; Bernatchez, Louis; Lien, Sigbjørn
Atlantic salmon (Salmo salar) is one of the most extensively studied fish species in the world due to its significance in aquaculture, fisheries and ongoing conservation efforts to protect declining populations. Yet, limited genomic resources have hampered our understanding of genetic architecture in the species and the genetic basis of adaptation to the wide range of natural and artificial environments it occupies. In this study, we describe the development of a medium-density Atlantic salmon single nucleotide polymorphism (SNP) array based on expressed sequence tags (ESTs) and genomic sequencing. The array was used in the most extensive assessment of population genetic structure performed to date in this species. A total of 6176 informative SNPs were successfully genotyped in 38 anadromous and freshwater wild populations distributed across the species natural range. Principal component analysis clearly differentiated European and North American populations, and within Europe, three major regional genetic groups were identified for the first time in a single analysis. We assessed the potential for the array to disentangle neutral and putative adaptive divergence of SNP allele frequencies across populations and among regional groups. In Europe, secondary contact zones were identified between major clusters where endogenous and exogenous barriers could be associated, rendering the interpretation of environmental influence on potentially adaptive divergence equivocal. A small number of markers highly divergent in allele frequencies (outliers) were observed between (multiple) freshwater and anadromous populations, between northern and southern latitudes, and when comparing Baltic populations to all others. We also discuss the potential future applications of the SNP array for conservation, management and aquaculture. © 2012 Blackwell Publishing Ltd.
Light scattering by single spherical homogeneous particles in the diameter range 1 to 20 μm and relative refractive index 1.20 is measured. Particle size of narrowly dispersed populations is determined and a multi-modal dispersion of five components is completely analyzed. A 360 0 light scattering photometer for analysis of single particles has been designed and developed. A fluid stream containing single particles intersects a focused laser beam at the primary focal point of an ellipsoidal reflector ring. The light scattered at angles theta = 2.5 0 to 177.5 0 at phi = 0 0 and 180 0 is reflected onto a circular array of photodiodes. The ellipsoidal reflector is situated in a chamber filled with fluid matching that of the stream to minimize refracting and reflecting interfaces. The detector array consists of 60 photodiodes each subtending 3 0 in scattering angle on 6 0 centers around 360 0 . 32 measurements on individual particles can be acquired at rates of 500 particles per second. The intensity and angular distribution of light scattered by spherical particles are indicative of size and relative refractive index. Calculations, using Lorenz--Mie theory, of differential scattering patterns integrated over angle corresponding to the detector geometry determined the instrument response to particle size. From this the expected resolution and experimental procedures are determined.Ultimately, the photometer will be utilized for identification and discrimination of biological cells based on the sensitivity of light scattering to size, shape, refractive index differences, internal granularity, and other internal morphology. This study has demonstrated the utility of the photometer and indicates potential for application to light scattering studies of biological cells
le Gac, Severine; de Boer, Hans L.; Wijnperle, Daniël; Meuleman, W.; Carlen, Edwin; van den Berg, Albert; Kim, Tae Song; Lee, Yoon-Sik; Chung, Taek-Dong; Jeon, Noo Li; Lee, Sang-Hoon; Suh, Kahp-Yang; Choo, Jaebum; Kim, Yong-Kweon
We report here a novel and easily scalable microfluidic platform for the parallel analysis of hundreds of individual cells, with controlled single cell trapping, followed by their lysis and subsequent retrieval of the cellular content for on-chip analysis. The device consists of a main channel and
Mortonson, Michael J.; Peiris, Hiranya V.; Easther, Richard
Future astrophysical data sets promise to strengthen constraints on models of inflation, and extracting these constraints requires methods and tools commensurate with the quality of the data. In this paper we describe ModeCode, a new, publicly available code that computes the primordial scalar and tensor power spectra for single-field inflationary models. ModeCode solves the inflationary mode equations numerically, avoiding the slow roll approximation. It is interfaced with CAMB and CosmoMC to compute cosmic microwave background angular power spectra and perform likelihood analysis and parameter estimation. ModeCode is easily extendable to additional models of inflation, and future updates will include Bayesian model comparison. Errors from ModeCode contribute negligibly to the error budget for analyses of data from Planck or other next generation experiments. We constrain representative single-field models (φ n with n=2/3, 1, 2, and 4, natural inflation, and 'hilltop' inflation) using current data, and provide forecasts for Planck. From current data, we obtain weak but nontrivial limits on the post-inflationary physics, which is a significant source of uncertainty in the predictions of inflationary models, while we find that Planck will dramatically improve these constraints. In particular, Planck will link the inflationary dynamics with the post-inflationary growth of the horizon, and thus begin to probe the ''primordial dark ages'' between TeV and grand unified theory scale energies.
Ludwig, Frank, E-mail: email@example.com [Institut für Elektrische Messtechnik und Grundlagen der Elektrotechnik, TU Braunschweig, Braunschweig (Germany); Balceris, Christoph; Viereck, Thilo [Institut für Elektrische Messtechnik und Grundlagen der Elektrotechnik, TU Braunschweig, Braunschweig (Germany); Posth, Oliver; Steinhoff, Uwe [Physikalisch-Technische Bundesanstalt, Berlin (Germany); Gavilan, Helena; Costo, Rocio [Instituto de Ciencia de Materiales de Madrid, ICMM/CSIC, Madrid (Spain); Zeng, Lunjie; Olsson, Eva [Department of Applied Physics, Chalmers University of Technology, Göteborg (Sweden); Jonasson, Christian; Johansson, Christer [ACREO Swedish ICT AB, Göteborg (Sweden)
Single-core iron-oxide nanoparticles with nominal core diameters of 14 nm and 19 nm were analyzed with a variety of non-magnetic and magnetic analysis techniques, including transmission electron microscopy (TEM), dynamic light scattering (DLS), static magnetization vs. magnetic field (M-H) measurements, ac susceptibility (ACS) and magnetorelaxometry (MRX). From the experimental data, distributions of core and hydrodynamic sizes are derived. Except for TEM where a number-weighted distribution is directly obtained, models have to be applied in order to determine size distributions from the measurand. It was found that the mean core diameters determined from TEM, M-H, ACS and MRX measurements agree well although they are based on different models (Langevin function, Brownian and Néel relaxation times). Especially for the sample with large cores, particle interaction effects come into play, causing agglomerates which were detected in DLS, ACS and MRX measurements. We observed that the number and size of agglomerates can be minimized by sufficiently strong diluting the suspension. - Highlights: • Investigation of size parameters of single-core magnetic nanoparticles with nominal core diameters of 14 nm and 19 nm utilizing different magnetic and non-magnetic methods • Hydrodynamic size determined from ac susceptibility measurements is consistent with the DLS findings • Core size agrees determined from static magnetization curves, MRX and ACS data agrees with results from TEM although the estimation is based on different models (Langevin function, Brownian and Néel relaxation times).
Comparison of semi-automated commercial rep-PCR fingerprinting, spoligotyping, 12-locus MIRU-VNTR typing and single nucleotide polymorphism analysis of the embB gene as molecular typing tools for Mycobacterium bovis.
Armas, Federica; Camperio, Cristina; Coltella, Luana; Selvaggini, Serena; Boniotti, Maria Beatrice; Pacciarini, Maria Lodovica; Di Marco Lo Presti, Vincenzo; Marianelli, Cinzia
Highly discriminatory genotyping strategies are essential in molecular epidemiological studies of tuberculosis. In this study we evaluated, for the first time, the efficacy of the repetitive sequence-based PCR (rep-PCR) DiversiLab Mycobacterium typing kit over spoligotyping, 12-locus mycobacterial interspersed repetitive unit-variable number tandem repeat (MIRU-VNTR) typing and embB single nucleotide polymorphism (SNP) analysis for Mycobacterium bovis typing. A total of 49 M. bovis animal isolates were used. DNA was extracted and genomic DNA was amplified using the DiversiLab Mycobacterium typing kit. The amplified fragments were separated and detected using a microfluidics chip with Agilent 2100. The resulting rep-PCR-based DNA fingerprints were uploaded to and analysed using web-based DiversiLab software through Pearson's correlation coefficient. Rep-PCR DiversiLab grouped M. bovis isolates into ten different clusters. Most isolates sharing identical spoligotype, MIRU-VNTR profile or embB gene polymorphism were grouped into different rep-PCR clusters. Rep-PCR DiversiLab displayed greater discriminatory power than spoligotyping and embB SNP analysis but a lower resolution power than the 12-locus MIRU-VNTR analysis. MIRU-VNTR confirmed that it is superior to the other PCR-based methods tested here. In combination with spoligotyping and 12-locus MIRU-VNTR analysis, rep-PCR improved the discriminatory power for M. bovis typing.
King, S.F.; Nimai Singh, N.
We perform a renormalisation group (RG) analysis of neutrino masses and mixing angles in the see-saw mechanism in the minimal supersymmetric standard model with three right-handed neutrinos, including the effects of the heavy neutrino thresholds. We focus on the case that one of the right-handed neutrinos provides the dominant contribution to the 23 block of the light Majorana matrix, causing its determinant to approximately vanish and giving an automatic neutrino mass hierarchy, so-called single right-handed neutrino dominance which may arise from a U(1) family symmetry. In these models radiative corrections can increase atmospheric and solar neutrino mixing by up to about 10% and 5%, respectively, and may help to achieve bi-maximal mixing. Significantly we find that the radiative corrections over the heavy neutrino threshold region are at least as important as those usually considered from the lightest right-handed neutrino down to low energies
Hui, S; Zak, S H
The problem of robust stability and convergence of learning parameters of adaptation algorithms in a noisy environment for the single preceptron is addressed. The case in which the same input pattern is presented in the adaptation cycle is analyzed. The algorithm proposed is of the Widrow-Hoff type. It is concluded that this algorithm is robust. However, the weight vectors do not necessarily converge in the presence of measurement noise. A modified version of this algorithm in which the reduction factors are allowed to vary with time is proposed, and it is shown that this algorithm is robust and that the weight vectors converge in the presence of bounded noise. Only deterministic-type arguments are used in the analysis. An ultimate bound on the error in terms of a convex combination of the initial error and the bound on the noise is obtained.
Schulz, Kenneth R; Danna, Erika A; Krutzik, Peter O; Nolan, Garry P
This protocol describes methods for monitoring intracellular phosphorylation-dependent signaling events on a single-cell basis. This approach measures cell signaling by treating cells with exogenous stimuli, fixing cells with formaldehyde, permeabilizing with methanol, and then staining with phospho-specific antibodies. Thus, cell signaling states can be determined as a measure of how cells interact with their environment. This method has applications in clinical research as well as mechanistic studies of basic biology. In clinical research, diagnostic or drug efficacy information can be retrieved by discovering how a disease affects the ability of cells to respond to growth factors. Basic scientists can use this technique to analyze signaling events in cell lines and human or murine primary cells, including rare populations, like B1 cells or stem cells. This technique has broad applications bringing standard biochemical analysis into primary cells in order to garner valuable information about signaling events in physiologic settings. © 2012 by John Wiley & Sons, Inc.
Goyal, Gaurav; Mulero, Rafael; Ali, Jamel; Darvish, Armin; Kim, Min Jun
This paper describes microparticle and bacterial translocation studies using low aspect ratio solid-state micropores. Micropores, 5 μm in diameter, were fabricated in 200 nm thick free-standing silicon nitride membranes, resulting in pores with an extremely low aspect ratio, nominally 0.04. For microparticle translocation experiments, sulfonated polystyrene microparticles and magnetic microbeads in size range of 1-4 μm were used. Using the microparticle translocation characteristics, we find that particle translocations result in a change only in the pore's geometrical resistance while the access resistance remains constant. Furthermore, we demonstrate the ability of our micropore to probe high-resolution shape information of translocating analytes using concatenated magnetic microspheres. Distinct current drop peaks were observed for each microsphere of the multibead architecture. For bacterial translocation experiments, nonflagellated Escherichia coli (strain HCB 5) and wild type flagellated Salmonella typhimurium (strain SJW1103) were used. Distinct current signatures for the two bacteria were obtained and this difference in translocation behavior was attributed to different surface protein distributions on the bacteria. Our findings may help in developing low aspect ratio pores for high-resolution microparticle characterization and single-cell analysis. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
hormone receptor PV pemphigus vulgaris 3 Abstract: Purpose: A single nucleotide polymorphism (SNP) in PERP (rs2484067, G>A) has been...assembly and maintenance of epithelial integrity 6,7. PERP is also implicated in the pathogenesis of pemphigus vulgaris , an autoimmune disease with...Beaudry VG, et al: Loss of the desmosomal protein perp enhances the phenotypic effects of pemphigus vulgaris autoantibodies. J Invest Dermatol 129:1710-8
Clinical significance of previously cryptic copy number alterations and loss of heterozygosity in pediatric acute myeloid leukemia and myelodysplastic syndrome determined using combined array comparative genomic hybridization plus single-nucleotide polymorphism microarray analyses.
Koh, Kyung-Nam; Lee, Jin Ok; Seo, Eul Ju; Lee, Seong Wook; Suh, Jin Kyung; Im, Ho Joon; Seo, Jong Jin
The combined array comparative genomic hybridization plus single-nucleotide polymorphism microarray (CGH+SNP microarray) platform can simultaneously detect copy number alterations (CNA) and copy-neutral loss of heterozygosity (LOH). Eighteen children with acute myeloid leukemia (AML) (n=15) or myelodysplastic syndrome (MDS) (n=3) were studied using CGH+SNP microarray to evaluate the clinical significance of submicroscopic chromosomal aberrations. CGH+SNP microarray revealed CNAs at 14 regions in 9 patients, while metaphase cytogenetic (MC) analysis detected CNAs in 11 regions in 8 patients. Using CGH+SNP microarray, LOHs>10 Mb involving terminal regions or the whole chromosome were detected in 3 of 18 patients (17%). CGH+SNP microarray revealed cryptic LOHs with or without CNAs in 3 of 5 patients with normal karyotypes. CGH+SNP microarray detected additional cryptic CNAs (n=2) and LOHs (n=5) in 6 of 13 patients with abnormal MC. In total, 9 patients demonstrated additional aberrations, including CNAs (n=3) and/or LOHs (n=8). Three of 15 patients with AML and terminal LOH>10 Mb demonstrated a significantly inferior relapse-free survival rate (P=0.041). This study demonstrates that CGH+SNP microarray can simultaneously detect previously cryptic CNAs and LOH, which may demonstrate prognostic implications.
Wolter, Cynthia A.; Gore, Brian F.
The current day flight deck operational environment consists of a two-person Captain/First Officer crew. A concept of operations (ConOps) to reduce the commercial cockpit to a single pilot from the current two pilot crew is termed Single Pilot Operations (SPO). This concept has been under study by researchers in the Flight Deck Display Research Laboratory (FDDRL) at the National Aeronautics and Space Administration's (NASA) Ames (Johnson, Comerford, Lachter, Battiste, Feary, and Mogford, 2012) and researchers from Langley Research Centers (Schutte et al., 2007). Transitioning from a two pilot crew to a single pilot crew will undoubtedly require changes in operational procedures, crew coordination, use of automation, and in how the roles and responsibilities of the flight deck and ATC are conceptualized in order to maintain the high levels of safety expected of the US National Airspace System. These modifications will affect the roles and the subsequent tasks that are required of the various operators in the NextGen environment. The current report outlines the process taken to identify and document the tasks required by the crew according to a number of operational scenarios studied by the FDDRL between the years 2012-2014. A baseline task decomposition has been refined to represent the tasks consistent with a new set of entities, tasks, roles, and responsibilities being explored by the FDDRL as the move is made towards SPO. Information from Subject Matter Expert interviews, participation in FDDRL experimental design meetings, and study observation was used to populate and refine task sets that were developed as part of the SPO task analyses. The task analysis is based upon the proposed ConOps for the third FDDRL SPO study. This experiment possessed nine different entities operating in six scenarios using a variety of SPO-related automation and procedural activities required to guide safe and efficient aircraft operations. The task analysis presents the roles and
How to Use SNP_TATA_Comparator to Find a Significant Change in Gene Expression Caused by the Regulatory SNP of This Gene's Promoter via a Change in Affinity of the TATA-Binding Protein for This Promoter.
Ponomarenko, Mikhail; Rasskazov, Dmitry; Arkova, Olga; Ponomarenko, Petr; Suslov, Valentin; Savinkova, Ludmila; Kolchanov, Nikolay
The use of biomedical SNP markers of diseases can improve effectiveness of treatment. Genotyping of patients with subsequent searching for SNPs more frequent than in norm is the only commonly accepted method for identification of SNP markers within the framework of translational research. The bioinformatics applications aimed at millions of unannotated SNPs of the "1000 Genomes" can make this search for SNP markers more focused and less expensive. We used our Web service involving Fisher's Z-score for candidate SNP markers to find a significant change in a gene's expression. Here we analyzed the change caused by SNPs in the gene's promoter via a change in affinity of the TATA-binding protein for this promoter. We provide examples and discuss how to use this bioinformatics application in the course of practical analysis of unannotated SNPs from the "1000 Genomes" project. Using known biomedical SNP markers, we identified 17 novel candidate SNP markers nearby: rs549858786 (rheumatoid arthritis); rs72661131 (cardiovascular events in rheumatoid arthritis); rs562962093 (stroke); rs563558831 (cyclophosphamide bioactivation); rs55878706 (malaria resistance, leukopenia), rs572527200 (asthma, systemic sclerosis, and psoriasis), rs371045754 (hemophilia B), rs587745372 (cardiovascular events); rs372329931, rs200209906, rs367732974, and rs549591993 (all four: cancer); rs17231520 and rs569033466 (both: atherosclerosis); rs63750953, rs281864525, and rs34166473 (all three: malaria resistance, thalassemia).
Ponomarenko, Mikhail P.; Arkova, Olga; Rasskazov, Dmitry; Ponomarenko, Petr; Savinkova, Ludmila; Kolchanov, Nikolay
Some variations of human genome [for example, single nucleotide polymorphisms (SNPs)] are markers of hereditary diseases and drug responses. Analysis of them can help to improve treatment. Computer-based analysis of millions of SNPs in the 1000 Genomes project makes a search for SNP markers more targeted. Here, we combined two computer-based approaches: DNA sequence analysis and keyword search in databases. In the binding sites for TATA-binding protein (TBP) in human gene promoters, we found candidate SNP markers of gender-biased autoimmune diseases, including rs1143627 [cachexia in rheumatoid arthritis (double prevalence among women)]; rs11557611 [demyelinating diseases (thrice more prevalent among young white women than among non-white individuals)]; rs17231520 and rs569033466 [both: atherosclerosis comorbid with related diseases (double prevalence among women)]; rs563763767 [Hughes syndrome-related thrombosis (lethal during pregnancy)]; rs2814778 [autoimmune diseases (excluding multiple sclerosis and rheumatoid arthritis) underlying hypergammaglobulinemia in women]; rs72661131 and rs562962093 (both: preterm delivery in pregnant diabetic women); and rs35518301, rs34166473, rs34500389, rs33981098, rs33980857, rs397509430, rs34598529, rs33931746, rs281864525, and rs63750953 (all: autoimmune diseases underlying hypergammaglobulinemia in women). Validation of these predicted candidate SNP markers using the clinical standards may advance personalized medicine. PMID:27092142
Børsting, Claus; Pereira, Vania; Andersen, Jeppe Dyrberg
and briefly describe the methods that are preferred for SNP typing in forensic genetics. In addition, we will illustrate how SNPs can be used as investigative leads in the police investigation by discussing the use of ancestry informative markers and forensic DNA phenotyping. Modern DNA sequencing......Single nucleotide polymorphisms (SNPs) are the most frequent DNA sequence variations in the genome. They have been studied extensively in the last decade with various purposes in mind. In this chapter, we will discuss the advantages and disadvantages of using SNPs for human identification...... technologies (also called next generation sequencing or NGS) have the potential to completely transform forensic genetic investigations as we know them today. Here, we will make a short introduction to NGS and explain how NGS may combine analysis of the traditional forensic genetic markers with analysis...
Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the most common source of genetic variation in eukaryotic species and have become an important marker for genetic studies. The mosquito Anopheles funestus is one of the major malaria vectors in Africa and yet, prior to this study, no SNPs have been described for this species. Here we report a genome-wide set of SNP markers for use in genetic studies on this important human disease vector. Results DNA fragments from 50 genes were amplified and sequenced from 21 specimens of An. funestus. A third of specimens were field collected in Malawi, a third from a colony of Mozambican origin and a third form a colony of Angolan origin. A total of 494 SNPs including 303 within the coding regions of genes and 5 indels were identified. The physical positions of these SNPs in the genome are known. There were on average 7 SNPs per kilobase similar to that observed in An. gambiae and Drosophila melanogaster. Transitions outnumbered transversions, at a ratio of 2:1. The increased frequency of transition substitutions in coding regions is likely due to the structure of the genetic code and selective constraints. Synonymous sites within coding regions showed a higher polymorphism rate than non-coding introns or 3' and 5'flanking DNA with most of the substitutions in coding regions being observed at the 3rd codon position. A positive correlation in the level of polymorphism was observed between coding and non-coding regions within a gene. By genotyping a subset of 30 SNPs, we confirmed the validity of the SNPs identified during this study. Conclusion This set of SNP markers represents a useful tool for genetic studies in An. funestus, and will be useful in identifying candidate genes that affect diverse ranges of phenotypes that impact on vector control, such as resistance insecticide, mosquito behavior and vector competence.
salmon (Hayes et al. 2007). However, in A. japonicus, only. 13 SNP have thus far been developed and characterized (Sun et al. 2010). Here we report 15 polymorphic SNP to enrich the list of currently available genetic makers of A. japonicus. ∗For correspondence. E-mail: firstname.lastname@example.org. Materials and methods.
DNA extraction and quantification. Leaf samples ... to this protocol, 24 ul of water (molecular biology grade), 1ul of ... DNA once quantified was stored in the freezer and dilutions made accordingly when needed. The SNP assay. SNP assays were performed by the 5' nuclease (Taqman) assay (Holland et al.,. 1991; Livak ...
Jan 3, 2012 ... al., 2009; HUGO et al., 2009). This is also the first SNP assay to detect individual ethnic origin in East Asia using Sequenom MassARRAY. SNP typing technology. It offered a molecular tool to help us in forensic investigations and anthropology studies. Forensic scientists have developed a series of STR and.
Schrangl, Lukas; Göhring, Janett; Schütz, Gerhard J.
Single molecule Förster resonance energy transfer (smFRET) is a popular tool to study biological systems that undergo topological transitions on the nanometer scale. smFRET experiments typically require recording of long smFRET trajectories and subsequent statistical analysis to extract parameters such as the states' lifetimes. Alternatively, analysis of probability distributions exploits the shapes of smFRET distributions at well chosen exposure times and hence works without the acquisition of time traces. Here, we describe a variant that utilizes statistical tests to compare experimental datasets with Monte Carlo simulations. For a given model, parameters are varied to cover the full realistic parameter space. As output, the method yields p-values which quantify the likelihood for each parameter setting to be consistent with the experimental data. The method provides suitable results even if the actual lifetimes differ by an order of magnitude. We also demonstrated the robustness of the method to inaccurately determine input parameters. As proof of concept, the new method was applied to the determination of transition rate constants for Holliday junctions.
Ben Ali, Sina-Elisabeth; Madi, Zita Erika; Hochegger, Rupert; Quist, David; Prewein, Bernhard; Haslberger, Alexander G.; Brandes, Christian
Genetic mutations must be avoided during the production and use of seeds. In the European Union (EU), Directive 2001/18/EC requires any DNA construct introduced via transformation to be stable. Establishing genetic stability is critical for the approval of genetically modified organisms (GMOs). In this study, genetic stability of two GMOs was examined using high resolution melting (HRM) analysis and real-time polymerase chain reaction (PCR) employing Scorpion primers for amplification. The genetic variability of the transgenic insert and that of the flanking regions in a single oilseed rape variety (GT73) and a stacked maize (MON88017 × MON810) was studied. The GT73 and the 5' region of MON810 showed no instabilities in the examined regions. However; two out of 100 analyzed samples carried a heterozygous point mutation in the 3' region of MON810 in the stacked variety. These results were verified by direct sequencing of the amplified PCR products as well as by sequencing of cloned PCR fragments. The occurrence of the mutation suggests that the 5' region is more suitable than the 3' region for the quantification of MON810. The identification of the single nucleotide polymorphism (SNP) in a stacked event is in contrast to the results of earlier studies of the same MON810 region in a single event where no DNA polymorphism was found. PMID:25365178
Hassany, Mohamed; Gamal, Amany; Zaki, Nashwa; Eysa, Basem
Pegylated interferon (PegIFN) is used in the treatment of chronic hepatitis C virus (HCV) patients especially in resource limited countries. Treatment with PegIFN stimulates the expression of a number of host genes encoding enzymes with antiviral activities, including myxovirus resistance gene-A (MxA gene). MxA gene was found to have a single nucleotide polymorphism (SNP) at position -88 in the promoter region that affects the expression of MxA gene protein and was suggested to affect the treatment outcome. The aim of the work was to assess the relation between the SNP in the MxA gene and its impact on treatment of chronic HCV patients with PegIFN and ribavirin. We therefore genotyped the biallelic G/T SNP in the promoter region of MxA gene at position -88 from the transcription start site by restriction fragment length polymorphism (RFLP) in 70 chronic HCV genotype 4 interferon naive Egyptians and 40 healthy controls. G allele was the prevalent one in both HCV patients group (105, 74.5%) and control group (66, 82.5%), while T allele was less expressed in patients group (36, 25.5%) and control group (14, 17.5%). There is no correlation between genotypes and response to IFN-alpha therapy: GG (OR: 0.958, 95% CI: 0.541 - 1.698, P = 0.884), GT (OR: 0.667, 95% CI: 0.188 - 2.362, P = 0.530), and TT (OR: 0.300, 95% CI: 0.083 - 1.090, P = 0.067). MxA nt-88 SNP did not affect the sustained virological response (SVR) rates after PegIFN and ribavirin combined treatment and did not act as a biological marker to potentially identify responders and non-responders to treatment. Our results call for additional large studies and/or meta-analysis of all currently available data to examine the role of MxA nt-88 SNP in predicting response to PegIFN and ribavirin in patients with IFN-alpha naive HCV genotype 4.
SacconePhD, Scott F [Washington University, St. Louis; Chesler, Elissa J [ORNL; Bierut, Laura J [Washington University, St. Louis; Kalivas, Peter J [Medical College of South Carolina, Charleston; Lerman, Caryn [University of Pennsylvania; Saccone, Nancy L [Washington University, St. Louis; Uhl, George R [Johns Hopkins University; Li, Chuan-Yun [Peking University; Philip, Vivek M [ORNL; Edenberg, Howard [Indiana University; Sherry, Steven [National Center for Biotechnology Information; Feolo, Michael [National Center for Biotechnology Information; Moyzis, Robert K [Johns Hopkins University; Rutter, Joni L [National Institute of Drug Abuse
Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well represented by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions.
Lakshmi K Matukumalli
Full Text Available The success of genome-wide association (GWA studies for the detection of sequence variation affecting complex traits in human has spurred interest in the use of large-scale high-density single nucleotide polymorphism (SNP genotyping for the identification of quantitative trait loci (QTL and for marker-assisted selection in model and agricultural species. A cost-effective and efficient approach for the development of a custom genotyping assay interrogating 54,001 SNP loci to support GWA applications in cattle is described. A novel algorithm for achieving a compressed inter-marker interval distribution proved remarkably successful, with median interval of 37 kb and maximum predicted gap of <350 kb. The assay was tested on a panel of 576 animals from 21 cattle breeds and six outgroup species and revealed that from 39,765 to 46,492 SNP are polymorphic within individual breeds (average minor allele frequency (MAF ranging from 0.24 to 0.27. The assay also identified 79 putative copy number variants in cattle. Utility for GWA was demonstrated by localizing known variation for coat color and the presence/absence of horns to their correct genomic locations. The combination of SNP selection and the novel spacing algorithm allows an efficient approach for the development of high-density genotyping platforms in species having full or even moderate quality draft sequence. Aspects of the approach can be exploited in species which lack an available genome sequence. The BovineSNP50 assay described here is commercially available from Illumina and provides a robust platform for mapping disease genes and QTL in cattle.
Full Text Available Abstract Background In livestock species like the chicken, high throughput single nucleotide polymorphism (SNP genotyping assays are increasingly being used for whole genome association studies and as a tool in breeding (referred to as genomic selection. To be of value in a wide variety of breeds and populations, the success rate of the SNP genotyping assay, the distribution of the SNP across the genome and the minor allele frequencies (MAF of the SNPs used are extremely important. Results We describe the design of a moderate density (60k Illumina SNP BeadChip in chicken consisting of SNPs known to be segregating at high to medium minor allele frequencies (MAF in the two major types of commercial chicken (broilers and layers. This was achieved by the identification of 352,303 SNPs with moderate to high MAF in 2 broilers and 2 layer lines using Illumina sequencing on reduced representation libraries. To further increase the utility of the chip, we also identified SNPs on sequences currently not covered by the chicken genome assembly (Gallus_gallus-2.1. This was achieved by 454 sequencing of the chicken genome at a depth of 12x and the identification of SNPs on 454-derived contigs not covered by the current chicken genome assembly. In total we added 790 SNPs that mapped to 454-derived contigs as well as 421 SNPs with a position on Chr_random of the current assembly. The SNP chip contains 57,636 SNPs of which 54,293 could be genotyped and were shown to be segregating in chicken populations. Our SNP identification procedure appeared to be highly reliable and the overall validation rate of the SNPs on the chip was 94%. We were able to map 328 SNPs derived from the 454 sequence contigs on the chicken genome. The majority of these SNPs map to chromosomes that are already represented in genome build Gallus_gallus-2.1.0. Twenty-eight SNPs were used to construct two new linkage groups most likely representing two micro-chromosomes not covered by the
Mitra, A K; Mukherjee, U K; Harding, T; Jang, J S; Stessman, H; Li, Y; Abyzov, A; Jen, J; Kumar, S; Rajkumar, V; Van Ness, B
Multiple myeloma (MM) is characterized by significant genetic diversity at subclonal levels that have a defining role in the heterogeneity of tumor progression, clinical aggressiveness and drug sensitivity. Although genome profiling studies have demonstrated heterogeneity in subclonal architecture that may ultimately lead to relapse, a gene expression-based prediction program that can identify, distinguish and quantify drug response in sub-populations within a bulk population of myeloma cells is lacking. In this study, we performed targeted transcriptome analysis on 528 pre-treatment single cells from 11 myeloma cell lines and 418 single cells from 8 drug-naïve MM patients, followed by intensive bioinformatics and statistical analysis for prediction of proteasome inhibitor sensitivity in individual cells. Using our previously reported drug response gene expression profile signature at the single-cell level, we developed an R Statistical analysis package available at https://github.com/bvnlabSCATTome, SCATTome (single-cell analysis of targeted transcriptome), that restructures the data obtained from Fluidigm single-cell quantitative real-time-PCR analysis run, filters missing data, performs scaling of filtered data, builds classification models and predicts drug response of individual cells based on targeted transcriptome using an assortment of machine learning methods. Application of SCATT should contribute to clinically relevant analysis of intratumor heterogeneity, and better inform drug choices based on subclonal cellular responses.
Bjelland, D W; Weigel, K A; Vukasinovic, N; Nkrumah, J D
The effects of increased pedigree inbreeding in dairy cattle populations have been well documented and result in a negative impact on profitability. Recent advances in genotyping technology have allowed researchers to move beyond pedigree analysis and study inbreeding at a molecular level. In this study, 5,853 animals were genotyped for 54,001 single nucleotide polymorphisms (SNP); 2,913 cows had phenotypic records including a single lactation for milk yield (from either lactation 1, 2, 3, or 4), reproductive performance, and linear type conformation. After removing SNP with poor call rates, low minor allele frequencies, and departure from Hardy-Weinberg equilibrium, 33,025 SNP remained for analyses. Three measures of genomic inbreeding were evaluated: percent homozygosity (FPH), inbreeding calculated from runs of homozygosity (FROH), and inbreeding derived from a genomic relationship matrix (FGRM). Average FPH was 60.5±1.1%, average FROH was 3.8±2.1%, and average FGRM was 20.8±2.3%, where animals with larger values for each of the genomic inbreeding indices were considered more inbred. Decreases in total milk yield to 205d postpartum of 53, 20, and 47kg per 1% increase in FPH, FROH, and FGRM, respectively, were observed. Increases in days open per 1% increase in FPH (1.76 d), FROH (1.72 d), and FGRM (1.06 d) were also noted, as well as increases in maternal calving difficulty (0.09, 0.03, and 0.04 on a 5-point scale for FPH, FROH, and FGRM, respectively). Several linear type traits, such as strength (-0.40, -0.11, and -0.19), rear legs rear view (-0.35, -0.16, and -0.14), front teat placement (0.35, 0.25, 0.18), and teat length (-0.24, -0.14, and -0.13) were also affected by increases in FPH, FROH, and FGRM, respectively. Overall, increases in each measure of genomic inbreeding in this study were associated with negative effects on production and reproductive ability in dairy cows. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc
Ulgen, Ayse; Li, Wentian
Abstract We compared linkage analysis results for an alcoholism trait, ALDX1 (DSM-III-R and Feigner criteria) using a nonparametric linkage analysis method, which takes into account allele sharing among several affected persons, for both microsatellite and single-nucleotide polymorphism (SNP) markers (Affymetrix and Illumina) in the Collaborative Study on the Genetics of Alcoholism (COGA) dataset provided to participants at the Genetic Analysis Workshop 14 (GAW14). The two sets of linkage res...
Ro, Younghye; Choi, Woojae; Kim, Hoyung; Jang, Hojin; Lee, Hoseon; Lee, Yoonseok; Kim, Danil
Holstein calves weighing less than 20 kg at birth have been noted in Korea. Due to insufficient information, we raised small calves with age-matched normal birth weight Holstein calves and determined body weights before puberty. In addition, 3 single nucleotide polymorphisms (SNPs) of the growth hormone (GH) gene were analyzed. Up to 10 months of age, low birth weight calves were smaller than normal weight calves. In exon 5 of the GH gene, SNP genotype variation was detected in some small cal...
Sanchez, Juan Jose; Børsting, C; Balogh, K
We report the results of an inter-laboratory exercise on typing of autosomal single nucleotide polymorphisms (SNP) for forensic genetic investigations in crime cases. The European DNA Profiling Group (EDNAP), a working group under the International Society for Forensic Genetics (ISFG), organised...
Aubrey E Hill
Full Text Available Like many other ancient genes, the cystic fibrosis transmembrane conductance regulator (CFTR has survived for hundreds of millions of years. In this report, we consider whether such prodigious longevity of an individual gene--as opposed to an entire genome or species--should be considered surprising in the face of eons of relentless DNA replication errors, mutagenesis, and other causes of sequence polymorphism. The conventions that modern human SNP patterns result either from purifying selection or random (neutral drift were not well supported, since extant models account rather poorly for the known plasticity and function (or the established SNP distributions found in a multitude of genes such as CFTR. Instead, our analysis can be taken as a polemic indicating that SNPs in CFTR and many other mammalian genes may have been generated--and continue to accrue--in a fundamentally more organized manner than would otherwise have been expected. The resulting viewpoint contradicts earlier claims of 'directional' or 'intelligent design-type' SNP formation, and has important implications regarding the pace of DNA adaptation, the genesis of conserved non-coding DNA, and the extent to which eukaryotic SNP formation should be viewed as adaptive.
Wang, Jun; Guo, Mao-zu; Wang, Chun-yu
Recent studies have shown genetic variation is the basis of the genome-wide disease association research. However, due to the high cost on genotyping large number of single nucleotide polymorphisms (SNPs), it is essential to choose a small subset of informative SNPs (tagSNPs), which are able to capture most variation in a population, to represent the rest SNPs. Several methods have been proposed to find the minimum set of tagSNPs, but most of them still have some disadvantages such as information loss and block-partition limit. This paper proposes a new hybrid method named CGTS which combines the ideas of the clustering and the graph algorithms to select tagSNPs on genotype data. This method aims to maximize the number of the discarding nontagSNPs in the given set. CGTS integrates the information of the LD association and the genotype diversity using the site graphs, discards redundant SNPs using the algorithm based on these graph structures. The clustering algorithm is used to reduce the running time of CGTS. The efficiency of the algorithm and quality of solutions are evaluated on biological data and the comparisons with three popular selecting methods are shown in the paper. Our theoretical analysis and experimental results show that our algorithm CGTS is not only more efficient than other methods but also can be get higher accuracy in tagSNP selection.
Mitha, Faheem; Herodotou, Herodotos; Borisov, Nedyalko; Jiang, Chen; Yoder, Josh; Owzar, Kouros
Background We describe SNPpy, a hybrid script database system using the Python SQLAlchemy library coupled with the PostgreSQL database to manage genotype data from Genome-Wide Association Studies (GWAS). This system makes it possible to merge study data with HapMap data and merge across studies for meta-analyses, including data filtering based on the values of phenotype and Single-Nucleotide Polymorphism (SNP) data. SNPpy and its dependencies are open source software. Results The current version of SNPpy offers utility functions to import genotype and annotation data from two commercial platforms. We use these to import data from two GWAS studies and the HapMap Project. We then export these individual datasets to standard data format files that can be imported into statistical software for downstream analyses. Conclusions By leveraging the power of relational databases, SNPpy offers integrated management and manipulation of genotype and phenotype data from GWAS studies. The analysis of these studies requires merging across GWAS datasets as well as patient and marker selection. To this end, SNPpy enables the user to filter the data and output the results as standardized GWAS file formats. It does low level and flexible data validation, including validation of patient data. SNPpy is a practical and extensible solution for investigators who seek to deploy central management of their GWAS data. PMID:22039405
Full Text Available BACKGROUND: We describe SNPpy, a hybrid script database system using the Python SQLAlchemy library coupled with the PostgreSQL database to manage genotype data from Genome-Wide Association Studies (GWAS. This system makes it possible to merge study data with HapMap data and merge across studies for meta-analyses, including data filtering based on the values of phenotype and Single-Nucleotide Polymorphism (SNP data. SNPpy and its dependencies are open source software. RESULTS: The current version of SNPpy offers utility functions to import genotype and annotation data from two commercial platforms. We use these to import data from two GWAS studies and the HapMap Project. We then export these individual datasets to standard data format files that can be imported into statistical software for downstream analyses. CONCLUSIONS: By leveraging the power of relational databases, SNPpy offers integrated management and manipulation of genotype and phenotype data from GWAS studies. The analysis of these studies requires merging across GWAS datasets as well as patient and marker selection. To this end, SNPpy enables the user to filter the data and output the results as standardized GWAS file formats. It does low level and flexible data validation, including validation of patient data. SNPpy is a practical and extensible solution for investigators who seek to deploy central management of their GWAS data.
Mitha, Faheem; Herodotou, Herodotos; Borisov, Nedyalko; Jiang, Chen; Yoder, Josh; Owzar, Kouros
We describe SNPpy, a hybrid script database system using the Python SQLAlchemy library coupled with the PostgreSQL database to manage genotype data from Genome-Wide Association Studies (GWAS). This system makes it possible to merge study data with HapMap data and merge across studies for meta-analyses, including data filtering based on the values of phenotype and Single-Nucleotide Polymorphism (SNP) data. SNPpy and its dependencies are open source software. The current version of SNPpy offers utility functions to import genotype and annotation data from two commercial platforms. We use these to import data from two GWAS studies and the HapMap Project. We then export these individual datasets to standard data format files that can be imported into statistical software for downstream analyses. By leveraging the power of relational databases, SNPpy offers integrated management and manipulation of genotype and phenotype data from GWAS studies. The analysis of these studies requires merging across GWAS datasets as well as patient and marker selection. To this end, SNPpy enables the user to filter the data and output the results as standardized GWAS file formats. It does low level and flexible data validation, including validation of patient data. SNPpy is a practical and extensible solution for investigators who seek to deploy central management of their GWAS data.
Longfils, M; Röding, M; Altskär, A; Schuster, E; Lorén, N; Särkkä, A; Rudemo, M
Recently we complemented the raster image correlation spectroscopy (RICS) method of analysing raster images via estimation of the image correlation function with the method single particle raster image analysis (SPRIA). In SPRIA, individual particles are identified and the diffusion coefficient of each particle is estimated by a maximum likelihood method. In this paper, we extend the SPRIA method to analyse mixtures of particles with a finite set of diffusion coefficients in a homogeneous medium. In examples with simulated and experimental data with two and three different diffusion coefficients, we show that SPRIA gives accurate estimates of the diffusion coefficients and their proportions. A simple technique for finding the number of different diffusion coefficients is also suggested. Further, we study the use of RICS for mixtures with two different diffusion coefficents and investigate, by plotting level curves of the correlation function, how large the quotient between diffusion coefficients needs to be in order to allow discrimination between models with one and two diffusion coefficients. We also describe a minor correction (compared to published papers) of the RICS autocorrelation function. © 2017 The Authors Journal of Microscopy © 2017 Royal Microscopical Society.
Moses Frank Oduori
Full Text Available This paper sets out to perform a static force analysis of the single toggle jaw crusher mechanism and to obtain the force transmission characteristics of the mechanism. In order to obtain force transmission metrics that are characteristic of the structure of the mechanism, such influences as friction, dead weight, and inertia are considered to be extraneous and neglected. Equations are obtained by considering the balance of forces at the moving joints and appropriately relating these to the input torque and the output torque. A mechanical advantage, the corresponding transmitted torque, and the variations thereof, during the cycle of motion of the mechanism, are obtained. The mechanical advantage that characterizes the mechanism is calculated as the mean value over the active crushing stroke of the mechanism. The force transmission characteristics can be used as criteria for the comparison of different jaw crusher mechanism designs in order to select the most suitable design for a given application. The equations obtained can also be used in estimating the forces sustained by the components of the mechanism.
Full Text Available Background & Objective: Single nucleotide polymorphisms are the cause of genetic variation to living organisms. Single nucleotide polymorphisms alter residues in the protein sequence. In this investigation, the relationship between prion protein gene polymorphisms and its relevance to pathogenicity was studied. Material & Method: Amino acid sequence of the main isoform from the human prion protein gene (PRNP was extracted from UniProt database and evaluated by FoldAmyloid and AmylPred servers. All non-synonymous single nucleotide polymorphisms (nsSNPs from SNP database (dbSNP were further analyzed by bioinformatics servers including SIFT, PolyPhen-2, I-Mutant-3.0, PANTHER, SNPs & GO, PHD-SNP, Meta-SNP, and MutPred to determine the most damaging nsSNPs. Results: The results of the first structure analyses by FoldAmyloid and AmylPerd servers implied that regions including 5-15, 174-178, 180-184, 211-217, and 240-252 were the most sensitive parts of the protein sequence to amyloidosis. Screening all nsSNPs of the main protein isoform using bioinformatic servers revealed that substitution of Aspartic acid with Valine at position 178 (ID code: rs11538766 was the most deleterious nsSNP in the protein structure. Conclusion: Substitution of the Aspartic acid with Valine at position 178 (D178V was the most pathogenic mutation in the human prion protein gene. Analyses from the MutPred server also showed that beta-sheets’ increment in the secondary structure was the main reason behind the molecular mechanism of the prion protein aggregation.
Pappas, D J; Lizee, A; Paunic, V; Beutner, K R; Motyer, A; Vukcevic, D; Leslie, S; Biesiada, J; Meller, J; Taylor, K D; Zheng, X; Zhao, L P; Gourraud, P-A; Hollenbach, J A; Mack, S J; Maiers, M
Four single nucleotide polymorphism (SNP)-based human leukocyte antigen (HLA) imputation methods (e-HLA, HIBAG, HLA*IMP:02 and MAGPrediction) were trained using 1000 Genomes SNP and HLA genotypes and assessed for their ability to accurately impute molecular HLA-A, -B, -C and -DRB1 genotypes in the Human Genome Diversity Project cell panel. Imputation concordance was high (>89%) across all methods for both HLA-A and HLA-C, but HLA-B and HLA-DRB1 proved generally difficult to impute. Overall, <27.8% of subjects were correctly imputed for all HLA loci by any method. Concordance across all loci was not enhanced via the application of confidence thresholds; reliance on confidence scores across methods only led to noticeable improvement (+3.2%) for HLA-DRB1. As the HLA complex is highly relevant to the study of human health and disease, a standardized assessment of SNP-based HLA imputation methods is crucial for advancing genomic research. Considerable room remains for the improvement of HLA-B and especially HLA-DRB1 imputation methods, and no imputation method is as accurate as molecular genotyping. The application of large, ancestrally diverse HLA and SNP reference data sets and multiple imputation methods has the potential to make SNP-based HLA imputation methods a tractable option for determining HLA genotypes.The Pharmacogenomics Journal advance online publication, 25 April 2017; doi:10.1038/tpj.2017.7.
Lin, Shaobin; Zhou, Yu; Zhou, Bingyi; Gu, Heng
To investigate the value of single nucleotide polymorphism array (SNP array) for the identification of de novo mutations in the DMD gene among fetuses. G-banded karyotyping and SNP array were performed on a fetus with intrauterine growth restriction but without family history of Duchenne/Becker muscular dystrophy (DMD/BMD). Multiplex ligation-dependent probe amplification (MLPA) was subsequently applied on amniocytes and maternal peripheral blood sample to detect DMD gene deletion/duplication mutations. Karyotyping of amniocytes showed a normal 46, XY karyotype. SNP array on amniocytes detected a 116 kb deletion (chrX: 32 455 741-32 571 504) at Xp21.1 with breakpoints at introns 16 and 30 respectively, encompassing exons 17-29 of the DMD gene. In addition, MLPA analysis of the DMD gene on amniocytes confirmed the deletion of exons 17 to 29 identified by SNP array. However, no deletion/duplication mutation was detected by MLPA in the mother. The de novo deletion of exons 17 to 29 of the DMD gene detected in the fetus may result in BMD or DMD. SNP array can improve the efficiency for detecting genomic disorders in fetuses with unidentified pathogenic genes, negative family history and nonspecific phenotypes.
Grün, Dominic; van Oudenaarden, Alexander
Recent advances in single-cell sequencing hold great potential for exploring biological systems with unprecedented resolution. Sequencing the genome of individual cells can reveal somatic mutations and allows the investigation of clonal dynamics. Single-cell transcriptome sequencing can elucidate
Børsting, Claus; Fordyce, Sarah L; Olofsson, Jill Katharina
The Ion Torrent™ HID SNP assay amplified 136 autosomal SNPs and 33 Y-chromosome markers in one PCR and the markers were subsequently typed using the Ion PGM™ second generation sequencing platform. A total of 51 of the autosomal SNPs were selected from the SNPforID panel that is routinely used...... in our ISO 17025 accredited laboratory. Concordance between the Ion Torrent™ HID SNP assay and the SNPforID assay was tested by typing 44 Iraqis twice with the Ion Torrent™ HID SNP assay. The same samples were previously typed with the SNPforID assay and the Y-chromosome haplogroups of the individuals...
Stangegaard, Michael; Tomas, Carmen; Hansen, Anders J.
Single nucleotide polymorphism genotyping provides a supplement for conventional short tandem repeats-based kits currently used for human identification. GenPlex (Applied Biosystems (AB), Foster City, CA) is an SNP-genotyping kit based on a multiplex of 48 informative, autosomal SNPs from...... of both partial and full plates. A total of 286 samples were analyzed in duplicates with the GenPlex reaction using the Biomek-3000. The results were compared with those obtained from the same samples using the SNaPshot(AB) single-base extension system. Full concordance of the results was obtained in all...
Yao, Haibo; Hruska, Zuzana; Kincaid, Russell; Ononye, Ambrose; Brown, Robert L.; Cleveland, Thomas E.
Aflatoxins are toxic secondary metabolites of the fungi Aspergillus flavus and Aspergillus parasiticus, among others. Aflatoxin contaminated corn is toxic to domestic animals when ingested in feed and is a known carcinogen associated with liver and lung cancer in humans. Consequently, aflatoxin levels in food and feed are regulated by the Food and Drug Administration (FDA) in the US, allowing 20 ppb (parts per billion) limits in food and 100 ppb in feed for interstate commerce. Currently, aflatoxin detection and quantification methods are based on analytical tests including thin-layer chromatography (TCL) and high performance liquid chromatography (HPLC). These analytical tests require the destruction of samples, and are costly and time consuming. Thus, the ability to detect aflatoxin in a rapid, nondestructive way is crucial to the grain industry, particularly to corn industry. Hyperspectral imaging technology offers a non-invasive approach toward screening for food safety inspection and quality control based on its spectral signature. The focus of this paper is to classify aflatoxin contaminated single corn kernels using fluorescence hyperspectral imagery. Field inoculated corn kernels were used in the study. Contaminated and control kernels under long wavelength ultraviolet excitation were imaged using a visible near-infrared (VNIR) hyperspectral camera. The imaged kernels were chemically analyzed to provide reference information for image analysis. This paper describes a procedure to process corn kernels located in different images for statistical training and classification. Two classification algorithms, Maximum Likelihood and Binary Encoding, were used to classify each corn kernel into "control" or "contaminated" through pixel classification. The Binary Encoding approach had a slightly better performance with accuracy equals to 87% or 88% when 20 ppb or 100 ppb was used as classification threshold, respectively.
Shrestha, Sandesh; Hu, Jian; Fryxell, Rebecca Trout; Mudge, Joann; Lamour, Kurt
Taro (Colocasia esculenta) is an important food crop, and taro leaf blight caused by Phytophthora colocasiae can significantly affect production. Our objectives were to develop single nucleotide polymorphism (SNP) markers for P. colocasiae and characterize populations in Hawaii (HI), Vietnam (VN) and Hainan Island, China (HIC). In total, 379 isolates were analyzed for mating type and multilocus SNP profiles including 214 from HI, 97 from VN and 68 from HIC. A total of 1152 single nucleotide variant (SNV) sites were identified via restriction site-associated DNA (RAD) sequencing of two field isolates. Genotyping with 27 SNPs revealed 41 multilocus SNP genotypes grouped into seven clonal lineages containing 2-232 members. Three clonal lineages were shared among countries. In addition, five SNP markers had a low incidence of loss of heterozygosity (LOH) during asexual laboratory growth. For HI and VN, >95% of isolates were the A2 mating type. On HIC, isolates within single clonal lineages had A1, A2 and A0 (neuter) isolates. The implications for the wide dispersal of clonal lineages are discussed. © 2014 by The Mycological Society of America.
Full Text Available Macrophages transformed foam cell formation occurs as a result of leukocyte accumulation mediated through intercellular adhesion molecule 1 (ICAM1, vascular cell adhesion molecule 1 (VCAM1, and E-selectin, secreted by inflamed or damaged endothelium. The key molecule is the ICAM-1, member of the adhesion immunoglobulin super family that maps to chromosome 19 p13.2-p13.3 codes for 505 amino acids have five extracellular domains including circulatory leukocytes binding site (primarily monocytes for recruiting it at the sites of inflammation and the tight adhesion with vascular endothelium for the above mentioned pathogenesis as an initial step. Hence the objective of the current paper is to review the Genome Wide Association (GWA studies and summarizes its understanding of functional Single Nucleotide Polymorphism (SNP's of ICAM-1 clinical association to provide better guidance for the clinicians and researchers of the merits, demerits of the current results and direct them to do research on larger number of population for better prospective.
Imtiaz, Waqas A.; Khan, Yousaf; Qamar, Affaq; Khan, Jehanzeb; Khan, Noaman Ahmed
This paper investigates how to reduce the cost of wavelength division multiplexing passive optical network (WDMPON) by comparing the transmission performance of bidirectional single feeder fiber and dual feeder fiber. Comparison is performed on the basis of power budgeting and cost of both arrangements. Simulation results using Optisystem show that the performance of a single feeder fiber is almost equivalent to that of a dual feeder fiber. Therefore, the single feeder fiber WDM-PON can efficiently replace the dual feeder fiber WDM-PON with the minimum deterioration in system performance and reduction in cost.
Nunes, José de Ribamar da Silva; Liu, Shikai; Pértille, Fábio; Perazza, Caio Augusto; Villela, Priscilla Marqui Schmidt; de Almeida-Val, Vera Maria Fonseca; Hilsdorf, Alexandre Wagner Silva; Liu, Zhanjiang; Coutinho, Luiz Lehmann
Colossoma macropomum, or tambaqui, is the largest native Characiform species found in the Amazon and Orinoco river basins, yet few resources for genetic studies and the genetic improvement of tambaqui exist. In this study, we identified a large number of single-nucleotide polymorphisms (SNPs) for tambaqui and constructed a high-resolution genetic linkage map from a full-sib family of 124 individuals and their parents using the genotyping by sequencing method. In all, 68,584 SNPs were initially identified using minimum minor allele frequency (MAF) of 5%. Filtering parameters were used to select high-quality markers for linkage analysis. We selected 7,734 SNPs for linkage mapping, resulting in 27 linkage groups with a minimum logarithm of odds (LOD) of 8 and maximum recombination fraction of 0.35. The final genetic map contains 7,192 successfully mapped markers that span a total of 2,811 cM, with an average marker interval of 0.39 cM. Comparative genomic analysis between tambaqui and zebrafish revealed variable levels of genomic conservation across the 27 linkage groups which allowed for functional SNP annotations. The large-scale SNP discovery obtained here, allowed us to build a high-density linkage map in tambaqui, which will be useful to enhance genetic studies that can be applied in breeding programs.
Wallenborn, M; Petters, O; Rudolf, D; Hantmann, H; Richter, M; Ahnert, P; Rohani, L; Smink, J J; Bulwin, G C; Krupp, W; Schulz, R M; Holland, H
In the development of cell-based medicinal products, it is crucial to guarantee that the application of such an advanced therapy medicinal product (ATMP) is safe for the patients. The consensus of the European regulatory authorities is: "In conclusion, on the basis of the state of art, conventional karyotyping can be considered a valuable and useful technique to analyse chromosomal stability during preclinical studies". 408 chondrocyte samples (84 monolayers and 324 spheroids) from six patients were analysed using trypsin-Giemsa staining, spectral karyotyping and fluorescence in situ hybridisation, to evaluate the genetic stability of an ATMP named Spherox®. Single nucleotide polymorphism (SNP) array analysis was performed on chondrocyte spheroids from five of the six donors. Applying this combination of techniques, the genetic analyses performed revealed no significant genetic instability until passage 3 in monolayer cells and interphase cells from spheroid cultures at different time points. Clonal occurrence of polyploid metaphases and endoreduplications were identified associated with prolonged cultivation time. Also, gonosomal losses were observed in chondrocyte spheroids, with increasing passage and duration of the differentiation phase. Interestingly, in one of the donors, chromosomal aberrations that are also described in extraskeletal myxoid chondrosarcoma were identified. The SNP array analysis exhibited chromosomal aberrations in two donors and copy neutral losses of heterozygosity regions in four donors. This study showed the necessity of combined genetic analyses at defined cultivation time points in quality studies within the field of cell therapy.
Liao, C; Fu, F; Li, R; Xie, G-E; Zhang, Y-L; Li, J; Li, D-Z
Chromosome microarray analysis (CMA) has proven to be a powerful tool in postnatal patients with intellectual disabilities, and it is increasingly used in prenatal diagnosis. However, its diagnostic capabilities in prenatal diagnosis vary, and clinical experiences have failed to establish a consensus regarding CMA indications, the design and resolution of microarrays, and the notification and interpretation of copy number variations (CNVs). We present our 5 years of clinical experience using whole-genome high-resolution single nucleotide polymorphism (SNP) arrays to investigate 446 fetuses that had structural malformations detected with ultrasound but for which standard karyotyping analysis showed normal karyotypes. CMA revealed genomic CNVs in 323 (72.4%) cases and clinically significant CNVs in 11.4% of the fetuses (51/446), including 2 cases of uniparental disomy (UPD) as well as 1 case of cryptic mosaic monosomy of chromosome X. Variants of unknown significance (VOUS) existed in 2.0% of the tested fetuses (9/446). Our results demonstrate the value of whole-genome high-resolution SNP arrays in fetuses with congenital malformations and give a higher detection rate of clinically significant genomic imbalance, especially for detecting UPD. Sufficient communication between technicians and genetic counselors, along with parental testing and comparison with data from in-house or international sources, can significantly reduce VOUS. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Heinz Ruth A
Full Text Available Abstract Background Association analysis is a powerful tool to identify gene loci that may contribute to phenotypic variation. This includes the estimation of nucleotide diversity, the assessment of linkage disequilibrium structure (LD and the evaluation of selection processes. Trait mapping by allele association requires a high-density map, which could be obtained by the addition of Single Nucleotide Polymorphisms (SNPs and short insertion and/or deletions (indels to SSR and AFLP genetic maps. Nucleotide diversity analysis of randomly selected candidate regions is a promising approach for the success of association analysis and fine mapping in the sunflower genome. Moreover, knowledge of the distance over which LD persists, in agronomically meaningful sunflower accessions, is important to establish the density of markers and the experimental design for association analysis. Results A set of 28 candidate genes related to biotic and abiotic stresses were studied in 19 sunflower inbred lines. A total of 14,348 bp of sequence alignment was analyzed per individual. In average, 1 SNP was found per 69 nucleotides and 38 indels were identified in the complete data set. The mean nucleotide polymorphism was moderate (θ = 0.0056, as expected for inbred materials. The number of haplotypes per region ranged from 1 to 9 (mean = 3.54 ± 1.88. Model-based population structure analysis allowed detection of admixed individuals within the set of accessions examined. Two putative gene pools were identified (G1 and G2, with a large proportion of the inbred lines being assigned to one of them (G1. Consistent with the absence of population sub-structuring, LD for G1 decayed more rapidly (r2 = 0.48 at 643 bp; trend line, pooled data than the LD trend line for the entire set of 19 individuals (r2 = 0.64 for the same distance. Conclusion Knowledge about the patterns of diversity and the genetic relationships between breeding materials could be an invaluable aid in crop
Bartsch, Philipp; Harsman, Anke; Wagner, Richard
The planar lipid bilayer technique is a powerful experimental approach for electrical single channel recordings of pore-forming membrane proteins in a chemically well-defined and easily modifiable environment. Here we provide a general survey of the basic materials and procedures required to set up a robust bilayer system and perform electrophysiological single channel recordings of reconstituted proteins suitable for the in-depth characterization of their functional properties.
Yip, Yum L; Scheib, Holger; Diemand, Alexander V; Gattiker, Alexandre; Famiglietti, Livia M; Gasteiger, Elisabeth; Bairoch, Amos
Missense mutation leading to single amino acid polymorphism (SAP) is the type of mutation most frequently related to human diseases. The Swiss-Prot protein knowledgebase records information on such mutations in various sections of a protein entry, namely in the "feature," "comment," and "reference" fields. To facilitate users in obtaining the most relevant information about each human SAP recorded in the knowledgebase, the Swiss-Prot Variant web pages were created to provide a summary of available sequence information, as well as additional structural information on each variant. In particular, the ModSNP database was set up to store information related to SAPs and to manage the modeling of SAPs onto protein structures via an automatic homology modeling pipeline. Currently, among the 16,566 human SAPs recorded in the Swiss-Prot knowledgebase (release 42.5, 21 November 2003), more than 25% have corresponding 3D-models. Of these variants, 47% are related to disease, 26% are polymorphisms, and 27% are not yet clearly classified. The ModSNP database is updated and the subsequent model construction pipeline is launched with each weekly Swiss-Prot release. Thus, the ModSNP database represents a valuable resource for the structural analysis of protein variation. The Swiss-Prot variant pages are accessible from the NiceProt view of a Swiss-Prot entry on the ExPASy server (www.expasy.org/), via a hyperlink created for the stable and unique identifier FTId of each human SAP. Copyright 2004 Wiley-Liss, Inc.
Full Text Available The objectives of the study were to classify SNPs according to their contribution to the feed conversion ratio and to indicate interactions between the most informative SNPs using entropy analysis. The records of 1296 pigs were included. Two selection criteria for molecular data were applied: call rate 0.95 and minor allele frequency 0.05. After this, 50 951 SNPs were included into the entropy analysis. For each SNP entropy and conditional entropy were estimated. For interaction analyses the most informative SNPs were selected. For each pair of SNPs, the mutual information was assessed. A majority of the loci studied showed relatively small contributions. The most informative SNPs are mainly located on chromosomes: 1, 4, 7, 9 and 14. Whereas important interactions between SNP pairs were detected on chromosomes: 1, 14, 15 and 16. High mutual information was registered for SNPs located nearby.
Ai, XianTao; Liang, YaJun; Wang, JunDuo; Zheng, JuYun; Gong, ZhaoLong; Guo, JiangPing; Li, XueYuan; Qu, YanYing
Cotton (Gossypium spp.) is the most important natural textile fiber crop, and Gossypium hirsutum L. is responsible for 90% of the annual cotton crop in the world. Information on cotton genetic diversity and population structure is essential for new breeding lines. In this study, we analyzed population structure and genetic diversity of 288 elite Gossypium hirsutum cultivar accessions collected from around the world, and especially from China, using genome-wide single nucleotide polymorphisms (SNP) markers. The average polymorphsim information content (PIC) was 0.25, indicating a relatively low degree of genetic diversity. Population structure analysis revealed extensive admixture and identified three subgroups. Phylogenetic analysis supported the subgroups identified by STRUCTURE. The results from both population structure and phylogenetic analysis were, for the most part, in agreement with pedigree information. Analysis of molecular variance revealed a larger amount of variation was due to diversity within the groups. Establishment of genetic diversity and population structure from this study could be useful for genetic and genomic analysis and systematic utilization of the standing genetic variation in upland cotton.
Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs have been associated with many aspects of human development and disease, and many non-coding SNPs associated with disease risk are presumed to affect gene regulation. We have previously shown that SNPs within transcription factor binding sites can affect transcription factor binding in an allele-specific and heritable manner. However, such analysis has relied on prior whole-genome genotypes provided by large external projects such as HapMap and the 1000 Genomes Project. This requirement limits the study of allele-specific effects of SNPs in primary patient samples from diseases of interest, where complete genotypes are not readily available. Results In this study, we show that we are able to identify SNPs de novo and accurately from ChIP-seq data generated in the ENCODE Project. Our de novo identified SNPs from ChIP-seq data are highly concordant with published genotypes. Independent experimental verification of more than 100 sites estimates our false discovery rate at less than 5%. Analysis of transcription factor binding at de novo identified SNPs revealed widespread heritable allele-specific binding, confirming previous observations. SNPs identified from ChIP-seq datasets were significantly enriched for disease-associated variants, and we identified dozens of allele-specific binding events in non-coding regions that could distinguish between disease and normal haplotypes. Conclusions Our approach combines SNP discovery, genotyping and allele-specific analysis, but is selectively focused on functional regulatory elements occupied by transcription factors or epigenetic marks, and will therefore be valuable for identifying the functional regulatory consequences of non-coding SNPs in primary disease samples.
Winham, Stacey J; Colby, Colin L; Freimuth, Robert R; Wang, Xin; de Andrade, Mariza; Huebner, Marianne; Biernacka, Joanna M
Identifying variants associated with complex human traits in high-dimensional data is a central goal of genome-wide association studies. However, complicated etiologies such as gene-gene interactions are ignored by the univariate analysis usually applied in these studies. Random Forests (RF) are a popular data-mining technique that can accommodate a large number of predictor variables and allow for complex models with interactions. RF analysis produces measures of variable importance that can be used to rank the predictor variables. Thus, single nucleotide polymorphism (SNP) analysis using RFs is gaining popularity as a potential filter approach that considers interactions in high-dimensional data. However, the impact of data dimensionality on the power of RF to identify interactions has not been thoroughly explored. We investigate the ability of rankings from variable importance measures to detect gene-gene interaction effects and their potential effectiveness as filters compared to p-values from univariate logistic regression, particularly as the data becomes increasingly high-dimensional. RF effectively identifies interactions in low dimensional data. As the total number of predictor variables increases, probability of detection declines more rapidly for interacting SNPs than for non-interacting SNPs, indicating that in high-dimensional data the RF variable importance measures are capturing marginal effects rather than capturing the effects of interactions. While RF remains a promising data-mining technique that extends univariate methods to condition on multiple variables simultaneously, RF variable importance measures fail to detect interaction effects in high-dimensional data in the absence of a strong marginal component, and therefore may not be useful as a filter technique that allows for interaction effects in genome-wide data.
Background Single nucleotide polymorphisms (SNPs) have been associated with many aspects of human development and disease, and many non-coding SNPs associated with disease risk are presumed to affect gene regulation. We have previously shown that SNPs within transcription factor binding sites can affect transcription factor binding in an allele-specific and heritable manner. However, such analysis has relied on prior whole-genome genotypes provided by large external projects such as HapMap and the 1000 Genomes Project. This requirement limits the study of allele-specific effects of SNPs in primary patient samples from diseases of interest, where complete genotypes are not readily available. Results In this study, we show that we are able to identify SNPs de novo and accurately from ChIP-seq data generated in the ENCODE Project. Our de novo identified SNPs from ChIP-seq data are highly concordant with published genotypes. Independent experimental verification of more than 100 sites estimates our false discovery rate at less than 5%. Analysis of transcription factor binding at de novo identified SNPs revealed widespread heritable allele-specific binding, confirming previous observations. SNPs identified from ChIP-seq datasets were significantly enriched for disease-associated variants, and we identified dozens of allele-specific binding events in non-coding regions that could distinguish between disease and normal haplotypes. Conclusions Our approach combines SNP discovery, genotyping and allele-specific analysis, but is selectively focused on functional regulatory elements occupied by transcription factors or epigenetic marks, and will therefore be valuable for identifying the functional regulatory consequences of non-coding SNPs in primary disease samples. PMID:22950704
Buzad, Francis A; Corne, Louis M; Brown, Thomas C; Fagin, Randy S; Hebert, April E; Kaczmarek, Charles A; Pack, Angie N; Payne, Thomas N
Single-incision surgery has gained in popularity, and the recent development of specialized robotic and laparoscopic instruments may remove some of the ergonomic and technical difficulties associated with this approach. However, questions of cost and efficiency remain. We prospectively collected perioperative outcome and efficiency (operative time, case volume) data for our single-site robotic cholecystectomy cases and retrospectively reviewed data for our single-incision laparoscopic cholecystectomy cases. There were no differences in patient characteristics or perioperative outcomes between the robotic (n = 20) and laparoscopic (n = 10) groups; operative times were equivalent (84.6 vs 85.5 min; p = 0.8737) and blood loss and complications were minimal. There was a higher robotic case volume, with an average of two robotic cases (range 1-4)/day vs one/day for laparoscopic cases (range 1-1; p = 0.0306). Streamlined instrument costs were essentially equivalent. Robotic single-site cholecystectomy is a safe, cost-effective alternative to single-incision laparoscopic cholecystectomy in a robot-existing model. Copyright © 2013 John Wiley & Sons, Ltd.
Mitchell, Cedar L; Andrianaivoarimanana, Voahangy; Colman, Rebecca E; Busch, Joseph; Hornstra-O'Neill, Heidie; Keim, Paul S; Wagner, David M; Rajerison, Minoarisoa; Birdsell, Dawn N
Genetic analysis of pathogenic organisms is a useful tool for linking human cases together and/or to potential environmental sources. The resulting data can also provide information on evolutionary patterns within a targeted species and phenotypic traits. However, the instruments often used to generate genotyping data, such as single nucleotide polymorphisms (SNPs), can be expensive and sometimes require advanced technologies to implement. This places many genotyping tools out of reach for laboratories that do not specialize in genetic studies and/or lack the requisite financial and technological resources. To address this issue, we developed a low cost and low tech genotyping system, termed agarose-MAMA, which combines traditional PCR and agarose gel electrophoresis to target phylogenetically informative SNPs. To demonstrate the utility of this approach for generating genotype data in a resource-constrained area (Madagascar), we designed an agarose-MAMA system targeting previously characterized SNPs within Yersinia pestis, the causative agent of plague. We then used this system to genetically type pathogenic strains of Y. pestis in a Malagasy laboratory not specialized in genetic studies, the Institut Pasteur de Madagascar (IPM). We conducted rigorous assay performance validations to assess potential variation introduced by differing research facilities, reagents, and personnel and found no difference in SNP genotyping results. These agarose-MAMA PCR assays are currently employed as an investigative tool at IPM, providing Malagasy researchers a means to improve the value of their plague epidemiological investigations by linking outbreaks to potential sources through genetic characterization of isolates and to improve understanding of disease ecology that may contribute to a long-term control effort. The success of our study demonstrates that the SNP-based genotyping capacity of laboratories in developing countries can be expanded with manageable financial cost for
Yousefi, Soheil; Abbassi-Daloii, Tooba; Kraaijenbrink, Thirsa; Vermaat, Martijn; Mei, Hailiang; van 't Hof, Peter; van Iterson, Maarten; Zhernakova, Daria V; Claringbould, Annique; Franke, Lude; 't Hart, Leen M; Slieker, Roderick C; van der Heijden, Amber; de Knijff, Peter; 't Hoen, Peter A C
SNP panels that uniquely identify an individual are useful for genetic and forensic research. Previously recommended SNP panels are based on DNA profiles and mostly contain intragenic SNPs. With the increasing interest in RNA expression profiles, we aimed for establishing a SNP panel for both DNA and RNA-based genotyping. To determine a small set of SNPs with maximally discriminative power, genotype calls were obtained from DNA and blood-derived RNA sequencing data belonging to healthy, geographically dispersed, Dutch individuals. SNPs were selected based on different criteria like genotype call rate, minor allele frequency, Hardy-Weinberg equilibrium and linkage disequilibrium. A panel of 50 SNPs was sufficient to identify an individual uniquely: the probability of identity was 6.9 × 10 - 20 when assuming no family relations and 1.2 × 10 - 10 when accounting for the presence of full sibs. The ability of the SNP panel to uniquely identify individuals on DNA and RNA level was validated in an independent population dataset. The panel is applicable to individuals from European descent, with slightly lower power in non-Europeans. Whereas most of the genes containing the 50 SNPs are expressed in various tissues, our SNP panel needs optimization for other tissues than blood. This first DNA/RNA SNP panel will be useful to identify sample mix-ups in biomedical research and for assigning DNA and RNA stains in crime scenes to unique individuals.
Theunert, Christoph; Tang, Kun; Lachmann, Michael; Hu, Sile; Stoneking, Mark
Dense, genome-wide single-nucleotide polymorphism (SNP) data can be used to reconstruct the demographic history of human populations. However, demographic inferences from such data are complicated by recombination and ascertainment bias. We introduce two new statistics, allele frequency-identity by descent (AF-IBD) and allele frequency-identity by state (AF-IBS), that make use of linkage disequilibrium information and show defined relationships to the time of coalescence. These statistics, when conditioned on the derived allele frequency, are able to infer complex population size changes. Moreover, the AF-IBS statistic, which is based on genome-wide SNP data, is robust to varying ascertainment conditions. We constructed an efficient approximate Bayesian computation (ABC) pipeline based on AF-IBD and AF-IBS that can accurately estimate demographic parameters, even for fairly complex models. Finally, we applied this ABC approach to genome-wide SNP data and inferred the demographic histories of two human populations, Yoruba and French. Our results suggest a rather stable ancestral population size with a mild recent expansion for Yoruba, whereas the French seemingly experienced a long-lasting severe bottleneck followed by a drastic population growth. This approach should prove useful for new insights into populations, especially those with complex demographic histories.
Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos
Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Full Text Available Influenza virus (IFV can evolve rapidly leading to genetic drifts and shifts resulting in human and animal influenza epidemics and pandemics. The genetic shift that gave rise to the 2009 influenza A/H1N1 pandemic originated from a triple gene reassortment of avian, swine and human IFVs. More minor genetic alterations in genetic drift can lead to influenza drug resistance such as the H274Y mutation associated with oseltamivir resistance. Hence, a rapid tool to detect IFV mutations and the potential emergence of new virulent strains can better prepare us for seasonal influenza outbreaks as well as potential pandemics. Furthermore, identification of specific mutations by closely examining single nucleotide polymorphisms (SNPs in IFV sequences is essential to classify potential genetic markers associated with potentially dangerous IFV phenotypes. In this study, we developed a novel R library called "SNPer" to analyze quantitative variants in SNPs among IFV subpopulations. The computational SNPer program was applied to three different subpopulations of published IFV genomic information. SNPer queried SNPs data and grouped the SNPs into (1 universal SNPs, (2 likely common SNPs, and (3 unique SNPs. SNPer outperformed manual visualization in terms of time and labor. SNPer took only three seconds with no errors in SNP comparison events compared with 40 hours with errors using manual visualization. The SNPer tool can accelerate the capacity to capture new and potentially dangerous IFV strains to mitigate future influenza outbreaks.
Su, Jie; Jiang, Linlin; Wu, Jingna; Liu, Zhiyu; Wu, Yuping
The water-soluble polysaccharides extracted from Sipunculus nudus (SNP) was investigated on the lifespan and immune damage repair of Drosophila melanogaster exposed to Cd (VI). SNP increased superoxyde dismutase (SOD), nitrogen monoxide (NO), glutathione peroxidase (GSH-Px) and total anti-oxidation competence (T-AOC), with decreased malondialdehyde (MDA) on D. melanogaster demonstrated that SNP could attenuate oxidative damage of D. melanogaster Exposed to Cd (VI). Real-time PCR and western blot analysis showed that SNP enhanced the gene expression of Diptericin, Drosomycin, Defensin, PGRP-LC and the protein level of Toll, p-JNK and Relish， that suggested the promoting effect of SNP on the immune damage repair of D. melanogaster exposed to Cd (VI). The increased level of Indy, Parkin and AMPK indicated the regulated effect of SNP on the longevity-related pathways through ageing-related moleculars of D. melanogaster exposed to Cd (VI). These results suggested that SNP could also improve the lifespan of D. melanogaster exposed to Cd (VI).
Full Text Available Copy number variation (CNV or single nucleotide phlyorphism (SNP is useful genetic resource to aid in understanding complex phenotypes or deseases susceptibility. Although thousands of CNVs and SNPs are currently avaliable in the public databases, they are somewhat difficult to use for analyses without visualization tools. We developed a web-based tool called the VCS (visualization of CNV or SNP to visualize the CNV or SNP detected. The VCS tool can assist to easily interpret a biological meaning from the numerical value of CNV and SNP. The VCS provides six visualization tools: i the enrichment of genome contents in CNV; ii the physical distribution of CNV or SNP on chromosomes; iii the distribution of log2 ratio of CNVs with criteria of interested; iv the number of CNV or SNP per binning unit; v the distribution of homozygosity of SNP genotype; and vi cytomap of genes within CNV or SNP region.
Full Text Available Abstract Background Technological advances have lead to the rapid increase in availability of single nucleotide polymorphisms (SNPs in a range of organisms, and there is a general optimism that SNPs will become the marker of choice for a range of evolutionary applications. Here, comparisons between 300 polymorphic SNPs and 14 short tandem repeats (STRs were conducted on a data set consisting of approximately 500 Atlantic salmon arranged in 10 samples/populations. Results Global FST ranged from 0.033-0.115 and -0.002-0.316 for the 14 STR and 300 SNP loci respectively. Global FST was similar among 28 linkage groups when averaging data from mapped SNPs. With the exception of selecting a panel of SNPs taking the locus displaying the highest global FST for each of the 28 linkage groups, which inflated estimation of genetic differentiation among the samples, inferred genetic relationships were highly similar between SNP and STR data sets and variants thereof. The best 15 SNPs (30 alleles gave a similar level of self-assignment to the best 4 STR loci (83 alleles, however, addition of further STR loci did not lead to a notable increase assignment whereas addition of up to 100 SNP loci increased assignment. Conclusion Whilst the optimal combinations of SNPs identified in this study are linked to the samples from which they were selected, this study demonstrates that identification of highly informative SNP loci from larger panels will provide researchers with a powerful approach to delineate genetic relationships at the individual and population levels.
Full Text Available Abstract Background The domestic cat has offered enormous genomic potential in the veterinary description of over 250 hereditary disease models as well as the occurrence of several deadly feline viruses (feline leukemia virus -- FeLV, feline coronavirus -- FECV, feline immunodeficiency virus - FIV that are homologues to human scourges (cancer, SARS, and AIDS respectively. However, to realize this bio-medical potential, a high density single nucleotide polymorphism (SNP map is required in order to accomplish disease and phenotype association discovery. Description To remedy this, we generated 3,178,297 paired fosmid-end Sanger sequence reads from seven cats, and combined these data with the publicly available 2X cat whole genome sequence. All sequence reads were assembled together to form a 3X whole genome assembly allowing the discovery of over three million SNPs. To reduce potential false positive SNPs due to the low coverage assembly, a low upper-limit was placed on sequence coverage and a high lower-limit on the quality of the discrepant bases at a potential variant site. In all domestic cats of different breeds: female Abyssinian, female American shorthair, male Cornish Rex, female European Burmese, female Persian, female Siamese, a male Ragdoll and a female African wildcat were sequenced lightly. We report a total of 964 k common SNPs suitable for a domestic cat SNP genotyping array and an additional 900 k SNPs detected between African wildcat and domestic cats breeds. An empirical sampling of 94 discovered SNPs were tested in the sequenced cats resulting in a SNP validation rate of 99%. Conclusions These data provide a large collection of mapped feline SNPs across the cat genome that will allow for the development of SNP genotyping platforms for mapping feline diseases.
Full Text Available Abstract Background Restriction Enzyme-based Reduced Representation Library (RRL method represents a relatively feasible and flexible strategy used for Single Nucleotide Polymorphism (SNP identification in different species. It has remarkable advantage of reducing the complexity of the genome by orders of magnitude. However, comprehensive evaluation for actual efficacy of SNP identification by this method is still unavailable. Results In order to evaluate the efficacy of Restriction Enzyme-based RRL method, we selected Tsp 45I enzyme which covers 266 Mb flanking region of the enzyme recognition site according to in silico simulation on human reference genome, then we sequenced YH RRL after Tsp 45I treatment and obtained reads of which 80.8% were mapped to target region with an 20-fold average coverage, about 96.8% of target region was covered by at least one read and 257 K SNPs were identified in the region using SOAPsnp software. Compared with whole genome resequencing data, we observed false discovery rate (FDR of 13.95% and false negative rate (FNR of 25.90%. The concordance rate of homozygote loci was over 99.8%, but that of heterozygote were only 92.56%. Repeat sequences and bases quality were proved to have a great effect on the accuracy of SNP calling, SNPs in recognition sites contributed evidently to the high FNR and the low concordance rate of heterozygote. Our results indicated that repeat masking and high stringent filter criteria could significantly decrease both FDR and FNR. Conclusions This study demonstrates that Restriction Enzyme-based RRL method was effective for SNP identification. The results highlight the important role of bias and the method-derived defects represented in this method and emphasize the special attentions noteworthy.
da Costa, Zirlane Portugal; Munhoz, Carla de Freitas; Vieira, Maria Lucia Carneiro
Passionflowers Passiflora edulis and Passiflora alata are diploid, outcrossing and understudied fruit bearing species. In Brazil, passion fruit cultivation began relatively recently and has earned the country an outstanding position as the world's top producer of passion fruit. The fruit's main economic value lies in the production of juice, an essential exotic ingredient in juice blends. Currently, crop improvement strategies, including those for underexploited tropical species, tend to incorporate molecular genetic approaches. In this study, we examined a set of P. edulis transcripts expressed in response to infection by Xanthomonas axonopodis, (the passion fruit's main bacterial pathogen that attacks the vines), aiming at the development of putative functional markers, i.e. SSRs (simple sequence repeats) and SNPs (single nucleotide polymorphisms). A total of 210 microsatellites were found in 998 sequences, and trinucleotide repeats were found to be the most frequent (31.4%). Of the sequences selected for designing primers, 80.9% could be used to develop SSR markers, and 60.6% SNP markers for P. alata. SNPs were all biallelic and found within 15 gene fragments of P. alata. Overall, gene fragments generated 10,003 bp. SNP frequency was estimated as one SNP every 294 bp. Polymorphism rates revealed by SSR and SNP loci were 29.4 and 53.6%, respectively. Passiflora edulis transcripts were useful for the development of putative functional markers for P. alata, suggesting a certain level of sequence conservation between these cultivated species. The markers developed herein could be used for genetic mapping purposes and also in diversity studies.
Hansen, Tine M; Graversen, Carina; Frøkjaer, Jens B
AIMS: The cortical response to nociceptive thermal stimuli recorded as contact heat evoked potentials (CHEPs) may be altered by morphine. However, previous studies have averaged CHEPs over multiple stimuli, which are confounded by jitter between sweeps. Thus, the aim was to assess single-sweep ch...
Tinoco, Ignacio Jr; Wen, Jin-Der
In the cell, proteins are synthesized by ribosomes in a multi-step process called translation. The ribosome translocates along the messenger RNA to read the codons that encode the amino acid sequence of a protein. Elongation factors, including EF-G and EF-Tu, are used to catalyze the process. Recently, we have shown that translation can be followed at the single-molecule level using optical tweezers; this technique allows us to study the kinetics of translation by measuring the lifetime the ribosome spends at each codon. Here, we analyze the data from single-molecule experiments and fit the data with simple kinetic models. We also simulate the translation kinetics based on a multi-step mechanism from ensemble kinetic measurements. The mean lifetimes from the simulation were consistent with our experimental single-molecule measurements. We found that the calculated lifetime distributions were fit in general by equations with up to five rate-determining steps. Two rate-determining steps were only obtained at low concentrations of elongation factors. These analyses can be used to design new single-molecule experiments to better understand the kinetics and mechanism of translation
Deshpande, VS; Needleman, A; Van der Giessen, E
The effect of loading conditions on the tensile stress versus strain response of micron-sized planar crystals with a single active slip system is investigated via finite and small deformation discrete dislocation plasticity analyses. When rotation of the tensile axis is prevented, lattice curvature
Zwerver, J.; Bredeweg, S. W.; Hof, A. L.
Background: The single-leg squat on a 25 decline board has been described as a clinical assessment tool and as a rehabilitation exercise for patients with patellar tendinopathy. Several assumptions have been made about its working mechanism on patellar load and patellofemoral forces, but these are
Bæk, Kristoffer Torbjørn; Svenningsen, Sine Lo; Eisen, Harvey
We have examined expression of the ¿cI operon in single cells via a rexgfp substitution. Although average fluorescence agreed with expectations for expression of ¿-repressor, fluorescence fluctuated greatly from cell-to-cell. Fluctuations in repressor concentration are not predicted by previous m...
Carly A Conran
Full Text Available Several different approaches are available to clinicians for determining prostate cancer (PCa risk. The clinical validity of various PCa risk assessment methods utilizing single nucleotide polymorphisms (SNPs has been established; however, these SNP-based methods have not been compared. The objective of this study was to compare the three most commonly used SNP-based methods for PCa risk assessment. Participants were men (n = 1654 enrolled in a prospective study of PCa development. Genotypes of 59 PCa risk-associated SNPs were available in this cohort. Three methods of calculating SNP-based genetic risk scores (GRSs were used for the evaluation of individual disease risk such as risk allele count (GRS-RAC, weighted risk allele count (GRS-wRAC, and population-standardized genetic risk score (GRS-PS. Mean GRSs were calculated, and performances were compared using area under the receiver operating characteristic curve (AUC and positive predictive value (PPV. All SNP-based methods were found to be independently associated with PCa (all P 0.05 for comparisons between the three methods, and all three SNP-based methods had a significantly higher AUC than family history (all P < 0.05. Results from this study suggest that while the three most commonly used SNP-based methods performed similarly in discriminating PCa from non-PCa at the population level, GRS-PS is the method of choice for risk assessment at the individual level because its value (where 1.0 represents average population risk can be easily interpreted regardless of the number of risk-associated SNPs used in the calculation.
Douglas Mark Ruden
Full Text Available This paper describes a new program SnpSift for filtering differential DNA sequence variants between two or more experimental genomes after genotoxic chemical exposure. Here, we illustrate how SnpSift can be used to identify candidate phenotype-relevant variants including single nucleotide polymorphisms (SNPs, multiple nucleotide polymorphisms (MNPs, insertions and deletions (InDels in mutant strains isolated from genome-wide chemical mutagenesis of Drosophila melanogaster. First, the genomes of two independently-isolated mutant fly strains that are allelic for a novel recessive male-sterile locus generated by genotoxic chemical exposure were sequenced using the Illumina next-generation DNA sequencer to obtain 20- to 29-fold coverage of the euchromatic sequences. The sequencing reads were processed and variants were called using standard bioinformatic tools. Next, SnpEff was used to annotate all sequence variants and their potential mutational effects on associated genes. Then, SnpSift was used to filter and select differential variants that potentially disrupt a common gene in the two allelic mutant strains. The potential causative DNA lesions were partially validated by capillary sequencing of PCR-amplified DNA in the genetic interval as defined by meiotic mapping and deletions that remove defined regions of the chromosome. Of the five candidate genes located in the genetic interval, the Pka-like gene CG12069 was found to carry a separate premature stop codon mutation in each of the two allelic mutants whereas the other 4 candidate genes within the interval have wild-type sequences. The Pka-like gene is therefore a strong candidate gene for the male-sterile locus. These results demonstrate that combining SnpEff and SnpSift can expedite the identification of candidate phenotype-causative mutations in chemically-mutagenized Drosophila strains. This technique can also be used to characterize the variety of mutations generated by genotoxic
Myasnikov, Alexander G. [IGBMC (Institute of Genetics and of Molecular and Cellular Biology), Department of Integrative Structural Biology, Centre National de la Recherche Scientifique (CNRS) UMR 7104/ Institut National de la Santé de la Recherche Médicale INSERM U964/ Université de Strasbourg, 1 rue Laurent Fries, 67404 Illkirch (France); Afonina, Zhanna A. [Institute of Protein Research, Russian Academy of Sciences, 142290 Pushchino, Moscow Region (Russian Federation); Klaholz, Bruno P., E-mail: email@example.com [IGBMC (Institute of Genetics and of Molecular and Cellular Biology), Department of Integrative Structural Biology, Centre National de la Recherche Scientifique (CNRS) UMR 7104/ Institut National de la Santé de la Recherche Médicale INSERM U964/ Université de Strasbourg, 1 rue Laurent Fries, 67404 Illkirch (France)
Cryo electron tomography (cryo-ET) can provide cellular and molecular structural information on various biological samples. However, the detailed interpretation of tomograms reconstructed from single-tilt data tends to suffer from low signal-to-noise ratio and artefacts caused by some systematically missing angular views. While these can be overcome by sub-tomogram averaging, they remain limiting for the analysis of unique structures. Double-tilt ET can improve the tomogram quality by acquiring a second tilt series after an in-plane rotation, but its usage is not widespread yet because it is considered technically demanding and it is rarely used under cryo conditions. Here we show that double-tilt cryo-ET improves the quality of 3D reconstructions so significantly that even single particle analysis can be envisaged despite of the intrinsically low image contrast obtained from frozen-hydrated specimens. This is illustrated by the analysis of eukaryotic polyribosomes in which individual ribosomes were reconstructed using single-tilt, partial and full double-tilt geometries. The improved tomograms favour the faster convergence of iterative sub-tomogram averaging and allow a better 3D classification using multivariate statistical analysis. Our study of single particles and molecular assemblies within polysomes illustrates that the dual-axis approach is particularly useful for cryo applications of ET, both for unique objects and for structures that can be classified and averaged. - Highlights: ► Double-tilt cryo-ET improves 3D reconstructions thus making single particle analysis possible. ► Dual-axis cryo-ET data favour a faster convergence of iterative sub-tomogram averaging. ► Individual ribosomes were reconstructed from single-tilt, partial/ full double-tilt geometries. ► Double-tilt cryo-ET facilitates analysis of larger molecular assemblies such as in cell sections. ► Dual-axis cryo-ET is applicable to unique objects and to structures that can be
Full Text Available Abstract Background It has been reported in the quantitative trait locus (QTL literature that when testing for QTL location and effect, the statistical power supporting methodologies based on two markers and their estimated genetic map is higher than for the genetic map independent methodologies known as single marker analyses. Close examination of these reports reveals that the two marker approaches are more powerful than single marker analyses only in certain cases. Simulation studies are a commonly used tool to determine the behavior of test statistics under known conditions. We conducted a simulation study to assess the general behavior of an intersection test and a two marker test under a variety of conditions. The study was designed to reveal whether two marker tests are always more powerful than intersection tests, or whether there are cases when an intersection test may outperform the two marker approach. We present a reanalysis of a data set from a QTL study of ovariole number in Drosophila melanogaster. Results Our simulation study results show that there are situations where the single marker intersection test equals or outperforms the two marker test. The intersection test and the two marker test identify overlapping regions in the reanalysis of the Drosophila melanogaster data. The region identified is consistent with a regression based interval mapping analysis. Conclusion We find that the intersection test is appropriate for analysis of QTL data. This approach has the advantage of simplicity and for certain situations supplies equivalent or more powerful results than a comparable two marker test.
Foley William J
Full Text Available Abstract Background There is little information about the DNA sequence variation within and between closely related plant species. The combination of re-sequencing technologies, large-scale DNA pools and availability of reference gene sequences allowed the extensive characterisation of single nucleotide polymorphisms (SNPs in genes of four biosynthetic pathways leading to the formation of ecologically relevant secondary metabolites in Eucalyptus. With this approach the occurrence and patterns of SNP variation for a set of genes can be compared across different species from the same genus. Results In a single GS-FLX run, we sequenced over 103 Mbp and assembled them to approximately 50 kbp of reference sequences. An average sequencing depth of 315 reads per nucleotide site was achieved for all four eucalypt species, Eucalyptus globulus, E. nitens, E. camaldulensis and E. loxophleba. We sequenced 23 genes from 1,764 individuals and discovered 8,631 SNPs across the species, with about 1.5 times as many SNPs per kbp in the introns compared to exons. The exons of the two closely related species (E. globulus and E. nitens had similar numbers of SNPs at synonymous and non-synonymous sites. These species also had similar levels of SNP diversity, whereas E. camaldulensis and E. loxophleba had much higher SNP diversity. Neither the pathway nor the position in the pathway influenced gene diversity. The four species share between 20 and 43% of the SNPs in these genes. Conclusion By using conservative statistical detection methods, we were confident about the validity of each SNP. With numerous individuals sampled over the geographical range of each species, we discovered one SNP in every 33 bp for E. nitens and one in every 31 bp in E. globulus. In contrast, the more distantly related species contained more SNPs: one in every 16 bp for E. camaldulensis and one in 17 bp for E. loxophleba, which is, to the best of our knowledge, the highest frequency of SNPs
María E. Fernández
Full Text Available During the last decade, microsatellites (short tandem repeats or STRs have been successfully used for animal genetic identification, traceability and paternity, although in recent year single nucleotide polymorphisms (SNPs have been increasingly used for this purpose. An efficient SNP identification system requires a marker set with enough power to identify individuals and their parents. Genetic diagnostics generally include the analysis of related animals. In this work, the degree of information provided by SNPs for a consanguineous herd of cattle was compared with that provided by STRs. Thirty-six closely related Angus cattle were genotyped for 18 STRs and 116 SNPs. Cumulative SNPs exclusion power values (Q for paternity and sample matching probability (MP yielded values greater than 0.9998 and 4.32E-42, respectively. Generally 2-3 SNPs per STR were needed to obtain an equivalent Q value. The MP showed that 24 SNPs were equivalent to the ISAG (International Society for Animal Genetics minimal recommended set of 12 STRs (MP ~ 10-11. These results provide valuable genetic data that support the consensus SNP panel for bovine genetic identification developed by the Parentage Recording Working Group of ICAR (International Committee for Animal Recording.
Li Ping Zhou
Full Text Available Objective. To explore the genetic traits of Kidney-yang deficiency syndrome (KDS. Design. Twelve KDS subjects and three spouses from a typical KDS family were recruited. Their genomic DNA samples were genotyped by Affymetrix 100K single-nucleotide polymorphism (SNP arrays. The linkage disequilibrium (LD SNPs were generated using GeneChip DNA analysis software (GDAS, Affymetrix. Genes located within 100 bp of the flanks of LD SNPs were mined via GeneView. 29 exons of the doublecortin domain containing 5 (DCDC5, a representative gene within the flank of an LD SNP, were resequenced. Results. Five LD SNPs display midrange linkage with KDS. Two genes with established functions, DCDC5 and Leucyl-tRNA synthetase, were mined in the flanks of LD SNPs. Resequencing of DCDC5 revealed a nonsynonymous variation, in which 3764T/A was replaced by C/G. Accordingly, the Ser1172 was substituted by Pro1172. The S1172P substitution effect was evaluated as “possibly damaging” by PolyPhen. Conclusion. We have identified a genomic variation of DCDC5 based on the LD SNPs derived from a KDS family. DCDC5 and other genes surrounding these SNPs display some relationships with key symptoms of KDS.
Courtney, D G; Moore, J E; Atkinson, S D; Maurizi, E; Allen, E H A; Pedrioli, D M L; McLean, W H I; Nesbit, M A; Moore, C B T
CRISPR/Cas9-based therapeutics hold the possibility for permanent treatment of genetic disease. The potency and specificity of this system has been used to target dominantly inherited conditions caused by heterozygous missense mutations through inclusion of the mutated base in the short-guide RNA (sgRNA) sequence. This research evaluates a novel approach for targeting heterozygous single-nucleotide polymorphisms (SNPs) using CRISPR/Cas9. We determined that a mutation within KRT12, which causes Meesmann's epithelial corneal dystrophy (MECD), leads to the occurrence of a novel protospacer adjacent motif (PAM). We designed an sgRNA complementary to the sequence adjacent to this SNP-derived PAM and evaluated its potency and allele specificity both in vitro and in vivo. This sgRNA was found to be highly effective at reducing the expression of mutant KRT12 mRNA and protein in vitro. To assess its activity in vivo we injected a combined Cas9/sgRNA expression construct into the corneal stroma of a humanized MECD mouse model. Sequence analysis of corneal genomic DNA revealed non-homologous end-joining repair resulting in frame-shifting deletions within the mutant KRT12 allele. This study is the first to