Smith Andrew M
Full Text Available Abstract Background Microarrays are an invaluable tool in many modern genomic studies. It is generally perceived that decreasing the size of microarray features leads to arrays with higher resolution (due to greater feature density, but this increase in resolution can compromise sensitivity. Results We demonstrate that barcode microarrays with smaller features are equally capable of detecting variation in DNA barcode intensity when compared to larger feature sizes within a specific microarray platform. The barcodes used in this study are the well-characterized set derived from the Yeast KnockOut (YKO collection used for screens of pooled yeast (Saccharomyces cerevisiae deletion mutants. We treated these pools with the glycosylation inhibitor tunicamycin as a test compound. Three generations of barcode microarrays at 30, 8 and 5 μm features sizes independently identified the primary target of tunicamycin to be ALG7. Conclusion We show that the data obtained with 5 μm feature size is of comparable quality to the 30 μm size and propose that further shrinking of features could yield barcode microarrays with equal or greater resolving power and, more importantly, higher density.
Full Text Available International fish trade reached an import value of 62.8 billion Euro in 2006, of which 44.6% are covered by the European Union. Species identification is a key problem throughout the life cycle of fishes: from eggs and larvae to adults in fisheries research and control, as well as processed fish products in consumer protection.This study aims to evaluate the applicability of the three mitochondrial genes 16S rRNA (16S, cytochrome b (cyt b, and cytochrome oxidase subunit I (COI for the identification of 50 European marine fish species by combining techniques of "DNA barcoding" and microarrays. In a DNA barcoding approach, neighbour Joining (NJ phylogenetic trees of 369 16S, 212 cyt b, and 447 COI sequences indicated that cyt b and COI are suitable for unambiguous identification, whereas 16S failed to discriminate closely related flatfish and gurnard species. In course of probe design for DNA microarray development, each of the markers yielded a high number of potentially species-specific probes in silico, although many of them were rejected based on microarray hybridisation experiments. None of the markers provided probes to discriminate the sibling flatfish and gurnard species. However, since 16S-probes were less negatively influenced by the "position of label" effect and showed the lowest rejection rate and the highest mean signal intensity, 16S is more suitable for DNA microarray probe design than cty b and COI. The large portion of rejected COI-probes after hybridisation experiments (>90% renders the DNA barcoding marker as rather unsuitable for this high-throughput technology.Based on these data, a DNA microarray containing 64 functional oligonucleotide probes for the identification of 30 out of the 50 fish species investigated was developed. It represents the next step towards an automated and easy-to-handle method to identify fish, ichthyoplankton, and fish products.
Michael A Cook
Full Text Available BACKGROUND: Molecular barcode arrays provide a powerful means to analyze cellular phenotypes in parallel through detection of short (20-60 base unique sequence tags, or "barcodes", associated with each strain or clone in a collection. However, costs of current methods for microarray construction, whether by in situ oligonucleotide synthesis or ex situ coupling of modified oligonucleotides to the slide surface are often prohibitive to large-scale analyses. METHODOLOGY/PRINCIPAL FINDINGS: Here we demonstrate that unmodified 20mer oligonucleotide probes printed on conventional surfaces show comparable hybridization signals to covalently linked 5'-amino-modified probes. As a test case, we undertook systematic cell size analysis of the budding yeast Saccharomyces cerevisiae genome-wide deletion collection by size separation of the deletion pool followed by determination of strain abundance in size fractions by barcode arrays. We demonstrate that the properties of a 13K unique feature spotted 20 mer oligonucleotide barcode microarray compare favorably with an analogous covalently-linked oligonucleotide array. Further, cell size profiles obtained with the size selection/barcode array approach recapitulate previous cell size measurements of individual deletion strains. Finally, through atomic force microscopy (AFM, we characterize the mechanism of hybridization to unmodified barcode probes on the slide surface. CONCLUSIONS/SIGNIFICANCE: These studies push the lower limit of probe size in genome-scale unmodified oligonucleotide microarray construction and demonstrate a versatile, cost-effective and reliable method for molecular barcode analysis.
Xu, Qikai; Schlabach, Michael R; Hannon, Gregory J; Elledge, Stephen J
DNA barcodes linked to genetic features greatly facilitate screening these features in pooled formats using microarray hybridization, and new tools are needed to design large sets of barcodes to allow construction of large barcoded mammalian libraries such as shRNA libraries. Here we report a framework for designing large sets of orthogonal barcode probes. We demonstrate the utility of this framework by designing 240,000 barcode probes and testing their performance by hybridization. From the test hybridizations, we also discovered new probe design rules that significantly reduce cross-hybridization after their introduction into the framework of the algorithm. These rules should improve the performance of DNA microarray probe designs for many applications.
Dankolo, Muhammad Nasiru; Radzi, Nor Haizan Mohamed; Sallehuddin, Roselina; Mustaffa, Noorfa Haszlinna
Microarray systems enable experts to examine gene profile at molecular level using machine learning algorithms. It increases the potentials of classification and diagnosis of many diseases at gene expression level. Though, numerous difficulties may affect the efficiency of machine learning algorithms which includes vast number of genes features comprised in the original data. Many of these features may be unrelated to the intended analysis. Therefore, feature selection is necessary to be performed in the data pre-processing. Many feature selection algorithms are developed and applied on microarray which including the metaheuristic optimization algorithms. This paper discusses the application of the metaheuristics algorithms for feature selection in microarray dataset. This study reveals that, the algorithms have yield an interesting result with limited resources thereby saving computational expenses of machine learning algorithms.
Full Text Available High dimensionality of microarray data sets may lead to low efficiency and overfitting. In this paper, a multiphase cooperative game theoretic feature selection approach is proposed for microarray data classification. In the first phase, due to high dimension of microarray data sets, the features are reduced using one of the two filter-based feature selection methods, namely, mutual information and Fisher ratio. In the second phase, Shapley index is used to evaluate the power of each feature. The main innovation of the proposed approach is to employ Qualitative Mutual Information (QMI for this purpose. The idea of Qualitative Mutual Information causes the selected features to have more stability and this stability helps to deal with the problem of data imbalance and scarcity. In the third phase, a forward selection scheme is applied which uses a scoring function to weight each feature. The performance of the proposed method is compared with other popular feature selection algorithms such as Fisher ratio, minimum redundancy maximum relevance, and previous works on cooperative game based feature selection. The average classification accuracy on eleven microarray data sets shows that the proposed method improves both average accuracy and average stability compared to other approaches.
Lan, Liang; Vucetic, Slobodan
A major challenge in microarray classification is that the number of features is typically orders of magnitude larger than the number of examples. In this paper, we propose a novel feature filter algorithm to select the feature subset with maximal discriminative power and minimal redundancy by solving a quadratic objective function with binary integer constraints. To improve the computational efficiency, the binary integer constraints are relaxed and a low-rank approximation to the quadratic term is applied. The proposed feature selection algorithm was extended to solve multi-task microarray classification problems. We compared the single-task version of the proposed feature selection algorithm with 9 existing feature selection methods on 4 benchmark microarray data sets. The empirical results show that the proposed method achieved the most accurate predictions overall. We also evaluated the multi-task version of the proposed algorithm on 8 multi-task microarray datasets. The multi-task feature selection algorithm resulted in significantly higher accuracy than when using the single-task feature selection methods.
Dec 8, 2015 ... mental stages was used to identify single feature polymorphisms (SFPs). ... on a high-density oligonucleotide expression array in which. ∗ ..... The sign (+/−) with SFPs indicates direction of polymorphism. In the. (−) sign (i.e. ...
Meher, Prabina Kumar; Sahu, Tanmaya Kumar; Rao, A R
DNA barcoding is a molecular diagnostic method that allows automated and accurate identification of species based on a short and standardized fragment of DNA. To this end, an attempt has been made in this study to develop a computational approach for identifying the species by comparing its barcode with the barcode sequence of known species present in the reference library. Each barcode sequence was first mapped onto a numeric feature vector based on k-mer frequencies and then Random forest methodology was employed on the transformed dataset for species identification. The proposed approach outperformed similarity-based, tree-based, diagnostic-based approaches and found comparable with existing supervised learning based approaches in terms of species identification success rate, while compared using real and simulated datasets. Based on the proposed approach, an online web interface SPIDBAR has also been developed and made freely available at http://cabgrid.res.in:8080/spidbar/ for species identification by the taxonomists. Copyright © 2016 Elsevier B.V. All rights reserved.
Reinders Marcel JT
Full Text Available Abstract Background Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear. Results We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical
Zena M Hira
Full Text Available Microarray databases are a large source of genetic data, which, upon proper analysis, could enhance our understanding of biology and medicine. Many microarray experiments have been designed to investigate the genetic mechanisms of cancer, and analytical approaches have been applied in order to classify different types of cancer or distinguish between cancerous and non-cancerous tissue. However, microarrays are high-dimensional datasets with high levels of noise and this causes problems when using machine learning methods. A popular approach to this problem is to search for a set of features that will simplify the structure and to some degree remove the noise from the data. The most widely used approach to feature extraction is principal component analysis (PCA which assumes a multivariate Gaussian model of the data. More recently, non-linear methods have been investigated. Among these, manifold learning algorithms, for example Isomap, aim to project the data from a higher dimensional space onto a lower dimension one. We have proposed a priori manifold learning for finding a manifold in which a representative set of microarray data is fused with relevant data taken from the KEGG pathway database. Once the manifold has been constructed the raw microarray data is projected onto it and clustering and classification can take place. In contrast to earlier fusion based methods, the prior knowledge from the KEGG databases is not used in, and does not bias the classification process--it merely acts as an aid to find the best space in which to search the data. In our experiments we have found that using our new manifold method gives better classification results than using either PCA or conventional Isomap.
Annavarapu, Chandra Sekhara Rao; Dara, Suresh; Banka, Haider
Cancer investigations in microarray data play a major role in cancer analysis and the treatment. Cancer microarray data consists of complex gene expressed patterns of cancer. In this article, a Multi-Objective Binary Particle Swarm Optimization (MOBPSO) algorithm is proposed for analyzing cancer gene expression data. Due to its high dimensionality, a fast heuristic based pre-processing technique is employed to reduce some of the crude domain features from the initial feature set. Since these pre-processed and reduced features are still high dimensional, the proposed MOBPSO algorithm is used for finding further feature subsets. The objective functions are suitably modeled by optimizing two conflicting objectives i.e., cardinality of feature subsets and distinctive capability of those selected subsets. As these two objective functions are conflicting in nature, they are more suitable for multi-objective modeling. The experiments are carried out on benchmark gene expression datasets, i.e., Colon, Lymphoma and Leukaemia available in literature. The performance of the selected feature subsets with their classification accuracy and validated using 10 fold cross validation techniques. A detailed comparative study is also made to show the betterment or competitiveness of the proposed algorithm. PMID:27822174
Full Text Available Microarray data usually contain a large number of genes, but a small number of samples. Feature subset selection for microarray data aims at reducing the number of genes so that useful information can be extracted from the samples. Reducing the dimension of data sets further helps in improving the computational efficiency of the learning model. In this paper, we propose a modified algorithm based on the tabu search as local search procedures to a Greedy Randomized Adaptive Search Procedure (GRASP for high dimensional microarray data sets. The proposed Tabu based Greedy Randomized Adaptive Search Procedure algorithm is named as TGRASP. In TGRASP, a new parameter has been introduced named as Tabu Tenure and the existing parameters, NumIter and size have been modified. We observed that different parameter settings affect the quality of the optimum. The second proposed algorithm known as FFGRASP (Firefly Greedy Randomized Adaptive Search Procedure uses a firefly optimization algorithm in the local search optimzation phase of the greedy randomized adaptive search procedure (GRASP. Firefly algorithm is one of the powerful algorithms for optimization of multimodal applications. Experimental results show that the proposed TGRASP and FFGRASP algorithms are much better than existing algorithm with respect to three performance parameters viz. accuracy, run time, number of a selected subset of features. We have also compared both the approaches with a unified metric (Extended Adjusted Ratio of Ratios which has shown that TGRASP approach outperforms existing approach for six out of nine cancer microarray datasets and FFGRASP performs better on seven out of nine datasets.
Papachristoudis, Georgios; Diplaris, Sotiris; Mitkas, Pericles A
Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.
Full Text Available Most of the available feature selection techniques in the literature are classifier bound. It means a group of features tied to the performance of a specific classifier as applied in wrapper and hybrid approach. Our objective in this study is to select a set of generic features not tied to any classifier based on the proposed framework. This framework uses attribute clustering and feature ranking techniques in pipeline in order to remove redundant features. On each uncovered cluster, signal-to-noise ratio, t-statistics and significance analysis of microarray are independently applied to select the top ranked features. Both filter and evolutionary wrapper approaches have been considered for feature selection and the data set with selected features are given to ensemble of predefined statistically different classifiers. The class labels of the test data are determined using majority voting technique. Moreover, with the aforesaid objectives, this paper focuses on obtaining a stable result out of various classification models. Further, a comparative analysis has been performed to study the classification accuracy and computational time of the current approach and evolutionary wrapper techniques. It gives a better insight into the features and further enhancing the classification accuracy with less computational time.
Full Text Available Cancer classification by doctors and radiologists was based on morphological and clinical features and had limited diagnostic ability in olden days. The recent arrival of DNA microarray technology has led to the concurrent monitoring of thousands of gene expressions in a single chip which stimulates the progress in cancer classification. In this paper, we have proposed a hybrid approach for microarray data classification based on nearest neighbor (KNN, naive Bayes, and support vector machine (SVM. Feature selection prior to classification plays a vital role and a feature selection technique which combines discrete wavelet transform (DWT and moving window technique (MWT is used. The performance of the proposed method is compared with the conventional classifiers like support vector machine, nearest neighbor, and naive Bayes. Experiments have been conducted on both real and benchmark datasets and the results indicate that the ensemble approach produces higher classification accuracy than conventional classifiers. This paper serves as an automated system for the classification of cancer and can be applied by doctors in real cases which serve as a boon to the medical community. This work further reduces the misclassification of cancers which is highly not allowed in cancer detection.
Harris Lyndsay N
Full Text Available Abstract Background Like microarray-based investigations, high-throughput proteomics techniques require machine learning algorithms to identify biomarkers that are informative for biological classification problems. Feature selection and classification algorithms need to be robust to noise and outliers in the data. Results We developed a recursive support vector machine (R-SVM algorithm to select important genes/biomarkers for the classification of noisy data. We compared its performance to a similar, state-of-the-art method (SVM recursive feature elimination or SVM-RFE, paying special attention to the ability of recovering the true informative genes/biomarkers and the robustness to outliers in the data. Simulation experiments show that a 5 %-~20 % improvement over SVM-RFE can be achieved regard to these properties. The SVM-based methods are also compared with a conventional univariate method and their respective strengths and weaknesses are discussed. R-SVM was applied to two sets of SELDI-TOF-MS proteomics data, one from a human breast cancer study and the other from a study on rat liver cirrhosis. Important biomarkers found by the algorithm were validated by follow-up biological experiments. Conclusion The proposed R-SVM method is suitable for analyzing noisy high-throughput proteomics and microarray data and it outperforms SVM-RFE in the robustness to noise and in the ability to recover informative features. The multivariate SVM-based method outperforms the univariate method in the classification performance, but univariate methods can reveal more of the differentially expressed features especially when there are correlations between the features.
Xu, Jiucheng; Mu, Huiyu; Wang, Yun; Huang, Fangzhou
The selection of feature genes with high recognition ability from the gene expression profiles has gained great significance in biology. However, most of the existing methods have a high time complexity and poor classification performance. Motivated by this, an effective feature selection method, called supervised locally linear embedding and Spearman's rank correlation coefficient (SLLE-SC 2 ), is proposed which is based on the concept of locally linear embedding and correlation coefficient algorithms. Supervised locally linear embedding takes into account class label information and improves the classification performance. Furthermore, Spearman's rank correlation coefficient is used to remove the coexpression genes. The experiment results obtained on four public tumor microarray datasets illustrate that our method is valid and feasible.
Full Text Available High dimensionality and small sample sizes, and their inherent risk of overfitting, pose great challenges for constructing efficient classifiers in microarray data classification. Therefore a feature selection technique should be conducted prior to data classification to enhance prediction performance. In general, filter methods can be considered as principal or auxiliary selection mechanism because of their simplicity, scalability, and low computational complexity. However, a series of trivial examples show that filter methods result in less accurate performance because they ignore the dependencies of features. Although few publications have devoted their attention to reveal the relationship of features by multivariate-based methods, these methods describe relationships among features only by linear methods. While simple linear combination relationship restrict the improvement in performance. In this paper, we used kernel method to discover inherent nonlinear correlations among features as well as between feature and target. Moreover, the number of orthogonal components was determined by kernel Fishers linear discriminant analysis (FLDA in a self-adaptive manner rather than by manual parameter settings. In order to reveal the effectiveness of our method we performed several experiments and compared the results between our method and other competitive multivariate-based features selectors. In our comparison, we used two classifiers (support vector machine, [Formula: see text]-nearest neighbor on two group datasets, namely two-class and multi-class datasets. Experimental results demonstrate that the performance of our method is better than others, especially on three hard-classify datasets, namely Wang's Breast Cancer, Gordon's Lung Adenocarcinoma and Pomeroy's Medulloblastoma.
Shah, M; Marchand, M; Corbeil, J
One of the objectives of designing feature selection learning algorithms is to obtain classifiers that depend on a small number of attributes and have verifiable future performance guarantees. There are few, if any, approaches that successfully address the two goals simultaneously. To the best of our knowledge, such algorithms that give theoretical bounds on the future performance have not been proposed so far in the context of the classification of gene expression data. In this work, we investigate the premise of learning a conjunction (or disjunction) of decision stumps in Occam's Razor, Sample Compression, and PAC-Bayes learning settings for identifying a small subset of attributes that can be used to perform reliable classification tasks. We apply the proposed approaches for gene identification from DNA microarray data and compare our results to those of the well-known successful approaches proposed for the task. We show that our algorithm not only finds hypotheses with a much smaller number of genes while giving competitive classification accuracy but also having tight risk guarantees on future performance, unlike other approaches. The proposed approaches are general and extensible in terms of both designing novel algorithms and application to other domains.
Full Text Available Microarray data has a high dimension of variables but available datasets usually have only a small number of samples, thereby making the study of such datasets interesting and challenging. In the task of analyzing microarray data for the purpose of, e.g., predicting gene-disease association, feature selection is very important because it provides a way to handle the high dimensionality by exploiting information redundancy induced by associations among genetic markers. Judicious feature selection in microarray data analysis can result in significant reduction of cost while maintaining or improving the classification or prediction accuracy of learning machines that are employed to sort out the datasets. In this paper, we propose a gene selection method called Recursive Feature Addition (RFA, which combines supervised learning and statistical similarity measures. We compare our method with the following gene selection methods: Support Vector Machine Recursive Feature Elimination (SVMRFE, Leave-One-Out Calculation Sequential Forward Selection (LOOCSFS, Gradient based Leave-one-out Gene Selection (GLGS. To evaluate the performance of these gene selection methods, we employ several popular learning classifiers on the MicroArray Quality Control phase II on predictive modeling (MAQC-II breast cancer dataset and the MAQC-II multiple myeloma dataset. Experimental results show that gene selection is strictly paired with learning classifier. Overall, our approach outperforms other compared methods. The biological functional analysis based on the MAQC-II breast cancer dataset convinced us to apply our method for phenotype prediction. Additionally, learning classifiers also play important roles in the classification of microarray data and our experimental results indicate that the Nearest Mean Scale Classifier (NMSC is a good choice due to its prediction reliability and its stability across the three performance measurements: Testing accuracy, MCC values, and
Sontrop, H.M.J.; Moerland, P.D.; Van den Ham, R.; Reinders, M.J.T.; Verhaegh, W.F.J.
Background: Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for
Sontrop, Herman M. J.; Moerland, Perry D.; van den Ham, René; Reinders, Marcel J. T.; Verhaegh, Wim F. J.
Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the
Kuksa, Pavel; Pavlovic, Vladimir
In this work we consider barcode DNA analysis problems and address them using alternative, alignment-free methods and representations which model sequences as collections of short sequence fragments (features). The methods use fixed-length representations (spectrum) for barcode sequences to measure similarities or dissimilarities between sequences coming from the same or different species. The spectrum-based representation not only allows for accurate and computationally efficient species classification, but also opens possibility for accurate clustering analysis of putative species barcodes and identification of critical within-barcode loci distinguishing barcodes of different sample groups. New alignment-free methods provide highly accurate and fast DNA barcode-based identification and classification of species with substantial improvements in accuracy and speed over state-of-the-art barcode analysis methods. We evaluate our methods on problems of species classification and identification using barcodes, important and relevant analytical tasks in many practical applications (adverse species movement monitoring, sampling surveys for unknown or pathogenic species identification, biodiversity assessment, etc.) On several benchmark barcode datasets, including ACG, Astraptes, Hesperiidae, Fish larvae, and Birds of North America, proposed alignment-free methods considerably improve prediction accuracy compared to prior results. We also observe significant running time improvements over the state-of-the-art methods. Our results show that newly developed alignment-free methods for DNA barcoding can efficiently and with high accuracy identify specimens by examining only few barcode features, resulting in increased scalability and interpretability of current computational approaches to barcoding.
Roger W Barrette
Full Text Available Several RT-PCR and genome sequencing strategies exist for the resolution of Foot-and-Mouth Disease virus (FMDV. While these approaches are relatively straightforward, they can be vulnerable to failure due to the unpredictable nature of FMDV genome sequence variations. Sequence independent single primer amplification (SISPA followed by genotyping microarray offers an attractive unbiased approach to FMDV characterization. Here we describe a custom FMDV microarray and a companion feature and template-assisted assembler software (FAT-assembler capable of resolving virus genome sequence using a moderate number of conserved microarray features. The results demonstrate that this approach may be used to rapidly characterize naturally occurring FMDV as well as an engineered chimeric strain of FMDV. The FAT-assembler, while applied to resolving FMDV genomes, represents a new bioinformatics approach that should be broadly applicable to interpreting microarray genotyping data for other viruses or target organisms.
Elena Purcaru; Cristian Toma
The paper presents a solution for endcoding/decoding DNA information in 2D barcodes. First part focuses on the existing techniques and symbologies in 2D barcodes field. The 2D barcode PDF417 is presented as starting point. The adaptations and optimizations on PDF417 and on DataMatrix lead to the solution - DNA2DBC - DeoxyriboNucleic Acid Two Dimensional Barcode. The second part shows the DNA2DBC encoding/decoding process step by step. In conclusions are enumerated the most important features ...
Park, Sungjin; Gildersleeve, Jeffrey C; Blixt, Klas Ola
In the last decade, carbohydrate microarrays have been core technologies for analyzing carbohydrate-mediated recognition events in a high-throughput fashion. A number of methods have been exploited for immobilizing glycans on the solid surface in a microarray format. This microarray...... of substrate specificities of glycosyltransferases. This review covers the construction of carbohydrate microarrays, detection methods of carbohydrate microarrays and their applications in biological and biomedical research....
Yang, Mingxing; Li, Xiumin; Li, Zhibin; Ou, Zhimin; Liu, Ming; Liu, Suhuan; Li, Xuejun; Yang, Shuyu
DNA microarray analysis is characterized by obtaining a large number of gene variables from a small number of observations. Cluster analysis is widely used to analyze DNA microarray data to make classification and diagnosis of disease. Because there are so many irrelevant and insignificant genes in a dataset, a feature selection approach must be employed in data analysis. The performance of cluster analysis of this high-throughput data depends on whether the feature selection approach chooses the most relevant genes associated with disease classes. Here we proposed a new method using multiple Orthogonal Partial Least Squares-Discriminant Analysis (mOPLS-DA) models and S-plots to select the most relevant genes to conduct three-class disease classification and prediction. We tested our method using Golub's leukemia microarray data. For three classes with subtypes, we proposed hierarchical orthogonal partial least squares-discriminant analysis (OPLS-DA) models and S-plots to select features for two main classes and their subtypes. For three classes in parallel, we employed three OPLS-DA models and S-plots to choose marker genes for each class. The power of feature selection to classify and predict three-class disease was evaluated using cluster analysis. Further, the general performance of our method was tested using four public datasets and compared with those of four other feature selection methods. The results revealed that our method effectively selected the most relevant features for disease classification and prediction, and its performance was better than that of the other methods.
Smith, Andrew M; Heisler, Lawrence E; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J; Chee, Mark; Roth, Frederick P; Giaever, Guri; Nislow, Corey
Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or "Bar-seq," outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that approximately 20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene-environment interactions on a genome-wide scale.
Skip to main content DNA Microarray Technology Enter Search Term(s): Español Research Funding An Overview Bioinformatics Current Grants Education and Training Funding Extramural Research News Features Funding Divisions Funding ...
Full Text Available We have previously developed a computational method for representing a genome as a barcode image, which makes various genomic features visually apparent. We have demonstrated that this visual capability has made some challenging genome analysis problems relatively easy to solve. We have applied this capability to a number of challenging problems, including (a identification of horizontally transferred genes, (b identification of genomic islands with special properties and (c binning of metagenomic sequences, and achieved highly encouraging results. These application results inspired us to develop this barcode-based genome analysis server for public service, which supports the following capabilities: (a calculation of the k-mer based barcode image for a provided DNA sequence; (b detection of sequence fragments in a given genome with distinct barcodes from those of the majority of the genome, (c clustering of provided DNA sequences into groups having similar barcodes; and (d homology-based search using Blast against a genome database for any selected genomic regions deemed to have interesting barcodes. The barcode server provides a job management capability, allowing processing of a large number of analysis jobs for barcode-based comparative genome analyses. The barcode server is accessible at http://csbl1.bmb.uga.edu/Barcode.
Wisesty, Untari N.; Warastri, Riris S.; Puspitasari, Shinta Y.
Cancer is one of the major causes of mordibility and mortality problems in the worldwide. Therefore, the need of a system that can analyze and identify a person suffering from a cancer by using microarray data derived from the patient’s Deoxyribonucleic Acid (DNA). But on microarray data has thousands of attributes, thus making the challenges in data processing. This is often referred to as the curse of dimensionality. Therefore, in this study built a system capable of detecting a patient whether contracted cancer or not. The algorithm used is Genetic Algorithm as feature selection and Momentum Backpropagation Neural Network as a classification method, with data used from the Kent Ridge Bio-medical Dataset. Based on system testing that has been done, the system can detect Leukemia and Colon Tumor with best accuracy equal to 98.33% for colon tumor data and 100% for leukimia data. Genetic Algorithm as feature selection algorithm can improve system accuracy, which is from 64.52% to 98.33% for colon tumor data and 65.28% to 100% for leukemia data, and the use of momentum parameters can accelerate the convergence of the system in the training process of Neural Network.
Pratheepa, Maria; Jalali, Sushil Kumar; Arokiaraj, Robinson Silvester; Venkatesan, Thiruvengadam; Nagesh, Mandadi; Panda, Madhusmita; Pattar, Sharath
Insect Barcode Information System called as Insect Barcode Informática (IBIn) is an online database resource developed by the National Bureau of Agriculturally Important Insects, Bangalore. This database provides acquisition, storage, analysis and publication of DNA barcode records of agriculturally important insects, for researchers specifically in India and other countries. It bridges a gap in bioinformatics by integrating molecular, morphological and distribution details of agriculturally important insects. IBIn was developed using PHP/My SQL by using relational database management concept. This database is based on the client- server architecture, where many clients can access data simultaneously. IBIn is freely available on-line and is user-friendly. IBIn allows the registered users to input new information, search and view information related to DNA barcode of agriculturally important insects.This paper provides a current status of insect barcode in India and brief introduction about the database IBIn. http://www.nabg-nbaii.res.in/barcode.
Cross, Joseph; Garard, Helen; Currie, Tina
DNA barcoding is increasingly being introduced into biological science educational curricula worldwide. The technique has a number of features that make it ideal for science curricula and particularly for Project-Based Learning (PBL). This report outlines the development of a DNA barcoding project in an Australian TAFE college, which also combined…
Tapia, Elizabeth; Spetale, Flavio; Krsticevic, Flavia; Angelone, Laura; Bulacio, Pilar
For many parallel applications of Next-Generation Sequencing (NGS) technologies short barcodes able to accurately multiplex a large number of samples are demanded. To address these competitive requirements, the use of error-correcting codes is advised. Current barcoding systems are mostly built from short random error-correcting codes, a feature that strongly limits their multiplexing accuracy and experimental scalability. To overcome these problems on sequencing systems impaired by mismatch errors, the alternative use of binary BCH and pseudo-quaternary Hamming codes has been proposed. However, these codes either fail to provide a fine-scale with regard to size of barcodes (BCH) or have intrinsic poor error correcting abilities (Hamming). Here, the design of barcodes from shortened binary BCH codes and quaternary Low Density Parity Check (LDPC) codes is introduced. Simulation results show that although accurate barcoding systems of high multiplexing capacity can be obtained with any of these codes, using quaternary LDPC codes may be particularly advantageous due to the lower rates of read losses and undetected sample misidentification errors. Even at mismatch error rates of 10(-2) per base, 24-nt LDPC barcodes can be used to multiplex roughly 2000 samples with a sample misidentification error rate in the order of 10(-9) at the expense of a rate of read losses just in the order of 10(-6).
Full Text Available For many parallel applications of Next-Generation Sequencing (NGS technologies short barcodes able to accurately multiplex a large number of samples are demanded. To address these competitive requirements, the use of error-correcting codes is advised. Current barcoding systems are mostly built from short random error-correcting codes, a feature that strongly limits their multiplexing accuracy and experimental scalability. To overcome these problems on sequencing systems impaired by mismatch errors, the alternative use of binary BCH and pseudo-quaternary Hamming codes has been proposed. However, these codes either fail to provide a fine-scale with regard to size of barcodes (BCH or have intrinsic poor error correcting abilities (Hamming. Here, the design of barcodes from shortened binary BCH codes and quaternary Low Density Parity Check (LDPC codes is introduced. Simulation results show that although accurate barcoding systems of high multiplexing capacity can be obtained with any of these codes, using quaternary LDPC codes may be particularly advantageous due to the lower rates of read losses and undetected sample misidentification errors. Even at mismatch error rates of 10(-2 per base, 24-nt LDPC barcodes can be used to multiplex roughly 2000 samples with a sample misidentification error rate in the order of 10(-9 at the expense of a rate of read losses just in the order of 10(-6.
This is the first book to compare eight LDFs by different types of datasets, such as Fisher’s iris data, medical data with collinearities, Swiss banknote data that is a linearly separable data (LSD), student pass/fail determination using student attributes, 18 pass/fail determinations using exam scores, Japanese automobile data, and six microarray datasets (the datasets) that are LSD. We developed the 100-fold cross-validation for the small sample method (Method 1) instead of the LOO method. We proposed a simple model selection procedure to choose the best model having minimum M2 and Revised IP-OLDF based on MNM criterion was found to be better than other M2s in the above datasets. We compared two statistical LDFs and six MP-based LDFs. Those were Fisher’s LDF, logistic regression, three SVMs, Revised IP-OLDF, and another two OLDFs. Only a hard-margin SVM (H-SVM) and Revised IP-OLDF could discriminate LSD theoretically (Problem 2). We solved the defect of the generalized inverse matrices (Problem 3). For ...
Background Technological advances are progressively increasing the application of genomics to a wider array of economically and ecologically important species. High-density maps enriched for transcribed genes facilitate the discovery of connections between genes and phenotypes. We report the construction of a high-density linkage map of expressed genes for the heterozygous genome of Eucalyptus using Single Feature Polymorphism (SFP) markers. Results SFP discovery and mapping was achieved using pseudo-testcross screening and selective mapping to simultaneously optimize linkage mapping and microarray costs. SFP genotyping was carried out by hybridizing complementary RNA prepared from 4.5 year-old trees xylem to an SFP array containing 103,000 25-mer oligonucleotide probes representing 20,726 unigenes derived from a modest size expressed sequence tags collection. An SFP-mapping microarray with 43,777 selected candidate SFP probes representing 15,698 genes was subsequently designed and used to genotype SFPs in a larger subset of the segregating population drawn by selective mapping. A total of 1,845 genes were mapped, with 884 of them ordered with high likelihood support on a framework map anchored to 180 microsatellites with average density of 1.2 cM. Using more probes per unigene increased by two-fold the likelihood of detecting segregating SFPs eventually resulting in more genes mapped. In silico validation showed that 87% of the SFPs map to the expected location on the 4.5X draft sequence of the Eucalyptus grandis genome. Conclusions The Eucalyptus 1,845 gene map is the most highly enriched map for transcriptional information for any forest tree species to date. It represents a major improvement on the number of genes previously positioned on Eucalyptus maps and provides an initial glimpse at the gene space for this global tree genome. A general protocol is proposed to build high-density transcript linkage maps in less characterized plant species by SFP genotyping
Lin, Chenxiang; Jungmann, Ralf; Leifer, Andrew M.; Li, Chao; Levner, Daniel; Church, George M.; Shih, William M.; Yin, Peng
The identification and differentiation of a large number of distinct molecular species with high temporal and spatial resolution is a major challenge in biomedical science. Fluorescence microscopy is a powerful tool, but its multiplexing ability is limited by the number of spectrally distinguishable fluorophores. Here, we used (deoxy)ribonucleic acid (DNA)-origami technology to construct submicrometre nanorods that act as fluorescent barcodes. We demonstrate that spatial control over the positioning of fluorophores on the surface of a stiff DNA nanorod can produce 216 distinct barcodes that can be decoded unambiguously using epifluorescence or total internal reflection fluorescence microscopy. Barcodes with higher spatial information density were demonstrated via the construction of super-resolution barcodes with features spaced by ˜40 nm. One species of the barcodes was used to tag yeast surface receptors, which suggests their potential applications as in situ imaging probes for diverse biomolecular and cellular entities in their native environments.
Full Text Available Abstract Background DNA barcoding is a key tool for assessing biodiversity in both taxonomic and environmental studies. Essential features of barcodes include their applicability to a wide spectrum of taxa and their ability to identify even closely related species. Several DNA regions have been proposed as barcodes and the region selected strongly influences the output of a study. However, formal comparisons between barcodes remained limited until now. Here we present a standard method for evaluating barcode quality, based on the use of a new bioinformatic tool that performs in silico PCR over large databases. We illustrate this approach by comparing the taxonomic coverage and the resolution of several DNA regions already proposed for the barcoding of vertebrates. To assess the relationship between in silico and in vitro PCR, we also developed specific primers amplifying different species of Felidae, and we tested them using both kinds of PCR Results Tests on specific primers confirmed the correspondence between in silico and in vitro PCR. Nevertheless, results of in silico and in vitro PCRs can be somehow different, also because tuning PCR conditions can increase the performance of primers with limited taxonomic coverage. The in silico evaluation of DNA barcodes showed a strong variation of taxonomic coverage (i.e., universality: barcodes based on highly degenerated primers and those corresponding to the conserved region of the Cyt-b showed the highest coverage. As expected, longer barcodes had a better resolution than shorter ones, which are however more convenient for ecological studies analysing environmental samples. Conclusions In silico PCR could be used to improve the performance of a study, by allowing the preliminary comparison of several DNA regions in order to identify the most appropriate barcode depending on the study aims.
Zhang, Yi; Sun, Jiashu; Zou, Yu; Chen, Wenwen; Zhang, Wei; Xi, Jianzhong Jeff; Jiang, Xingyu
Multiplexed assay of analytes is of great importance for clinical diagnostics and other analytical applications. Barcode-based bioassays with the ability to encode and decode may realize this goal in a straightforward and consistent manner. We present here a microfluidic barcoded chip containing several sets of microchannels with different widths, imitating the commonly used barcode. A single barcoded microchip can carry out tens of individual protein/nucleic acid assays (encode) and immediately yield all assay results by a portable barcode reader or a smartphone (decode). The applicability of a barcoded microchip is demonstrated by human immunodeficiency virus (HIV) immunoassays for simultaneous detection of three targets (anti-gp41 antibody, anti-gp120 antibody, and anti-gp36 antibody) from six human serum samples. We can also determine seven pathogen-specific oligonucleotides by a single chip containing both positive and negative controls.
The Bar-Code Automated Waste Tracking System was designed to be a site-Specific program with a general purpose application for transportability to other facilities. The system is user-friendly, totally automated, and incorporates the use of a drive-up window that is close to the areas dealing in container preparation, delivery, pickup, and disposal. The system features ''stop-and-go'' operation rather than a long, tedious, error-prone manual entry. The system is designed for automation but allows operators to concentrate on proper handling of waste while maintaining manual entry of data as a backup. A large wall plaque filled with bar-code labels is used to input specific details about any movement of waste
There is an ongoing campaign to DNA barcode the world's >20 000 bee species. Recent revisions of Lasioglossum (Dialictus) (Hymenoptera: Halictidae) for Canada and the eastern United States were completed using integrative taxonomy. DNA barcode data from 110 species of L. (Dialictus) are examined for their value in identification and discovering additional taxonomic diversity. Specimen identification success was estimated using the best close match method. Error rates were 20% relative to current taxonomic understanding. Barcode Index Numbers (BINs) assigned using Refined Single Linkage Analysis (RESL) and barcode gaps using the Automatic Barcode Gap Discovery (ABGD) method were also assessed. RESL was incongruent for 44.5% of species, although some cryptic diversity may exist. Forty-three of 110 species were part of merged BINs with multiple species. The barcode gap is non-existent for the data set as a whole and ABGD showed levels of discordance similar to the RESL. The viridatum species-group is particularly problematic, so that DNA barcodes alone would be misleading for species delimitation and specimen identification. Character-based methods using fixed nucleotide substitutions could improve specimen identification success in some cases. The use of DNA barcoding for species discovery for standard taxonomic practice in the absence of a well-defined barcode gap is discussed.
Bucklin, Ann; Steinke, Dirk; Blanco-Bercial, Leocadio
More than 230,000 known species representing 31 metazoan phyla populate the world's oceans. Perhaps another 1,000,000 or more species remain to be discovered. There is reason for concern that species extinctions may outpace discovery, especially in diverse and endangered marine habitats such as coral reefs. DNA barcodes (i.e., short DNA sequences for species recognition and discrimination) are useful tools to accelerate species-level analysis of marine biodiversity and to facilitate conservation efforts. This review focuses on the usual barcode region for metazoans: a ˜648 base-pair region of the mitochondrial cytochrome c oxidase subunit I (COI) gene. Barcodes have also been used for population genetic and phylogeographic analysis, identification of prey in gut contents, detection of invasive species, forensics, and seafood safety. More controversially, barcodes have been used to delimit species boundaries, reveal cryptic species, and discover new species. Emerging frontiers are the use of barcodes for rapid and increasingly automated biodiversity assessment by high-throughput sequencing, including environmental barcoding and the use of barcodes to detect species for which formal identification or scientific naming may never be possible.
.... The mobile barcodes must be used for marketing, promotional or educational purposes. They may not be... POSTAL SERVICE 39 CFR Part 111 Mobile Barcode Promotion AGENCY: Postal Service TM . ACTION: Final... and flats, and Standard Mail[reg] letters and flats bearing two-dimensional mobile barcodes. DATES...
Hollingsworth, Peter M.; Graham, Sean W.; Little, Damon P.
The main aim of DNA barcoding is to establish a shared community resource of DNA sequences that can be used for organismal identification and taxonomic clarification. This approach was successfully pioneered in animals using a portion of the cytochrome oxidase 1 (CO1) mitochondrial gene. In plants, establishing a standardized DNA barcoding system has been more challenging. In this paper, we review the process of selecting and refining a plant barcode; evaluate the factors which influence the discriminatory power of the approach; describe some early applications of plant barcoding and summarise major emerging projects; and outline tool development that will be necessary for plant DNA barcoding to advance. PMID:21637336
Peter M Hollingsworth
Full Text Available The main aim of DNA barcoding is to establish a shared community resource of DNA sequences that can be used for organismal identification and taxonomic clarification. This approach was successfully pioneered in animals using a portion of the cytochrome oxidase 1 (CO1 mitochondrial gene. In plants, establishing a standardized DNA barcoding system has been more challenging. In this paper, we review the process of selecting and refining a plant barcode; evaluate the factors which influence the discriminatory power of the approach; describe some early applications of plant barcoding and summarise major emerging projects; and outline tool development that will be necessary for plant DNA barcoding to advance.
Nielsen, Rasmus; Matz, M.
The use of DNA as a tool for species identification has become known as "DNA barcoding" (Floyd et al., 2002; Hebert et al., 2003; Remigio and Hebert, 2003). The basic idea is straightforward: a small amount of DNA is extracted from the specimen, amplified and sequenced. The gene region sequenced...... is chosen so that it is nearly identical among individuals of the same species, but different between species, and therefore its sequence, can serve as an identification tag for the species ("DNA barcode"). By matching the sequence obtained from an unidentified specimen ("query" sequence) to the database...
G. A. Kukharev
Full Text Available The paper provides analysis of existing approaches to the generating of barcodes and description of the system structure for generating of barcodes from facial images. The method for generating of standard type linear barcodes from facial images is proposed. This method is based on the difference of intensity gradients, which represent images in the form of initial features. Further averaging of these features into a limited number of intervals is performed; the quantization of results into decimal digits from 0 to 9 and table conversion into the standard barcode is done. Testing was conducted on the Face94 database and database of composite faces of different ages. It showed that the proposed method ensures the stability of generated barcodes according to changes of scale, pose and mirroring of facial images, as well as changes of facial expressions and shadows on faces from local lighting. The proposed solutions are computationally low-cost and do not require the use of any specialized image processing software for generating of facial barcodes in real-time systems.
Yang, Cheng-Hong; Wu, Kuo-Chuan; Chuang, Li-Yeh; Chang, Hsueh-Wei
DNA barcode sequences are accumulating in large data sets. A barcode is generally a sequence larger than 1000 base pairs and generates a computational burden. Although the DNA barcode was originally envisioned as straightforward species tags, the identification usage of barcode sequences is rarely emphasized currently. Single-nucleotide polymorphism (SNP) association studies provide us an idea that the SNPs may be the ideal target of feature selection to discriminate between different species. We hypothesize that SNP-based barcodes may be more effective than the full length of DNA barcode sequences for species discrimination. To address this issue, we tested a r ibulose diphosphate carboxylase ( rbcL ) S NP b arcoding (RSB) strategy using a decision tree algorithm. After alignment and trimming, 31 SNPs were discovered in the rbcL sequences from 38 Brassicaceae plant species. In the decision tree construction, these SNPs were computed to set up the decision rule to assign the sequences into 2 groups level by level. After algorithm processing, 37 nodes and 31 loci were required for discriminating 38 species. Finally, the sequence tags consisting of 31 rbcL SNP barcodes were identified for discriminating 38 Brassicaceae species based on the decision tree-selected SNP pattern using RSB method. Taken together, this study provides the rational that the SNP aspect of DNA barcode for rbcL gene is a useful and effective sequence for tagging 38 Brassicaceae species.
This book chapter details the protocols for DNA barcoding in plants, starting from DNA isolation, sequencing, sequence annotation using MEGA, till identification of barcode gaps. A good chapter for beginners in plant taxonomy
Walt, David R
This tutorial review describes how fibre optic microarrays can be used to create a variety of sensing and measurement systems. This review covers the basics of optical fibres and arrays, the different microarray architectures, and describes a multitude of applications. Such arrays enable multiplexed sensing for a variety of analytes including nucleic acids, vapours, and biomolecules. Polymer-coated fibre arrays can be used for measuring microscopic chemical phenomena, such as corrosion and localized release of biochemicals from cells. In addition, these microarrays can serve as a substrate for fundamental studies of single molecules and single cells. The review covers topics of interest to chemists, biologists, materials scientists, and engineers.
Full Text Available BACKGROUND: Populus is an ecologically and economically important genus of trees, but distinguishing between wild species is relatively difficult due to extensive interspecific hybridization and introgression, and the high level of intraspecific morphological variation. The DNA barcoding approach is a potential solution to this problem. METHODOLOGY/PRINCIPAL FINDINGS: Here, we tested the discrimination power of five chloroplast barcodes and one nuclear barcode (ITS among 95 trees that represent 21 Populus species from western China. Among all single barcode candidates, the discrimination power is highest for the nuclear ITS, progressively lower for chloroplast barcodes matK (M, trnG-psbK (G and psbK-psbI (P, and trnH-psbA (H and rbcL (R; the discrimination efficiency of the nuclear ITS (I is also higher than any two-, three-, or even the five-locus combination of chloroplast barcodes. Among the five combinations of a single chloroplast barcode plus the nuclear ITS, H+I and P+I differentiated the highest and lowest portion of species, respectively. The highest discrimination rate for the barcodes or barcode combinations examined here is 55.0% (H+I, and usually discrimination failures occurred among species from sympatric or parapatric areas. CONCLUSIONS/SIGNIFICANCE: In this case study, we showed that when discriminating Populus species from western China, the nuclear ITS region represents a more promising barcode than any maternally inherited chloroplast region or combination of chloroplast regions. Meanwhile, combining the ITS region with chloroplast regions may improve the barcoding success rate and assist in detecting recent interspecific hybridizations. Failure to discriminate among several groups of Populus species from sympatric or parapatric areas may have been the result of incomplete lineage sorting, frequent interspecific hybridizations and introgressions. We agree with a previous proposal for constructing a tiered barcoding system in
McFadden, Catherine S; Benayahu, Yehuda; Pante, Eric; Thoma, Jana N; Nevarez, P Andrew; France, Scott C
The widespread assumption that COI and other mitochondrial genes will be ineffective DNA barcodes for anthozoan cnidarians has not been well tested for most anthozoans other than scleractinian corals. Here we examine the limitations of mitochondrial gene barcoding in the sub-class Octocorallia, a large, diverse, and ecologically important group of anthozoans. Pairwise genetic distance values (uncorrected p) were compared for three candidate barcoding regions: the Folmer region of COI; a fragment of the octocoral-specific mitochondrial protein-coding gene, msh1; and an extended barcode of msh1 plus COI with a short, adjacent intergenic region (igr1). Intraspecific variation was barcodes, and there was no discernible barcoding gap between intra- and interspecific p values. In a case study to assess regional octocoral biodiversity, COI and msh1 barcodes each identified 70% of morphospecies. In a second case study, a nucleotide character-based analysis correctly identified 70% of species in the temperate genus Alcyonium. Although interspecific genetic distances were 2× greater for msh1 than COI, each marker identified similar numbers of species in the two case studies, and the extended COI + igr1 + msh1 barcode more effectively discriminated sister taxa in Alcyonium. Although far from perfect for species identification, a COI + igr1 + msh1 barcode nonetheless represents a valuable addition to the depauperate set of characters available for octocoral taxonomy. © 2010 Blackwell Publishing Ltd.
WERNER-WASHBURNE, MARGARET; DAVIDSON, GEORGE S.
Collaboration between Sandia National Laboratories and the University of New Mexico Biology Department resulted in the capability to train students in microarray techniques and the interpretation of data from microarray experiments. These studies provide for a better understanding of the role of stationary phase and the gene regulation involved in exit from stationary phase, which may eventually have important clinical implications. Importantly, this research trained numerous students and is the basis for three new Ph.D. projects
Xu, Yueshuang; Wang, Huan; Luan, Chengxin; Liu, Yuxiao; Chen, Baoan; Zhao, Yuanjin
Rapid and sensitive diagnosing hematological infections based on the separation and detection of pathogenic bacteria in the patient's blood is a significant challenge. To address this, we herein present a new barcodes technology that can simultaneously capture and detect multiple types of pathogenic bacteria from a complex sample. The barcodes are poly (ethylene glycol) (PEG) hydrogel inverse opal particles with characteristic reflection peak codes that remain stable during bacteria capture on their surfaces. As the spherical surface of the particles has ordered porous nanostructure, the barcodes can provide not only more surface area for probe immobilization and reaction, but also a nanopatterned platform for highly efficient bioreactions. In addition, the PEG hydrogel scaffold could decrease the non-specificity adsorption by its anti-adhesive effect, and the decorated aptamer probes in the scaffolds could increase the sensitivity, reliability, and specificity of the bacteria capture and detection. Moreover, the tagged magnetic nanoparticles in the PEG scaffold could impart the barcodes with controllable movement under magnetic fields, which can be used to significantly increase the reaction speed and simplify the processing of the bioassays. Based on the describe barcodes, it was demonstrated that the bacteria could be captured and identified even at low bacterial concentrations (100 CFU mL -1 ) within 2.5h, which is effectively shortened in comparison with the "gold standard" in clinic. These features make the barcodes ideal for capturing and detecting multiple bacteria from clinical samples for hematological infection diagnostics. Copyright © 2017 Elsevier B.V. All rights reserved.
Full Text Available Cartilaginous fish are particularly vulnerable to anthropogenic stressors and environmental change because of their K-selected reproductive strategy. Accurate data from scientific surveys and landings are essential to assess conservation status and to develop robust protection and management plans. Currently available data are often incomplete or incorrect as a result of inaccurate species identifications, due to a high level of morphological stasis, especially among closely related taxa. Moreover, several diagnostic characters clearly visible in adult specimens are less evident in juveniles. Here we present results generated by the ELASMOMED Consortium, a regional network aiming to sample and DNA-barcode the Mediterranean Chondrichthyans with the ultimate goal to provide a comprehensive DNA barcode reference library. This library will support and improve the molecular taxonomy of this group and the effectiveness of management and conservation measures. We successfully barcoded 882 individuals belonging to 42 species (17 sharks, 24 batoids and one chimaera, including four endemic and several threatened ones. Morphological misidentifications were found across most orders, further confirming the need for a comprehensive DNA barcoding library as a valuable tool for the reliable identification of specimens in support of taxonomist who are reviewing current identification keys. Despite low intraspecific variation among their barcode sequences and reduced samples size, five species showed preliminary evidence of phylogeographic structure. Overall, the ELASMOMED initiative further emphasizes the key role accurate DNA barcoding libraries play in establishing reliable diagnostic species specific features in otherwise taxonomically problematic groups for biodiversity management and conservation actions.
Full Text Available The mitochondrial cytochrome c-oxidase subunit I (COI can serve as a fast and accurate marker for the identification of animal species, and has been applied in a number of studies on birds. We here sequenced the COI gene for 387 individuals of 147 species of birds from the Netherlands, with 83 species being represented by >2 sequences. The Netherlands occupies a small geographic area and 95% of all samples were collected within a 50 km radius from one another. The intraspecific divergences averaged 0.29% among this assemblage, but most values were lower; the interspecific divergences averaged 9.54%. In all, 95% of species were represented by a unique barcode, with 6 species of gulls and skua (Larus and Stercorariusat least one shared barcode. This is best explained by these species representing recent radiations with ongoing hybridization. In contrast, one species, the Lesser Whitethroat Sylvia curruca showed deep divergences, averaging 5.76% and up to 8.68% between individuals. These possibly represent two distinct taxa, S. curruca and S. blythi, both clearly separated in a haplotype network analysis. Our study adds to a growing body of DNA barcodes that have become available for birds, and shows that a DNA barcoding approach enables to identify known Dutch bird species with a very high resolution. In addition some species were flagged up for further detailed taxonomic investigation, illustrating that even in ornithologically well-known areas such as the Netherlands, more is to be learned about the birds that are present.
Rouse Richard JD
Full Text Available Abstract Background Successful microarray experimentation requires a complex interplay between the slide chemistry, the printing pins, the nucleic acid probes and targets, and the hybridization milieu. Optimization of these parameters and a careful evaluation of emerging slide chemistries are a prerequisite to any large scale array fabrication effort. We have developed a 'microarray meter' tool which assesses the inherent variations associated with microarray measurement prior to embarking on large scale projects. Findings The microarray meter consists of nucleic acid targets (reference and dynamic range control and probe components. Different plate designs containing identical probe material were formulated to accommodate different robotic and pin designs. We examined the variability in probe quality and quantity (as judged by the amount of DNA printed and remaining post-hybridization using three robots equipped with capillary printing pins. Discussion The generation of microarray data with minimal variation requires consistent quality control of the (DNA microarray manufacturing and experimental processes. Spot reproducibility is a measure primarily of the variations associated with printing. The microarray meter assesses array quality by measuring the DNA content for every feature. It provides a post-hybridization analysis of array quality by scoring probe performance using three metrics, a a measure of variability in the signal intensities, b a measure of the signal dynamic range and c a measure of variability of the spot morphologies.
In this paper we proposed a novel approach that will increase the capacity of barcode ... and data security and compression, over the traditional black and white ... A literature survey on 2D colour barcode brought about a new development to ...
... Mail barcodes and POSTNET (Postal Numeric Encoding Technique) barcodes are USPS-developed methods to... sealed envelope (the preferred method) or, if unenveloped, must be sealed or glued completely along all... routing code appears in the lower right corner. * * * * * [Delete current 5.6, DPBC Numeric Equivalent, in...
Lahaye, Renaud; van der Bank, Michelle; Bogarin, Diego; Warner, Jorge; Pupulin, Franco; Gigot, Guillaume; Maurin, Olivier; Duthoit, Sylvie; Barraclough, Timothy G; Savolainen, Vincent
DNA barcoding is a technique in which species identification is performed by using DNA sequences from a small fragment of the genome, with the aim of contributing to a wide range of ecological and conservation studies in which traditional taxonomic identification is not practical. DNA barcoding is well established in animals, but there is not yet any universally accepted barcode for plants. Here, we undertook intensive field collections in two biodiversity hotspots (Mesoamerica and southern Africa). Using >1,600 samples, we compared eight potential barcodes. Going beyond previous plant studies, we assessed to what extent a "DNA barcoding gap" is present between intra- and interspecific variations, using multiple accessions per species. Given its adequate rate of variation, easy amplification, and alignment, we identified a portion of the plastid matK gene as a universal DNA barcode for flowering plants. Critically, we further demonstrate the applicability of DNA barcoding for biodiversity inventories. In addition, analyzing >1,000 species of Mesoamerican orchids, DNA barcoding with matK alone reveals cryptic species and proves useful in identifying species listed in Convention on International Trade of Endangered Species (CITES) appendixes.
... eligibility for the use of POSTNET barcodes and allow only Intelligent Mail barcodes (IMbs) for automation price eligibility purposes, including Qualified Business Reply Mail (QBRM) prices. The Postal Service... and working with individual mailers and software providers to ensure that the use of an Intelligent...
This article describes how DNA barcoding investigations bring biology to life. Biologists recognize the power of DNA barcoding not just to teach biology through connections to the real world but also to immerse students in the exciting process of science. As an investigator in the Program for the Human Environment at Rockefeller University in New…
DNA barcoding involves sequencing a standard region of DNA as a tool for species identification. However, there has been no agreement on which region(s) should be used for barcoding land plants. To provide a community recommendation on a standard plant barcode, we have compared the performance of 7 leading candidate plastid DNA regions (atpF-atpH spacer, matK gene, rbcL gene, rpoB gene, rpoC1 gene, psbK-psbI spacer, and trnH-psbA spacer). Based on assessments of recoverability, sequence quality, and levels of species discrimination, we recommend the 2-locus combination of rbcL+matK as the plant barcode. This core 2-locus barcode will provide a universal framework for the routine use of DNA sequence data to identify specimens and contribute toward the discovery of overlooked species of land plants.
Hollingsworth, Peter M.; Forrest, Laura L.; Spouge, John L.; Hajibabaei, Mehrdad; Ratnasingham, Sujeevan; van der Bank, Michelle; Chase, Mark W.; Cowan, Robyn S.; Erickson, David L.; Fazekas, Aron J.; Graham, Sean W.; James, Karen E.; Kim, Ki-Joong; Kress, W. John; Schneider, Harald; van AlphenStahl, Jonathan; Barrett, Spencer C.H.; van den Berg, Cassio; Bogarin, Diego; Burgess, Kevin S.; Cameron, Kenneth M.; Carine, Mark; Chacón, Juliana; Clark, Alexandra; Clarkson, James J.; Conrad, Ferozah; Devey, Dion S.; Ford, Caroline S.; Hedderson, Terry A.J.; Hollingsworth, Michelle L.; Husband, Brian C.; Kelly, Laura J.; Kesanakurti, Prasad R.; Kim, Jung Sung; Kim, Young-Dong; Lahaye, Renaud; Lee, Hae-Lim; Long, David G.; Madriñán, Santiago; Maurin, Olivier; Meusnier, Isabelle; Newmaster, Steven G.; Park, Chong-Wook; Percy, Diana M.; Petersen, Gitte; Richardson, James E.; Salazar, Gerardo A.; Savolainen, Vincent; Seberg, Ole; Wilkinson, Michael J.; Yi, Dong-Keun; Little, Damon P.
DNA barcoding involves sequencing a standard region of DNA as a tool for species identification. However, there has been no agreement on which region(s) should be used for barcoding land plants. To provide a community recommendation on a standard plant barcode, we have compared the performance of 7 leading candidate plastid DNA regions (atpF–atpH spacer, matK gene, rbcL gene, rpoB gene, rpoC1 gene, psbK–psbI spacer, and trnH–psbA spacer). Based on assessments of recoverability, sequence quality, and levels of species discrimination, we recommend the 2-locus combination of rbcL+matK as the plant barcode. This core 2-locus barcode will provide a universal framework for the routine use of DNA sequence data to identify specimens and contribute toward the discovery of overlooked species of land plants. PMID:19666622
Mlinar, V.; Zunger, A.
Self-assembled semiconductor quantum dots (QDs) show in high-resolution single-dot spectra a multitude of sharp lines, resembling a barcode, due to various neutral and charged exciton complexes. Here we propose the 'spectral barcoding' method that deciphers structural motifs of dots by using such barcode as input to an artificial-intelligence learning system. Thus, we invert the common practice of deducing spectra from structure by deducing structure from spectra. This approach (i) lays the foundation for building a much needed structure-spectra understanding for large nanostructures and (ii) can guide future design of desired optical features of QDs by controlling during growth only those structural motifs that decide given optical features.
Bezeng, B S; Davies, T J; Daru, B H; Kabongo, R M; Maurin, O; Yessoufou, K; van der Bank, H; van der Bank, M
The African Centre for DNA Barcoding (ACDB) was established in 2005 as part of a global initiative to accurately and rapidly survey biodiversity using short DNA sequences. The mitochondrial cytochrome c oxidase 1 gene (CO1) was rapidly adopted as the de facto barcode for animals. Following the evaluation of several candidate loci for plants, the Plant Working Group of the Consortium for the Barcoding of Life in 2009 recommended that two plastid genes, rbcLa and matK, be adopted as core DNA barcodes for terrestrial plants. To date, numerous studies continue to test the discriminatory power of these markers across various plant lineages. Over the past decade, we at the ACDB have used these core DNA barcodes to generate a barcode library for southern Africa. To date, the ACDB has contributed more than 21 000 plant barcodes and over 3000 CO1 barcodes for animals to the Barcode of Life Database (BOLD). Building upon this effort, we at the ACDB have addressed questions related to community assembly, biogeography, phylogenetic diversification, and invasion biology. Collectively, our work demonstrates the diverse applications of DNA barcoding in ecology, systematics, evolutionary biology, and conservation.
Roy, Kevin R; Smith, Justin D; Vonesch, Sibylle C; Lin, Gen; Tu, Chelsea Szu; Lederer, Alex R; Chu, Angela; Suresh, Sundari; Nguyen, Michelle; Horecka, Joe; Tripathi, Ashutosh; Burnett, Wallace T; Morgan, Maddison A; Schulz, Julia; Orsley, Kevin M; Wei, Wu; Aiyar, Raeka S; Davis, Ronald W; Bankaitis, Vytas A; Haber, James E; Salit, Marc L; St Onge, Robert P; Steinmetz, Lars M
Our understanding of how genotype controls phenotype is limited by the scale at which we can precisely alter the genome and assess the phenotypic consequences of each perturbation. Here we describe a CRISPR-Cas9-based method for multiplexed accurate genome editing with short, trackable, integrated cellular barcodes (MAGESTIC) in Saccharomyces cerevisiae. MAGESTIC uses array-synthesized guide-donor oligos for plasmid-based high-throughput editing and features genomic barcode integration to prevent plasmid barcode loss and to enable robust phenotyping. We demonstrate that editing efficiency can be increased more than fivefold by recruiting donor DNA to the site of breaks using the LexA-Fkh1p fusion protein. We performed saturation editing of the essential gene SEC14 and identified amino acids critical for chemical inhibition of lipid signaling. We also constructed thousands of natural genetic variants, characterized guide mismatch tolerance at the genome scale, and ascertained that cryptic Pol III termination elements substantially reduce guide efficacy. MAGESTIC will be broadly useful to uncover the genetic basis of phenotypes in yeast.
A type barcode is a DNA barcode unequivocally tied to an authoritatively identified specimen, preferably the primary type specimen. Type barcodes are analogous, albeit subordinate, to type specimens, providing a stable reference to which other barcodes can be compared. We here designate and describe...
Full Text Available Abstract Background Members of the aquatic monocot family Lemnaceae (commonly called duckweeds represent the smallest and fastest growing flowering plants. Their highly reduced morphology and infrequent flowering result in a dearth of characters for distinguishing between the nearly 38 species that exhibit these tiny, closely-related and often morphologically similar features within the same family of plants. Results We developed a simple and rapid DNA-based molecular identification system for the Lemnaceae based on sequence polymorphisms. We compared the barcoding potential of the seven plastid-markers proposed by the CBOL (Consortium for the Barcode of Life plant-working group to discriminate species within the land plants in 97 accessions representing 31 species from the family of Lemnaceae. A Lemnaceae-specific set of PCR and sequencing primers were designed for four plastid coding genes (rpoB, rpoC1, rbcL and matK and three noncoding spacers (atpF-atpH, psbK-psbI and trnH-psbA based on the Lemna minor chloroplast genome sequence. We assessed the ease of amplification and sequencing for these markers, examined the extent of the barcoding gap between intra- and inter-specific variation by pairwise distances, evaluated successful identifications based on direct sequence comparison of the "best close match" and the construction of a phylogenetic tree. Conclusions Based on its reliable amplification, straightforward sequence alignment, and rates of DNA variation between species and within species, we propose that the atpF-atpH noncoding spacer could serve as a universal DNA barcoding marker for species-level identification of duckweeds.
Spouge, John L; Mariño-Ramírez, Leonardo
This chapter describes a workflow for measuring the efficacy of a barcode in identifying species. First, assemble individual sequence databases corresponding to each barcode marker. A controlled collection of taxonomic data is preferable to GenBank data, because GenBank data can be problematic, particularly when comparing barcodes based on more than one marker. To ensure proper controls when evaluating species identification, specimens not having a sequence in every marker database should be discarded. Second, select a computer algorithm for assigning species to barcode sequences. No algorithm has yet improved notably on assigning a specimen to the species of its nearest neighbor within a barcode database. Because global sequence alignments (e.g., with the Needleman-Wunsch algorithm, or some related algorithm) examine entire barcode sequences, they generally produce better species assignments than local sequence alignments (e.g., with BLAST). No neighboring method (e.g., global sequence similarity, global sequence distance, or evolutionary distance based on a global alignment) has yet shown a notable superiority in identifying species. Finally, "the probability of correct identification" (PCI) provides an appropriate measurement of barcode efficacy. The overall PCI for a data set is the average of the species PCIs, taken over all species in the data set. This chapter states explicitly how to calculate PCI, how to estimate its statistical sampling error, and how to use data on PCR failure to set limits on how much improvements in PCR technology can improve species identification.
Vences, Miguel; Nagy, Zoltán T; Sonet, Gontran; Verheyen, Erik
Only a few major research programs are currently targeting COI barcoding of amphibians and reptiles (including chelonians and crocodiles), two major groups of tetrapods. Amphibian and reptile species are typically old, strongly divergent, and contain deep conspecific lineages which might lead to problems in species assignment with incomplete reference databases. As far as known, there is no single pair of COI primers that will guarantee a sufficient rate of success across all amphibian and reptile taxa, or within major subclades of amphibians and reptiles, which means that the PCR amplification strategy needs to be adjusted depending on the specific research question. In general, many more amphibian and reptile taxa have been sequenced for 16S rDNA, which for some purposes may be a suitable complementary marker, at least until a more comprehensive COI reference database becomes available. DNA barcoding has successfully been used to identify amphibian larval stages (tadpoles) in species-rich tropical assemblages. Tissue sampling, DNA extraction, and amplification of COI is straightforward in amphibians and reptiles. Single primer pairs are likely to have a failure rate between 5 and 50% if taxa of a wide taxonomic range are targeted; in such cases the use of primer cocktails or subsequent hierarchical usage of different primer pairs is necessary. If the target group is taxonomically limited, many studies have followed a strategy of designing specific primers which then allow an easy and reliable amplification of all samples.
Nair, V.R.; Kidangan, F.X.; Prabhu, R.G.; Bucklin, A.; Nair, S.
Chaetognatha are the second most abundant zooplankton group in the Indian waters Precise identification of the species is critical for biogeographical studies DNA barcodes using mitochondrial cytochrome c oxidase (COI) of seven dominant...
Metri, Rahul; Jerath, Gaurav; Kailas, Govind; Gacche, Nitin; Pal, Adityabarna; Ramakrishnan, Vibin
A reduced representation in the format of a barcode has been developed to provide an overview of the topological nature of a given protein structure from 3D coordinate file. The molecular structure of a protein coordinate file from Protein Data Bank is first expressed in terms of an alpha-numero code and further converted to a barcode image. The barcode representation can be used to compare and contrast different proteins based on their structure. The utility of this method has been exemplified by comparing structural barcodes of proteins that belong to same fold family, and across different folds. In addition to this, we have attempted to provide an illustration to (i) the structural changes often seen in a given protein molecule upon interaction with ligands and (ii) Modifications in overall topology of a given protein during evolution. The program is fully downloadable from the website http://www.iitg.ac.in/probar/. © 2013 The Protein Society.
Foottit, Robert G.; Maw, Eric; Hebert, P. D. N.
Background Many studies have shown the suitability of sequence variation in the 5′ region of the mitochondrial cytochrome c oxidase I (COI) gene as a DNA barcode for the identification of species in a wide range of animal groups. We examined 471 species in 147 genera of Hemiptera: Auchenorrhyncha drawn from specimens in the Canadian National Collection of Insects to assess the effectiveness of DNA barcoding in this group. Methodology/Principal Findings Analysis of the COI gene revealed less than 2% intra-specific divergence in 93% of the taxa examined, while minimum interspecific distances exceeded 2% in 70% of congeneric species pairs. Although most species are characterized by a distinct sequence cluster, sequences for members of many groups of closely related species either shared sequences or showed close similarity, with 25% of species separated from their nearest neighbor by less than 1%. Conclusions/Significance This study, although preliminary, provides DNA barcodes for about 8% of the species of this hemipteran suborder found in North America north of Mexico. Barcodes can enable the identification of many species of Auchenorrhyncha, but members of some species groups cannot be discriminated. Future use of DNA barcodes in regulatory, pest management, and environmental applications will be possible as the barcode library for Auchenorrhyncha expands to include more species and broader geographic coverage. PMID:25004106
Gerlach, Rebecca; Pinard, Dan; Weaver, Matt; Alattar, Adnan
This paper presents a speed comparison between the use of Digimarc® Barcodes and the Universal Product Code (UPC) for customer checkout at point of sale (POS). The recently introduced Digimarc Barcode promises to increase the speed of scanning packaged goods at POS. When this increase is exploited by workforce optimization systems, the retail industry could potentially save billions of dollars. The Digimarc Barcode is based on Digimarc's watermarking technology, and it is imperceptible, very robust, and does not require any special ink, material, or printing processes. Using an image-based scanner, a checker can quickly scan consumer packaged goods (CPG) embedded with the Digimarc Barcode without the need to reorient the packages with respect to the scanner. Faster scanning of packages saves money and enhances customer satisfaction. It reduces the length of the queues at checkout, reduces the cost of cashier labor, and makes self-checkout more convenient. This paper quantifies the increase in POS scanning rates resulting from the use of the Digimarc Barcode versus the traditional UPC. It explains the testing methodology, describes the experimental setup, and analyzes the obtained results. It concludes that the Digimarc Barcode increases number of items per minute (IPM) scanned at least 50% over traditional UPC.
Hachesu, Peyman Rezaei; Zyaei, Leila; Hassankhani, Hadi
Lack of attention to the proper barcode using leads to lack of use or misuse in the hospitals. The present research aimed to investigate the requirements and barrier for using barcode technology and presenting suggestions to use it. The research is observational-descriptive. The data was collected using the designed checklist which its validity was assessed. This check list consists of two parts: "Requirements" and "barrier" of using the barcodes. Research community included 10 teaching hospitals and a class of 65 participants included people in the hospitals. The collected data was analyzed using descriptive statistics. Required changes of workflow processes in the hospital and compliance them with the hospital policy are such requirements that had been infringed in the 90 % of hospitals. Prioritization of some hospital processes for barcoding, system integration with Hospital Information system (HIS), training of staff and budgeting are requirements for the successful implementation which had been infringed in the 80% of hospitals. Dissatisfaction with the quality of barcode labels and lacks of adequate scanners both whit the rate of 100 %, and the lack of understanding of the necessary requirements for implementation of barcodes as 80% were the most important barrier. Integrate bar code system with clinical workflow should be considered. Lack of knowledge and understanding toward the infrastructure, inadequate staff training and technologic problems are considered as the greatest barriers.
Background DNA barcoding provides a rapid, accurate, and standardized method for species-level identification using short DNA sequences. Such a standardized identification method is useful for mapping all the species on Earth, particularly when DNA sequencing technology is cheaply available. There are many nations in Asia with many biodiversity resources that need to be mapped and registered in databases. Results We have built a general DNA barcode data processing system, BioBarcode, with open source software - which is a general purpose database and server. It uses mySQL RDBMS 5.0, BLAST2, and Apache httpd server. An exemplary database of BioBarcode has around 11,300 specimen entries (including GenBank data) and registers the biological species to map their genetic relationships. The BioBarcode database contains a chromatogram viewer which improves the performance in DNA sequence analyses. Conclusion Asia has a very high degree of biodiversity and the BioBarcode database server system aims to provide an efficient bioinformatics protocol that can be freely used by Asian researchers and research organizations interested in DNA barcoding. The BioBarcode promotes the rapid acquisition of biological species DNA sequence data that meet global standards by providing specialized services, and provides useful tools that will make barcoding cheaper and faster in the biodiversity community such as standardization, depository, management, and analysis of DNA barcode data. The system can be downloaded upon request, and an exemplary server has been constructed with which to build an Asian biodiversity system http://www.asianbarcode.org. PMID:19958506
But, there is growing interest in barcoding on the part of diverse countries from around the world, as demonstrated at the Third International Barcode of Life conference held in Mexico, November 2009. This project will allow iBOL to expand the application of barcoding to developing countries. This will involve establishing ...
Barcode scanning has become more than just fun. Now libraries and businesses are leveraging barcode technology as an innovative tool to market their products and ideas. Developed and popularized in Japan, these Quick Response (QR) or two-dimensional barcodes allow marketers to provide interactive content in an otherwise static environment. In this…
Makarova, Olga; Contaldo, Nicoletta; Paltrinieri, Samanta
Phytoplasma identi fi cation has proved dif fi cult due to their inability to be maintained in vitro. DNA barcoding is an identi fi cation method based on comparison of a short DNA sequence with known sequences from a database. A DNA barcoding tool has been developed for phytoplasma identi fi cat...... genes, can be used to identify the following phytoplasma groups: 16SrI, 16SrII, 16SrIII, 16SrIV, 16SrV, 16SrVI, 16SrVII, 16SrIX, 16SrX, 16SrXI, 16SrXII, 16SrXV, 16SrXX, 16SrXXI....... cation. While other sequencebased methods may be well adapted to identification of particular strains of phytoplasmas, often they cannot be used for the simultaneous identification of phytoplasmas from different groups. The phytoplasma DNA barcoding protocol in this chapter, based on the tuf and 16SrRNA......Phytoplasma identi fi cation has proved dif fi cult due to their inability to be maintained in vitro. DNA barcoding is an identi fi cation method based on comparison of a short DNA sequence with known sequences from a database. A DNA barcoding tool has been developed for phytoplasma identi fi...
Kress, W John; García-Robledo, Carlos; Uriarte, Maria; Erickson, David L
The use of DNA barcodes, which are short gene sequences taken from a standardized portion of the genome and used to identify species, is entering a new phase of application as more and more investigations employ these genetic markers to address questions relating to the ecology and evolution of natural systems. The suite of DNA barcode markers now applied to specific taxonomic groups of organisms are proving invaluable for understanding species boundaries, community ecology, functional trait evolution, trophic interactions, and the conservation of biodiversity. The application of next-generation sequencing (NGS) technology will greatly expand the versatility of DNA barcodes across the Tree of Life, habitats, and geographies as new methodologies are explored and developed. Published by Elsevier Ltd.
Full Text Available Cellular barcoding and other single-cell lineage-tracing strategies form experimental methodologies for analysis of in vivo cell fate that have been instrumental in several significant recent discoveries. Due to the highly nonlinear nature of proliferation and differentiation, interrogation of the resulting data for evaluation of potential lineage pathways requires a new quantitative framework complete with appropriate statistical tests. Here, we develop such a framework, illustrating its utility by analyzing data from barcoded multipotent cells of the blood system. This application demonstrates that the data require additional paths beyond those found in the classical model, which leads us to propose that hematopoietic differentiation follows a loss of potential mechanism and to suggest further experiments to test this deduction. Our quantitative framework can evaluate the compatibility of lineage trees with barcoded data from any proliferating and differentiating cell system.
Wang, Bin; Zheng, Xuedong; Zhou, Shihua; Zhou, Changjun; Wei, Xiaopeng; Zhang, Qiang; Wei, Ziqi
Following the completion of the human genome project, a large amount of high-throughput bio-data was generated. To analyze these data, massively parallel sequencing, namely next-generation sequencing, was rapidly developed. DNA barcodes are used to identify the ownership between sequences and samples when they are attached at the beginning or end of sequencing reads. Constructing DNA barcode sets provides the candidate DNA barcodes for this application. To increase the accuracy of DNA barcode sets, a particle swarm optimization (PSO) algorithm has been modified and used to construct the DNA barcode sets in this paper. Compared with the extant results, some lower bounds of DNA barcode sets are improved. The results show that the proposed algorithm is effective in constructing DNA barcode sets.
Full Text Available For decades, researchers have been trying to create intuitive virtual environments by blending reality and virtual reality, thus enabling general users to interact with the digital domain as easily as with the real world. The result is “augmented reality” (AR. AR seamlessly superimposes virtual objects on to a real environment in three dimensions (3D and in real time. One of the most important parts that helps close the gap between virtuality and reality is the marker used in the AR system. While pictorial marker and bar-code marker are the two most commonly used marker types in the market, they have some disadvantages in visual and processing performance. In this paper, we present a novelty method that combines the bar-code with the original feature of a colour picture (e.g., photos, trading cards, advertisement’s figure. Our method decorates on top of the original pictorial images additional features with a single stereogram image that optically conceals a multi-level (3D bar-code. Thus, it has a larger capability of storing data compared to the general 1D barcode. This new type of marker has the potential of addressing the issues that the current types of marker are facing. It not only keeps the original information of the picture but also contains encoded numeric information. In our limited evaluation, this pictorial bar-code shows a relatively robust performance under various conditions and scaling; thus, it provides a promising AR approach to be used in many applications such as trading card games, educations, and advertisements.
Nithaniyal, Stalin; Vassou, Sophie Lorraine; Poovitha, Sundar; Raju, Balaji; Parani, Madasamy
Plants are the major source of therapeutic ingredients in complementary and alternative medicine (CAM). However, species adulteration in traded medicinal plant raw drugs threatens the reliability and safety of CAM. Since morphological features of medicinal plants are often not intact in the raw drugs, DNA barcoding was employed for species identification. Adulteration in 112 traded raw drugs was tested after creating a reference DNA barcode library consisting of 1452 rbcL and matK barcodes from 521 medicinal plant species. Species resolution of this library was 74.4%, 90.2%, and 93.0% for rbcL, matK, and rbcL + matK, respectively. DNA barcoding revealed adulteration in about 20% of the raw drugs, and at least 6% of them were derived from plants with completely different medicinal or toxic properties. Raw drugs in the form of dried roots, powders, and whole plants were found to be more prone to adulteration than rhizomes, fruits, and seeds. Morphological resemblance, co-occurrence, mislabeling, confusing vernacular names, and unauthorized or fraudulent substitutions might have contributed to species adulteration in the raw drugs. Therefore, this library can be routinely used to authenticate traded raw drugs for the benefit of all stakeholders: traders, consumers, and regulatory agencies.
Melta Rini Fahmi
Full Text Available Identifikasi spesies menjadi tantangan dalam pengelolaan ikan hias introduksi baik untuk tujuan budidaya maupun konservasi. Penelitian ini bertujuan untuk melakukan identifikasi molekuler ikan hias introduksi yang beredar di pembudidaya dan pasar ikan hias Indonesia dengan menggunakan barcode DNA gen COI. Sampel ikan diperoleh dari pembudidaya dan importir ikan hias di kawasan Bandung dan Jakarta. Total DNA diekstraksi dari jaringan sirip ekor dengan menggunakan metode kolom. Amplifikasi gen target dilakukan dengan menggunakan primer FishF1, FishF2, FishR1, dan FishR2. Hasil pembacaan untai DNA disejajarkan dengan sekuen yang terdapat pada genbank melalui program BLAST. Identifikasi dilakukan melalui kekerabatan pohon filogenetik dan presentasi indeks kesamaan dengan sekuen genbank. Hasil identifikasi menunjukkan sampel yang diuji terbagi menjadi lima grup, yaitu: Synodontis terdiri atas lima spesies, Corydoras: empat spesies, Phseudoplatystoma: tiga spesies, Botia: tiga spesies, dan Leporinus: tiga spesies dengan nilai boostrap 99-100. Indeks kesamaan sekuen menunjukkan sebanyak 11 spesies memiliki indeks kesamaan 99%-100% dengan data genbank yaitu Synodontis decorus, Synodontis eupterus, Synodontis greshoffi, Botia kubotai, Botia lohachata, Rasbora erythromicron, Corydoras aeneus, Gyrinocheilus aymonieri, Eigenmannia virescens, Leporinus affinis, Phractocephalus hemioliopterus. Dua spesies teridentifikasi sebagai hasil hibridisasi (kawin silang yaitu Leopard catfish (100% identik dengan Pseudoplatystoma faciatum dan Synodontis leopard (100% identik dengan Synodontis notatus. Hasil analisis nukleotida penciri diperoleh tujuh nukleotida untuk Synodontis decora, 10 nukleotida untuk Synodontis tanganyicae, 13 nukleotida untuk Synodontis euterus, empat nukleotida untuk Synodontis notatus, dan 14 untuk Synodontis grashoffi. Kejelasan identifikasi spesies ikan menjadi kunci utama dalam budidaya, perdagangan, manajemen, konservasi, dan pengembangan
Sambrook, Joseph; Bowtell, David
.... DNA Microarrays provides authoritative, detailed instruction on the design, construction, and applications of microarrays, as well as comprehensive descriptions of the software tools and strategies...
Chabbert, Christophe D; Adjalley, Sophie H; Steinmetz, Lars M; Pelechano, Vicent
Chromatin immunoprecipitation followed by sequencing (ChIP-Seq) or microarray hybridization (ChIP-on-chip) are standard methods for the study of transcription factor binding sites and histone chemical modifications. However, these approaches only allow profiling of a single factor or protein modification at a time.In this chapter, we present Bar-ChIP, a higher throughput version of ChIP-Seq that relies on the direct ligation of molecular barcodes to chromatin fragments. Bar-ChIP enables the concurrent profiling of multiple DNA-protein interactions and is therefore amenable to experimental scale-up, without the need for any robotic instrumentation.
Françoso, E; Arias, M C
Bees (Apidae), of which there are more than 19 900 species, are extremely important for ecosystem services and economic purposes, so taxon identity is a major concern. The goal of this study was to optimize the DNA barcode technique based on the Cytochrome c oxidase (COI) mitochondrial gene region. This approach has previously been shown to be useful in resolving taxonomic inconsistencies and for species identification when morphological data are poor. Specifically, we designed and tested new primers and standardized PCR conditions to amplify the barcode region for bees, focusing on the corbiculate Apids. In addition, primers were designed to amplify small COI amplicons and tested with pinned specimens. Short barcode sequences were easily obtained for some Bombus century-old museum specimens and shown to be useful as mini-barcodes. The new primers and PCR conditions established in this study proved to be successful for the amplification of the barcode region for all species tested, regardless of the conditions of tissue preservation. We saw no evidence of Wolbachia or numts amplification by these primers, and so we suggest that these new primers are of broad value for corbiculate bee identification through DNA barcode. © 2013 John Wiley & Sons Ltd.
Elías-Gutiérrez, M; León-Regagnon, V
DNA barcoding has become an important current scientific trend to the understanding of the world biodiversity. In the case of mega-diverse hot spots like Mexico, this technique represents an important tool for taxonomists, allowing them to concentrate in highlighted species by the barcodes instead of analyzing entire sets of specimens. This tendency resulted in the creation of a national network named Mexican Barcode of Life (MEXBOL) which main goals are to train students, and to promote the interaction and collective work among researchers interested in this topic. As a result, the number of records in the Barcode of Life Database (BOLD) for some groups, such as the Mammalia, Actinopterygii, Polychaeta, Branchiopoda, Ostracoda, Maxillopoda, Nematoda, Pinophyta, Ascomycota and Basidiomycota place Mexico among the top ten countries in the generation of these data. This special number presents only few of the many interesting findings in this region of the world, after the use of this technique and its integration with other methodologies. © 2013 John Wiley & Sons Ltd.
... discontinue POSTNET barcodes for automation letter and flat price eligibility. There were six comments... on each piece, to facilitate processing by presort companies. We added language to specifically allow..., with or without prepayment of postage, for return to the address on the reply piece. If postage is...
Ward, R D; Hanner, R; Hebert, P D N
FISH-BOL, the Fish Barcode of Life campaign, is an international research collaboration that is assembling a standardized reference DNA sequence library for all fishes. Analysis is targeting a 648 base pair region of the mitochondrial cytochrome c oxidase I (COI) gene. More than 5000 species have already been DNA barcoded, with an average of five specimens per species, typically vouchers with authoritative identifications. The barcode sequence from any fish, fillet, fin, egg or larva can be matched against these reference sequences using BOLD; the Barcode of Life Data System (http://www.barcodinglife.org). The benefits of barcoding fishes include facilitating species identification, highlighting cases of range expansion for known species, flagging previously overlooked species and enabling identifications where traditional methods cannot be applied. Results thus far indicate that barcodes separate c. 98 and 93% of already described marine and freshwater fish species, respectively. Several specimens with divergent barcode sequences have been confirmed by integrative taxonomic analysis as new species. Past concerns in relation to the use of fish barcoding for species discrimination are discussed. These include hybridization, recent radiations, regional differentiation in barcode sequences and nuclear copies of the barcode region. However, current results indicate these issues are of little concern for the great majority of specimens.
Liu, D; Liu, L; Guo, G; Wang, W; Sun, Q; Parani, M; Ma, J
DNA barcoding is a novel concept for taxonomic identification using short, specific genetic markers and has been applied to study a large number of eukaryotes. The huge amount of data output generated by DNA barcoding requires well-organized information systems. Besides the Barcode of Life Data system (BOLD) established in Canada, the mirror system is also important for the international barcode of life project (iBOL). For this purpose, we developed the BOLDMirror, a global mirror system of DNA barcode data. It is open-sourced and can run on the LAMP (Linux + Apache + MySQL + PHP) environment. BOLDMirror has data synchronization, data representation and statistics modules, and also provides spaces to store user operation history. BOLDMirror can be accessed at http://www.boldmirror.net and several countries have used it to setup their site of DNA barcoding. © 2012 John Wiley & Sons Ltd.
Little, Damon P
Small portions of the barcode region - mini-barcodes - may be used in place of full-length barcodes to overcome DNA degradation for samples with poor DNA preservation. 591,491,286 rbcL mini-barcode primer combinations were electronically evaluated for PCR universality, and two novel highly universal sets of priming sites were identified. Novel and published rbcL mini-barcode primers were evaluated for PCR amplification [determined with a validated electronic simulation (n = 2765) and empirically (n = 188)], Sanger sequence quality [determined empirically (n = 188)], and taxonomic discrimination [determined empirically (n = 30,472)]. PCR amplification for all mini-barcodes, as estimated by validated electronic simulation, was successful for 90.2-99.8% of species. Overall Sanger sequence quality for mini-barcodes was very low - the best mini-barcode tested produced sequences of adequate quality (B20 ≥ 0.5) for 74.5% of samples. The majority of mini-barcodes provide correct identifications of families in excess of 70.1% of the time. Discriminatory power noticeably decreased at lower taxonomic levels. At the species level, the discriminatory power of the best mini-barcode was less than 38.2%. For samples believed to contain DNA from only one species, an investigator should attempt to sequence, in decreasing order of utility and probability of success, mini-barcodes F (rbcL1/rbcLB), D (F52/R193) and K (F517/R604). For samples believed to contain DNA from more than one species, an investigator should amplify and sequence mini-barcode D (F52/R193). © 2013 John Wiley & Sons Ltd.
-based identification systems and the dwindling pool of taxonomists highlight the need for alternate methods for species identification which should be quick, cost effective and efficient. DNA barcoding emerges as a most favoured alternate method by the researchers..., electronics and computer science. The mission of the CBOL is to develop DNA barcoding as a global standard in taxonomy, rapidly accelerate compiling of DNA barcodes of known and newly discovered plant and animal species, establish a public library...
Sarkar, Indra Neil; Trizna, Michael
With the volume of molecular sequence data that is systematically being generated globally, there is a need for centralized resources for data exploration and analytics. DNA Barcode initiatives are on track to generate a compendium of molecular sequence–based signatures for identifying animals and plants. To date, the range of available data exploration and analytic tools to explore these data have only been available in a boutique form—often representing a frustrating hurdle for many researchers that may not necessarily have resources to install or implement algorithms described by the analytic community. The Barcode of Life Data Portal (BDP) is a first step towards integrating the latest biodiversity informatics innovations with molecular sequence data from DNA barcoding. Through establishment of community driven standards, based on discussion with the Data Analysis Working Group (DAWG) of the Consortium for the Barcode of Life (CBOL), the BDP provides an infrastructure for incorporation of existing and next-generation DNA barcode analytic applications in an open forum. PMID:21818249
Kim, Sungmin; Song, Kyo-Hong; Ree, Han-Il; Kim, Won
Non-biting midges (Diptera: Chironomidae) are a diverse population that commonly causes respiratory allergies in humans. Chironomid larvae can be used to indicate freshwater pollution, but accurate identification on the basis of morphological characteristics is difficult. In this study, we constructed a mitochondrial cytochrome c oxidase subunit I (COI)-based DNA barcode library for Korean chironomids. This library consists of 211 specimens from 49 species, including adults and unidentified larvae. The interspecies and intraspecies COI sequence variations were analyzed. Sophisticated indexes were developed in order to properly evaluate indistinct barcode gaps that are created by insufficient sampling on both the interspecies and intraspecies levels and by variable mutation rates across taxa. In a variety of insect datasets, these indexes were useful for re-evaluating large barcode datasets and for defining COI barcode gaps. The COI-based DNA barcode library will provide a rapid and reliable tool for the molecular identification of Korean chironomid species. Furthermore, this reverse-taxonomic approach will be improved by the continuous addition of other speceis’ sequences to the library. PMID:22138764
Hajibabaei, Mehrdad; deWaard, Jeremy R; Ivanova, Natalia V; Ratnasingham, Sujeevan; Dooh, Robert T; Kirk, Stephanie L; Mackie, Paula M; Hebert, Paul D.N
Large-scale DNA barcoding projects are now moving toward activation while the creation of a comprehensive barcode library for eukaryotes will ultimately require the acquisition of some 100 million barcodes. To satisfy this need, analytical facilities must adopt protocols that can support the rapid, cost-effective assembly of barcodes. In this paper we discuss the prospects for establishing high volume DNA barcoding facilities by evaluating key steps in the analytical chain from specimens to barcodes. Alliances with members of the taxonomic community represent the most effective strategy for provisioning the analytical chain with specimens. The optimal protocols for DNA extraction and subsequent PCR amplification of the barcode region depend strongly on their condition, but production targets of 100K barcode records per year are now feasible for facilities working with compliant specimens. The analysis of museum collections is currently challenging, but PCR cocktails that combine polymerases with repair enzyme(s) promise future success. Barcode analysis is already a cost-effective option for species identification in some situations and this will increasingly be the case as reference libraries are assembled and analytical protocols are simplified. PMID:16214753
Huemer, Peter; Karsholt, Ole; Mutanen, Marko
We explore the potential value of DNA barcode divergence for species delimitation in the genus Caryocolum Gregor & Povolný, 1954 (Lepidoptera, Gelechiidae), based on data from 44 European species (including 4 subspecies). Low intraspecific divergence of the DNA barcodes of the mtCOI (cytochrome c...... oxidase 1) gene and/or distinct barcode gaps to the nearest neighbor support species status for all examined nominal taxa. However, in 8 taxa we observed deep splits with a maximum intraspecific barcode divergence beyond a threshold of 3%, thus indicating possible cryptic diversity. The taxonomy...
Full Text Available DNA barcoding, the identification of species using one or a few short standardized DNA sequences, is an important complement to traditional taxonomy. However, there are particular challenges for barcoding plants, especially for species with complex evolutionary histories. We herein evaluated the utility of five candidate sequences - rbcL, matK, trnH-psbA, trnL-F and the internal transcribed spacer (ITS - for barcoding Rhodiola species, a group of high-altitude plants frequently used as adaptogens, hemostatics and tonics in traditional Tibetan medicine. Rhodiola was suggested to have diversified rapidly recently. The genus is thus a good model for testing DNA barcoding strategies for recently diversified medicinal plants. This study analyzed 189 accessions, representing 47 of the 55 recognized Rhodiola species in the Flora of China treatment. Based on intraspecific and interspecific divergence and degree of monophyly statistics, ITS was the best single-locus barcode, resolving 66% of the Rhodiola species. The core combination rbcL+matK resolved only 40.4% of them. Unsurprisingly, the combined use of all five loci provided the highest discrimination power, resolving 80.9% of the species. However, this is weaker than the discrimination power generally reported in barcoding studies of other plant taxa. The observed complications may be due to the recent diversification, incomplete lineage sorting and reticulate evolution of the genus. These processes are common features of numerous plant groups in the high-altitude regions of the Qinghai-Tibetan Plateau.
Karim, Asma; Iqbal, Asad; Akhtar, Rehan; Rizwan, Muhammad; Amar, Ali; Qamar, Usman; Jahan, Shah
DNA bar-coding is a taxonomic method that uses small genetic markers in organisms' mitochondrial DNA (mt DNA) for identification of particular species. It uses sequence diversity in a 658-base pair fragment near the 5' end of the mitochondrial cytochrome c oxidase subunit 1 (CO1) gene as a tool for species identification. DNA barcoding is more accurate and reliable method as compared with the morphological identification. It is equally useful in juveniles as well as adult stages of fishes. The present study was conducted to identify three farm fish species of Pakistan (Cyprinus carpio, Cirrhinus mrigala, and Ctenopharyngodon idella) genetically. All of them belonged to family cyprinidae. CO1 gene was amplified. PCR products were sequenced and analyzed by bioinformatic software. Conspecific, congenric, and confamilial k2P nucleotide divergence was estimated. From these findings, it was concluded that the gene sequence, CO1, may serve as milestone for the identification of related species at molecular level.
Gonzalez, Mailyn Adriana; Baraloto, Christopher; Engel, Julien; Mori, Scott A; Pétronelli, Pascal; Riéra, Bernard; Roger, Aurélien; Thébaud, Christophe; Chave, Jérôme
Large-scale plant diversity inventories are critical to develop informed conservation strategies. However, the workload required for classic taxonomic surveys remains high and is particularly problematic for megadiverse tropical forests. Based on a comprehensive census of all trees in two hectares of a tropical forest in French Guiana, we examined whether plant DNA barcoding could contribute to increasing the quality and the pace of tropical plant biodiversity surveys. Of the eight plant DNA markers we tested (rbcLa, rpoC1, rpoB, matK, ycf5, trnL, psbA-trnH, ITS), matK and ITS had a low rate of sequencing success. More critically, none of the plastid markers achieved a rate of correct plant identification greater than 70%, either alone or combined. The performance of all barcoding markers was noticeably low in few species-rich clades, such as the Laureae, and the Sapotaceae. A field test of the approach enabled us to detect 130 molecular operational taxonomic units in a sample of 252 juvenile trees. Including molecular markers increased the identification rate of juveniles from 72% (morphology alone) to 96% (morphology and molecular) of the individuals assigned to a known tree taxon. We conclude that while DNA barcoding is an invaluable tool for detecting errors in identifications and for identifying plants at juvenile stages, its limited ability to identify collections will constrain the practical implementation of DNA-based tropical plant biodiversity programs.
Collins, R A; Cruickshank, R H
Despite the broad benefits that DNA barcoding can bring to a diverse range of biological disciplines, a number of shortcomings still exist in terms of the experimental design of studies incorporating this approach. One underlying reason for this lies in the confusion that often exists between species discovery and specimen identification, and this is reflected in the way that hypotheses are generated and tested. Although these aims can be associated, they are quite distinct and require different methodological approaches, but their conflation has led to the frequently inappropriate use of commonly used analytical methods such as neighbour-joining trees, bootstrap resampling and fixed distance thresholds. Furthermore, the misidentification of voucher specimens can also have serious implications for end users of reference libraries such as the Barcode of Life Data Systems, and in this regard we advocate increased diligence in the a priori identification of specimens to be used for this purpose. This commentary provides an assessment of seven deficiencies that we identify as common in the DNA barcoding literature, and outline some potential improvements for its adaptation and adoption towards more reliable and accurate outcomes. © 2012 John Wiley & Sons Ltd.
Full Text Available Biodiversity research is becoming increasingly dependent on genomics, which allows the unprecedented digitization and understanding of the planet's biological heritage. The use of genetic markers i.e. DNA barcoding, has proved to be a powerful tool in species identification. However, full exploitation of this approach is hampered by the high sequencing costs and the absence of equipped facilities in biodiversity-rich countries. In the present work, we developed a portable sequencing laboratory based on the portable DNA sequencer from Oxford Nanopore Technologies, the MinION. Complementary laboratory equipment and reagents were selected to be used in remote and tough environmental conditions. The performance of the MinION sequencer and the portable laboratory was tested for DNA barcoding in a mimicking tropical environment, as well as in a remote rainforest of Tanzania lacking electricity. Despite the relatively high sequencing error-rate of the MinION, the development of a suitable pipeline for data analysis allowed the accurate identification of different species of vertebrates including amphibians, reptiles and mammals. In situ sequencing of a wild frog allowed us to rapidly identify the species captured, thus confirming that effective DNA barcoding in the field is possible. These results open new perspectives for real-time-on-site DNA sequencing thus potentially increasing opportunities for the understanding of biodiversity in areas lacking conventional laboratory facilities.
Beilharz, Traude H; Preiss, Thomas
Nearly all eukaryotic mRNAs terminate in a poly(A) tail that serves important roles in mRNA utilization. In the cytoplasm, the poly(A) tail promotes both mRNA stability and translation, and these functions are frequently regulated through changes in tail length. To identify the scope of poly(A) tail length control in a transcriptome, we developed the polyadenylation state microarray (PASTA) method. It involves the purification of mRNA based on poly(A) tail length using thermal elution from poly(U) sepharose, followed by microarray analysis of the resulting fractions. In this chapter we detail our PASTA approach and describe some methods for bulk and mRNA-specific poly(A) tail length measurements of use to monitor the procedure and independently verify the microarray data.
Full Text Available Abstract Background Image analysis of microarrays and, in particular, spot quantification and spot quality control, is one of the most important steps in statistical analysis of microarray data. Recent methods of spot quality control are still in early age of development, often leading to underestimation of true positive microarray features and, consequently, to loss of important biological information. Therefore, improving and standardizing the statistical approaches of spot quality control are essential to facilitate the overall analysis of microarray data and subsequent extraction of biological information. Findings We evaluated the performance of two image analysis packages MAIA and GenePix (GP using two complementary experimental approaches with a focus on the statistical analysis of spot quality factors. First, we developed control microarrays with a priori known fluorescence ratios to verify the accuracy and precision of the ratio estimation of signal intensities. Next, we developed advanced semi-automatic protocols of spot quality evaluation in MAIA and GP and compared their performance with available facilities of spot quantitative filtering in GP. We evaluated these algorithms for standardised spot quality analysis in a whole-genome microarray experiment assessing well-characterised transcriptional modifications induced by the transcription regulator SNAI1. Using a set of RT-PCR or qRT-PCR validated microarray data, we found that the semi-automatic protocol of spot quality control we developed with MAIA allowed recovering approximately 13% more spots and 38% more differentially expressed genes (at FDR = 5% than GP with default spot filtering conditions. Conclusion Careful control of spot quality characteristics with advanced spot quality evaluation can significantly increase the amount of confident and accurate data resulting in more meaningful biological conclusions.
Full Text Available Dimension reduction has become inevitable for pre-processing of high dimensional data. “Gene expression microarray data” is an instance of such high dimensional data. Gene expression microarray data displays the maximum number of genes (features simultaneously at a molecular level with a very small number of samples. The copious numbers of genes are usually provided to a learning algorithm for producing a complete characterization of the classification task. However, most of the times the majority of the genes are irrelevant or redundant to the learning task. It will deteriorate the learning accuracy and training speed as well as lead to the problem of overfitting. Thus, dimension reduction of microarray data is a crucial preprocessing step for prediction and classification of disease. Various feature selection and feature extraction techniques have been proposed in the literature to identify the genes, that have direct impact on the various machine learning algorithms for classification and eliminate the remaining ones. This paper describes the taxonomy of dimension reduction methods with their characteristics, evaluation criteria, advantages and disadvantages. It also presents a review of numerous dimension reduction approaches for microarray data, mainly those methods that have been proposed over the past few years.
Ferri, G; Corradini, B; Ferrari, F; Santunione, A L; Palazzoli, F; Alu', M
The ambitious idea of using a short piece of DNA for large-scale species identification (DNA barcoding) is already a powerful tool for scientists and the application of this standard technique seems promising in a range of fields including forensic genetics. While DNA barcoding enjoyed a remarkable success for animal identification through cytochrome c oxidase I (COI) analysis, the attempts to identify a single barcode for plants remained a vain hope for a longtime. From the beginning, the Consortium for the Barcode of Life (CBOL) showed a lack of agreement on a core plant barcode, reflecting the diversity of viewpoints. Different research groups advocated various markers with divergent set of criteria until the recent publication by the CBOL-Plant Working Group. After a four-year effort, in 2009 the International Team concluded to agree on standard markers promoting a multilocus solution (rbcL and matK), with 70-75% of discrimination to the species level. In 2009 our group firstly proposed the broad application of DNA barcoding principles as a tool for identification of trace botanical evidence through the analysis of two chloroplast loci (trnH-psbA and trnL-trnF) in plant species belonging to local flora. Difficulties and drawbacks that were encountered included a poor coverage of species in specific databases and the lack of authenticated reference sequences for the selected markers. Successful preliminary results were obtained providing an approach to progressively identify unknown plant specimens to a given taxonomic rank, usable by any non-specialist botanist or in case of a shortage of taxonomic expertise. Now we considered mandatory to update and to compare our previous findings with the new selected plastid markers (matK+rbcL), taking into account forensic requirements. Features of all the four loci (the two previously analyzed trnH-psbA+trnL-trnF and matK+rbcL) were compared singly and in multilocus solutions to assess the most suitable combination for
Mallo, Diego; Posada, David
The unprecedented amount of data resulting from next-generation sequencing has opened a new era in phylogenetic estimation. Although large datasets should, in theory, increase phylogenetic resolution, massive, multilocus datasets have uncovered a great deal of phylogenetic incongruence among different genomic regions, due both to stochastic error and to the action of different evolutionary process such as incomplete lineage sorting, gene duplication and loss and horizontal gene transfer. This incongruence violates one of the fundamental assumptions of the DNA barcoding approach, which assumes that gene history and species history are identical. In this review, we explain some of the most important challenges we will have to face to reconstruct the history of species, and the advantages and disadvantages of different strategies for the phylogenetic analysis of multilocus data. In particular, we describe the evolutionary events that can generate species tree-gene tree discordance, compare the most popular methods for species tree reconstruction, highlight the challenges we need to face when using them and discuss their potential utility in barcoding. Current barcoding methods sacrifice a great amount of statistical power by only considering one locus, and a transition to multilocus barcodes would not only improve current barcoding methods, but also facilitate an eventual transition to species-tree-based barcoding strategies, which could better accommodate scenarios where the barcode gap is too small or inexistent.This article is part of the themed issue 'From DNA barcodes to biomes'. © 2016 The Authors.
... between mini-barcode and the full- length DNA barcode was carried out in Microsoft Excel. (http://www.office.microsoft.com). ..... Received 15 June 2013, in final revised form 5 April 2014; accepted 3 June 2014. Unedited version published ...
Oct 15, 2012 ... to geography-based vs clade-based sampling of amphibians. ANDREA ... phylogenetic sampling, the addition of DNA barcoding to RAPs may present a greater challenge for the identification ...... odes for soil nematode identification. Mol. .... barcoding amphibians: take the chance, meet the challenge. Mol.
Background: DNA barcoding is a technique used to identify species based on species-specific differences in short regions of their DNA. It is widely used in species discrimination of medicinal plants and traditional medicines. Materials and Methods: In the present study, four potential DNA barcodes, namely rbcL, matK, ...
Hansen, Daniel Kold; Nasrollahi, Kamal; Rasmussen, Christoffer Bøgelund
Barcodes, in their different forms, can be found on almost any packages available in the market. Detecting and then decoding of barcodes have therefore great applications. We describe how to adapt the state-of-the- art deep learning-based detector of You Only Look Once (YOLO) for the purpose...
Bystrykh, Leonid V.
The diversity and scope of multiplex parallel sequencing applications is steadily increasing. Critically, multiplex parallel sequencing applications methods rely on the use of barcoded primers for sample identification, and the quality of the barcodes directly impacts the quality of the resulting
Frézal, Lise; Leblois, Raphael
Research using cytochrome c oxidase barcoding techniques on zoological specimens was initiated by Hebert et al. [Hebert, P.D.N., Ratnasingham, S., deWaard, J.R., 2003. Barcoding animal life: cytochrome c oxidase subunit 1 divergences among closely related species. Proc. R. Soc. Lond. B 270, S96-S99]. By March 2004, the Consortium for the Barcode of Life started to promote the use of a standardized DNA barcoding approach, consisting of identifying a specimen as belonging to a certain animal species based on a single universal marker: the DNA barcode sequence. Over the last 4 years, this approach has become increasingly popular and advances as well as limitations have clearly emerged as increasing amounts of organisms have been studied. Our purpose is to briefly expose DNA Barcode of Life principles, pros and cons, relevance and universality. The initially proposed Barcode of life framework has greatly evolved, giving rise to a flexible description of DNA barcoding and a larger range of applications.
DNA barcoding is a widely used molecular approach for species cataloging for unambiguous identification and conservation. In the present study, DNA barcoding of some West African mammals were performed with six new mitochondrial CO1 sequences for Civettictis civetta, Tadarida nigeriae, Orycteropus afer, ...
Cornils, Kerstin; Thielecke, Lars; Hüser, Svenja; Forgber, Michael; Thomaschewski, Michael; Kleist, Nadja; Hussein, Kais; Riecken, Kristoffer; Volz, Tassilo; Gerdes, Sebastian; Glauche, Ingmar; Dahl, Andreas; Dandri, Maura; Roeder, Ingo; Fehse, Boris
RGB marking and DNA barcoding are two cutting-edge technologies in the field of clonal cell marking. To combine the virtues of both approaches, we equipped LeGO vectors encoding red, green or blue fluorescent proteins with complex DNA barcodes carrying color-specific signatures. For these vectors, we generated highly complex plasmid libraries that were used for the production of barcoded lentiviral vector particles. In proof-of-principle experiments, we used barcoded vectors for RGB marking of cell lines and primary murine hepatocytes. We applied single-cell polymerase chain reaction to decipher barcode signatures of individual RGB-marked cells expressing defined color hues. This enabled us to prove clonal identity of cells with one and the same RGB color. Also, we made use of barcoded vectors to investigate clonal development of leukemia induced by ectopic oncogene expression in murine hematopoietic cells. In conclusion, by combining RGB marking and DNA barcoding, we have established a novel technique for the unambiguous genetic marking of individual cells in the context of normal regeneration as well as malignant outgrowth. Moreover, the introduction of color-specific signatures in barcodes will facilitate studies on the impact of different variables (e.g. vector type, transgenes, culture conditions) in the context of competitive repopulation studies. PMID:24476916
Vu, D.; Eberhardt, U.; Szöke, S.; Groenewald, M.; Robert, V.
This paper presents a laboratory information management system for DNA sequences (LIMS) created and based on the needs of a DNA barcoding project at the CBS-KNAW Fungal Biodiversity Centre (Utrecht, the Netherlands). DNA barcoding is a global initiative for species identification through simple DNA
This research aimed at exploring the diversity of Sapindaceae in West and Central Africa with particular emphasis on identification of the plant samples as well as generation of DNA barcodes with a view to sharing the DNA barcode sequence(s) in a public database. These were achieved following standard protocols.
Blaxter, Mark; Mann, Jenna; Chapman, Tom; Thomas, Fran; Whitton, Claire; Floyd, Robin; Abebe, Eyualem
The scale of diversity of life on this planet is a significant challenge for any scientific programme hoping to produce a complete catalogue, whatever means is used. For DNA barcoding studies, this difficulty is compounded by the realization that any chosen barcode sequence is not the gene 'for' speciation and that taxa have evolutionary histories. How are we to disentangle the confounding effects of reticulate population genetic processes? Using the DNA barcode data from meiofaunal surveys, here we discuss the benefits of treating the taxa defined by barcodes without reference to their correspondence to 'species', and suggest that using this non-idealist approach facilitates access to taxon groups that are not accessible to other methods of enumeration and classification. Major issues remain, in particular the methodologies for taxon discrimination in DNA barcode data.
An, Jeung Hee; Lee, Kwon-Jai; Choi, Jeong-Woo
Nanotechnology-based bio-barcode amplification analysis offers an innovative approach for detecting neurotransmitters. We evaluated the efficacy of this method for detecting norepinephrine in normal and oxidative-stress damaged dopaminergic cells. Our approach use a combination of DNA barcodes and bead-based immunoassays for detecting neurotransmitters with surface-enhanced Raman spectroscopy (SERS), and provides polymerase chain reaction (PCR)-like sensitivity. This method relies on magnetic Dynabeads containing antibodies and nanoparticles that are loaded both with DNA barcords and with antibodies that can sandwich the target protein captured by the Dynabead-bound antibodies. The aggregate sandwich structures are magnetically separated from the solution and treated to remove the conjugated barcode DNA. The DNA barcodes are then identified by SERS and PCR analysis. The concentration of norepinephrine in dopaminergic cells can be readily detected using the bio-barcode assay, which is a rapid, high-throughput screening tool for detecting neurotransmitters.
Ronald Pamela C
Full Text Available Abstract Background Few microarrays have been quantitatively calibrated to identify optimal hybridization conditions because it is difficult to precisely determine the hybridization characteristics of a microarray using biologically variable cDNA samples. Results Using synthesized samples with known concentrations of specific oligonucleotides, a series of microarray experiments was conducted to evaluate microarrays designed by PICKY, an oligo microarray design software tool, and to test a direct microarray calibration method based on the PICKY-predicted, thermodynamically closest nontarget information. The complete set of microarray experiment results is archived in the GEO database with series accession number GSE14717. Additional data files and Perl programs described in this paper can be obtained from the website http://www.complex.iastate.edu under the PICKY Download area. Conclusion PICKY-designed microarray probes are highly reliable over a wide range of hybridization temperatures and sample concentrations. The microarray calibration method reported here allows researchers to experimentally optimize their hybridization conditions. Because this method is straightforward, uses existing microarrays and relatively inexpensive synthesized samples, it can be used by any lab that uses microarrays designed by PICKY. In addition, other microarrays can be reanalyzed by PICKY to obtain the thermodynamically closest nontarget information for calibration.
This paper reviews basics and updates of each microarray technology and serves to .... through protein microarrays. Protein microarrays also known as protein chips are nothing but grids that ... conditioned media, patient sera, plasma and urine. Clontech ... based antibody arrays) is similar to membrane-based antibody ...
Dufva, Hans Martin; Christensen, C.B.V.
DNA microarrays have changed the field of biomedical sciences over the past 10 years. For several reasons, antibody and other protein microarrays have not developed at the same rate. However, protein and antibody arrays have emerged as a powerful tool to complement DNA microarrays during the post...
Osathanunkul, M; Madesis, P; Ounjai, S; Pumiputavon, K; Somboonchai, R; Lithanatudom, P; Chaowasku, T; Wipasa, J; Suwannapoom, C
DNA barcoding, which was developed about a decade ago, relies on short, standardized regions of the genome to identify plant and animal species. This method can be used to not only identify known species but also to discover novel ones. Numerous sequences are stored in online databases worldwide. One of the ways to save cost and time (by omitting the sequencing step) in species identification is to use available barcode data to design optimized primers for further analysis, such as high-resolution melting analysis (HRM). This study aimed to determine the effectiveness of the hybrid method Bar-HRM (DNA barcoding combined with HRM) to identify species that share similar external morphological features, rather than conduct traditional taxonomic identification that require major parts (leaf, flower, fruit) of the specimens. The specimens used for testing were those, which could not be identified at the species level and could either be Uvaria longipes or Uvaria wrayias, indicated by morphological identification. Primer pairs derived from chloroplast regions (matK, psbA-trnH, rbcL, and trnL) were used in the Bar-HRM. The results obtained from psbA-trnH primers were good enough to help in identifying the specimen while the rest were not. Bar-HRM analysis was proven to be a fast and cost-effective method for plant species identification.
Full Text Available Abstract – In the production and trade of food products in the era of globalization, people are consuming, especially Muslims need to be given the knowledge, information and access to adequate in order to obtain the correct information about the halal status of products bought. The use of barcode scanners halal product information using the mobile platform is effective and useful for the public to find out information on a product. Barcode scanners can be read by optical scanners called barcode readers or scanned from an image by special software. In Indonesia, most mobile phones have the scanning software for 2D codes, and similar devices available via smartphone. Keywords : Barcode Scanner, Mobile Platform, Halal Products, Smartphone Abstrak - Dalam kegiatan produksi dan perdagangan produk pangan di era globalisasi ini, masyarakat yang mengkonsumsi, khususnya umat islam perlu diberikan pengetahuan tentang kehalalan produk, informasi dan akses yang memadai agar memperoleh informasi yang benar tentang status kehalalan produk yang dibelinya. Penggunaan barcode scanner informasi produk halal dengan menggunakan mobile platform dinilai cukup efektif dan berguna bagi masyarakat luas untuk mengetahui informasi sebuah produk. Barcode scanner dapat dibaca oleh pemindai optik yang disebut pembaca kode batang atau dipindai dari sebuah gambar oleh perangkat lunak khusus. Di Indonesia, kebanyakan telepon genggam memiliki perangkat lunak pemindai untuk kode 2D, dan perangkat sejenis tersedia melalui smartphone. Kata Kunci: Barcode Scanner, Mobile Platform, Produk Halal, Smartphone
Keskın, Emre; Atar, Hasan H
DNA barcoding was used in the identification of 89 commercially important freshwater and marine fish species found in Turkish ichthyofauna. A total of 1765 DNA barcodes using a 654-bp-long fragment of the mitochondrial cytochrome c oxidase subunit I gene were generated for 89 commercially important freshwater and marine fish species found in Turkish ichthyofauna. These species belong to 70 genera, 40 families and 19 orders from class Actinopterygii, and all were associated with a distinct DNA barcode. Nine and 12 of the COI barcode clusters represent the first species records submitted to the BOLD and GenBank databases, respectively. All COI barcodes (except sequences of first species records) were matched with reference sequences of expected species, according to morphological identification. Average nucleotide frequencies of the data set were calculated as T = 29.7%, C = 28.2%, A = 23.6% and G = 18.6%. Average pairwise genetic distance among individuals were estimated as 0.32%, 9.62%, 17,90% and 22.40% for conspecific, congeneric, confamilial and within order, respectively. Kimura 2-parameter genetic distance values were found to increase with taxonomic level. For most of the species analysed in our data set, there is a barcoding gap, and an overlap in the barcoding gap exists for only two genera. Neighbour-joining trees were drawn based on DNA barcodes and all the specimens clustered in agreement with their taxonomic classification at species level. Results of this study supported DNA barcoding as an efficient molecular tool for a better monitoring, conservation and management of fisheries. © 2013 John Wiley & Sons Ltd.
Full Text Available BACKGROUND: DNA barcoding has been advanced as a promising tool to aid species identification and discovery through the use of short, standardized gene targets. Despite extensive taxonomic studies, for a variety of reasons the identification of fishes can be problematic, even for experts. DNA barcoding is proving to be a useful tool in this context. However, its broad application is impeded by the need to construct a comprehensive reference sequence library for all fish species. Here, we make a regional contribution to this grand challenge by calibrating the species discrimination efficiency of barcoding among 125 Argentine fish species, representing nearly one third of the known fauna, and examine the utility of these data to address several key taxonomic uncertainties pertaining to species in this region. METHODOLOGY/PRINCIPAL FINDINGS: Specimens were collected and morphologically identified during crusies conducted between 2005 and 2008. The standard BARCODE fragment of COI was amplified and bi-directionally sequenced from 577 specimens (mean of 5 specimens/species, and all specimens and sequence data were archived and interrogated using analytical tools available on the Barcode of Life Data System (BOLD; www.barcodinglife.org. Nearly all species exhibited discrete clusters of closely related haplogroups which permitted the discrimination of 95% of the species (i.e. 119/125 examined while cases of shared haplotypes were detected among just three species-pairs. Notably, barcoding aided the identification of a new species of skate, Dipturus argentinensis, permitted the recognition of Genypterus brasiliensis as a valid species and questions the generic assignment of Paralichthys isosceles. CONCLUSIONS/SIGNIFICANCE: This study constitutes a significant contribution to the global barcode reference sequence library for fishes and demonstrates the utility of barcoding for regional species identification. As an independent assessment of alpha
Rowena F Stern
Full Text Available Dinoflagellates are an ecologically important group of protists with important functions as primary producers, coral symbionts and in toxic red tides. Although widely studied, the natural diversity of dinoflagellates is not well known. DNA barcoding has been utilized successfully for many protist groups. We used this approach to systematically sample known "species", as a reference to measure the natural diversity in three marine environments.In this study, we assembled a large cytochrome c oxidase 1 (COI barcode database from 8 public algal culture collections plus 3 private collections worldwide resulting in 336 individual barcodes linked to specific cultures. We demonstrate that COI can identify to the species level in 15 dinoflagellate genera, generally in agreement with existing species names. Exceptions were found in species belonging to genera that were generally already known to be taxonomically challenging, such as Alexandrium or Symbiodinium. Using this barcode database as a baseline for cultured dinoflagellate diversity, we investigated the natural diversity in three diverse marine environments (Northeast Pacific, Northwest Atlantic, and Caribbean, including an evaluation of single-cell barcoding to identify uncultivated groups. From all three environments, the great majority of barcodes were not represented by any known cultured dinoflagellate, and we also observed an explosion in the diversity of genera that previously contained a modest number of known species, belonging to Kareniaceae. In total, 91.5% of non-identical environmental barcodes represent distinct species, but only 51 out of 603 unique environmental barcodes could be linked to cultured species using a conservative cut-off based on distances between cultured species.COI barcoding was successful in identifying species from 70% of cultured genera. When applied to environmental samples, it revealed a massive amount of natural diversity in dinoflagellates. This highlights
Kara K S Layton
Full Text Available DNA barcoding has proven an effective tool for species identification in varied groups of marine invertebrates including crustaceans, molluscs, polychaetes and echinoderms. In this study, we further validate its utility by analyzing almost half of the 300 species of Echinodermata known from Canadian waters. COI sequences from 999 specimens were assigned to 145 BINs. In most cases, species discrimination was straightforward due to the large difference (25-fold between mean intra- (0.48% and inter- (12.0% specific divergence. Six species were flagged for further taxonomic investigation because specimens assigned to them fell into two or three discrete sequence clusters. The potential influence of larval dispersal capacity and glacial events on patterns of genetic diversity is discussed for 19 trans-oceanic species. Although additional research is needed to clarify biogeographic patterns and resolve taxonomic questions, this study represents an important step in the assembly of a DNA barcode library for all Canadian echinoderms, a valuable resource for future biosurveillance programs.
Scott, Adrian C; Ludlow, Catherine L; Cromie, Gareth A; Dudley, Aimée M
Tetrad analysis is a valuable tool for yeast genetics, but the laborious manual nature of the process has hindered its application on large scales. Barcode Enabled Sequencing of Tetrads (BEST)1 replaces the manual processes of isolating, disrupting and spacing tetrads. BEST isolates tetrads by virtue of a sporulation-specific GFP fusion protein that permits fluorescence-activated cell sorting of tetrads directly onto agar plates, where the ascus is enzymatically digested and the spores are disrupted and randomly arrayed by glass bead plating. The haploid colonies are then assigned sister spore relationships, i.e. information about which spores originated from the same tetrad, using molecular barcodes read during genotyping. By removing the bottleneck of manual dissection, hundreds or even thousands of tetrads can be isolated in minutes. Here we present a detailed description of the experimental procedures required to perform BEST in the yeast Saccharomyces cerevisiae, starting with a heterozygous diploid strain through the isolation of colonies derived from the haploid meiotic progeny.
Fernandez, Paula; Soria, Marcelo; Blesa, David; DiRienzo, Julio; Moschen, Sebastian; Rivarola, Maximo; Clavijo, Bernardo Jose; Gonzalez, Sergio; Peluffo, Lucila; Príncipi, Dario; Dosio, Guillermo; Aguirrezabal, Luis; García-García, Francisco; Conesa, Ana; Hopp, Esteban; Dopazo, Joaquín; Heinz, Ruth Amelia; Paniego, Norma
Oligonucleotide-based microarrays with accurate gene coverage represent a key strategy for transcriptional studies in orphan species such as sunflower, H. annuus L., which lacks full genome sequences. The goal of this study was the development and functional annotation of a comprehensive sunflower unigene collection and the design and validation of a custom sunflower oligonucleotide-based microarray. A large scale EST (>130,000 ESTs) curation, assembly and sequence annotation was performed using Blast2GO (www.blast2go.de). The EST assembly comprises 41,013 putative transcripts (12,924 contigs and 28,089 singletons). The resulting Sunflower Unigen Resource (SUR version 1.0) was used to design an oligonucleotide-based Agilent microarray for cultivated sunflower. This microarray includes a total of 42,326 features: 1,417 Agilent controls, 74 control probes for sunflower replicated 10 times (740 controls) and 40,169 different non-control probes. Microarray performance was validated using a model experiment examining the induction of senescence by water deficit. Pre-processing and differential expression analysis of Agilent microarrays was performed using the Bioconductor limma package. The analyses based on p-values calculated by eBayes (psunflower unigene collection, and a custom, validated sunflower oligonucleotide-based microarray using Agilent technology. Both the curated unigene collection and the validated oligonucleotide microarray provide key resources for sunflower genome analysis, transcriptional studies, and molecular breeding for crop improvement.
Ladayya, Faroh; Purnami, Santi Wulan; Irhamah
DNA microarrays are data containing gene expression with small sample sizes and high number of features. Furthermore, imbalanced classes is a common problem in microarray data. This occurs when a dataset is dominated by a class which have significantly more instances than the other minority classes. Therefore, it is needed a classification method that solve the problem of high dimensional and imbalanced data. Support Vector Machine (SVM) is one of the classification methods that is capable of handling large or small samples, nonlinear, high dimensional, over learning and local minimum issues. SVM has been widely applied to DNA microarray data classification and it has been shown that SVM provides the best performance among other machine learning methods. However, imbalanced data will be a problem because SVM treats all samples in the same importance thus the results is bias for minority class. To overcome the imbalanced data, Fuzzy SVM (FSVM) is proposed. This method apply a fuzzy membership to each input point and reformulate the SVM such that different input points provide different contributions to the classifier. The minority classes have large fuzzy membership so FSVM can pay more attention to the samples with larger fuzzy membership. Given DNA microarray data is a high dimensional data with a very large number of features, it is necessary to do feature selection first using Fast Correlation based Filter (FCBF). In this study will be analyzed by SVM, FSVM and both methods by applying FCBF and get the classification performance of them. Based on the overall results, FSVM on selected features has the best classification performance compared to SVM.
Full Text Available Genomic microarrays are powerful research tools in bioinformatics and modern medicinal research because they enable massively-parallel assays and simultaneous monitoring of thousands of gene expression of biological samples. However, a simple microarray experiment often leads to very high-dimensional data and a huge amount of information, the vast amount of data challenges researchers into extracting the important features and reducing the high dimensionality. In this paper, a nonlinear dimensionality reduction kernel method based locally linear embedding(LLE is proposed, and fuzzy K-nearest neighbors algorithm which denoises datasets will be introduced as a replacement to the classical LLEÃ¢Â€Â™s KNN algorithm. In addition, kernel method based support vector machine (SVM will be used to classify genomic microarray data sets in this paper. We demonstrate the application of the techniques to two published DNA microarray data sets. The experimental results confirm the superiority and high success rates of the presented method.
Herbáth, Melinda; Balogh, Andrea; Matkó, János; Papp, Krisztián; Prechl, József
Protein microarray technology is becoming the method of choice for identifying protein interaction partners, detecting specific proteins, carbohydrates and lipids, or for characterizing protein interactions and serum antibodies in a massively parallel manner. Availability of the well-established instrumentation of DNA arrays and development of new fluorescent detection instruments promoted the spread of this technique. Fluorescent detection has the advantage of high sensitivity, specificity, simplicity and wide dynamic range required by most measurements. Fluorescence through specifically designed probes and an increasing variety of detection modes offers an excellent tool for such microarray platforms. Measuring for example the level of antibodies, their isotypes and/or antigen specificity simultaneously can offer more complex and comprehensive information about the investigated biological phenomenon, especially if we take into consideration that hundreds of samples can be measured in a single assay. Not only body fluids, but also cell lysates, extracted cellular components, and intact living cells can be analyzed on protein arrays for monitoring functional responses to printed samples on the surface. As a rapidly evolving area, protein microarray technology offers a great bulk of information and new depth of knowledge. These are the features that endow protein arrays with wide applicability and robust sample analyzing capability. On the whole, protein arrays are emerging new tools not just in proteomics, but glycomics, lipidomics, and are also important for immunological research. In this review we attempt to summarize the technical aspects of planar fluorescent microarray technology along with the description of its main immunological applications. (topical review)
Houni, Karim; Sawaya, Wadih; Delignon, Yves
In the convergence context of identification technology and information-data transmission, the barcode found its place as the simplest and the most pervasive solution for new uses, especially within mobile commerce, bringing youth to this long-lived technology. From a communication theory point of view, a barcode is a singular coding based on a graphical representation of the information to be transmitted. We present an information theoretic approach for 1D image-based barcode reading analysis. With a barcode facing the camera, distortions and acquisition are modeled as a communication channel. The performance of the system is evaluated by means of the average mutual information quantity. On the basis of this theoretical criterion for a reliable transmission, we introduce two new measures: the theoretical depth of field and the theoretical resolution. Simulations illustrate the gain of this approach.
Little, Damon P; Jeanson, Marc L
Herbal dietary supplements made from saw palmetto (Serenoa repens; Arecaceae) fruit are commonly consumed to ameliorate benign prostate hyperplasia. A novel DNA mini-barcode assay to accurately identify [specificity = 1.00 (95% confidence interval = 0.74-1.00); sensitivity = 1.00 (95% confidence interval = 0.66-1.00); n = 31] saw palmetto dietary supplements was designed from a DNA barcode reference library created for this purpose. The mini-barcodes were used to estimate the frequency of mislabeled saw palmetto herbal dietary supplements on the market in the United States of America. Of the 37 supplements examined, amplifiable DNA could be extracted from 34 (92%). Mini-barcode analysis of these supplements demonstrated that 29 (85%) contain saw palmetto and that 2 (6%) supplements contain related species that cannot be legally sold as herbal dietary supplements in the United States of America. The identity of 3 (9%) supplements could not be conclusively determined.
Huh, Jin Wook; Chung, Woong Sik; Chung, Wan Kyun
This paper addresses the localization and navigation problem using invisible two dimensional barcodes on the floor. Compared with other methods using natural/artificial landmark, the proposed localization method has great advantages in cost and appearance, since the location of the robot is perfectly known using the barcode information after the mapping is finished. We also propose a navigation algorithm which uses the topological structure. For the topological information, we define nodes and edges which are suitable for indoor navigation, especially for large area having multiple rooms, many walls and many static obstacles. The proposed algorithm also has an advantage that errors occurred in each node are mutually independent and can be compensated exactly after some navigation using barcode. Simulation and experimental results were performed to verify the algorithm in the barcode environment, and the result showed an excellent performance. After mapping, it is also possible to solve the kidnapped case and generate paths using topological information
Casas, Princess Angelie S; Sing, Kong-Wah; Lee, Ping-Shin; Nuñeza, Olga M; Villanueva, Reagan Joseph T; Wilson, John-James
Reliable species identification provides a sounder basis for use of species in the order Odonata as biological indicators and for their conservation, an urgent concern as many species are threatened with imminent extinction. We generated 134 COI barcodes from 36 morphologically identified species of Odonata collected from Mindanao Island, representing 10 families and 19 genera. Intraspecific sequence divergences ranged from 0 to 6.7% with four species showing more than 2%, while interspecific sequence divergences ranged from 0.5 to 23.3% with seven species showing less than 2%. Consequently, no distinct gap was observed between intraspecific and interspecific DNA barcode divergences. The numerous islands of the Philippine archipelago may have facilitated rapid speciation in the Odonata and resulted in low interspecific sequence divergences among closely related groups of species. This study contributes DNA barcodes for 36 morphologically identified species of Odonata reported from Mindanao including 31 species with no previous DNA barcode records.
Lewinska, Anna Malgorzata; Hoof, Jakob Blæsbjerg; Peuhkuri, Ruut Hannele
Detection and identification of indoor fungi in water-damaged buildings is crucial for preventi and control of fungal growth. This study focuses on a molecular method called DNA barcoding. evaluates commonly used sequences in DNA barcoding for fungal species identification Chaetomium...... and Stachybotrys. The existing DNA barcodes: ITS, SSU, LSU, B-TUB, CMD, RP and TEF-1α do not give satisfying species resolution to be considered as DNA barcodes for the two genera. Therefore, novel barcodes for them are needed. Barcode potentials, such as HOG1 a NAHA, were identified using bioinformatics...
Alivisatos, A. Paul; Scher, Erik C.; Manna, Liberato
Graded core/shell semiconductor nanorods and shaped nanorods are disclosed comprising Group II-VI, Group III-V and Group IV semiconductors and methods of making the same. Also disclosed are nanorod barcodes using core/shell nanorods where the core is a semiconductor or metal material, and with or without a shell. Methods of labeling analytes using the nanorod barcodes are also disclosed.
Kress, W. John; Wurdack, Kenneth J.; Zimmer, Elizabeth A.; Weigt, Lee A.; Janzen, Daniel H.
Methods for identifying species by using short orthologous DNA sequences, known as “DNA barcodes,” have been proposed and initiated to facilitate biodiversity studies, identify juveniles, associate sexes, and enhance forensic analyses. The cytochrome c oxidase 1 sequence, which has been found to be widely applicable in animal barcoding, is not appropriate for most species of plants because of a much slower rate of cytochrome c oxidase 1 gene evolution in higher plants than in animals. We ther...
Natasha de Vere
Full Text Available We present the first national DNA barcode resource that covers the native flowering plants and conifers for the nation of Wales (1143 species. Using the plant DNA barcode markers rbcL and matK, we have assembled 97.7% coverage for rbcL, 90.2% for matK, and a dual-locus barcode for 89.7% of the native Welsh flora. We have sampled multiple individuals for each species, resulting in 3304 rbcL and 2419 matK sequences. The majority of our samples (85% are from DNA extracted from herbarium specimens. Recoverability of DNA barcodes is lower using herbarium specimens, compared to freshly collected material, mostly due to lower amplification success, but this is balanced by the increased efficiency of sampling species that have already been collected, identified, and verified by taxonomic experts. The effectiveness of the DNA barcodes for identification (level of discrimination is assessed using four approaches: the presence of a barcode gap (using pairwise and multiple alignments, formation of monophyletic groups using Neighbour-Joining trees, and sequence similarity in BLASTn searches. These approaches yield similar results, providing relative discrimination levels of 69.4 to 74.9% of all species and 98.6 to 99.8% of genera using both markers. Species discrimination can be further improved using spatially explicit sampling. Mean species discrimination using barcode gap analysis (with a multiple alignment is 81.6% within 10×10 km squares and 93.3% for 2×2 km squares. Our database of DNA barcodes for Welsh native flowering plants and conifers represents the most complete coverage of any national flora, and offers a valuable platform for a wide range of applications that require accurate species identification.
Trevino, Victor; Falciani, Francesco; Barrera-Saldaña, Hugo A
Among the many benefits of the Human Genome Project are new and powerful tools such as the genome-wide hybridization devices referred to as microarrays. Initially designed to measure gene transcriptional levels, microarray technologies are now used for comparing other genome features among individuals and their tissues and cells. Results provide valuable information on disease subcategories, disease prognosis, and treatment outcome. Likewise, they reveal differences in genetic makeup, regulat...
Keskin, Emre; Atar, Hasan Hüseyin
DNA barcoding was used in order to identify aquatic invertebrates sampled from fisheries bycatch and discards. A total of 440 unique cytochrome c oxidase sub unit I (COI) barcodes were generated for 22 species from three important phyla (Arthropoda, Cnidaria, and Mollusca). All the species were sequenced and submitted to GenBank and Barcode of Life Database (BOLD) databases using 654 bp-long fragment of mitochondrial COI gene. Two of them (Pontastacus leptodactylus and Rapana bezoar) were first records of the species for the BOLD database and six of them (Carcinus aestuarii, Loligo vulgaris, Melicertus kerathurus, Nephrops norvegicus, Scyllarides latus, and Scyllarus arctus) were first standard (>648 bp) COI barcode records for the GenBank database. COI barcodes were analyzed for nucleotide composition, nucleotide pair frequencies, and Kimura's two-parameter genetic distance. Mean genetic distance among species was found increasing at higher taxonomic levels. Neighbor-joining trees generated were congruent with morphometric-based taxonomic classification. Findings of this study clearly demonstrate that DNA barcodes could be used as an efficient molecular tool in identification of not only target species from fisheries but also bycatch and discard species, and so it could provide us leverage for a better understanding in monitoring and management of fisheries and biodiversity.
Wittkowski Knut M
Full Text Available Abstract Background Microscopists are familiar with many blemishes that fluorescence images can have due to dust and debris, glass flaws, uneven distribution of fluids or surface coatings, etc. Microarray scans show similar artefacts, which affect the analysis, particularly when one tries to detect subtle changes. However, most blemishes are hard to find by the unaided eye, particularly in high-density oligonucleotide arrays (HDONAs. Results We present a method that harnesses the statistical power provided by having several HDONAs available, which are obtained under similar conditions except for the experimental factor. This method "harshlights" blemishes and renders them evident. We find empirically that about 25% of our chips are blemished, and we analyze the impact of masking them on screening for differentially expressed genes. Conclusion Experiments attempting to assess subtle expression changes should be carefully screened for blemishes on the chips. The proposed method provides investigators with a novel robust approach to improve the sensitivity of microarray analyses. By utilizing topological information to identify and mask blemishes prior to model based analyses, the method prevents artefacts from confounding the process of background correction, normalization, and summarization.
Hoffmann, Katrin; Firth, Martin J; Beesley, Alex H; Klerk, Nicholas H de; Kees, Ursula R
Recent findings from microarray studies have raised the prospect of a standardized diagnostic gene expression platform to enhance accurate diagnosis and risk stratification in paediatric acute lymphoblastic leukaemia (ALL). However, the robustness as well as the format for such a diagnostic test remains to be determined. As a step towards clinical application of these findings, we have systematically analyzed a published ALL microarray data set using Robust Multi-array Analysis (RMA) and Random Forest (RF). We examined published microarray data from 104 ALL patients specimens, that represent six different subgroups defined by cytogenetic features and immunophenotypes. Using the decision-tree based supervised learning algorithm Random Forest (RF), we determined a small set of genes for optimal subgroup distinction and subsequently validated their predictive power in an independent patient cohort. We achieved very high overall ALL subgroup prediction accuracies of about 98%, and were able to verify the robustness of these genes in an independent panel of 68 specimens obtained from a different institution and processed in a different laboratory. Our study established that the selection of discriminating genes is strongly dependent on the analysis method. This may have profound implications for clinical use, particularly when the classifier is reduced to a small set of genes. We have demonstrated that as few as 26 genes yield accurate class prediction and importantly, almost 70% of these genes have not been previously identified as essential for class distinction of the six ALL subgroups. Our finding supports the feasibility of qRT-PCR technology for standardized diagnostic testing in paediatric ALL and should, in conjunction with conventional cytogenetics lead to a more accurate classification of the disease. In addition, we have demonstrated that microarray findings from one study can be confirmed in an independent study, using an entirely independent patient cohort
Sphingolipid metabolism. Glycerolipid metabolism. Arginine and proline metabolism. Brassinosteroid biosynthesis. Protein processing in endoplasmic re culum. Cyanoamino acid metabolism. Circadian rhythm - plant. Fructose and mannose metabolism. Pyruvate metabolism. Photosynthesis beta-Alanine metabolism.
Full Text Available Abstract Background Microarray data analysis has been the subject of extensive and ongoing pipeline development due to its complexity, the availability of several options at each analysis step, and the development of new analysis demands, including integration with new data sources. Bioinformatics pipelines are usually custom built for different applications, making them typically difficult to modify, extend and repurpose. Scientific workflow systems are intended to address these issues by providing general-purpose frameworks in which to develop and execute such pipelines. The Kepler workflow environment is a well-established system under continual development that is employed in several areas of scientific research. Kepler provides a flexible graphical interface, featuring clear display of parameter values, for design and modification of workflows. It has capabilities for developing novel computational components in the R, Python, and Java programming languages, all of which are widely used for bioinformatics algorithm development, along with capabilities for invoking external applications and using web services. Results We developed a series of fully functional bioinformatics pipelines addressing common tasks in microarray processing in the Kepler workflow environment. These pipelines consist of a set of tools for GFF file processing of NimbleGen chromatin immunoprecipitation on microarray (ChIP-chip datasets and more comprehensive workflows for Affymetrix gene expression microarray bioinformatics and basic primer design for PCR experiments, which are often used to validate microarray results. Although functional in themselves, these workflows can be easily customized, extended, or repurposed to match the needs of specific projects and are designed to be a toolkit and starting point for specific applications. These workflows illustrate a workflow programming paradigm focusing on local resources (programs and data and therefore are close to
Stropp, Thomas; McPhillips, Timothy; Ludäscher, Bertram; Bieda, Mark
Microarray data analysis has been the subject of extensive and ongoing pipeline development due to its complexity, the availability of several options at each analysis step, and the development of new analysis demands, including integration with new data sources. Bioinformatics pipelines are usually custom built for different applications, making them typically difficult to modify, extend and repurpose. Scientific workflow systems are intended to address these issues by providing general-purpose frameworks in which to develop and execute such pipelines. The Kepler workflow environment is a well-established system under continual development that is employed in several areas of scientific research. Kepler provides a flexible graphical interface, featuring clear display of parameter values, for design and modification of workflows. It has capabilities for developing novel computational components in the R, Python, and Java programming languages, all of which are widely used for bioinformatics algorithm development, along with capabilities for invoking external applications and using web services. We developed a series of fully functional bioinformatics pipelines addressing common tasks in microarray processing in the Kepler workflow environment. These pipelines consist of a set of tools for GFF file processing of NimbleGen chromatin immunoprecipitation on microarray (ChIP-chip) datasets and more comprehensive workflows for Affymetrix gene expression microarray bioinformatics and basic primer design for PCR experiments, which are often used to validate microarray results. Although functional in themselves, these workflows can be easily customized, extended, or repurposed to match the needs of specific projects and are designed to be a toolkit and starting point for specific applications. These workflows illustrate a workflow programming paradigm focusing on local resources (programs and data) and therefore are close to traditional shell scripting or R
Background Microarray data analysis has been the subject of extensive and ongoing pipeline development due to its complexity, the availability of several options at each analysis step, and the development of new analysis demands, including integration with new data sources. Bioinformatics pipelines are usually custom built for different applications, making them typically difficult to modify, extend and repurpose. Scientific workflow systems are intended to address these issues by providing general-purpose frameworks in which to develop and execute such pipelines. The Kepler workflow environment is a well-established system under continual development that is employed in several areas of scientific research. Kepler provides a flexible graphical interface, featuring clear display of parameter values, for design and modification of workflows. It has capabilities for developing novel computational components in the R, Python, and Java programming languages, all of which are widely used for bioinformatics algorithm development, along with capabilities for invoking external applications and using web services. Results We developed a series of fully functional bioinformatics pipelines addressing common tasks in microarray processing in the Kepler workflow environment. These pipelines consist of a set of tools for GFF file processing of NimbleGen chromatin immunoprecipitation on microarray (ChIP-chip) datasets and more comprehensive workflows for Affymetrix gene expression microarray bioinformatics and basic primer design for PCR experiments, which are often used to validate microarray results. Although functional in themselves, these workflows can be easily customized, extended, or repurposed to match the needs of specific projects and are designed to be a toolkit and starting point for specific applications. These workflows illustrate a workflow programming paradigm focusing on local resources (programs and data) and therefore are close to traditional shell scripting or
DNA barcoding revealed the presence of the polyphagous leafminer pest Liriomyza sativae Blanchard in Bangladesh. DNA barcode sequences for mitochondrial COI were generated for Agromyzidae larvae, pupae and adults collected from field populations across Bangladesh. BLAST sequence similarity searches ...
Min Yu; Lichao Jiao; Juan Guo; Alex C. Wiedenhoeft; Tuo He; Xiaomei Jiang; Yafang Yin
ITS2+trnH-psbA was the best combination of DNA barcode to resolve the Dalbergia wood species studied. We demonstrate the feasibility of building a DNA barcode reference database using xylarium wood specimens.
Geary, Janis; Camicioli, Emma; Bubela, Tania
Paul Hebert and colleagues first described DNA barcoding in 2003, which led to international efforts to promote and coordinate its use. Since its inception, DNA barcoding has generated considerable media coverage. We analysed whether this coverage reflected both the scientific and social mandates of international barcoding organizations. We searched newspaper databases to identify 900 English-language articles from 2003 to 2013. Coverage of the science of DNA barcoding was highly positive but lacked context for key topics. Coverage omissions pose challenges for public understanding of the science and applications of DNA barcoding; these included coverage of governance structures and issues related to the sharing of genetic resources across national borders. Our analysis provided insight into how barcoding communication efforts have translated into media coverage; more targeted communication efforts may focus media attention on previously omitted, but important topics. Our analysis is timely as the DNA barcoding community works to establish the International Society for the Barcode of Life.
Deng, Wupeng; Hu, Jiwei; Liu, Quan; Lou, Ping
With the development of barcodes for commercial use, people's requirements for detecting barcodes by smart phone become increasingly pressing. The low quality of barcode image captured by mobile phone always affects the decoding and recognition rates. This paper focuses on locating and decoding EAN-13 barcodes in fuzzy images. We present a more accurate locating algorithm based on segment length and high fault-tolerant rate algorithm for decoding barcodes. Unlike existing approaches, location algorithm is based on the edge segment length of EAN -13 barcodes, while our decoding algorithm allows the appearance of fuzzy region in barcode image. Experimental results are performed on damaged, contaminated and scratched digital images, and provide a quite promising result for EAN -13 barcode location and decoding.
Layton, Kara K S; Martel, André L; Hebert, Paul D N
Molluscs are the most diverse marine phylum and this high diversity has resulted in considerable taxonomic problems. Because the number of species in Canadian oceans remains uncertain, there is a need to incorporate molecular methods into species identifications. A 648 base pair segment of the cytochrome c oxidase subunit I gene has proven useful for the identification and discovery of species in many animal lineages. While the utility of DNA barcoding in molluscs has been demonstrated in other studies, this is the first effort to construct a DNA barcode registry for marine molluscs across such a large geographic area. This study examines patterns of DNA barcode variation in 227 species of Canadian marine molluscs. Intraspecific sequence divergences ranged from 0-26.4% and a barcode gap existed for most taxa. Eleven cases of relatively deep (>2%) intraspecific divergence were detected, suggesting the possible presence of overlooked species. Structural variation was detected in COI with indels found in 37 species, mostly bivalves. Some indels were present in divergent lineages, primarily in the region of the first external loop, suggesting certain areas are hotspots for change. Lastly, mean GC content varied substantially among orders (24.5%-46.5%), and showed a significant positive correlation with nearest neighbour distances. DNA barcoding is an effective tool for the identification of Canadian marine molluscs and for revealing possible cases of overlooked species. Some species with deep intraspecific divergence showed a biogeographic partition between lineages on the Atlantic, Arctic and Pacific coasts, suggesting the role of Pleistocene glaciations in the subdivision of their populations. Indels were prevalent in the barcode region of the COI gene in bivalves and gastropods. This study highlights the efficacy of DNA barcoding for providing insights into sequence variation across a broad taxonomic group on a large geographic scale.
Smith, M Alex; Bertrand, Claudia; Crosby, Kate; Eveleigh, Eldon S; Fernandez-Triana, Jose; Fisher, Brian L; Gibbs, Jason; Hajibabaei, Mehrdad; Hallwachs, Winnie; Hind, Katharine; Hrcek, Jan; Huang, Da-Wei; Janda, Milan; Janzen, Daniel H; Li, Yanwei; Miller, Scott E; Packer, Laurence; Quicke, Donald; Ratnasingham, Sujeevan; Rodriguez, Josephine; Rougerie, Rodolphe; Shaw, Mark R; Sheffield, Cory; Stahlhut, Julie K; Steinke, Dirk; Whitfield, James; Wood, Monty; Zhou, Xin
Wolbachia is a genus of bacterial endosymbionts that impacts the breeding systems of their hosts. Wolbachia can confuse the patterns of mitochondrial variation, including DNA barcodes, because it influences the pathways through which mitochondria are inherited. We examined the extent to which these endosymbionts are detected in routine DNA barcoding, assessed their impact upon the insect sequence divergence and identification accuracy, and considered the variation present in Wolbachia COI. Using both standard PCR assays (Wolbachia surface coding protein--wsp), and bacterial COI fragments we found evidence of Wolbachia in insect total genomic extracts created for DNA barcoding library construction. When >2 million insect COI trace files were examined on the Barcode of Life Datasystem (BOLD) Wolbachia COI was present in 0.16% of the cases. It is possible to generate Wolbachia COI using standard insect primers; however, that amplicon was never confused with the COI of the host. Wolbachia alleles recovered were predominantly Supergroup A and were broadly distributed geographically and phylogenetically. We conclude that the presence of the Wolbachia DNA in total genomic extracts made from insects is unlikely to compromise the accuracy of the DNA barcode library; in fact, the ability to query this DNA library (the database and the extracts) for endosymbionts is one of the ancillary benefits of such a large scale endeavor--which we provide several examples. It is our conclusion that regular assays for Wolbachia presence and type can, and should, be adopted by large scale insect barcoding initiatives. While COI is one of the five multi-locus sequence typing (MLST) genes used for categorizing Wolbachia, there is limited overlap with the eukaryotic DNA barcode region.
DNA microarrays become increasingly important in the field of clinical diagnostics. These microarrays, also called DNA chips, are small solid substrates, typically having a maximum surface area of a few cm2, onto which many spots are arrayed in a pre-determined pattern. Each of these spots contains
Fangel, Jonatan Ulrik; Pedersen, H.L.; Vidal-Melgosa, S.
Almost all plant cells are surrounded by glycan-rich cell walls, which form much of the plant body and collectively are the largest source of biomass on earth. Plants use polysaccharides for support, defense, signaling, cell adhesion, and as energy storage, and many plant glycans are also important...... industrially and nutritionally. Understanding the biological roles of plant glycans and the effective exploitation of their useful properties requires a detailed understanding of their structures, occurrence, and molecular interactions. Microarray technology has revolutionized the massively high...... for plant research and can be used to map glycan populations across large numbers of samples to screen antibodies, carbohydrate binding proteins, and carbohydrate binding modules and to investigate enzyme activities....
de Koning, Dirk-Jan; Jaffrézic, Florence; Lund, Mogens Sandø
Microarray analyses have become an important tool in animal genomics. While their use is becoming widespread, there is still a lot of ongoing research regarding the analysis of microarray data. In the context of a European Network of Excellence, 31 researchers representing 14 research groups from...... 10 countries performed and discussed the statistical analyses of real and simulated 2-colour microarray data that were distributed among participants. The real data consisted of 48 microarrays from a disease challenge experiment in dairy cattle, while the simulated data consisted of 10 microarrays...... statistical weights, to omitting a large number of spots or omitting entire slides. Surprisingly, these very different approaches gave quite similar results when applied to the simulated data, although not all participating groups analysed both real and simulated data. The workshop was very successful...
Contaldo, Nicoletta; Paltrinieri, Samanta; Makarova, Olga
DNA barcoding is an identification method based on comparison of a short DNA sequence with known sequences from a database. A DNA barcoding tool has been developed for phytoplasma identification. This phytoplasma DNA barcoding protocol based on the tuf gene has been shown to identify phytoplasmas...
Jeffrey M. Marcus
Full Text Available DNA barcodes are very useful for species identification especially when identification by traditional morphological characters is difficult. However, the short mitochondrial and chloroplast barcodes currently in use often fail to distinguish between closely related species, are prone to lateral transfer, and provide inadequate phylogenetic resolution, particularly at deeper nodes. The deficiencies of short barcode identifiers are similar to the deficiencies of the short year identifiers that caused the Y2K problem in computer science. The resolution of the Y2K problem was to increase the size of the year identifiers. The performance of conventional mitochondrial COI barcodes for phylogenetics was compared with the performance of complete mitochondrial genomes and nuclear ribosomal RNA repeats obtained by genome skimming for a set of caddisfly taxa (Insect Order Trichoptera. The analysis focused on Trichoptera Family Hydropsychidae, the net-spinning caddisflies, which demonstrates many of the frustrating limitations of current barcodes. To conduct phylogenetic comparisons, complete mitochondrial genomes (15 kb each and nuclear ribosomal repeats (9 kb each from six caddisfly species were sequenced, assembled, and are reported for the first time. These sequences were analyzed in comparison with eight previously published trichopteran mitochondrial genomes and two triochopteran rRNA repeats, plus outgroup sequences from sister clade Lepidoptera (butterflies and moths. COI trees were not well-resolved, had low bootstrap support, and differed in topology from prior phylogenetic analyses of the Trichoptera. Phylogenetic trees based on mitochondrial genomes or rRNA repeats were well-resolved with high bootstrap support and were largely congruent with each other. Because they are easily sequenced by genome skimming, provide robust phylogenetic resolution at various phylogenetic depths, can better distinguish between closely related species, and (in the
Hebert, Paul D N; Dewaard, Jeremy R; Landry, Jean-François
This study reports DNA barcodes for more than 1300 Lepidoptera species from the eastern half of North America, establishing that 99.3 per cent of these species possess diagnostic barcode sequences. Intraspecific divergences averaged just 0.43 per cent among this assemblage, but most values were lower. The mean was elevated by deep barcode divergences (greater than 2%) in 5.1 per cent of the species, often involving the sympatric occurrence of two barcode clusters. A few of these cases have been analysed in detail, revealing species overlooked by the current taxonomic system. This study also provided a large-scale test of the extent of regional divergence in barcode sequences, indicating that geographical differentiation in the Lepidoptera of eastern North America is small, even when comparisons involve populations as much as 2800 km apart. The present results affirm that a highly effective system for the identification of Lepidoptera in this region can be built with few records per species because of the limited intra-specific variation. As most terrestrial and marine taxa are likely to possess a similar pattern of population structure, an effective DNA-based identification system can be developed with modest effort.
Oba, Yuichi; Ôhira, Hitoo; Murase, Yukio; Moriyama, Akihiko; Kumazawa, Yoshinori
Click beetles (Coleoptera: Elateridae) represent one of the largest groups of beetle insects. Some click beetles in larval form, known as wireworms, are destructive agricultural pests. Morphological identification of click beetles is generally difficult and requires taxonomic expertise. This study reports on the DNA barcoding of Japanese click beetles to enable their rapid and accurate identification. We collected and assembled 762 cytochrome oxidase subunit I barcode sequences from 275 species, which cover approximately 75% of the common species found on the Japanese main island, Honshu. This barcode library also contains 20 out of the 21 potential pest species recorded in Japan. Our analysis shows that most morphologically identified species form distinct phylogenetic clusters separated from each other by large molecular distances. This supports the general usefulness of the DNA barcoding approach for quick and reliable identification of Japanese elaterid species for environmental impact assessment, agricultural pest control, and biodiversity analysis. On the other hand, the taxonomic boundary in dozens of species did not agree with the boundary of barcode index numbers (a criterion for sequence-based species delimitation). These findings urge taxonomic reinvestigation of these mismatched taxa.
Full Text Available Click beetles (Coleoptera: Elateridae represent one of the largest groups of beetle insects. Some click beetles in larval form, known as wireworms, are destructive agricultural pests. Morphological identification of click beetles is generally difficult and requires taxonomic expertise. This study reports on the DNA barcoding of Japanese click beetles to enable their rapid and accurate identification. We collected and assembled 762 cytochrome oxidase subunit I barcode sequences from 275 species, which cover approximately 75% of the common species found on the Japanese main island, Honshu. This barcode library also contains 20 out of the 21 potential pest species recorded in Japan. Our analysis shows that most morphologically identified species form distinct phylogenetic clusters separated from each other by large molecular distances. This supports the general usefulness of the DNA barcoding approach for quick and reliable identification of Japanese elaterid species for environmental impact assessment, agricultural pest control, and biodiversity analysis. On the other hand, the taxonomic boundary in dozens of species did not agree with the boundary of barcode index numbers (a criterion for sequence-based species delimitation. These findings urge taxonomic reinvestigation of these mismatched taxa.
Giovanna Câmara Giudicelli
Full Text Available DNA barcoding is a technique for discriminating and identifying species using short, variable, and standardized DNA regions. Here, we tested for the first time the performance of plastid and nuclear regions as DNA barcodes in Passiflora. This genus is a largely variable, with more than 900 species of high ecological, commercial, and ornamental importance. We analyzed 1034 accessions of 222 species representing the four subgenera of Passiflora and evaluated the effectiveness of five plastid regions and three nuclear datasets currently employed as DNA barcodes in plants using barcoding gap, applied similarity-, and tree-based methods. The plastid regions were able to identify less than 45% of species, whereas the nuclear datasets were efficient for more than 50% using “best match” and “best close match” methods of TaxonDNA software. All subgenera presented higher interspecific pairwise distances and did not fully overlap with the intraspecific distance, and similarity-based methods showed better results than tree-based methods. The nuclear ribosomal internal transcribed spacer 1 (ITS1 region presented a higher discrimination power than the other datasets and also showed other desirable characteristics as a DNA barcode for this genus. Therefore, we suggest that this region should be used as a starting point to identify Passiflora species.
Nam, Jwa-Min; Jang, Kyung-Jin; Groves, Jay T
The colorimetric bio-barcode assay is a red-to-blue color change-based protein detection method with ultrahigh sensitivity. This assay is based on both the bio-barcode amplification method that allows for detecting miniscule amount of targets with attomolar sensitivity and gold nanoparticle-based colorimetric DNA detection method that allows for a simple and straightforward detection of biomolecules of interest (here we detect interleukin-2, an important biomarker (cytokine) for many immunodeficiency-related diseases and cancers). The protocol is composed of the following steps: (i) conjugation of target capture molecules and barcode DNA strands onto silica microparticles, (ii) target capture with probes, (iii) separation and release of barcode DNA strands from the separated probes, (iv) detection of released barcode DNA using DNA-modified gold nanoparticle probes and (v) red-to-blue color change analysis with a graphic software. Actual target detection and quantification steps with premade probes take approximately 3 h (whole protocol including probe preparations takes approximately 3 days).
G. A. Kukharev
Full Text Available In the paper a method of generating of standard type linear barcodes from facial images is proposed. The method is based on use of the histogram of facial image brightness, averaging the histogram on a limited number of intervals, quantization of results in a range of decimal numbers from 0 to 9 and table conversion into the final barcode. The proposed solution is computationally low-cost and not requires the use of specialized software on image processing that allows generating of facial barcodes in mobile systems, and thus the proposed method can be interpreted as an express method. Results of tests on the Face94 and CUHK Face Sketch FERET Databases showed that the proposed method is a new solution for use in the real-world practice and ensures the stability of generated barcodes in changes of scale, pose and mirroring of a facial image, and also changes of a facial expression and shadows on faces from local lighting. The proposed method is based on generating of a standard barcode directly from the facial image, and thus contains the subjective information about a person's face.
Adamowicz, Sarah J; Steinke, Dirk
DNA barcoding--the sequencing of short, standardized DNA regions for specimen identification and species discovery--has promised to facilitate rapid access to biodiversity knowledge by diverse users. Here, we advance our opinion that increased global participation in genetics research is beneficial, both to scientists and for science, and explore the premise that DNA barcoding can help to democratize participation in genetics research. We examine publication patterns (2003-2014) in the DNA barcoding literature and compare trends with those in the broader, related domain of genomics. While genomics is the older and much larger field, the number of nations contributing to the published literature is similar between disciplines. Meanwhile, DNA barcoding exhibits a higher pace of growth in the number of publications as well as greater evenness among nations in their proportional contribution to total authorships. This exploration revealed DNA barcoding to be a highly international discipline, with growing participation by researchers in especially biodiverse nations. We briefly consider several of the challenges that may hinder further participation in genetics research, including access to training and molecular facilities as well as policy relating to the movement of genetic resources.
Rodriguez, Tony; Haaga, Don; Calhoon, Sean
The Digimarc® Barcode is a digital watermark applied to packages and variable data labels that carries GS1 standard GTIN-14 data traditionally carried by a 1-D barcode. The Digimarc Barcode can be read with smartphones and imaging-based barcode readers commonly used in grocery and retail environments. Using smartphones, consumers can engage with products and retailers can materially increase the speed of check-out, increasing store margins and providing a better experience for shoppers. Internal testing has shown an average of 53% increase in scanning throughput, enabling 100's of millions of dollars in cost savings  for retailers when deployed at scale. To get to scale, the process of embedding a digital watermark must be automated and integrated within existing workflows. Creating the tools and processes to do so represents a new challenge for the watermarking community. This paper presents a description and an analysis of the workflow implemented by Digimarc to deploy the Digimarc Barcode at scale. An overview of the tools created and lessons learned during the introduction of technology to the market are provided.
Thanapal, P.; Prabhu, J.; Jakhar, Mridula
Over the recent years, many industries have started implementing new technologies for tracing and tracking their products. These technologies are a kind of blessing to their management system. The technology and management system has to work in parallel to avoid loopholes in the system. We can see so many technologies around us and the most difficult and important part is to choose best out of all these new technologies. The important point which we need to take care while choosing a technology for the system is to make sure the technology can integrate properly with the other parameters in the management system. The industry management system consists of many levels such as initial level, intermediate level, final level and tracking. Nowadays tracking a product from its initial stage is becoming a trend. To cope up with this upcoming trend and also with the company demand, integrating the product with Barcode, RFID tags, NFC tag or any other traceable technology. Many supply chain Management system are also adopting this techniques.
Hammond, Maria; Nong, Rachel Yuan; Ericsson, Olle; Pardali, Katerina; Landegren, Ulf
Patterns of protein interactions provide important insights in basic biology, and their analysis plays an increasing role in drug development and diagnostics of disease. We have established a scalable technique to compare two biological samples for the levels of all pairwise interactions among a set of targeted protein molecules. The technique is a combination of the proximity ligation assay with readout via dual tag microarrays. In the proximity ligation assay protein identities are encoded as DNA sequences by attaching DNA oligonucleotides to antibodies directed against the proteins of interest. Upon binding by pairs of antibodies to proteins present in the same molecular complexes, ligation reactions give rise to reporter DNA molecules that contain the combined sequence information from the two DNA strands. The ligation reactions also serve to incorporate a sample barcode in the reporter molecules to allow for direct comparison between pairs of samples. The samples are evaluated using a dual tag microarray where information is decoded, revealing which pairs of tags that have become joined. As a proof-of-concept we demonstrate that this approach can be used to detect a set of five proteins and their pairwise interactions both in cellular lysates and in fixed tissue culture cells. This paper provides a general strategy to analyze the extent of any pairwise interactions in large sets of molecules by decoding reporter DNA strands that identify the interacting molecules.
Full Text Available Abstract Background Microarray analysis has become a widely used technique for the study of gene-expression patterns on a genomic scale. As more and more laboratories are adopting microarray technology, there is a need for powerful and easy to use microarray databases facilitating array fabrication, labeling, hybridization, and data analysis. The wealth of data generated by this high throughput approach renders adequate database and analysis tools crucial for the pursuit of insights into the transcriptomic behavior of cells. Results MARS (Microarray Analysis and Retrieval System provides a comprehensive MIAME supportive suite for storing, retrieving, and analyzing multi color microarray data. The system comprises a laboratory information management system (LIMS, a quality control management, as well as a sophisticated user management system. MARS is fully integrated into an analytical pipeline of microarray image analysis, normalization, gene expression clustering, and mapping of gene expression data onto biological pathways. The incorporation of ontologies and the use of MAGE-ML enables an export of studies stored in MARS to public repositories and other databases accepting these documents. Conclusion We have developed an integrated system tailored to serve the specific needs of microarray based research projects using a unique fusion of Web based and standalone applications connected to the latest J2EE application server technology. The presented system is freely available for academic and non-profit institutions. More information can be found at http://genome.tugraz.at.
Liu, Hongfang; Li, Xin; Yoon, Victoria; Clarke, Robert
As the most common cancer among women, breast cancer results from the accumulation of mutations in essential genes. Recent advance in high-throughput gene expression microarray technology has inspired researchers to use the technology to assist breast cancer diagnosis, prognosis, and treatment prediction. However, the high dimensionality of microarray experiments and public access of data from many experiments have caused inconsistencies which initiated the development of controlled terminologies and ontologies for annotating microarray experiments, such as the standard microarray Gene Expression Data (MGED) ontology (MO). In this paper, we developed BCM-CO, an ontology tailored specifically for indexing clinical annotations of breast cancer microarray samples from the NCI Thesaurus. Our research showed that the coverage of NCI Thesaurus is very limited with respect to i) terms used by researchers to describe breast cancer histology (covering 22 out of 48 histology terms); ii) breast cancer cell lines (covering one out of 12 cell lines); and iii) classes corresponding to the breast cancer grading and staging. By incorporating a wider range of those terms into BCM-CO, we were able to indexed breast cancer microarray samples from GEO using BCM-CO and MGED ontology and developed a prototype system with web interface that allows the retrieval of microarray data based on the ontology annotations. PMID:18999108
Full Text Available Abstract Background Microarray technologies have become common tools in biological research. As a result, a need for effective computational methods for data analysis has emerged. Numerous different algorithms have been proposed for analyzing the data. However, an objective evaluation of the proposed algorithms is not possible due to the lack of biological ground truth information. To overcome this fundamental problem, the use of simulated microarray data for algorithm validation has been proposed. Results We present a microarray simulation model which can be used to validate different kinds of data analysis algorithms. The proposed model is unique in the sense that it includes all the steps that affect the quality of real microarray data. These steps include the simulation of biological ground truth data, applying biological and measurement technology specific error models, and finally simulating the microarray slide manufacturing and hybridization. After all these steps are taken into account, the simulated data has realistic biological and statistical characteristics. The applicability of the proposed model is demonstrated by several examples. Conclusion The proposed microarray simulation model is modular and can be used in different kinds of applications. It includes several error models that have been proposed earlier and it can be used with different types of input data. The model can be used to simulate both spotted two-channel and oligonucleotide based single-channel microarrays. All this makes the model a valuable tool for example in validation of data analysis algorithms.
Choe, Jae Gol; Shin, Kyung Ho; Lee, Min Soo; Kim, Meyoung Kon
Microarray technology allows the simultaneous analysis of gene expression patterns of thousands of genes, in a systematic fashion, under a similar set of experimental conditions, thus making the data highly comparable. In some cases arrays are used simply as a primary screen leading to downstream molecular characterization of individual gene candidates. In other cases, the goal of expression profiling is to begin to identify complex regulatory networks underlying developmental processes and disease states. Microarrays were originally used with cell lines or other simple model systems. More recently, microarrays have been used in the analysis of more complex biological tissues including neural systems and the brain. The application of cDNA arrays in neuropsychiatry has lagged behind other fields for a number of reasons. These include a requirement for a large amount of input probe RNA in fluorescent-glass based array systems and the cellular complexity introduced by multicellular brain and neural tissues. An additional factor that impacts the general use of microarrays in neuropsychiatry is the lack of availability of sequenced clone sets from model systems. While human cDNA clones have been widely available, high quality rat, mouse, and drosophilae, among others are just becoming widely available. A final factor in the application of cDNA microarrays in neuropsychiatry is cost of commercial arrays. As academic microarray facilitates become more commonplace custom made arrays will become more widely available at a lower cost allowing more widespread applications. In summary, microarray technology is rapidly having an impact on many areas of biomedical research. Radioisotope-nylon based microarrays offer alternatives that may in some cases be more sensitive, flexible, inexpensive, and universal as compared to other array formats, such as fluorescent-glass arrays. In some situations of limited RNA or exotic species, radioactive membrane microarrays may be the most
Choe, Jae Gol; Shin, Kyung Ho; Lee, Min Soo; Kim, Meyoung Kon [Korea University Medical School, Seoul (Korea, Republic of)
Microarray technology allows the simultaneous analysis of gene expression patterns of thousands of genes, in a systematic fashion, under a similar set of experimental conditions, thus making the data highly comparable. In some cases arrays are used simply as a primary screen leading to downstream molecular characterization of individual gene candidates. In other cases, the goal of expression profiling is to begin to identify complex regulatory networks underlying developmental processes and disease states. Microarrays were originally used with cell lines or other simple model systems. More recently, microarrays have been used in the analysis of more complex biological tissues including neural systems and the brain. The application of cDNA arrays in neuropsychiatry has lagged behind other fields for a number of reasons. These include a requirement for a large amount of input probe RNA in fluorescent-glass based array systems and the cellular complexity introduced by multicellular brain and neural tissues. An additional factor that impacts the general use of microarrays in neuropsychiatry is the lack of availability of sequenced clone sets from model systems. While human cDNA clones have been widely available, high quality rat, mouse, and drosophilae, among others are just becoming widely available. A final factor in the application of cDNA microarrays in neuropsychiatry is cost of commercial arrays. As academic microarray facilitates become more commonplace custom made arrays will become more widely available at a lower cost allowing more widespread applications. In summary, microarray technology is rapidly having an impact on many areas of biomedical research. Radioisotope-nylon based microarrays offer alternatives that may in some cases be more sensitive, flexible, inexpensive, and universal as compared to other array formats, such as fluorescent-glass arrays. In some situations of limited RNA or exotic species, radioactive membrane microarrays may be the most
Shang, Yuqin; Zeng, Yun; Zeng, Yong
Protein glycosylation is one of the key processes that play essential roles in biological functions and dysfunctions. However, progress in glycomics has considerably lagged behind genomics and proteomics, due in part to the enormous challenges in analysis of glycans. Here we present a new integrated and automated microfluidic lectin barcode platform to substantially improve the performance of lectin array for focused glycomic profiling. The chip design and flow control were optimized to promote the lectin-glycan binding kinetics and speed of lectin microarray. Moreover, we established an on-chip lectin assay which employs a very simple blocking method to effectively suppress the undesired background due to lectin binding of antibodies. Using this technology, we demonstrated focused differential profiling of tissue-specific glycosylation changes of a biomarker, CA125 protein purified from ovarian cancer cell line and different tissues from ovarian cancer patients in a fast, reproducible, and high-throughput fashion. Highly sensitive CA125 detection was also demonstrated with a detection limit much lower than the clinical cutoff value for cancer diagnosis. This microfluidic platform holds the potential to integrate with sample preparation functions to construct a fully integrated “sample-to-answer” microsystem for focused differential glycomic analysis. Thus, our technology should present a powerful tool in support of rapid advance in glycobiology and glyco-biomarker development.
Vu, Thuy Duong; Eberhardt, Ursula; Szöke, Szániszló; Groenewald, Marizeth; Robert, Vincent
This paper presents a laboratory information management system for DNA sequences (LIMS) created and based on the needs of a DNA barcoding project at the CBS-KNAW Fungal Biodiversity Centre (Utrecht, the Netherlands). DNA barcoding is a global initiative for species identification through simple DNA sequence markers. We aim at generating barcode data for all strains (or specimens) included in the collection (currently ca. 80 k). The LIMS has been developed to better manage large amounts of sequence data and to keep track of the whole experimental procedure. The system has allowed us to classify strains more efficiently as the quality of sequence data has improved, and as a result, up-to-date taxonomic names have been given to strains and more accurate correlation analyses have been carried out.
Schirripa Spagnolo, Giuseppe; Cozzella, Lorenzo; Simonetti, Carla
Nowadays all the National Central Banks are continuously studying innovative anti-counterfeiting systems for banknotes. In this note, an innovative solution is proposed, which combines the potentiality of a hylemetric approach (methodology conceptually similar to biometry), based on notes' intrinsic characteristics, with a well-known and consolidated 2D barcode identification system. In particular, in this note we propose to extract from the banknotes a univocal binary control sequence (template) and insert an encrypted version of it in a barcode printed on the same banknote. For a more acceptable look and feel of a banknote, the superposed barcode can be stamped using IR ink that is visible to near-IR image sensors. This makes the banknote verification simpler. (technical design note)
Fernández-Álvarez, Fernando Ángel; Machordom, Annie
For several groups, like nemerteans, morphology-based identification is a hard discipline, but DNA barcoding may help non-experts in the identification process. In this study, DNA barcoding is used to reveal the cryptic invasion of Pacific Cephalothrix cf. simula into Atlantic and Mediterranean coasts. Although DNA barcoding is a promising method for the identification of Nemertea, only 6 % of the known number of nemertean species is currently associated with a correct DNA barcode. Therefore, additional morphological and molecular studies are necessary to advance the utility of DNA barcoding in the characterisation of possible nemertean alien invasions.
Schori, M.; Showalter, A.M.
DNA barcoding involves the generation of DNA sequencing data from particular genetic regions in an organism and the use of these sequence data to identify or 'barcode' that organism and distinguish it from other species. Here, DNA barcoding is being used to identify several medicinal plants found in Pakistan and distinguished them from other similar species. Several challenges to the successful implementation of plant DNA barcoding are presented and discussed. Despite these challenges, DNA barcoding has the potential to uniquely identify medicinal plants and provide quality control and standardization of the plant material supplied to the pharmaceutical industry. (author)
Agasti, Sarit S; Liong, Monty; Peterson, Vanessa M; Lee, Hakho; Weissleder, Ralph
DNA barcoding is an attractive technology, as it allows sensitive and multiplexed target analysis. However, DNA barcoding of cellular proteins remains challenging, primarily because barcode amplification and readout techniques are often incompatible with the cellular microenvironment. Here we describe the development and validation of a photocleavable DNA barcode-antibody conjugate method for rapid, quantitative, and multiplexed detection of proteins in single live cells. Following target binding, this method allows DNA barcodes to be photoreleased in solution, enabling easy isolation, amplification, and readout. As a proof of principle, we demonstrate sensitive and multiplexed detection of protein biomarkers in a variety of cancer cells.
Takeuchi, Ichiro; Nakagawa, Masao; Seto, Masao
In many microarray studies, gene set selection is an important preliminary step for subsequent main task such as tumor classification, cancer subtype identification, etc. In this paper, we investigate the possibility of using metric learning as an alternative to gene set selection. We develop a simple metric learning algorithm aiming to use it for microarray data analysis. Exploiting a property of the algorithm, we introduce a novel approach for extending the metric learning to be adaptive. We apply the algorithm to previously studied microarray data on malignant lymphoma subtype identification.
Ellen L Kenchington
Full Text Available DNA barcode sequences were developed from 557 mesopelagic and upper bathypelagic teleost specimens collected in waters off Atlantic Canada. Confident morphological identifications were available for 366 specimens, of 118 species and 93 genera, which yielded 328 haplotypes. Five of the species were novel to the Barcode of Life Database (BOLD. Most of the 118 species conformed to expectations of monophyly and the presence of a "barcode gap", though some known weaknesses in existing taxonomy were confirmed and a deficiency in published keys was revealed. Of the specimens for which no firm morphological identification was available, 156 were successfully identified to species, and a further 11 to genus, using their barcode sequences and a combination of distance- and character-based methods. The remaining 24 specimens were from species for which no reference barcode is yet available or else ones confused by apparent misidentification of publicly available sequences in BOLD. Addition of the new sequences to those previously in BOLD contributed support to recent taxonomic revisions of Chiasmodon and Poromitra, while it also revealed 18 cases of potential cryptic speciation. Most of the latter appear to result from genetic divergence among populations in different ocean basins, while the general lack of strong horizontal environmental gradients within the deep sea has allowed morphology to be conserved. Other examples of divergence appear to distinguish individuals living under the sub-tropical gyre of the North Atlantic from those under that ocean's sub-polar gyre. In contrast, the available sequences for two myctophid species, Benthosema glaciale and Notoscopelus elongatus, showed genetic structuring on finer geographic scales. The observed structure was not consistent with recent suggestions that "resident" populations of myctophids can maintain allopatry despite the mixing of ocean waters. Rather, it indicates that the very rapid speciation
Pei, Weike; Feyerabend, Thorsten B; Rössler, Jens; Wang, Xi; Postrach, Daniel; Busch, Katrin; Rode, Immanuel; Klapproth, Kay; Dietlein, Nikolaus; Quedenau, Claudia; Chen, Wei; Sauer, Sascha; Wolf, Stephan; Höfer, Thomas; Rodewald, Hans-Reimer
Developmental deconvolution of complex organs and tissues at the level of individual cells remains challenging. Non-invasive genetic fate mapping has been widely used, but the low number of distinct fluorescent marker proteins limits its resolution. Much higher numbers of cell markers have been generated using viral integration sites, viral barcodes, and strategies based on transposons and CRISPR-Cas9 genome editing; however, temporal and tissue-specific induction of barcodes in situ has not been achieved. Here we report the development of an artificial DNA recombination locus (termed Polylox) that enables broadly applicable endogenous barcoding based on the Cre-loxP recombination system. Polylox recombination in situ reaches a practical diversity of several hundred thousand barcodes, allowing tagging of single cells. We have used this experimental system, combined with fate mapping, to assess haematopoietic stem cell (HSC) fates in vivo. Classical models of haematopoietic lineage specification assume a tree with few major branches. More recently, driven in part by the development of more efficient single-cell assays and improved transplantation efficiencies, different models have been proposed, in which unilineage priming may occur in mice and humans at the level of HSCs. We have introduced barcodes into HSC progenitors in embryonic mice, and found that the adult HSC compartment is a mosaic of embryo-derived HSC clones, some of which are unexpectedly large. Most HSC clones gave rise to multilineage or oligolineage fates, arguing against unilineage priming, and suggesting coherent usage of the potential of cells in a clone. The spreading of barcodes, both after induction in embryos and in adult mice, revealed a basic split between common myeloid-erythroid development and common lymphocyte development, supporting the long-held but contested view of a tree-like haematopoietic structure.
Grant-Braham, Bruce; Britton, John
Sponsorship of Formula One (F1) motor racing, which has been used as an indirect medium of tobacco advertising for several decades, was prohibited by the 2005 European Union Tobacco Advertising Directive. Most F1 tobacco sponsorship of motor racing in the EU has since ceased, with the exception of the Scuderia Ferrari team, which continues to be funded by Philip Morris. In 2007, the Marlboro logo on Ferrari cars and other race regalia was replaced by an evolving 'barcode' design, which Ferrari later claimed was part of the livery of the car, and not a Marlboro advertisement. To determine whether the 'barcode' graphics used by Ferrari represent 'alibi' Marlboro advertising. Academic and grey literature, and online tobacco industry document archives, were searched using terms relevant to tobacco marketing and motorsport. Tobacco sponsorship of F1 motor racing began in 1968, and Philip Morris has sponsored F1 teams since 1972. Phillip Morris first used a 'barcode' design, comprising red vertical parallel lines below the word Marlboro on the British Racing Motors F1 car in 1972. Vertical or horizontal 'barcode' designs have been used in this way, latterly without the word Marlboro, ever since. The modern 'barcode' logos occupied the same position on cars and drivers' clothing as conventional Marlboro logos in the past. The shared use of red colour by Marlboro and Ferrari is also recognised by Philip Morris as a means of promoting brand association between Marlboro and Ferrari. The Ferrari 'barcode' designs are alibi Marlboro logos and hence constitute advertising prohibited by the 2005 EU Tobacco Advertising Directive.
Rössler, Jens; Wang, Xi; Postrach, Daniel; Busch, Katrin; Rode, Immanuel; Klapproth, Kay; Dietlein, Nikolaus; Quedenau, Claudia; Chen, Wei; Sauer, Sascha; Wolf, Stephan; Höfer, Thomas; Rodewald, Hans-Reimer
Developmental deconvolution of complex organs and tissues at the level of individual cells remains challenging. Non-invasive genetic fate mapping1 has been widely used, but the low number of distinct fluorescent marker proteins limits its resolution. Much higher numbers of cell markers have been generated using viral integration sites2, viral barcodes3, and strategies based on transposons4 and CRISPR/Cas9 genome editing5; however, temporal and tissue-specific induction of barcodes in situ has not been achieved. Here we report the development of an artificial DNA recombination locus (termed Polylox) that enables broadly applicable endogenous barcoding based on the Cre-loxP recombination system6,7. Polylox recombination in situ reaches a practical diversity of several hundred thousand barcodes, allowing tagging of single cells. We have used this experimental system, combined with fate mapping, to assess haematopoietic stem cell (HSC) fates in vivo. Classical models of haematopoietic lineage specification assume a tree with few major branches. More recently, driven in part by the development of more efficient single-cell assays and improved transplantation efficiencies, different models have been proposed, in which unilineage priming may occur in mice and humans at the level of HSCs8–10. We have introduced barcodes into HSC progenitors in embryonic mice, and found that the adult HSC compartment is a mosaic of embryo-derived HSC clones, some of which are unexpectedly large. Most HSC clones gave rise to multilineage or oligolineage fates, arguing against unilineage priming, and suggesting coherent usage of the potential of cells in a clone. The spreading of barcodes, both after induction in embryos and in adult mice, revealed a basic split between common myeloid-erythroid development and common lymphocyte development, supporting the long-held but contested view of a tree-like haematopoietic structure. PMID:28813413
Quilang, Jonas P; Yu, Shiny Cathlynne S
Many species of catfish are important resources for human consumption, for sport fishing and for use in aquarium industry. In the Philippines, some species are cultivated and some are caught in the wild for food and a few introduced species have become invasive. In this study, DNA barcoding using the mitochondrial cytochrome c oxidase I (COI) gene was done on commercially and economically important Philippine catfishes. A total of 75 specimens belonging to 11 species and 5 families were DNA barcoded. The genetic distances were computed and Neighbor-Joining (NJ) trees were constructed based on the Kimura 2-Parameter (K2P) method. The average K2P distances within species, genus, family and order were 0.2, 8.2, 12.7 and 21.9%, respectively. COI sequences clustered according to their species designation for 7 of the 11 catfishes. DNA barcoding was not able to discriminate between Arius dispar and A. manillensis and between Pterygoplichthys disjunctivus and P. pardalis. The morphological characters that are used to distinguish between these species do not complement molecular identification through DNA barcoding. DNA barcoding also showed that Clarias batrachus from the Philippines is different from the species found in India and Thailand, which supports earlier suggestions based on morphology that those found in India should be designated as C. magur and those in mainland Southeast Asia as C. aff. batrachus "Indochina". This study has shown that DNA barcoding can be used for species delineation and for tagging some species for further taxonomic investigation, which has implications on proper management and conservation strategies.
Kim, Sungmin; Eo, Hae-Seok; Koo, Hyeyoung; Choi, Jun-Kil; Kim, Won
In this study, we applied DNA barcoding to identify species using short DNA sequence analysis. We examined the utility of DNA barcoding by identifying 53 Korean freshwater fish species, 233 other freshwater fish species, and 1339 saltwater fish species. We successfully developed a web-based molecular identification system for fish (MISF) using a profile hidden Markov model. MISF facilitates efficient and reliable species identification, overcoming the limitations of conventional taxonomic approaches. MISF is freely accessible at http://bioinfosys.snu.ac.kr:8080/MISF/misf.jsp .
Adriana E. Radulovici
Full Text Available ‘Biodiversity’ means the variety of life and it can be studied at different levels (genetic, species, ecosystem and scales (spatial and temporal. Last decades showed that marine biodiversity has been severely underestimated at all levels. In order to investigate diversity patterns and underlying processes, there is a need to know what species live in the marine environment. An emerging tool for species identification, DNA barcoding can reliably assign unknown specimens to known species, also flagging potential cryptic species and genetically distant populations. This paper will review the role of DNA barcoding for the study of marine biodiversity at the species level.
Lee, Y. H.; Kim, T. K.; Kang, I. S.; Cho, H. S.; Son, J. S. [KAERI, Taejon (Korea, Republic of)
Solid radioactive wastes are generated from the post-irradiated fuel examination facility, the irradiated material examination facility, the research reactor, and the laboratories at KAERI. A bar-code system for a solid radioactive waste management of a research organization became necessary while developing the RAWMIS(Radioactive Waste Management Integration System) which it can generate personal history management for efficient management of a waste, documents, all kinds of statistics. This paper introduces an input and output application program design to do to database with data in the results and a stream process of a treatment that analyzed the waste occurrence present situation and data by bar-code system.
An, Jeung Hee; Kim, Tae-Hyung; Oh, Byung-Keun; Choi, Jeong Woo
Nanotechnology-based bio-barcode-amplification analysis may be an innovative approach to dopamine detection. In this study, we evaluated the efficacy of this bio-barcode DNA method in detecting dopamine from dopaminergic cells. Herein, a combination DNA barcode and bead-based immunoassay for neurotransmitter detection with PCR-like sensitivity is described. This method relies on magnetic nanoparticles with antibodies and nanoparticles that are encoded with DNA, and antibodies that can sandwich the target protein captured by the nanoparticle-bound antibodies. The aggregate sandwich structures are magnetically separated from solution, and treated in order to remove the conjugated barcode DNA. The DNA barcodes were then identified via PCR analysis. The dopamine concentration in dopaminergic cells can be readily and rapidly detected via the bio-barcode assay method. The bio-barcode assay method is, therefore, a rapid and high-throughput screening tool for the detection of neurotransmitters such as dopamine.
Ferri, Gianmarco; Alù, Milena; Corradini, Beatrice; Beduschi, Giovanni
Forensic botany can provide significant supporting evidence during criminal investigations. However, it is still an underutilized field of investigation with its most common application limited to identifying specific as well as suspected illegal plants. The ubiquitous presence of plant species can be useful in forensics, but the absence of an accurate identification system remains the major obstacle to the present inability to routinely and correctly identify trace botanical evidence. Many plant materials cannot be identified and differentiated to the species level by traditional morphological characteristics when botanical specimens are degraded and lack physical features. By taking advantage of a universal barcode system, DNA sequencing, and other biomolecular techniques used routinely in forensic investigations, two chloroplast DNA regions were evaluated for their use as "barcoding" markers for plant identification in the field of forensics. We therefore investigated the forensic use of two non-coding plastid regions, psbA-trnH and trnL-trnF, to create a multimarker system for species identification that could be useful throughout the plant kingdom. The sequences from 63 plants belonging to our local flora were submitted and registered on the GenBank database. Sequence comparison to set up the level of identification (species, genus, or family) through Blast algorithms allowed us to assess the suitability of this method. The results confirmed the effectiveness of our botanic universal multimarker assay in forensic investigations.
blood glucose > 16.7 mmol/L were used as the model group and treated with Dendrobium mixture. (DEN ... Keywords: Diabetes, Gene expression, Dendrobium mixture, Microarray testing ..... homeostasis in airway smooth muscle. Am J.
Full Text Available Abstract Background Microarray core facilities are commonplace in biological research organizations, and need systems for accurately tracking various logistical aspects of their operation. Although these different needs could be handled separately, an integrated management system provides benefits in organization, automation and reduction in errors. Results We present SLIMarray (System for Lab Information Management of Microarrays, an open source, modular database web application capable of managing microarray inventories, sample processing and usage charges. The software allows modular configuration and is well suited for further development, providing users the flexibility to adapt it to their needs. SLIMarray Lite, a version of the software that is especially easy to install and run, is also available. Conclusion SLIMarray addresses the previously unmet need for free and open source software for managing the logistics of a microarray core facility.
Bell, Cameron; Guerinet, Julien; Atkinson, Katherine M; Wilson, Kumanan
Two-dimensional (2D) barcoding has the potential to enhance documentation of vaccine encounters at the point of care. However, this is currently limited to environments equipped with dedicated barcode scanners and compatible record systems. Mobile devices may present a cost-effective alternative to leverage 2D vaccine vial barcodes and improve vaccine product-specific information residing in digital health records. Mobile devices have the potential to capture product-specific information from 2D vaccine vial barcodes. We sought to examine the feasibility, performance, and potential limitations of scanning 2D barcodes on vaccine vials using 4 different mobile phones. A unique barcode scanning app was developed for Android and iOS operating systems. The impact of 4 variables on the scan success rate, data accuracy, and time to scan were examined: barcode size, curvature, fading, and ambient lighting conditions. Two experimenters performed 4 trials 10 times each, amounting to a total of 2160 barcode scan attempts. Of the 1832 successful scans performed in this evaluation, zero produced incorrect data. Five-millimeter barcodes were the slowest to scan, although only by 0.5 seconds on average. Barcodes with up to 50% fading had a 100% success rate, but success rate deteriorated beyond 60% fading. Curved barcodes took longer to scan compared with flat, but success rate deterioration was only observed at a vial diameter of 10 mm. Light conditions did not affect success rate or scan time between 500 lux and 20 lux. Conditions below 20 lux impeded the device's ability to scan successfully. Variability in scan time was observed across devices in all trials performed. 2D vaccine barcoding is possible using mobile devices and is successful under the majority of conditions examined. Manufacturers utilizing 2D barcodes should take into consideration the impact of factors that limit scan success rates. Future studies should evaluate the effect of mobile barcoding on workflow and
Full Text Available Despite the large number of software tools developed to address different areas of microarray data analysis, very few offer an all-in-one solution with little learning curve. For microarray core labs, there are even fewer software packages available to help with their routine but critical tasks, such as data quality control (QC and inventory management. We have developed a simple-to-use web portal to allow bench biologists to analyze and query complicated microarray data and related biological pathways without prior training. Both experiment-based and gene-based analysis can be easily performed, even for the first-time user, through the intuitive multi-layer design and interactive graphic links. While being friendly to inexperienced users, most parameters in Goober can be easily adjusted via drop-down menus to allow advanced users to tailor their needs and perform more complicated analysis. Moreover, we have integrated graphic pathway analysis into the website to help users examine microarray data within the relevant biological content. Goober also contains features that cover most of the common tasks in microarray core labs, such as real time array QC, data loading, array usage and inventory tracking. Overall, Goober is a complete microarray solution to help biologists instantly discover valuable information from a microarray experiment and enhance the quality and productivity of microarray core labs. The whole package is freely available at http://sourceforge.net/projects/goober. A demo web server is available at http://www.goober-array.org.
Full Text Available Tissue microarrays are commonly used in modern pathology for cancer tissue evaluation, as it is a very potent technique. Tissue microarray slides are often scanned to perform computer-aided histopathological analysis of the tissue cores. For processing the image, splitting the whole virtual slide into images of individual cores is required. The only way to distinguish cores corresponding to specimens in the tissue microarray is through their arrangement. Unfortunately, distinguishing the correct order of cores is not a trivial task as they are not labelled directly on the slide. The main aim of this study was to create a procedure capable of automatically finding and extracting cores from archival images of the tissue microarrays. This software supports the work of scientists who want to perform further image processing on single cores. The proposed method is an efficient and fast procedure, working in fully automatic or semi-automatic mode. A total of 89% of punches were correctly extracted with automatic selection. With an addition of manual correction, it is possible to fully prepare the whole slide image for extraction in 2 min per tissue microarray. The proposed technique requires minimum skill and time to parse big array of cores from tissue microarray whole slide image into individual core images.
Zhang, Wenqian; Yu, Ying; Hertwig, Falk; Thierry-Mieg, Jean; Zhang, Wenwei; Thierry-Mieg, Danielle; Wang, Jian; Furlanello, Cesare; Devanarayan, Viswanath; Cheng, Jie; Deng, Youping; Hero, Barbara; Hong, Huixiao; Jia, Meiwen; Li, Li; Lin, Simon M; Nikolsky, Yuri; Oberthuer, André; Qing, Tao; Su, Zhenqiang; Volland, Ruth; Wang, Charles; Wang, May D; Ai, Junmei; Albanese, Davide; Asgharzadeh, Shahab; Avigad, Smadar; Bao, Wenjun; Bessarabova, Marina; Brilliant, Murray H; Brors, Benedikt; Chierici, Marco; Chu, Tzu-Ming; Zhang, Jibin; Grundy, Richard G; He, Min Max; Hebbring, Scott; Kaufman, Howard L; Lababidi, Samir; Lancashire, Lee J; Li, Yan; Lu, Xin X; Luo, Heng; Ma, Xiwen; Ning, Baitang; Noguera, Rosa; Peifer, Martin; Phan, John H; Roels, Frederik; Rosswog, Carolina; Shao, Susan; Shen, Jie; Theissen, Jessica; Tonini, Gian Paolo; Vandesompele, Jo; Wu, Po-Yen; Xiao, Wenzhong; Xu, Joshua; Xu, Weihong; Xuan, Jiekun; Yang, Yong; Ye, Zhan; Dong, Zirui; Zhang, Ke K; Yin, Ye; Zhao, Chen; Zheng, Yuanting; Wolfinger, Russell D; Shi, Tieliu; Malkas, Linda H; Berthold, Frank; Wang, Jun; Tong, Weida; Shi, Leming; Peng, Zhiyu; Fischer, Matthias
Gene expression profiling is being widely applied in cancer research to identify biomarkers for clinical endpoint prediction. Since RNA-seq provides a powerful tool for transcriptome-based applications beyond the limitations of microarrays, we sought to systematically evaluate the performance of RNA-seq-based and microarray-based classifiers in this MAQC-III/SEQC study for clinical endpoint prediction using neuroblastoma as a model. We generate gene expression profiles from 498 primary neuroblastomas using both RNA-seq and 44 k microarrays. Characterization of the neuroblastoma transcriptome by RNA-seq reveals that more than 48,000 genes and 200,000 transcripts are being expressed in this malignancy. We also find that RNA-seq provides much more detailed information on specific transcript expression patterns in clinico-genetic neuroblastoma subgroups than microarrays. To systematically compare the power of RNA-seq and microarray-based models in predicting clinical endpoints, we divide the cohort randomly into training and validation sets and develop 360 predictive models on six clinical endpoints of varying predictability. Evaluation of factors potentially affecting model performances reveals that prediction accuracies are most strongly influenced by the nature of the clinical endpoint, whereas technological platforms (RNA-seq vs. microarrays), RNA-seq data analysis pipelines, and feature levels (gene vs. transcript vs. exon-junction level) do not significantly affect performances of the models. We demonstrate that RNA-seq outperforms microarrays in determining the transcriptomic characteristics of cancer, while RNA-seq and microarray-based models perform similarly in clinical endpoint prediction. Our findings may be valuable to guide future studies on the development of gene expression-based predictive models and their implementation in clinical practice.
Full Text Available Oligonucleotide-based microarrays with accurate gene coverage represent a key strategy for transcriptional studies in orphan species such as sunflower, H. annuus L., which lacks full genome sequences. The goal of this study was the development and functional annotation of a comprehensive sunflower unigene collection and the design and validation of a custom sunflower oligonucleotide-based microarray. A large scale EST (>130,000 ESTs curation, assembly and sequence annotation was performed using Blast2GO (www.blast2go.de. The EST assembly comprises 41,013 putative transcripts (12,924 contigs and 28,089 singletons. The resulting Sunflower Unigen Resource (SUR version 1.0 was used to design an oligonucleotide-based Agilent microarray for cultivated sunflower. This microarray includes a total of 42,326 features: 1,417 Agilent controls, 74 control probes for sunflower replicated 10 times (740 controls and 40,169 different non-control probes. Microarray performance was validated using a model experiment examining the induction of senescence by water deficit. Pre-processing and differential expression analysis of Agilent microarrays was performed using the Bioconductor limma package. The analyses based on p-values calculated by eBayes (p<0.01 allowed the detection of 558 differentially expressed genes between water stress and control conditions; from these, ten genes were further validated by qPCR. Over-represented ontologies were identified using FatiScan in the Babelomics suite. This work generated a curated and trustable sunflower unigene collection, and a custom, validated sunflower oligonucleotide-based microarray using Agilent technology. Both the curated unigene collection and the validated oligonucleotide microarray provide key resources for sunflower genome analysis, transcriptional studies, and molecular breeding for crop improvement.
Full Text Available Abstract Background We report the development of a microarray platform for rapid and cost-effective genetic mapping, and its evaluation using rice as a model. In contrast to methods employing whole-genome tiling microarrays for genotyping, our method is based on low-cost spotted microarray production, focusing only on known polymorphic features. Results We have produced a genotyping microarray for rice, comprising 880 single feature polymorphism (SFP elements derived from insertions/deletions identified by aligning genomic sequences of the japonica cultivar Nipponbare and the indica cultivar 93-11. The SFPs were experimentally verified by hybridization with labeled genomic DNA prepared from the two cultivars. Using the genotyping microarrays, we found high levels of polymorphism across diverse rice accessions, and were able to classify all five subpopulations of rice with high bootstrap support. The microarrays were used for mapping of a gene conferring resistance to Magnaporthe grisea, the causative organism of rice blast disease, by quantitative genotyping of samples from a recombinant inbred line population pooled by phenotype. Conclusion We anticipate this microarray-based genotyping platform, based on its low cost-per-sample, to be particularly useful in applications requiring whole-genome molecular marker coverage across large numbers of individuals.
Because of the increasing demand for herbal remedies and for authentication of the source material, it is vital to provide a single database containing information about authentic plant materials and their potential adulterants. The database should provide DNA barcodes for data retrieval and similar...
Het herkennen van biologische soorten aan de hand van een gestandaardiseerde DNA-barcode heeft de laatste tijd een enorme vlucht genomen. Gedreven door aan de ene kant de biodiversiteitscrises en de mogelijke global change, en aan de andere kant zowel razendsnelle technologische vooruitgang als ook
Laiho, Juha; Ståhls, Gunilla
A majority of the known Colias species (Lepidoptera: Pieridae, Coliadinae) occur in the mountainous regions of Central-Asia, vast areas that are hard to access, rendering the knowledge of many species limited due to the lack of extensive sampling. Two gene regions, the mitochondrial COI 'barcode' region and the nuclear ribosomal protein RpS2 gene region were used for exploring the utility of these DNA markers for species identification. A comprehensive sampling of COI barcodes for Central Asian Colias butterflies showed that the barcodes facilitated identification of most of the included species. Phylogenetic reconstruction based on parsimony and Neighbour-Joining recovered most species as monophyletic entities. For the RpS2 gene region species-specific sequences were registered for some of the included Colias spp. Nevertheless, this gene region was not deemed useful as additional molecular 'barcode'. A parsimony analysis of the combined COI and RpS2 data did not support the current subgeneric classification based on morphological characteristics.
Gu, Liangcai; Li, Chao; Aach, John; Hill, David E.; Vidal, Marc; Church, George M.
In contrast with advances in massively parallel DNA sequencing1, high-throughput protein analyses2-4 are often limited by ensemble measurements, individual analyte purification and hence compromised quality and cost-effectiveness. Single-molecule (SM) protein detection achieved using optical methods5 is limited by the number of spectrally nonoverlapping chromophores. Here, we introduce a single molecular interaction-sequencing (SMI-Seq) technology for parallel protein interaction profiling leveraging SM advantages. DNA barcodes are attached to proteins collectively via ribosome display6 or individually via enzymatic conjugation. Barcoded proteins are assayed en masse in aqueous solution and subsequently immobilized in a polyacrylamide (PAA) thin film to construct a random SM array, where barcoding DNAs are amplified into in situ polymerase colonies (polonies)7 and analyzed by DNA sequencing. This method allows precise quantification of various proteins with a theoretical maximum array density of over one million polonies per square millimeter. Furthermore, protein interactions can be measured based on the statistics of colocalized polonies arising from barcoding DNAs of interacting proteins. Two demanding applications, G-protein coupled receptor (GPCR) and antibody binding profiling, were demonstrated. SMI-Seq enables “library vs. library” screening in a one-pot assay, simultaneously interrogating molecular binding affinity and specificity. PMID:25252978
Deoxyribonucleic acid (DNA) barcoding is a novel technology that uses a standard DNA sequence to facilitate species identification. Species identification is necessary for the authentication of traditional plant based medicines. Although a consensus has not been agreed regarding which DNA sequences can be used as ...
... in planning for future mailings and preparing for system changes necessary to adopt the new IMpb... Code 128 barcodes, which make use of Application Identifiers (AI) to define the encoded data and how it... capabilities, the Postal Service is providing advance notice of a future proposal to require customers to...
Walther, G.; Pawlowska, J.; Alastruey-Izquierdo, A.; Wrzosek, M.; Rodriguez-Tudela, J.L.; Dolatabadi, S.; Chakrabarti, A.; de Hoog, G.S.
The order Mucorales comprises predominantly fast-growing saprotrophic fungi, some of which are used for the fermentation of foodstuffs but it also includes species known to cause infections in patients with severe immune or metabolic impairments. To inventory biodiversity in Mucorales ITS barcodes
Crockett, J.D.; Carr, C.C.
Over the past several years, the use, tracking, and documentation of measuring and test equipment (M ampersand TE) has become a major issue. New regulations are forcing companies to develop new policies for providing use history, traceability, and accountability of M ampersand TE. This paper discusses how the Fast Flux Test Facility (FFTF), operated by Westinghouse Hanford Company and located at the Hanford site in Rich- land, Washington, overcame these obstacles by using a computerized system exercising bar-code technology. A data base was developed to identify M ampersand TE containing 33 separate fields, such as manufacturer, model, range, bar-code number, and other pertinent information. A bar-code label was attached to each piece of M ampersand TE. A second data base was created to identify the employee using the M ampersand TE. The fields contained pertinent user information such as name, location, and payroll number. Each employee's payroll number was bar coded and attached to the back of their identification badge. A computer program was developed to automate certain tasks previously performed and tracked by hand. Bar-code technology was combined with this computer program to control the input and distribution of information, eliminate common mistakes, electronically store information, and reduce the time required to check out the M ampersand TE for use
Shokralla, Shadi; Hellberg, Rosalee S; Handy, Sara M; King, Ian; Hajibabaei, Mehrdad
Species substitution is a form of seafood fraud for the purpose of economic gain. DNA barcoding utilizes species-specific DNA sequence information for specimen identification. Previous work has established the usability of short DNA sequences-mini-barcodes-for identification of specimens harboring degraded DNA. This study aims at establishing a DNA mini-barcoding system for all fish species commonly used in processed fish products in North America. Six mini-barcode primer pairs targeting short (127-314 bp) fragments of the cytochrome c oxidase I (CO1) DNA barcode region were developed by examining over 8,000 DNA barcodes from species in the U.S. Food and Drug Administration (FDA) Seafood List. The mini-barcode primer pairs were then tested against 44 processed fish products representing a range of species and product types. Of the 44 products, 41 (93.2%) could be identified at the species or genus level. The greatest mini-barcoding success rate found with an individual primer pair was 88.6% compared to 20.5% success rate achieved by the full-length DNA barcode primers. Overall, this study presents a mini-barcoding system that can be used to identify a wide range of fish species in commercial products and may be utilized in high throughput DNA sequencing for authentication of heavily processed fish products.
Sikes, Derek S; Bowser, Matthew; Morton, John M; Bickford, Casey; Meierotto, Sarah; Hildebrandt, Kyndall
Climate change may result in ecological futures with novel species assemblages, trophic mismatch, and mass extinction. Alaska has a limited taxonomic workforce to address these changes. We are building a DNA barcode library to facilitate a metabarcoding approach to monitoring non-marine arthropods. Working with the Canadian Centre for DNA Barcoding, we obtained DNA barcodes from recently collected and authoritatively identified specimens in the University of Alaska Museum (UAM) Insect Collection and the Kenai National Wildlife Refuge collection. We submitted tissues from 4776 specimens, of which 81% yielded DNA barcodes representing 1662 species and 1788 Barcode Index Numbers (BINs), of primarily terrestrial, large-bodied arthropods. This represents 84% of the species available for DNA barcoding in the UAM Insect Collection. There are now 4020 Alaskan arthropod species represented by DNA barcodes, after including all records in Barcode of Life Data Systems (BOLD) of species that occur in Alaska - i.e., 48.5% of the 8277 Alaskan, non-marine-arthropod, named species have associated DNA barcodes. An assessment of the identification power of the library in its current state yielded fewer species-level identifications than expected, but the results were not discouraging. We believe we are the first to deliberately begin development of a DNA barcode library of the entire arthropod fauna for a North American state or province. Although far from complete, this library will become increasingly valuable as more species are added and costs to obtain DNA sequences fall.
Lyons, Eli; Sheridan, Paul; Tremmel, Georg; Miyano, Satoru; Sugano, Sumio
High-throughput screens allow for the identification of specific biomolecules with characteristics of interest. In barcoded screens, DNA barcodes are linked to target biomolecules in a manner allowing for the target molecules making up a library to be identified by sequencing the DNA barcodes using Next Generation Sequencing. To be useful in experimental settings, the DNA barcodes in a library must satisfy certain constraints related to GC content, homopolymer length, Hamming distance, and blacklisted subsequences. Here we report a novel framework to quickly generate large-scale libraries of DNA barcodes for use in high-throughput screens. We show that our framework dramatically reduces the computation time required to generate large-scale DNA barcode libraries, compared with a naїve approach to DNA barcode library generation. As a proof of concept, we demonstrate that our framework is able to generate a library consisting of one million DNA barcodes for use in a fragment antibody phage display screening experiment. We also report generating a general purpose one billion DNA barcode library, the largest such library yet reported in literature. Our results demonstrate the value of our novel large-scale DNA barcode library generation framework for use in high-throughput screening applications.
Zhang, Wei; Fan, Xiaohong; Zhu, Shuifang; Zhao, Hong; Fu, Lianzhong
Comprehensive sampling is crucial to DNA barcoding, but it is rarely performed because materials are usually unavailable. In practice, only a few rather than all species of a genus are required to be identified. Thus identification of a given species using a limited sample is of great importance in current application of DNA barcodes. Here, we selected 70 individuals representing 48 species from each major lineage of Solanum, one of the most species-rich genera of seed plants, to explore whether DNA barcodes can provide reliable specific-species discrimination in the context of incomplete sampling. Chloroplast genes ndhF and trnS-trnG and the nuclear gene waxy, the commonly used markers in Solanum phylogeny, were selected as the supplementary barcodes. The tree-building and modified barcode gap methods were employed to assess species resolution. The results showed that four Solanum species of quarantine concern could be successfully identified through the two-step barcoding sampling strategy. In addition, discrepancies between nuclear and cpDNA barcodes in some samples demonstrated the ability to discriminate hybrid species, and highlights the necessity of using barcode regions with different modes of inheritance. We conclude that efficient phylogenetic markers are good candidates as the supplementary barcodes in a given taxonomic group. Critically, we hypothesized that a specific-species could be identified from a phylogenetic framework using incomplete sampling-through this, DNA barcoding will greatly benefit the current fields of its application.
Erika Sendra Tavares
Full Text Available BACKGROUND: Towards lower latitudes the number of recognized species is not only higher, but also phylogeographic subdivision within species is more pronounced. Moreover, new genetically isolated populations are often described in recent phylogenies of Neotropical birds suggesting that the number of species in the region is underestimated. Previous COI barcoding of Argentinean bird species showed more complex patterns of regional divergence in the Neotropical than in the North American avifauna. METHODS AND FINDINGS: Here we analyzed 1,431 samples from 561 different species to extend the Neotropical bird barcode survey to lower latitudes, and detected even higher geographic structure within species than reported previously. About 93% (520 of the species were identified correctly from their DNA barcodes. The remaining 41 species were not monophyletic in their COI sequences because they shared barcode sequences with closely related species (N = 21 or contained very divergent clusters suggestive of putative new species embedded within the gene tree (N = 20. Deep intraspecific divergences overlapping with among-species differences were detected in 48 species, often with samples from large geographic areas and several including multiple subspecies. This strong population genetic structure often coincided with breaks between different ecoregions or areas of endemism. CONCLUSIONS: The taxonomic uncertainty associated with the high incidence of non-monophyletic species and discovery of putative species obscures studies of historical patterns of species diversification in the Neotropical region. We showed that COI barcodes are a valuable tool to indicate which taxa would benefit from more extensive taxonomic revisions with multilocus approaches. Moreover, our results support hypotheses that the megadiversity of birds in the region is associated with multiple geographic processes starting well before the Quaternary and extending to more recent
Zhang, Xiaomei; Li, Na; Yao, Yuanyuan; Liang, Xuming; Qu, Xianyou; Liu, Xiang; Zhu, Yingjie; Yang, Dajian; Sun, Wei
Species of genus Tripterygium (Celastraceae) have attracted much attention owing to their excellent effect on treating autoimmune and inflammatory diseases. However, due to high market demand causing overexploitation, natural populations of genus Tripterygium have rapidly declined. Tripterygium medicinal materials are mainly collected from the wild, making the quality of medicinal materials unstable. Additionally, identification of herbal materials from Tripterygium species and their adulterants is difficult based on morphological characters. Therefore, an accurate, convenient, and stability method is urgently needed. In this wok, we developed a DNA barcoding technique to distinguish T. wilfordii HOOK. f., T. hypoglaucum (LÉVL.) HUTCH, and T. regelii SPRAGUE et TAKEDA and their adulterants based on four uniform and standard DNA regions (internal transcribed spacer 2 (ITS2), matK, rbcL, and psbA-trnH). DNA was extracted from 26 locations of fresh leaves. Phylogenetic tree was constructed with Neighbor-Joining (NJ) method, while barcoding gap was analyzed to assess identification efficiency. Compared with the other DNA barcodes applied individually or in combination, ITS2+psbA-trnH was demonstrated as the optimal barcode. T. hypoglaucum and T. wilfordii can be considered as conspecific, while T. regelii was recognized as a separate species. Furthermore, identification of commercial Tripterygium samples was conducted using BLAST against GenBank and Species Identification System for Traditional Chinese Medicine. Our results indicated that DNA barcoding is a convenient, effective, and stability method to identify and distinguish Tripterygium and its adulterants, and could be applied as the quality control for Tripterygium medicinal preparations and monitoring of the medicinal herb trade in markets.
Full Text Available Although they are important disease vectors mosquito biodiversity in Pakistan is poorly known. Recent epidemics of dengue fever have revealed the need for more detailed understanding of the diversity and distributions of mosquito species in this region. DNA barcoding improves the accuracy of mosquito inventories because morphological differences between many species are subtle, leading to misidentifications.Sequence variation in the barcode region of the mitochondrial COI gene was used to identify mosquito species, reveal genetic diversity, and map the distribution of the dengue-vector species in Pakistan. Analysis of 1684 mosquitoes from 491 sites in Punjab and Khyber Pakhtunkhwa during 2010-2013 revealed 32 species with the assemblage dominated by Culex quinquefasciatus (61% of the collection. The genus Aedes (Stegomyia comprised 15% of the specimens, and was represented by six taxa with the two dengue vector species, Ae. albopictus and Ae. aegypti, dominant and broadly distributed. Anopheles made up another 6% of the catch with An. subpictus dominating. Barcode sequence divergence in conspecific specimens ranged from 0-2.4%, while congeneric species showed from 2.3-17.8% divergence. A global haplotype analysis of disease-vectors showed the presence of multiple haplotypes, although a single haplotype of each dengue-vector species was dominant in most countries. Geographic distribution of Ae. aegypti and Ae. albopictus showed the later species was dominant and found in both rural and urban environments.As the first DNA-based analysis of mosquitoes in Pakistan, this study has begun the construction of a barcode reference library for the mosquitoes of this region. Levels of genetic diversity varied among species. Because of its capacity to differentiate species, even those with subtle morphological differences, DNA barcoding aids accurate tracking of vector populations.
Novak, Jaroslav P; Kim, Seon-Young; Xu, Jun
BACKGROUND: DNA microarrays are a powerful technology that can provide a wealth of gene expression data for disease studies, drug development, and a wide scope of other investigations. Because of the large volume and inherent variability of DNA microarray data, many new statistical methods have...
Peña, V; Hernandez-Kantun, J; Grall, J; Pardo, C; Lopez, L; Barbara, I; Le Gall, L; Barreiro, R
Fertile gametangial plants of Phymatolithon calcareum, which are seldom reported in the Atlantic European coasts, were collected as encrusting, epilithic plants in a subtidal maerl bed in Brittany (France). Based on their morphological features, the plants were identified as P. calcareum. This identification was further confirmed by DNA barcodes using as a reference COI-5P sequences obtained from the neotype together with recent collections from the Atlantic European maerl beds. The reproduct...
Full Text Available Model organisms have played an important role in the elucidation of multiple genes and cellular processes that regulate aging. In this study we utilized the budding yeast, Saccharomyces cerevisiae, in a large-scale screen for genes that function in the regulation of chronological lifespan, which is defined by the number of days that non-dividing cells remain viable. A pooled collection of viable haploid gene deletion mutants, each tagged with unique identifying DNA "bar-code" sequences was chronologically aged in liquid culture. Viable mutants in the aging population were selected at several time points and then detected using a microarray DNA hybridization technique that quantifies abundance of the barcode tags. Multiple short- and long-lived mutants were identified using this approach. Among the confirmed short-lived mutants were those defective for autophagy, indicating a key requirement for the recycling of cellular organelles in longevity. Defects in autophagy also prevented lifespan extension induced by limitation of amino acids in the growth media. Among the confirmed long-lived mutants were those defective in the highly conserved de novo purine biosynthesis pathway (the ADE genes, which ultimately produces IMP and AMP. Blocking this pathway extended lifespan to the same degree as calorie (glucose restriction. A recently discovered cell-extrinsic mechanism of chronological aging involving acetic acid secretion and toxicity was suppressed in a long-lived ade4Delta mutant and exacerbated by a short-lived atg16Delta autophagy mutant. The identification of multiple novel effectors of yeast chronological lifespan will greatly aid in the elucidation of mechanisms that cells and organisms utilize in slowing down the aging process.
Marco-Herrero, Elena; González-Gordillo, J. Ignacio; Cuesta, José A.
The morphology of the megalopa stage of the panopeid Rhithropanopeus harrisii is redescribed and illustrated in detail from plankton specimens identified by DNA barcode (16S mtDNA) as previous descriptions do not meet the current standard of brachyuran larval description. Several morphological characters vary widely from those of other panopeid species which could cast some doubt on the species' placement in the same family. Besides, some anomalous megalopae of R. harrisii were found among specimens reared at the laboratory from zoeae collected in the plankton. These anomalous morphological features are discussed in terms of problems associated with laboratory rearing conditions.
Chen, Hua; Li, Jun
Microarrays are important tools for high-throughput analysis of biomolecules. The use of microarrays for parallel screening of nucleic acid and protein profiles has become an industry standard. A few limitations of microarrays are the requirement for relatively large sample volumes and elongated incubation time, as well as the limit of detection. In addition, traditional microarrays make use of bulky instrumentation for the detection, and sample amplification and labeling are quite laborious, which increase analysis cost and delays the time for obtaining results. These problems limit microarray techniques from point-of-care and field applications. One strategy for overcoming these problems is to develop nanoarrays, particularly electronics-based nanoarrays. With further miniaturization, higher sensitivity, and simplified sample preparation, nanoarrays could potentially be employed for biomolecular analysis in personal healthcare and monitoring of trace pathogens. In this chapter, it is intended to introduce the concept and advantage of nanotechnology and then describe current methods and protocols for novel nanoarrays in three aspects: (1) label-free nucleic acids analysis using nanoarrays, (2) nanoarrays for protein detection by conventional optical fluorescence microscopy as well as by novel label-free methods such as atomic force microscopy, and (3) nanoarray for enzymatic-based assay. These nanoarrays will have significant applications in drug discovery, medical diagnosis, genetic testing, environmental monitoring, and food safety inspection.
Hu, Jianjun; Li, Haifeng; Waterman, Michael S; Zhou, Xianghong Jasmine
Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples. We present the integrative Missing Value Estimation method (iMISS) by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS) imputation algorithm by up to 15% improvement in our benchmark tests. We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets.
Full Text Available Abstract Background Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples. Results We present the integrative Missing Value Estimation method (iMISS by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS imputation algorithm by up to 15% improvement in our benchmark tests. Conclusion We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets.
Kokel, David; Rennekamp, Andrew J; Shah, Asmi H; Liebel, Urban; Peterson, Randall T
For decades, studying the behavioral effects of individual drugs and genetic mutations has been at the heart of efforts to understand and treat nervous system disorders. High-throughput technologies adapted from other disciplines (e.g., high-throughput chemical screening, genomics) are changing the scale of data acquisition in behavioral neuroscience. Massive behavioral datasets are beginning to emerge, particularly from zebrafish labs, where behavioral assays can be performed rapidly and reproducibly in 96-well, high-throughput format. Mining these datasets and making comparisons across different assays are major challenges for the field. Here, we review behavioral barcoding, a process by which complex behavioral assays are reduced to a string of numeric features, facilitating analysis and comparison within and across datasets. Copyright © 2012 Elsevier Ltd. All rights reserved.
Full Text Available The Division of Birds, National Museum of Natural History, Smithsonian Institution in Washington, DC, has obtained and released DNA barcodes for 2,808 frozen tissue samples. Of the 1,403 species represented by these samples, 1,147 species have not been barcoded previously. This data release increases the number of bird species with standard barcodes by 91%. These records meet the data standard of the Consortium for the Barcode of Life and they have the reserved keyword BARCODE in GenBank. The data are now available on GenBank and the Barcode of Life Data Systems.
Cao, Cuong; Dhumpa, Raghuram; Bang, Dang Duong
involves the sandwiching of the target AIV between magnetic immunoprobes and barcode-carrying immunoprobes. Because each barcode-carrying immunoprobe is functionalized with a multitude of fluorophore-DNA barcode strands, many DNA barcodes are released for each positive binding event resulting......In this paper, a coupling of fluorophore-DNA barcode and bead-based immunoassay for detecting avian influenza virus (AIV) with PCR-like sensitivity is reported. The assay is based on the use of sandwich immunoassay and fluorophore-tagged oligonucleotides as representative barcodes. The detection...
Li, Lei; Wang, Xiangfeng; Stolc, Viktor
. We report here a full-genome transcription analysis of the indica rice subspecies using high-density oligonucleotide tiling microarrays. Our results provided expression data support for the existence of 35,970 (81.9%) annotated gene models and identified 5,464 unique transcribed intergenic regions...... that share similar compositional properties with the annotated exons and have significant homology to other plant proteins. Elucidating and mapping of all transcribed regions revealed an association between global transcription and cytological chromosome features, and an overall similarity of transcriptional......Sequencing and computational annotation revealed several features, including high gene numbers, unusual composition of the predicted genes and a large number of genes lacking homology to known genes, that distinguish the rice (Oryza sativa) genome from that of other fully sequenced model species...
Full Text Available Different from significant gene expression analysis which looks for genes that are differentially regulated, feature selection in the microarray-based prognostic gene expression analysis aims at finding a subset of marker genes that are not only differentially expressed but also informative for prediction. Unfortunately feature selection in literature of microarray study is predominated by the simple heuristic univariate gene filter paradigm that selects differentially expressed genes according to their statistical significances. We introduce a combinatory feature selection strategy that integrates differential gene expression analysis with the Gram-Schmidt process to identify prognostic genes that are both statistically significant and highly informative for predicting tumour survival outcomes. Empirical application to leukemia and ovarian cancer survival data through-within- and cross-study validations shows that the feature space can be largely reduced while achieving improved testing performances.
Full Text Available In biological systems that undergo processes such as differentiation, a clear concept of progression exists. We present a novel computational approach, called Sample Progression Discovery (SPD, to discover patterns of biological progression underlying microarray gene expression data. SPD assumes that individual samples of a microarray dataset are related by an unknown biological process (i.e., differentiation, development, cell cycle, disease progression, and that each sample represents one unknown point along the progression of that process. SPD aims to organize the samples in a manner that reveals the underlying progression and to simultaneously identify subsets of genes that are responsible for that progression. We demonstrate the performance of SPD on a variety of microarray datasets that were generated by sampling a biological process at different points along its progression, without providing SPD any information of the underlying process. When applied to a cell cycle time series microarray dataset, SPD was not provided any prior knowledge of samples' time order or of which genes are cell-cycle regulated, yet SPD recovered the correct time order and identified many genes that have been associated with the cell cycle. When applied to B-cell differentiation data, SPD recovered the correct order of stages of normal B-cell differentiation and the linkage between preB-ALL tumor cells with their cell origin preB. When applied to mouse embryonic stem cell differentiation data, SPD uncovered a landscape of ESC differentiation into various lineages and genes that represent both generic and lineage specific processes. When applied to a prostate cancer microarray dataset, SPD identified gene modules that reflect a progression consistent with disease stages. SPD may be best viewed as a novel tool for synthesizing biological hypotheses because it provides a likely biological progression underlying a microarray dataset and, perhaps more importantly, the
M Alex Smith
Full Text Available The efficient and effective monitoring of individuals and populations is critically dependent on correct species identification. While this point may seem obvious, identifying the majority of the more than 100 natural enemies involved in the spruce budworm (Choristoneura fumiferana--SBW food web remains a non-trivial endeavor. Insect parasitoids play a major role in the processes governing the population dynamics of SBW throughout eastern North America. However, these species are at the leading edge of the taxonomic impediment and integrating standardized identification capacity into existing field programs would provide clear benefits. We asked to what extent DNA barcoding the SBW food web would alter our understanding of the diversity and connectence of the food web and the frequency of generalists vs. specialists in different forest habitats. We DNA barcoded over 10% of the insects collected from the SBW food web in three New Brunswick forest plots from 1983 to 1993. For 30% of these specimens, we amplified at least one additional nuclear region. When the nodes of the food web were estimated based on barcode divergences (using molecular operational taxonomic units (MOTU or phylogenetic diversity (PD--the food web became much more diverse and connectence was reduced. We tested one measure of food web structure (the "bird feeder effect" and found no difference compared to the morphologically based predictions. Many, but not all, of the presumably polyphagous parasitoids now appear to be morphologically-cryptic host-specialists. To our knowledge, this project is the first to barcode a food web in which interactions have already been well-documented and described in space, time and abundance. It is poised to be a system in which field-based methods permit the identification capacity required by forestry scientists. Food web barcoding provided an effective tool for the accurate identification of all species involved in the cascading effects of
Smith, M Alex; Eveleigh, Eldon S; McCann, Kevin S; Merilo, Mark T; McCarthy, Peter C; Van Rooyen, Kathleen I
The efficient and effective monitoring of individuals and populations is critically dependent on correct species identification. While this point may seem obvious, identifying the majority of the more than 100 natural enemies involved in the spruce budworm (Choristoneura fumiferana--SBW) food web remains a non-trivial endeavor. Insect parasitoids play a major role in the processes governing the population dynamics of SBW throughout eastern North America. However, these species are at the leading edge of the taxonomic impediment and integrating standardized identification capacity into existing field programs would provide clear benefits. We asked to what extent DNA barcoding the SBW food web would alter our understanding of the diversity and connectence of the food web and the frequency of generalists vs. specialists in different forest habitats. We DNA barcoded over 10% of the insects collected from the SBW food web in three New Brunswick forest plots from 1983 to 1993. For 30% of these specimens, we amplified at least one additional nuclear region. When the nodes of the food web were estimated based on barcode divergences (using molecular operational taxonomic units (MOTU) or phylogenetic diversity (PD)--the food web became much more diverse and connectence was reduced. We tested one measure of food web structure (the "bird feeder effect") and found no difference compared to the morphologically based predictions. Many, but not all, of the presumably polyphagous parasitoids now appear to be morphologically-cryptic host-specialists. To our knowledge, this project is the first to barcode a food web in which interactions have already been well-documented and described in space, time and abundance. It is poised to be a system in which field-based methods permit the identification capacity required by forestry scientists. Food web barcoding provided an effective tool for the accurate identification of all species involved in the cascading effects of future budworm
Andersen, G.L.; He, Z.; DeSantis, T.Z.; Brodie, E.L.; Zhou, J.
Microarrays have proven to be a useful and high-throughput method to provide targeted DNA sequence information for up to many thousands of specific genetic regions in a single test. A microarray consists of multiple DNA oligonucleotide probes that, under high stringency conditions, hybridize only to specific complementary nucleic acid sequences (targets). A fluorescent signal indicates the presence and, in many cases, the abundance of genetic regions of interest. In this chapter we will look at how microarrays are used in microbial ecology, especially with the recent increase in microbial community DNA sequence data. Of particular interest to microbial ecologists, phylogenetic microarrays are used for the analysis of phylotypes in a community and functional gene arrays are used for the analysis of functional genes, and, by inference, phylotypes in environmental samples. A phylogenetic microarray that has been developed by the Andersen laboratory, the PhyloChip, will be discussed as an example of a microarray that targets the known diversity within the 16S rRNA gene to determine microbial community composition. Using multiple, confirmatory probes to increase the confidence of detection and a mismatch probe for every perfect match probe to minimize the effect of cross-hybridization by non-target regions, the PhyloChip is able to simultaneously identify any of thousands of taxa present in an environmental sample. The PhyloChip is shown to reveal greater diversity within a community than rRNA gene sequencing due to the placement of the entire gene product on the microarray compared with the analysis of up to thousands of individual molecules by traditional sequencing methods. A functional gene array that has been developed by the Zhou laboratory, the GeoChip, will be discussed as an example of a microarray that dynamically identifies functional activities of multiple members within a community. The recent version of GeoChip contains more than 24,000 50mer
Gaharwar, Akhilesh K.; Arpanaei, Ayyoob; Andresen, Thomas Lars
Three dimensional (3D) biomaterial microarrays hold enormous promise for regenerative medicine because of their ability to accelerate the design and fabrication of biomimetic materials. Such tissue-like biomaterials can provide an appropriate microenvironment for stimulating and controlling stem...... for tissue engineering and drug screening applications....... cell differentiation into tissue-specifi c lineages. The use of 3D biomaterial microarrays can, if optimized correctly, result in a more than 1000-fold reduction in biomaterials and cells consumption when engineering optimal materials combinations, which makes these miniaturized systems very attractive...
Baum, Andreas; Dominiak, Malgorzata Maria; Vidal-Melgosa, Silvia
and carbohydrate microarray analysis were performed directly on the crude lime peel extracts during the time course of the extractions. Multivariate analysis of the data was carried out to predict final pectin yields. Fourier transform infrared spectroscopy (FTIR) was found applicable for determining the optimal...... extraction time for the enzymatic and acidic extraction processes, respectively. The combined results of FTIR and carbohydrate microarray analysis suggested major differences in the crude pectin extracts obtained by enzymatic and acid extraction, respectively. Enzymatically extracted pectin, thus, showed......, and that FTIR and carbohydrate microarray analysis have potential to be developed into online process analysis tools for prediction of pectin extraction yields and pectin features from measurements on crude pectin extracts....
Full Text Available DNA barcoding has been proposed to be one of the most promising tools for accurate and rapid identification of taxa. However, few publications have evaluated the efficiency of DNA barcoding for the large genera of flowering plants. Dendrobium, one of the largest genera of flowering plants, contains many species that are important in horticulture, medicine and biodiversity conservation. Besides, Dendrobium is a notoriously difficult group to identify. DNA barcoding was expected to be a supplementary means for species identification, conservation and future studies in Dendrobium. We assessed the power of 11 candidate barcodes on the basis of 1,698 accessions of 184 Dendrobium species obtained primarily from mainland Asia. Our results indicated that five single barcodes, i.e., ITS, ITS2, matK, rbcL and trnH-psbA, can be easily amplified and sequenced with the currently established primers. Four barcodes, ITS, ITS2, ITS+matK, and ITS2+matK, have distinct barcoding gaps. ITS+matK was the optimal barcode based on all evaluation methods. Furthermore, the efficiency of ITS+matK was verified in four other large genera including Ficus, Lysimachia, Paphiopedilum, and Pedicularis in this study. Therefore, we tentatively recommend the combination of ITS+matK as a core DNA barcode for large flowering plant genera.
Ashfaq, Muhammad; Akhtar, Saleem; Khan, Arif M; Adamowicz, Sarah J; Hebert, Paul D N
DNA barcodes were obtained for 81 butterfly species belonging to 52 genera from sites in north-central Pakistan to test the utility of barcoding for their identification and to gain a better understanding of regional barcode variation. These species represent 25% of the butterfly fauna of Pakistan and belong to five families, although the Nymphalidae were dominant, comprising 38% of the total specimens. Barcode analysis showed that maximum conspecific divergence was 1.6%, while there was 1.7-14.3% divergence from the nearest neighbour species. Barcode records for 55 species showed Barcode of Life Data Systems (BOLD), but only 26 of these cases involved specimens from neighbouring India and Central Asia. Analysis revealed that most species showed little incremental sequence variation when specimens from other regions were considered, but a threefold increase was noted in a few cases. There was a clear gap between maximum intraspecific and minimum nearest neighbour distance for all 81 species. Neighbour-joining cluster analysis showed that members of each species formed a monophyletic cluster with strong bootstrap support. The barcode results revealed two provisional species that could not be clearly linked to known taxa, while 24 other species gained their first coverage. Future work should extend the barcode reference library to include all butterfly species from Pakistan as well as neighbouring countries to gain a better understanding of regional variation in barcode sequences in this topographically and climatically complex region. © 2013 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd.
Liu, Chang; Shi, Linchun; Xu, Xiaolan; Li, Huan; Xing, Hang; Liang, Dong; Jiang, Kun; Pang, Xiaohui; Song, Jingyuan; Chen, Shilin
The DNA barcoding technology uses a standard region of DNA sequence for species identification and discovery. At present, "DNA barcode" actually refers to DNA sequences, which are not amenable to information storage, recognition, and retrieval. Our aim is to identify the best symbology that can represent DNA barcode sequences in practical applications. A comprehensive set of sequences for five DNA barcode markers ITS2, rbcL, matK, psbA-trnH, and CO1 was used as the test data. Fifty-three different types of one-dimensional and ten two-dimensional barcode symbologies were compared based on different criteria, such as coding capacity, compression efficiency, and error detection ability. The quick response (QR) code was found to have the largest coding capacity and relatively high compression ratio. To facilitate the further usage of QR code-based DNA barcodes, a web server was developed and is accessible at http://qrfordna.dnsalias.org. The web server allows users to retrieve the QR code for a species of interests, convert a DNA sequence to and from a QR code, and perform species identification based on local and global sequence similarities. In summary, the first comprehensive evaluation of various barcode symbologies has been carried out. The QR code has been found to be the most appropriate symbology for DNA barcode sequences. A web server has also been constructed to allow biologists to utilize QR codes in practical DNA barcoding applications.
Full Text Available The DNA barcoding technology uses a standard region of DNA sequence for species identification and discovery. At present, "DNA barcode" actually refers to DNA sequences, which are not amenable to information storage, recognition, and retrieval. Our aim is to identify the best symbology that can represent DNA barcode sequences in practical applications. A comprehensive set of sequences for five DNA barcode markers ITS2, rbcL, matK, psbA-trnH, and CO1 was used as the test data. Fifty-three different types of one-dimensional and ten two-dimensional barcode symbologies were compared based on different criteria, such as coding capacity, compression efficiency, and error detection ability. The quick response (QR code was found to have the largest coding capacity and relatively high compression ratio. To facilitate the further usage of QR code-based DNA barcodes, a web server was developed and is accessible at http://qrfordna.dnsalias.org. The web server allows users to retrieve the QR code for a species of interests, convert a DNA sequence to and from a QR code, and perform species identification based on local and global sequence similarities. In summary, the first comprehensive evaluation of various barcode symbologies has been carried out. The QR code has been found to be the most appropriate symbology for DNA barcode sequences. A web server has also been constructed to allow biologists to utilize QR codes in practical DNA barcoding applications.
Xu, Songzhi; Li, Dezhu; Li, Jianwu; Xiang, Xiaoguo; Jin, Weitao; Huang, Weichang; Jin, Xiaohua; Huang, Luqi
DNA barcoding has been proposed to be one of the most promising tools for accurate and rapid identification of taxa. However, few publications have evaluated the efficiency of DNA barcoding for the large genera of flowering plants. Dendrobium, one of the largest genera of flowering plants, contains many species that are important in horticulture, medicine and biodiversity conservation. Besides, Dendrobium is a notoriously difficult group to identify. DNA barcoding was expected to be a supplementary means for species identification, conservation and future studies in Dendrobium. We assessed the power of 11 candidate barcodes on the basis of 1,698 accessions of 184 Dendrobium species obtained primarily from mainland Asia. Our results indicated that five single barcodes, i.e., ITS, ITS2, matK, rbcL and trnH-psbA, can be easily amplified and sequenced with the currently established primers. Four barcodes, ITS, ITS2, ITS+matK, and ITS2+matK, have distinct barcoding gaps. ITS+matK was the optimal barcode based on all evaluation methods. Furthermore, the efficiency of ITS+matK was verified in four other large genera including Ficus, Lysimachia, Paphiopedilum, and Pedicularis in this study. Therefore, we tentatively recommend the combination of ITS+matK as a core DNA barcode for large flowering plant genera.
Dai, Yilin; Guo, Ling; Li, Meng; Chen, Yi-Bu
Microarray data analysis presents a significant challenge to researchers who are unable to use the powerful Bioconductor and its numerous tools due to their lack of knowledge of R language. Among the few existing software programs that offer a graphic user interface to Bioconductor packages, none have implemented a comprehensive strategy to address the accuracy and reliability issue of microarray data analysis due to the well known probe design problems associated with many widely used microarray chips. There is also a lack of tools that would expedite the functional analysis of microarray results. We present Microarray Я US, an R-based graphical user interface that implements over a dozen popular Bioconductor packages to offer researchers a streamlined workflow for routine differential microarray expression data analysis without the need to learn R language. In order to enable a more accurate analysis and interpretation of microarray data, we incorporated the latest custom probe re-definition and re-annotation for Affymetrix and Illumina chips. A versatile microarray results output utility tool was also implemented for easy and fast generation of input files for over 20 of the most widely used functional analysis software programs. Coupled with a well-designed user interface, Microarray Я US leverages cutting edge Bioconductor packages for researchers with no knowledge in R language. It also enables a more reliable and accurate microarray data analysis and expedites downstream functional analysis of microarray results.
Kerr Kathleen F
Full Text Available Abstract Background As part of its broad and ambitious mission, the MicroArray Quality Control (MAQC project reported the results of experiments using External RNA Controls (ERCs on five microarray platforms. For most platforms, several different methods of data processing were considered. However, there was no similar consideration of different methods for processing the data from the Agilent two-color platform. While this omission is understandable given the scale of the project, it can create the false impression that there is consensus about the best way to process Agilent two-color data. It is also important to consider whether ERCs are representative of all the probes on a microarray. Results A comparison of different methods of processing Agilent two-color data shows substantial differences among methods for low-intensity genes. The sensitivity and specificity for detecting differentially expressed genes varies substantially for different methods. Analysis also reveals that the ERCs in the MAQC data only span the upper half of the intensity range, and therefore cannot be representative of all genes on the microarray. Conclusion Although ERCs demonstrate good agreement between observed and expected log-ratios on the Agilent two-color platform, such an analysis is incomplete. Simple loess normalization outperformed data processing with Agilent's Feature Extraction software for accurate identification of differentially expressed genes. Results from studies using ERCs should not be over-generalized when ERCs are not representative of all probes on a microarray.
Santoro, Stephanie L; Hashimoto, Sayaka; McKinney, Aimee; Mihalic Mosher, Theresa; Pyatt, Robert; Reshmi, Shalini C; Astbury, Caroline; Hickey, Scott E
Maternal uniparental disomy (UPD) 15 is one of the molecular causes of Prader-Willi syndrome (PWS), a multisystem disorder which presents with neonatal hypotonia and feeding difficulty. Current diagnostic algorithms differ regarding the use of SNP microarray to detect PWS. We retrospectively examined the frequency with which SNP microarray could identify regions of homozygosity (ROH) in patients with PWS. We determined that 7/12 (58%) patients with previously confirmed PWS by methylation analysis and microsatellite-positive UPD studies had ROH (>10 Mb) by SNP microarray. Additional assessment of 5,000 clinical microarrays, performed from 2013 to present, determined that only a single case of ROH for chromosome 15 was not caused by an imprinting disorder or identity by descent. We observed that ROH for chromosome 15 is rarely incidental and strongly associated with hypotonic infants having features of PWS. Although UPD microsatellite studies remain essential to definitively establish the presence of UPD, SNP microarray has important utility in the timely diagnostic algorithm for PWS. © 2017 S. Karger AG, Basel.
Saarela, Jeffery M.; Sokoloff, Paul C.; Gillespie, Lynn J.; Consaul, Laurie L.; Bull, Roger D.
Accurate identification of Arctic plant species is critical for understanding potential climate-induced changes in their diversity and distributions. To facilitate rapid identification we generated DNA barcodes for the core plastid barcode loci (rbcL and matK) for 490 vascular plant species, representing nearly half of the Canadian Arctic flora and 93% of the flora of the Canadian Arctic Archipelago. Sequence recovery was higher for rbcL than matK (93% and 81%), and rbcL was easier to recover than matK from herbarium specimens (92% and 77%). Distance-based and sequence-similarity analyses of combined rbcL + matK data discriminate 97% of genera, 56% of species, and 7% of infraspecific taxa. There is a significant negative correlation between the number of species sampled per genus and the percent species resolution per genus. We characterize barcode variation in detail in the ten largest genera sampled (Carex, Draba, Festuca, Pedicularis, Poa, Potentilla, Puccinellia, Ranunculus, Salix, and Saxifraga) in the context of their phylogenetic relationships and taxonomy. Discrimination with the core barcode loci in these genera ranges from 0% in Salix to 85% in Carex. Haplotype variation in multiple genera does not correspond to species boundaries, including Taraxacum, in which the distribution of plastid haplotypes among Arctic species is consistent with plastid variation documented in non-Arctic species. Introgression of Poa glauca plastid DNA into multiple individuals of P. hartzii is problematic for identification of these species with DNA barcodes. Of three supplementary barcode loci (psbA–trnH, psbK–psbI, atpF–atpH) collected for a subset of Poa and Puccinellia species, only atpF–atpH improved discrimination in Puccinellia, compared with rbcL and matK. Variation in matK in Vaccinium uliginosum and rbcL in Saxifraga oppositifolia corresponds to variation in other loci used to characterize the phylogeographic histories of these Arctic-alpine species. PMID
Full Text Available Abstract Background The selection of genes that discriminate disease classes from microarray data is widely used for the identification of diagnostic biomarkers. Although various gene selection methods are currently available and some of them have shown excellent performance, no single method can retain the best performance for all types of microarray datasets. It is desirable to use a comparative approach to find the best gene selection result after rigorous test of different methodological strategies for a given microarray dataset. Results FiGS is a web-based workbench that automatically compares various gene selection procedures and provides the optimal gene selection result for an input microarray dataset. FiGS builds up diverse gene selection procedures by aligning different feature selection techniques and classifiers. In addition to the highly reputed techniques, FiGS diversifies the gene selection procedures by incorporating gene clustering options in the feature selection step and different data pre-processing options in classifier training step. All candidate gene selection procedures are evaluated by the .632+ bootstrap errors and listed with their classification accuracies and selected gene sets. FiGS runs on parallelized computing nodes that capacitate heavy computations. FiGS is freely accessible at http://gexp.kaist.ac.kr/figs. Conclusion FiGS is an web-based application that automates an extensive search for the optimized gene selection analysis for a microarray dataset in a parallel computing environment. FiGS will provide both an efficient and comprehensive means of acquiring optimal gene sets that discriminate disease states from microarray datasets.
Hayward, T J; Hong, B; Vyas, K N; Palfreyman, J J; Cooper, J F K; Jiang, Z; Llandro, J; Mitrelias, T; Bland, J A C; Barnes, C H W; Jeong, J R
We present proof-of-principle experiments and simulations that demonstrate a new biological assay technology in which microscopic tags carrying multi-bit magnetic codes are used to label probe biomolecules. It is demonstrated that these 'micro-barcode tags' can be encoded, transported using micro-fluidics and are compatible with surface chemistry. We also present simulations and experimental results which suggest the feasibility of decoding the micro-barcode tags using magnetoresistive sensors. Together, these results demonstrate substantial progress towards meeting the critical requirements of a magnetically encoded, high-throughput and portable biological assay platform. We also show that an extension of our technology could potentially be used to label libraries consisting of ∼10 4 distinct probe molecules, and could therefore have a strong impact on mainstream medical diagnostics.
Full Text Available This article outlines the development of a mobile application for the Ryerson University Library. The application provides for ISBN barcode scanning that results in a lookup of library copies and services for the book scanned, as well as QR code scanning. Two versions of the application were developed, one for iOS and one for Android. The article includes some details on the free packages used for barcode scanning functionality. Source code for the Ryerson iOS and Android applications are freely available, and instructions are provided on customizing the Ryerson application for use in other library environments. Some statistics on the number of downloads of the Ryerson mobile app by users are included.
Huang, Zuhao; Tu, Feiyun
The avian genera Calidris and Tringa are the largest of the widespread family of Scolopacidae. The phylogeny of members of the two genera is still a matter of controversial. Mitochondrial cytochrome c oxidase subunit I (COI) can serve as a fast and accurate marker for the identification and phylogeny of animal species. In this study, we analyzed the COI barcodes of thirty-one species of the two genera. All the species had distinct COI sequences. Two hundred and twenty-one variable sites were identified. Kimura two-parameter distances were calculated between barcodes. Neighbor-joining and maximum likelihood methods were used to construct phylogenetic trees. All the species could be discriminated by their distinct clades in the phylogenetic trees. The phylogenetic trees grouped all the species of Calidris and Tringa into different monophyletic clade, respectively. COI data showed a well-supported phylogeny for Calidris and Tringa species.
Elizabeth L Clare
Full Text Available DNA barcoding using the cytochrome c oxidase subunit 1 gene (COI is frequently employed as an efficient method of species identification in animal life and may also be used to estimate species richness, particularly in understudied faunas. Despite numerous past demonstrations of the efficiency of this technique, few studies have attempted to employ DNA barcoding methodologies on a large geographic scale, particularly within tropical regions. In this study we survey current and potential species diversity using DNA barcodes with a collection of more than 9000 individuals from 163 species of Neotropical bats (order Chiroptera. This represents one of the largest surveys to employ this strategy on any animal group and is certainly the largest to date for land vertebrates. Our analysis documents the utility of this tool over great geographic distances and across extraordinarily diverse habitats. Among the 163 included species 98.8% possessed distinct sets of COI haplotypes making them easily recognizable at this locus. We detected only a single case of shared haplotypes. Intraspecific diversity in the region was high among currently recognized species (mean of 1.38%, range 0-11.79% with respect to birds, though comparable to other bat assemblages. In 44 of 163 cases, well-supported, distinct intraspecific lineages were identified which may suggest the presence of cryptic species though mean and maximum intraspecific divergence were not good predictors of their presence. In all cases, intraspecific lineages require additional investigation using complementary molecular techniques and additional characters such as morphology and acoustic data. Our analysis provides strong support for the continued assembly of DNA barcoding libraries and ongoing taxonomic investigation of bats.
Wang, Pengfei; Tian, Cheng; Li, Xiang; Mao, Chengde
Barcode-like (BC) nanopatterns from programmed self-assembly of nucleic acids (DNA and RNA) are reported. BC nanostructures are generated by the introduction of open spaces at selected sites to an otherwise closely packed, plain, rectangle nucleic acid nanostructure. This strategy is applied to nanostructures assembled from both origami approach and single stranded tile approach. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Five different loci 18S, UPA, rbcl, ITS and tufA were tested for their use as deoxyribonucleic acid (DNA) barcode in this study. Although the UPA primers were designed to amplify all phototrophic algae and cyanobacteria, UPA and 18S did not amplified at all for the genus Chlorella while ITS1, ITS2 rDNA and rbcL markers ...
Wilkins, N.; Rodríguez, Á. E.
Zooplankton is composed of animals that drift within the water column. The study of zooplankton biodiversity and distribution is crucial to understand oceanic ecosystems and anticipate the effects of climate change. In this study our focus is on ichthyoplankton (fish eggs and larvae). Our aim is to employ molecular genetic techniques such as DNA barcoding to begin a detailed characterization of ichthyoplankton diversity, abundance and community structure in the Hampton Roads Bay Estuary (HRBE). A sampling of zooplankton was performed on June 19, 2015. Samples were taken with a 0.5m, 200 µm mesh net in triplicates at two stations: inner shore in the mouth of Jones Creek and 5 miles off Hampton in the lower part of Chesapeake Bay. Physical parameters (dissolved oxygen, salinity, and temperature and water transparency) were measured simultaneously. Species were identified by DNA barcoding using the mitochondrial DNA (mtDNA) of the Cytochrome Oxidase 1 (CO1) gene. Fish eggs were identified from Opistonema oglinum (Atlantic Thread Herring) at the offshore stations while, Anchoa mitchilli was found at both stations. These species are common to the area and as observed, differences in species between stations were found. O. oglinum eggs were found in the offshore stations, which is their reported habitat. A. mitchilli eggs were found in both stations; both known to exhibit a wider salinity tolerance. This work indicates that using mtDNA-CO1 barcoding is suitable to identify ichthyoplankton to the species level and helped validate DNA barcoding as a faster taxonomic approach. The long term objective of this project is to provide taxonomic composition and biodiversity assessment of ichthyoplankton in HRBE. This data will be a reference for broad monitoring programs; for a better understanding and management of ecologically and commercially important species in the HRBE. Monthly samplings will be performed for a year beginning September 2015.
Sudhindra Mahoorkar; Anoop Jain
Over the years, various denture marking systems have been reported in the literature for personal identification. They have been broadly divided into surface marking and inclusion methods. In this technique, patient's unique identification number and barcode printed in the patient's Aadhaar card issued by Unique Identification Authority of India (UIDAI) are used as denture markers. This article describes a simple, quick, and economical method for identification of individual.
Mahoorkar, Sudhindra; Jain, Anoop
Over the years, various denture marking systems have been reported in the literature for personal identification. They have been broadly divided into surface marking and inclusion methods. In this technique, patient's unique identification number and barcode printed in the patient's Aadhaar card issued by Unique Identification Authority of India (UIDAI) are used as denture markers. This article describes a simple, quick, and economical method for identification of individual.
Blagoev, Gergin A; deWaard, Jeremy R; Ratnasingham, Sujeevan; deWaard, Stephanie L; Lu, Liuqiong; Robertson, James; Telfer, Angela C; Hebert, Paul D N
Approximately 1460 species of spiders have been reported from Canada, 3% of the global fauna. This study provides a DNA barcode reference library for 1018 of these species based upon the analysis of more than 30,000 specimens. The sequence results show a clear barcode gap in most cases with a mean intraspecific divergence of 0.78% vs. a minimum nearest-neighbour (NN) distance averaging 7.85%. The sequences were assigned to 1359 Barcode index numbers (BINs) with 1344 of these BINs composed of specimens belonging to a single currently recognized species. There was a perfect correspondence between BIN membership and a known species in 795 cases, while another 197 species were assigned to two or more BINs (556 in total). A few other species (26) were involved in BIN merges or in a combination of merges and splits. There was only a weak relationship between the number of specimens analysed for a species and its BIN count. However, three species were clear outliers with their specimens being placed in 11-22 BINs. Although all BIN splits need further study to clarify the taxonomic status of the entities involved, DNA barcodes discriminated 98% of the 1018 species. The present survey conservatively revealed 16 species new to science, 52 species new to Canada and major range extensions for 426 species. However, if most BIN splits detected in this study reflect cryptic taxa, the true species count for Canadian spiders could be 30-50% higher than currently recognized. © 2015 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
Mocellin, Simone; Rossi, Carlo Riccardo
The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.
The main aim of this master thesis was the simultaneous detection of four selected plant viruses ? Apple mosaic virus, Plum pox virus, Prunus necrotic ringspot virus and Prune harf virus, by microarrays. The intermediate step in the process of the detection was optimizing of multiplex polymerase chain reaction (PCR).
Oct 20, 2014 ... the advent of DNA microarray techniques (Lee et al. 2007). ... atoms of ribose to form a bicyclic ribosyl structure. It is the .... 532 nm and emission at 570 nm. The signal ..... sis and validation using real-time PCR. Nucleic Acids ...
Hybridization of labeled cDNA to microarrays is an intuitively simple and a vastly underestimated process. If it is not performed, optimized, and standardized with the same attention to detail as e.g., RNA amplification, information may be overlooked or even lost. Careful balancing of the amount ...
Barnard, Betsy; Sussman, Michael; BonDurant, Sandra Splinter; Nienhuis, James; Krysan, Patrick
We have developed and optimized the necessary laboratory materials to make DNA microarray technology accessible to all high school students at a fraction of both cost and data size. The primary component is a DNA chip/array that students "print" by hand and then analyze using research tools that have been adapted for classroom use. The…
Thygesen, Helene H.; Zwinderman, Aeilko H.
Background: When DNA microarray data are used for gene clustering, genotype/phenotype correlation studies, or tissue classification the signal intensities are usually transformed and normalized in several steps in order to improve comparability and signal/noise ratio. These steps may include
Yáñez-Rivera, Beatriz; Carrera-Parra, Luis Fernando
Abstract The species of the genus Notopygos Grube, 1855 are characterized by an ovate body, a prominent caruncle with three lobes, dendritic branchiae, and double dorsal cirri. Twenty-two species belonging to Notopygos have been described, mostly from the Indo-Pacific region. In America, few species are frequently recorded: Notopygos crinita Grube, 1855 from St. Helena Island (Atlantic) and Notopygos ornata Grube and Ørsted in Grube 1857 from Costa Rica (Pacific). Notopygos crinita is a widely distributed species in the Western Atlantic with additional reports in the Mediterranean Sea (as a questionable alien species) and in the Pacific Ocean. However, only the genus features have been considered, consequently some records could be misidentifications. During a revision of materials from collections and the barcode project, ‘Mexican Barcode of Life, MEXBOL’, we found specimens of Notopygos megalops and an undescribed species from reef zones in the Caribbean; the former had been considered a junior synonym of Notopygos crinita. Herein, Notopygos megalops is reestablished and Notopygos caribea sp. n. is described. A morphological and DNA barcode approach was used to explain the records of Notopygos ornata in the Atlantic and to show the differences with the new species, since both species share features such as complex pigmentation patterns, and circular projections in the median lobe of the caruncle. PMID:23459182
Full Text Available The species of the genus Notopygos Grube, 1855 are characterized by an ovate body, a prominent caruncle with three lobes, dendritic branchiae, and double dorsal cirri. Twenty-two species belonging to Notopygos have been described, mostly from the Indo-Pacific region. In America, few species are frequently recorded: N. crinita Grube, 1855 from St. Helena Island (Atlantic and N. ornata Grube and Ørsted in Grube 1857 from Costa Rica (Pacific. Notopygos crinita is a widely distributed species in the Western Atlantic with additional reports in the Mediterranean Sea (as a questionable alien species and in the Pacific Ocean. However, only the genus features have been considered, consequently some records could be misidentifications. During a revision of materials from collections and the barcode project, ‘Mexican Barcode of Life, MEXBOL’, we found specimens of N. megalops and an undescribed species from reef zones in the Caribbean; the former had been considered a junior synonym of N. crinita. Herein, N. megalops is reestablished and N. caribea sp. n. is described. A morphological and DNA barcode approach was used to explain the records of N. ornata in the Atlantic and to show the differences with the new species, since both species share features such as complex pigmentation patterns, and circular projections in the median lobe of the caruncle.
Molecular methods, such as DNA barcoding, have the potential in enhance biomonitoring programs worldwide. Altering routinely used sample preservation methods to protect DNA from degradation may pose a potential impediment to application of DNA barcoding and metagenomics for biom...
Aquilino, Sean V L; Tango, Jazzlyn M; Fontanilla, Ian K C; Pagulayan, Roberto C; Basiao, Zubaida U; Ong, Perry S; Quilang, Jonas P
This study represents the first molecular survey of the ichthyofauna of Taal Lake and the first DNA barcoding attempt in Philippine fishes. Taal Lake, the third largest lake in the Philippines, is considered a very important fisheries resource and is home to the world's only freshwater sardine, Sardinella tawilis. However, overexploitation and introduction of exotic fishes have caused a massive decline in the diversity of native species as well as in overall productivity of the lake. In this study, 118 individuals of 23 native, endemic and introduced fishes of Taal Lake were barcoded using the partial DNA sequence of the mitochondrial cytochrome c oxidase subunit I (COI) gene. These species belong to 21 genera, 17 families and 9 orders. Divergence of sequences within and between species was determined using Kimura 2-parameter (K2P) distance model, and a neighbour-joining tree was generated with 1000 bootstrap replications using the K2P model. All COI sequences for each of the 23 species were clearly discriminated among genera. The average within species, within genus, within family and within order percent genetic divergence was 0.60%, 11.07%, 17.67% and 24.08%, respectively. Our results provide evidence that COI DNA barcodes are effective for the rapid and accurate identification of fishes and for identifying certain species that need further taxonomic investigation. © 2011 Blackwell Publishing Ltd.
Peikon, Ian D; Kebschull, Justus M; Vagin, Vasily V; Ravens, Diana I; Sun, Yu-Chi; Brouzes, Eric; Corrêa, Ivan R; Bressan, Dario; Zador, Anthony M
The function of a neural circuit is determined by the details of its synaptic connections. At present, the only available method for determining a neural wiring diagram with single synapse precision-a 'connectome'-is based on imaging methods that are slow, labor-intensive and expensive. Here, we present SYNseq, a method for converting the connectome into a form that can exploit the speed and low cost of modern high-throughput DNA sequencing. In SYNseq, each neuron is labeled with a unique random nucleotide sequence-an RNA 'barcode'-which is targeted to the synapse using engineered proteins. Barcodes in pre- and postsynaptic neurons are then associated through protein-protein crosslinking across the synapse, extracted from the tissue, and joined into a form suitable for sequencing. Although our failure to develop an efficient barcode joining scheme precludes the widespread application of this approach, we expect that with further development SYNseq will enable tracing of complex circuits at high speed and low cost. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Nussinov, Ruth; Ma, Buyong; Tsai, Chung-Jung; Csermely, Peter
The cellular network is highly interconnected. Pathways merge and diverge. They proceed through shared proteins and may change directions. How are cellular pathways controlled and their directions decided, coded, and read? These questions become particularly acute when we consider that a small number of pathways, such as signaling pathways that regulate cell fates, cell proliferation, and cell death in development, are extensively exploited. This review focuses on these signaling questions from the structural standpoint and discusses the literature in this light. All co-occurring allosteric events (including posttranslational modifications, pathogen binding, and gain-of-function mutations) collectively tag the protein functional site with a unique barcode. The barcode shape is read by an interacting molecule, which transmits the signal. A conformational barcode provides an intracellular address label, which selectively favors binding to one partner and quenches binding to others, and, in this way, determines the pathway direction, and, eventually, the cell's response and fate. Copyright © 2013 Elsevier Ltd. All rights reserved.
Smurthwaite, Cameron A; Williams, Wesley; Fetsko, Alexandra; Abbadessa, Darin; Stolp, Zachary D; Reed, Connor W; Dharmawan, Andre; Wolkowicz, Roland
Fluorescent proteins, fluorescent dyes and fluorophores in general have revolutionized the field of molecular cell biology. In particular, the discovery of fluorescent proteins and their genes have enabled the engineering of protein fusions for localization, the analysis of transcriptional activation and translation of proteins of interest, or the general tracking of individual cells and cell populations. The use of fluorescent protein genes in combination with retroviral technology has further allowed the expression of these proteins in mammalian cells in a stable and reliable manner. Shown here is how one can utilize these genes to give cells within a population of cells their own biosignature. As the biosignature is achieved with retroviral technology, cells are barcoded 'indefinitely'. As such, they can be individually tracked within a mixture of barcoded cells and utilized in more complex biological applications. The tracking of distinct populations in a mixture of cells is ideal for multiplexed applications such as discovery of drugs against a multitude of targets or the activation profile of different promoters. The protocol describes how to elegantly develop and amplify barcoded mammalian cells with distinct genetic fluorescent markers, and how to use several markers at once or one marker at different intensities. Finally, the protocol describes how the cells can be further utilized in combination with cell-based assays to increase the power of analysis through multiplexing.
Yachie, Nozomu; Petsalaki, Evangelia; Mellor, Joseph C; Weile, Jochen; Jacob, Yves; Verby, Marta; Ozturk, Sedide B; Li, Siyang; Cote, Atina G; Mosca, Roberto; Knapp, Jennifer J; Ko, Minjeong; Yu, Analyn; Gebbia, Marinella; Sahni, Nidhi; Yi, Song; Tyagi, Tanya; Sheykhkarimli, Dayag; Roth, Jonathan F; Wong, Cassandra; Musa, Louai; Snider, Jamie; Liu, Yi-Chun; Yu, Haiyuan; Braun, Pascal; Stagljar, Igor; Hao, Tong; Calderwood, Michael A; Pelletier, Laurence; Aloy, Patrick; Hill, David E; Vidal, Marc; Roth, Frederick P
High-throughput binary protein interaction mapping is continuing to extend our understanding of cellular function and disease mechanisms. However, we remain one or two orders of magnitude away from a complete interaction map for humans and other major model organisms. Completion will require screening at substantially larger scales with many complementary assays, requiring further efficiency gains in proteome-scale interaction mapping. Here, we report Barcode Fusion Genetics-Yeast Two-Hybrid (BFG-Y2H), by which a full matrix of protein pairs can be screened in a single multiplexed strain pool. BFG-Y2H uses Cre recombination to fuse DNA barcodes from distinct plasmids, generating chimeric protein-pair barcodes that can be quantified via next-generation sequencing. We applied BFG-Y2H to four different matrices ranging in scale from ~25 K to 2.5 M protein pairs. The results show that BFG-Y2H increases the efficiency of protein matrix screening, with quality that is on par with state-of-the-art Y2H methods. © 2016 The Authors. Published under the terms of the CC BY 4.0 license.
Full Text Available In the present study, we investigated DNA barcoding effectiveness to characterize honeybee pollen pellets, a food supplement largely used for human nutrition due to its therapeutic properties. We collected pollen pellets using modified beehives placed in three zones within an alpine protected area (Grigna Settentrionale Regional Park, Italy. A DNA barcoding reference database, including rbcL and trnH-psbA sequences from 693 plant species (104 sequenced in this study was assembled. The database was used to identify pollen collected from the hives. Fifty-two plant species were identified at the molecular level. Results suggested rbcL alone could not distinguish among congeneric plants; however, psbA-trnH identified most of the pollen samples at the species level. Substantial variability in pollen composition was observed between the highest elevation locality (Alpe Moconodeno, characterized by arid grasslands and a rocky substrate, and the other two sites (Cornisella and Ortanella at lower altitudes. Pollen from Ortanella and Cornisella showed the presence of typical deciduous forest species; however in samples collected at Ortanella, pollen of the invasive Lonicera japonica, and the ornamental Pelargonium x hortorum were observed. Our results indicated pollen composition was largely influenced by floristic local biodiversity, plant phenology, and the presence of alien flowering species. Therefore, pollen molecular characterization based on DNA barcoding might serve useful to beekeepers in obtaining honeybee products with specific nutritional or therapeutic characteristics desired by food market demands.
Hubert, Nicolas; Espiau, Benoit; Meyer, Christopher; Planes, Serge
Marine fishes exhibit spectacular phenotypic changes during their ontogeny, and the identification of their early stages is challenging due to the paucity of diagnostic morphological characters at the species level. Meanwhile, the importance of early life stages in dispersal and connectivity has recently experienced an increasing interest in conservation programmes for coral reef fishes. This study aims at assessing the effectiveness of DNA barcoding for the automated identification of coral reef fish larvae through large-scale ecosystemic sampling. Fish larvae were mainly collected using bongo nets and light traps around Moorea between September 2008 and August 2010 in 10 sites distributed in open waters. Fish larvae ranged from 2 to 100 mm of total length, with the most abundant individuals being <5 mm. Among the 505 individuals DNA barcoded, 373 larvae (i.e. 75%) were identified to the species level. A total of 106 species were detected, among which 11 corresponded to pelagic and bathypelagic species, while 95 corresponded to species observed at the adult stage on neighbouring reefs. This study highlights the benefits and pitfalls of using standardized molecular systems for species identification and illustrates the new possibilities enabled by DNA barcoding for future work on coral reef fish larval ecology. © 2014 John Wiley & Sons Ltd.
Full Text Available Microarray technologies have been the basis of numerous important findings regarding gene expression in the few last decades. Studies have generated large amounts of data describing various processes, which, due to the existence of public databases, are widely available for further analysis. Given their lower cost and higher maturity compared to newer sequencing technologies, these data continue to be produced, even though data quality has been the subject of some debate. However, given the large volume of data generated, integration can help overcome some issues related, e.g., to noise or reduced time resolution, while providing additional insight on features not directly addressed by sequencing methods. Here, we present an integration test case based on public Drosophila melanogaster datasets (gene expression, binding site affinities, known interactions. Using an evolutionary computation framework, we show how integration can enhance the ability to recover transcriptional gene regulatory networks from these data, as well as indicating which data types are more important for quantitative and qualitative network inference. Our results show a clear improvement in performance when multiple datasets are integrated, indicating that microarray data will remain a valuable and viable resource for some time to come.
Sîrbu, Alina; Crane, Martin; Ruskin, Heather J
Microarray technologies have been the basis of numerous important findings regarding gene expression in the few last decades. Studies have generated large amounts of data describing various processes, which, due to the existence of public databases, are widely available for further analysis. Given their lower cost and higher maturity compared to newer sequencing technologies, these data continue to be produced, even though data quality has been the subject of some debate. However, given the large volume of data generated, integration can help overcome some issues related, e.g., to noise or reduced time resolution, while providing additional insight on features not directly addressed by sequencing methods. Here, we present an integration test case based on public Drosophila melanogaster datasets (gene expression, binding site affinities, known interactions). Using an evolutionary computation framework, we show how integration can enhance the ability to recover transcriptional gene regulatory networks from these data, as well as indicating which data types are more important for quantitative and qualitative network inference. Our results show a clear improvement in performance when multiple datasets are integrated, indicating that microarray data will remain a valuable and viable resource for some time to come.
Gresham Cathy R
Full Text Available Abstract Background Modeling results from chicken microarray studies is challenging for researchers due to little functional annotation associated with these arrays. The Affymetrix GenChip chicken genome array, one of the biggest arrays that serve as a key research tool for the study of chicken functional genomics, is among the few arrays that link gene products to Gene Ontology (GO. However the GO annotation data presented by Affymetrix is incomplete, for example, they do not show references linked to manually annotated functions. In addition, there is no tool that facilitates microarray researchers to directly retrieve functional annotations for their datasets from the annotated arrays. This costs researchers amount of time in searching multiple GO databases for functional information. Results We have improved the breadth of functional annotations of the gene products associated with probesets on the Affymetrix chicken genome array by 45% and the quality of annotation by 14%. We have also identified the most significant diseases and disorders, different types of genes, and known drug targets represented on Affymetrix chicken genome array. To facilitate functional annotation of other arrays and microarray experimental datasets we developed an Array GO Mapper (AGOM tool to help researchers to quickly retrieve corresponding functional information for their dataset. Conclusion Results from this study will directly facilitate annotation of other chicken arrays and microarray experimental datasets. Researchers will be able to quickly model their microarray dataset into more reliable biological functional information by using AGOM tool. The disease, disorders, gene types and drug targets revealed in the study will allow researchers to learn more about how genes function in complex biological systems and may lead to new drug discovery and development of therapies. The GO annotation data generated will be available for public use via AgBase website and
Yu, Min; Jiao, Lichao; Guo, Juan; Wiedenhoeft, Alex C; He, Tuo; Jiang, Xiaomei; Yin, Yafang
ITS2+ trnH - psbA was the best combination of DNA barcode to resolve the Dalbergia wood species studied. We demonstrate the feasibility of building a DNA barcode reference database using xylarium wood specimens. The increase in illegal logging and timber trade of CITES-listed tropical species necessitates the development of unambiguous identification methods at the species level. For these methods to be fully functional and deployable for law enforcement, they must work using wood or wood products. DNA barcoding of wood has been promoted as a promising tool for species identification; however, the main barrier to extensive application of DNA barcoding to wood is the lack of a comprehensive and reliable DNA reference library of barcodes from wood. In this study, xylarium wood specimens of nine Dalbergia species were selected from the Wood Collection of the Chinese Academy of Forestry and DNA was then extracted from them for further PCR amplification of eight potential DNA barcode sequences (ITS2, matK, trnL, trnH-psbA, trnV-trnM1, trnV-trnM2, trnC-petN, and trnS-trnG). The barcodes were tested singly and in combination for species-level discrimination ability by tree-based [neighbor-joining (NJ)] and distance-based (TaxonDNA) methods. We found that the discrimination ability of DNA barcodes in combination was higher than any single DNA marker among the Dalbergia species studied, with the best two-marker combination of ITS2+trnH-psbA analyzed with NJ trees performing the best (100% accuracy). These barcodes are relatively short regions (wood as the source material, a necessary factor to apply DNA barcoding to timber trade. The present results demonstrate the feasibility of using vouchered xylarium specimens to build DNA barcoding reference databases.
Full Text Available This paper presents microarray BASICA: an integrated image processing tool for background adjustment, segmentation, image compression, and analysis of cDNA microarray images. BASICA uses a fast Mann-Whitney test-based algorithm to segment cDNA microarray images, and performs postprocessing to eliminate the segmentation irregularities. The segmentation results, along with the foreground and background intensities obtained with the background adjustment, are then used for independent compression of the foreground and background. We introduce a new distortion measurement for cDNA microarray image compression and devise a coding scheme by modifying the embedded block coding with optimized truncation (EBCOT algorithm (Taubman, 2000 to achieve optimal rate-distortion performance in lossy coding while still maintaining outstanding lossless compression performance. Experimental results show that the bit rate required to ensure sufficiently accurate gene expression measurement varies and depends on the quality of cDNA microarray images. For homogeneously hybridized cDNA microarray images, BASICA is able to provide from a bit rate as low as 5 bpp the gene expression data that are 99% in agreement with those of the original 32 bpp images.
Full Text Available Abstract Background Microarray data have a high dimension of variables and a small sample size. In microarray data analyses, two important issues are how to choose genes, which provide reliable and good prediction for disease status, and how to determine the final gene set that is best for classification. Associations among genetic markers mean one can exploit information redundancy to potentially reduce classification cost in terms of time and money. Results To deal with redundant information and improve classification, we propose a gene selection method, Recursive Feature Addition, which combines supervised learning and statistical similarity measures. To determine the final optimal gene set for prediction and classification, we propose an algorithm, Lagging Prediction Peephole Optimization. By using six benchmark microarray gene expression data sets, we compared Recursive Feature Addition with recently developed gene selection methods: Support Vector Machine Recursive Feature Elimination, Leave-One-Out Calculation Sequential Forward Selection and several others. Conclusions On average, with the use of popular learning machines including Nearest Mean Scaled Classifier, Support Vector Machine, Naive Bayes Classifier and Random Forest, Recursive Feature Addition outperformed other methods. Our studies also showed that Lagging Prediction Peephole Optimization is superior to random strategy; Recursive Feature Addition with Lagging Prediction Peephole Optimization obtained better testing accuracies than the gene selection method varSelRF.
Yu, Hong; Kong, Lingfeng; Li, Qi
In this study, we evaluated the efficacy of 12 mitochondrial protein-coding genes from 238 mitochondrial genomes of 140 molluscan species as potential DNA barcodes for mollusks. Three barcoding methods (distance, monophyly and character-based methods) were used in species identification. The species recovery rates based on genetic distances for the 12 genes ranged from 70.83 to 83.33%. There were no significant differences in intra- or interspecific variability among the 12 genes. The monophyly and character-based methods provided higher resolution than the distance-based method in species delimitation. Especially in closely related taxa, the character-based method showed some advantages. The results suggested that besides the standard COI barcode, other 11 mitochondrial protein-coding genes could also be potentially used as a molecular diagnostic for molluscan species discrimination. Our results also showed that the combination of mitochondrial genes did not enhance the efficacy for species identification and a single mitochondrial gene would be fully competent.
Jennifer A Hipp
Full Text Available Background: Conventional tissue microarrays (TMAs consist of cores of tissue inserted into a recipient paraffin block such that a tissue section on a single glass slide can contain numerous patient samples in a spatially structured pattern. Scanning TMAs into digital slides for subsequent analysis by computer-aided diagnostic (CAD algorithms all offers the possibility of evaluating candidate algorithms against a near-complete repertoire of variable disease morphologies. This parallel interrogation approach simplifies the evaluation, validation, and comparison of such candidate algorithms. A recently developed digital tool, digital core (dCORE, and image microarray maker (iMAM enables the capture of uniformly sized and resolution-matched images, with these representing key morphologic features and fields of view, aggregated into a single monolithic digital image file in an array format, which we define as an image microarray (IMA. We further define the TMA-IMA construct as IMA-based images derived from whole slide images of TMAs themselves. Methods: Here we describe the first combined use of the previously described dCORE and iMAM tools, toward the goal of generating a higher-order image construct, with multiple TMA cores from multiple distinct conventional TMAs assembled as a single digital image montage. This image construct served as the basis of the carrying out of a massively parallel image analysis exercise, based on the use of the previously described spatially invariant vector quantization (SIVQ algorithm. Results: Multicase, multifield TMA-IMAs of follicular lymphoma and follicular hyperplasia were separately rendered, using the aforementioned tools. Each of these two IMAs contained a distinct spectrum of morphologic heterogeneity with respect to both tingible body macrophage (TBM appearance and apoptotic body morphology. SIVQ-based pattern matching, with ring vectors selected to screen for either tingible body macrophages or apoptotic
Chen, Shilin; Yao, Hui; Han, Jianping; Liu, Chang; Song, Jingyuan; Shi, Linchun; Zhu, Yingjie; Ma, Xinye; Gao, Ting; Pang, Xiaohui; Luo, Kun; Li, Ying; Li, Xiwen; Jia, Xiaocheng; Lin, Yulin; Leon, Christine
The plant working group of the Consortium for the Barcode of Life recommended the two-locus combination of rbcL+matK as the plant barcode, yet the combination was shown to successfully discriminate among 907 samples from 550 species at the species level with a probability of 72%. The group admits that the two-locus barcode is far from perfect due to the low identification rate, and the search is not over. Here, we compared seven candidate DNA barcodes (psbA-trnH, matK, rbcL, rpoC1, ycf5, ITS2, and ITS) from medicinal plant species. Our ranking criteria included PCR amplification efficiency, differential intra- and inter-specific divergences, and the DNA barcoding gap. Our data suggest that the second internal transcribed spacer (ITS2) of nuclear ribosomal DNA represents the most suitable region for DNA barcoding applications. Furthermore, we tested the discrimination ability of ITS2 in more than 6600 plant samples belonging to 4800 species from 753 distinct genera and found that the rate of successful identification with the ITS2 was 92.7% at the species level. The ITS2 region can be potentially used as a standard DNA barcode to identify medicinal plants and their closely related species. We also propose that ITS2 can serve as a novel universal barcode for the identification of a broader range of plant taxa.
Full Text Available Melilotus, an annual or biennial herb, belongs to the tribe Trifolieae (Leguminosae and consists of 19 species. As an important green manure crop, diverse Melilotus species have different values as feed and medicine. To identify different Melilotus species, we examined the efficiency of five candidate regions as barcodes, including the internal transcribed spacer (ITS and two chloroplast loci, rbcL and matK, and two non-coding loci, trnH-psbA and trnL-F. In total, 198 individuals from 98 accessions representing 18 Melilotus species were sequenced for these five potential barcodes. Based on inter-specific divergence, we analysed sequences and confirmed that each candidate barcode was able to identify some of the 18 species. The resolution of a single barcode and its combinations ranged from 33.33% to 88.89%. Analysis of pairwise distances showed that matK+rbcL+trnL-F+trnH-psbA+ITS (MRTPI had the greatest value and rbcL the least. Barcode gap values and similarity value analyses confirmed these trends. The results indicated that an ITS region, successfully identifying 13 of 18 species, was the most appropriate single barcode and that the combination of all five potential barcodes identified 16 of the 18 species. We conclude that MRTPI is the most effective tool for Melilotus species identification. Taking full advantage of the barcode system, a clear taxonomic relationship can be applied to identify Melilotus species and enhance their practical production.
DNA barcodes have proven invaluable in identifying and distinguishing insect pests, for example for determining the provenance of exotic invasives, but relatively few insect natural enemies have been barcoded. We used Folmer et al.’s universal invertebrate primers (1994), and those designed by Heber...
Hausmann, A.; Godfray, H.C.J.; Huemer, J.; Mutane, M.; Rougerie, R.; Nieukerken, van E.J.; Ratnasingham, S.; Hebert, P.D.N.
Background: The geometrid moths of Europe are one of the best investigated insect groups in traditional taxonomy making them an ideal model group to test the accuracy of the Barcode Index Number (BIN) system of BOLD (Barcode of Life Datasystems), a method that supports automated, rapid species
Six DNA regions were evaluated in a multi-national, multi-laboratory consortium as potential DNA barcodes for Fungi, the second largest kingdom of eukaryotic life. The region of the mitochondrial cytochrome c oxidase subunit 1 used as the animal barcode was excluded as a potential marker, because it...
Moftah, Marie; Abdel Aziz, Sayeda H.; Elramah, Sara; Favereaux, Alexandre
The identification of species constitutes the first basic step in phylogenetic studies, biodiversity monitoring and conservation. DNA barcoding, i.e. the sequencing of a short standardized region of DNA, has been proposed as a new tool for animal species identification. The present study provides an update on the composition of shark in the Egyptian Mediterranean waters off Alexandria, since the latest study to date was performed 30 years ago, DNA barcoding was used in addition to classical taxonomical methodologies. Thus, 51 specimen were DNA barcoded for a 667 bp region of the mitochondrial COI gene. Although DNA barcoding aims at developing species identification systems, some phylogenetic signals were apparent in the data. In the neighbor-joining tree, 8 major clusters were apparent, each of them containing individuals belonging to the same species, and most with 100% bootstrap value. This study is the first to our knowledge to use DNA barcoding of the mitochondrial COI gene in order to confirm the presence of species Squalus acanthias, Oxynotus centrina, Squatina squatina, Scyliorhinus canicula, Scyliorhinus stellaris, Mustelus mustelus, Mustelus punctulatus and Carcharhinus altimus in the Egyptian Mediterranean waters. Finally, our study is the starting point of a new barcoding database concerning shark composition in the Egyptian Mediterranean waters (Barcoding of Egyptian Mediterranean Sharks [BEMS], http://www.boldsystems.org/views/projectlist.php?Barcoding%20Fish%20%28FishBOL%29). PMID:22087242
Full Text Available The identification of species constitutes the first basic step in phylogenetic studies, biodiversity monitoring and conservation. DNA barcoding, i.e. the sequencing of a short standardized region of DNA, has been proposed as a new tool for animal species identification. The present study provides an update on the composition of shark in the Egyptian Mediterranean waters off Alexandria, since the latest study to date was performed 30 years ago, DNA barcoding was used in addition to classical taxonomical methodologies. Thus, 51 specimen were DNA barcoded for a 667 bp region of the mitochondrial COI gene. Although DNA barcoding aims at developing species identification systems, some phylogenetic signals were apparent in the data. In the neighbor-joining tree, 8 major clusters were apparent, each of them containing individuals belonging to the same species, and most with 100% bootstrap value. This study is the first to our knowledge to use DNA barcoding of the mitochondrial COI gene in order to confirm the presence of species Squalus acanthias, Oxynotus centrina, Squatina squatina, Scyliorhinus canicula, Scyliorhinus stellaris, Mustelus mustelus, Mustelus punctulatus and Carcharhinus altimus in the Egyptian Mediterranean waters. Finally, our study is the starting point of a new barcoding database concerning shark composition in the Egyptian Mediterranean waters (Barcoding of Egyptian Mediterranean Sharks [BEMS], http://www.boldsystems.org/views/projectlist.php?Barcoding%20Fish%20%28FishBOL%29.
Stielow, J B; Lévesque, C A; Seifert, K A; Meyer, W; Irinyi, L; Smits, D; Renfurm, R; Verkley, G J M; Groenewald, M; Chaduli, D; Lomascolo, A; Welti, S; Lesage-Meessen, L; Favel, A; Al-Hatmi, A M S; Damm, U; Yilmaz, N.; Houbraken, J.; Lombard, L.; Quaedvlieg, W.; Binder, M.; Vaas, L.A.I.; Vu, D.; Yurkov, A.; Begerow, D.; Roehl, O.; Guerreiro, M.; Fonseca, A.; Samerpitak, K.; Diepeningen, A.D. van; Dolatabadi, S.; Moreno, L.F.; Casaregola, S.; Mallet, S.; Jacques, N.; Roscini, L.; Egidi, E.; Bizet, C.; Garcia-Hermoso, D.; Martín, M.P.; Deng, S.; Groenewald, J.Z.; Boekhout, T.; Beer, Z.W. de; Barnes, I.; Duong, T.A.; Wingfield, M.J.; Hoog, G.S. de; Crous, P.W.; Lewis, C.T.; Hambleton, S.; Moussa, T.A.A.; Al-Zahrani, H.S.; Almaghrabi, O.A.; Louis-Seize, G.; Assabgui, R.; McCormick, W.; Omer, G.; Dukik, K.; Cardinali, G.; Eberhardt, U.; Vries, M. de; Robert, V.
The aim of this study was to assess potential candidate gene regions and corresponding universal primer pairs as secondary DNA barcodes for the fungal kingdom, additional to ITS rDNA as primary barcode. Amplification efficiencies of 14 (partially) universal primer pairs targeting eight genetic
Stielow, J.B.; Lévesque, C.A.; Seifert, K.A.; Meyer, W.; Irinyi, L.; Smits, D.; Renfurm, R.; Verkley, G.J.M.; Groenewald, M.; Chaduli, D.; Lomascolo, A.; Welti, S.; Lesage-Meessen, L.; Favel, A.; Al-Hatmi, A.M.S.; Damm, U.; Yilmaz, N.; Houbraken, J.; Lombard, L.; Quaedvlieg, W.; Binder, M.; Vaas, L.A.I.; Vu, D.; Yurkov, A.; Begerow, D.; Roehl, O.; Guerreiro, M.; Fonseca, A.; Samerpitak, K.; Diepeningen, van A.D.; Dolatabadi, S.; Moreno, L.F.; Casaregola, S.; Mallet, S.; Jacques, N.; Roscini, L.; Egidi, E.; Bizet, C.; Garcia-Hermoso, D.; Martin, M.P.; Deng, S.; Groenewald, J.Z.; Boekhout, T.; Beer, de Z.W.; Barnes, I.; Duong, T.A.; Wingfield, M.J.; Hoog, de G.S.; Crous, P.W.; Lewis, C.T.; Hambleton, S.; Moussa, T.A.A.; Al-Zahrani, H.S.; Almaghrabi, O.A.; Louis-Seize, G.; Assabgui, R.; McCormick, W.; Omer, G.; Dukik, K.; Cardinali, G.; Eberhardt, U.; Vries, de M.; Robert, V.
The aim of this study was to assess potential candidate gene regions and corresponding universal primer pairs as secondary DNA barcodes for the fungal kingdom, additional to ITS rDNA as primary barcode. Ampliﬁcation efﬁciencies of 14 (partially) universal primer pairs targeting eight genetic markers
Schoch, C.L.; Seifert, K.A.; Huhndorf, S.; Robert, V.; Spouge, J.L.; Levesque, C.A.; Chen, W.; Crous, P.W.; Boekhout, T.; Damm, U.; Hoog, de G.S.; Eberhardt, U.; Groenewald, J.Z.; Groenewald, M.; Hagen, F.; Houbraken, J.; Quaedvlieg, W.; Stielow, B.; Vu, T.D.; Walther, G.
Six DNA regions were evaluated as potential DNA barcodes for Fungi, the second largest kingdom of eukaryotic life, by a multinational, multilaboratory consortium. The region of the mitochondrial cytochrome c oxidase subunit 1 used as the animal barcode was excluded as a potential marker, because it
Tekin, Ender; Coughlan, James M
While there are many barcode readers available for identifying products in a supermarket or at home on mobile phones (e.g., Red Laser iPhone app), such readers are inaccessible to blind or visually impaired persons because of their reliance on visual feedback from the user to center the barcode in the camera's field of view. We describe a mobile phone application that guides a visually impaired user to the barcode on a package in real-time using the phone's built-in video camera. Once the barcode is located by the system, the user is prompted with audio signals to bring the camera closer to the barcode until it can be resolved by the camera, which is then decoded and the corresponding product information read aloud using text-to-speech. Experiments with a blind volunteer demonstrate proof of concept of our system, which allowed the volunteer to locate barcodes which were then translated to product information that was announced to the user. We successfully tested a series of common products, as well as user-generated barcodes labeling household items that may not come with barcodes.
Fournier-Wirth, C; Coste, J
Until the late 1990s, mandatory blood screening for transmissible infectious agents depended entirely on antigen/antibody-based detection assays. The recent emergence of Nucleic acid Amplification Technologies (NAT) has revolutionised viral diagnosis, not only by increasing the level of sensitivity but also by facilitating the detection of several viruses in parallel by multiplexing specific primers. In more complex biological situations, when a broad spectrum of pathogens must be screened, the limitations of these first generation technologies became apparent. High throughput systems, such as DNA Arrays, permit a conceptually new approach. These miniaturised micro systems allow the detection of hundreds of different targets simultaneously, inducing a dramatic decrease in reagent consumption, a reduction in the number of confirmation tests and a simplification of data interpretation. However, the systems currently available require additional instrumentation and reagents for sample preparation and target amplification prior to detection on the DNA array. A major challenge in the area of DNA detection is the development of methods that do not rely on target amplification systems. Likewise, the advances of protein microarrays have lagged because of poor stability of proteins, complex coupling chemistry and weak detection signals. Emerging technologies like Biosensors and nano-particle based DNA or Protein Bio-Barcode Amplification Assays are promising diagnostic tools for a wide range of clinical applications, including blood donation screening.
Chen, Shi-Lin; Yao, Hui; Han, Jian-Ping; Xin, Tian-Yi; Pang, Xiao-Hui; Shi, Lin-Chun; Luo, Kun; Song, Jing-Yuan; Hou, Dian-Yun; Shi, Shang-Mei; Qian, Zhong-Zhi
Since the research of molecular identification of Chinese Materia Medica (CMM) using DNA barcode is rapidly developing and popularizing, the principle of this method is approved to be listed in the Supplement of the Pharmacopoeia of the People's Republic of China. Based on the study on comprehensive samples, the DNA barcoding systems have been established to identify CMM, i.e. ITS2 as a core barcode and psbA-trnH as a complementary locus for identification of planta medica, and COI as a core barcode and ITS2 as a complementary locus for identification of animal medica. This article introduced the principle of molecular identification of CMM using DNA barcoding and its drafting instructions. Furthermore, its application perspective was discussed.
Gaikwad, Swapnil; Warudkar, Ashwin; Shouche, Yogesh
DNA barcoding has emerged as an additional tool for taxonomy and as an aid to taxonomic impediments. Due to their extensive morphological variation, spiders are taxonomically challenging. Therefore, all over the world, attempts are being made to DNA barcode species of spiders. Till now no attempts were made to DNA barcode Indian spiders despite their rich diversity. We have generated DNA barcodes for 60 species (n = 112) of spiders for the first time from India. Although only 17 species were correctly identified at the species level, DNA barcoding correctly discriminated 99% of the species studied here. We have also found high intraspecies nucleotide divergence in Plexippus paykulli suggesting cryptic diversity that needs to be studied in detail. Our study also showed non-specific amplification of the Cytochrome Oxidase I (COI) gene of endosymbiont bacteria Wolbachia. However, these cases are very rare and could be resolved by the use of modified or group specific primers.
Full Text Available We present a DNA barcoding study of Neotropical odonates from the Upper Plata basin, Brazil. A total of 38 species were collected in a transition region of "Cerrado" and Atlantic Forest, both regarded as biological hotspots, and 130 cytochrome c oxidase subunit I (COI barcodes were generated for the collected specimens. The distinct gap between intraspecific (0-2% and interspecific variation (15% and above in COI, and resulting separation of Barcode Index Numbers (BIN, allowed for successful identification of specimens in 94% of cases. The 6% fail rate was due to a shared BIN between two separate nominal species. DNA barcoding, based on COI, thus seems to be a reliable and efficient tool for identifying Neotropical odonate specimens down to the species level. These results underscore the utility of DNA barcoding to aid specimen identification in diverse biological hotspots, areas that require urgent action regarding taxonomic surveys and biodiversity conservation.
Xenitidis, P; Seimenis, I; Kakolyris, S; Adamopoulos, A
High-throughput technology like microarrays is widely used in the inference of gene regulatory networks (GRNs). We focused on time series data since we are interested in the dynamics of GRNs and the identification of dynamic networks. We evaluated the amount of information that exists in artificial time series microarray data and the ability of an inference process to produce accurate models based on them. We used dynamic artificial gene regulatory networks in order to create artificial microarray data. Key features that characterize microarray data such as the time separation of directly triggered genes, the percentage of directly triggered genes and the triggering function type were altered in order to reveal the limits that are imposed by the nature of microarray data on the inference process. We examined the effect of various factors on the inference performance such as the network size, the presence of noise in microarray data, and the network sparseness. We used a system theory approach and examined the relationship between the pole placement of the inferred system and the inference performance. We examined the relationship between the inference performance in the time domain and the true system parameter identification. Simulation results indicated that time separation and the percentage of directly triggered genes are crucial factors. Also, network sparseness, the triggering function type and noise in input data affect the inference performance. When two factors were simultaneously varied, it was found that variation of one parameter significantly affects the dynamic response of the other. Crucial factors were also examined using a real GRN and acquired results confirmed simulation findings with artificial data. Different initial conditions were also used as an alternative triggering approach. Relevant results confirmed that the number of datasets constitutes the most significant parameter with regard to the inference performance. Copyright © 2017 Elsevier
at identifying the exact breakpoints where DNA has been gained or lost. In this thesis, three popular methods are compared and a realistic simulation model is presented for generating artificial data with known breakpoints and known DNA copy number. By using simulated data, we obtain a realistic evaluation......During the past few years, innovations in the DNA sequencing technology has led to an explosion in available DNA sequence information. This has revolutionized biological research and promoted the development of high throughput analysis methods that can take advantage of the vast amount of sequence...... data. For this, the DNA microarray technology has gained enormous popularity due to its ability to measure the presence or the activity of thousands of genes simultaneously. Microarrays for high throughput data analyses are not limited to a few organisms but may be applied to everything from bacteria...
Satish Balasaheb Nimse
Full Text Available The highly programmable positioning of molecules (biomolecules, nanoparticles, nanobeads, nanocomposites materials on surfaces has potential applications in the fields of biosensors, biomolecular electronics, and nanodevices. However, the conventional techniques including self-assembled monolayers fail to position the molecules on the nanometer scale to produce highly organized monolayers on the surface. The present article elaborates different techniques for the immobilization of the biomolecules on the surface to produce microarrays and their diagnostic applications. The advantages and the drawbacks of various methods are compared. This article also sheds light on the applications of the different technologies for the detection and discrimination of viral/bacterial genotypes and the detection of the biomarkers. A brief survey with 115 references covering the last 10 years on the biological applications of microarrays in various fields is also provided.
Schlecht, Ulrich; Primig, Michael
Gametogenesis is a key developmental process that involves complex transcriptional regulation of numerous genes including many that are conserved between unicellular eukaryotes and mammals. Recent expression-profiling experiments using microarrays have provided insight into the co-ordinated transcription of several hundred genes during mitotic growth and meiotic development in budding and fission yeast. Furthermore, microarray-based studies have identified numerous loci that are regulated during the cell cycle or expressed in a germ-cell specific manner in eukaryotic model systems like Caenorhabditis elegans, Mus musculus as well as Homo sapiens. The unprecedented amount of information produced by post-genome biology has spawned novel approaches to organizing biological knowledge using currently available information technology. This review outlines experiments that contribute to an emerging comprehensive picture of the molecular machinery governing sexual reproduction in eukaryotes.
Kierzek, Elzbieta; Kierzek, Ryszard; Turner, Douglas H; Catrina, Irina E
Determining RNA secondary structure is important for understanding structure-function relationships and identifying potential drug targets. This paper reports the use of microarrays with heptamer 2'-O-methyl oligoribonucleotides to probe the secondary structure of an RNA and thereby improve the prediction of that secondary structure. When experimental constraints from hybridization results are added to a free-energy minimization algorithm, the prediction of the secondary structure of Escherichia coli 5S rRNA improves from 27 to 92% of the known canonical base pairs. Optimization of buffer conditions for hybridization and application of 2'-O-methyl-2-thiouridine to enhance binding and improve discrimination between AU and GU pairs are also described. The results suggest that probing RNA with oligonucleotide microarrays can facilitate determination of secondary structure.
Gogalic, S.; Hageneder, S.; Ctortecka, C.; Bauch, M.; Khan, I.; Preininger, Claudia; Sauer, U.; Dostalek, J.
Plasmonic amplification of fluorescence signal in bioassays with microarray detection format is reported. A crossed relief diffraction grating was designed to couple an excitation laser beam to surface plasmons at the wavelength overlapping with the absorption and emission bands of fluorophore Dy647 that was used as a label. The surface of periodically corrugated sensor chip was coated with surface plasmon-supporting gold layer and a thin SU8 polymer film carrying epoxy groups. These groups were employed for the covalent immobilization of capture antibodies at arrays of spots. The plasmonic amplification of fluorescence signal on the developed microarray chip was tested by using interleukin 8 sandwich immunoassay. The readout was performed ex situ after drying the chip by using a commercial scanner with high numerical aperture collecting lens. Obtained results reveal the enhancement of fluorescence signal by a factor of 5 when compared to a regular glass chip.
Bychkov, Dmitrii; Turkki, Riku; Haglund, Caj; Linder, Nina; Lundin, Johan
Recent advances in computer vision enable increasingly accurate automated pattern classification. In the current study we evaluate whether a convolutional neural network (CNN) can be trained to predict disease outcome in patients with colorectal cancer based on images of tumor tissue microarray samples. We compare the prognostic accuracy of CNN features extracted from the whole, unsegmented tissue microarray spot image, with that of CNN features extracted from the epithelial and non-epithelial compartments, respectively. The prognostic accuracy of visually assessed histologic grade is used as a reference. The image data set consists of digitized hematoxylin-eosin (H and E) stained tissue microarray samples obtained from 180 patients with colorectal cancer. The patient samples represent a variety of histological grades, have data available on a series of clinicopathological variables including long-term outcome and ground truth annotations performed by experts. The CNN features extracted from images of the epithelial tissue compartment significantly predicted outcome (hazard ratio (HR) 2.08; CI95% 1.04-4.16; area under the curve (AUC) 0.66) in a test set of 60 patients, as compared to the CNN features extracted from unsegmented images (HR 1.67; CI95% 0.84-3.31, AUC 0.57) and visually assessed histologic grade (HR 1.96; CI95% 0.99-3.88, AUC 0.61). As a conclusion, a deep-learning classifier can be trained to predict outcome of colorectal cancer based on images of H and E stained tissue microarray samples and the CNN features extracted from the epithelial compartment only resulted in a prognostic discrimination comparable to that of visually determined histologic grade.
Barrios Mello, Rafael; Regis Silva, Maria Regina; Seixas Alves, Maria Teresa; Evison, Martin; Guimarães, Marco Aurélio; Francisco, Rafaella Arrabaça; Dias Astolphi, Rafael; Miazato Iwamura, Edna Sadayo
Taphonomic processes affecting bone post mortem are important in forensic, archaeological and palaeontological investigations. In this study, the application of tissue microarray (TMA) analysis to a sample of femoral bone specimens from 20 exhumed individuals of known period of burial and age at death is described. TMA allows multiplexing of subsamples, permitting standardized comparative analysis of adjacent sections in 3-D and of representative cross-sections of a large number of specimens....
Luo, Arong; Zhang, Aibing; Ho, Simon Yw; Xu, Weijun; Zhang, Yanzhou; Shi, Weifeng; Cameron, Stephen L; Zhu, Chaodong
A well-informed choice of genetic locus is central to the efficacy of DNA barcoding. Current DNA barcoding in animals involves the use of the 5' half of the mitochondrial cytochrome oxidase 1 gene (CO1) to diagnose and delimit species. However, there is no compelling a priori reason for the exclusive focus on this region, and it has been shown that it performs poorly for certain animal groups. To explore alternative mitochondrial barcoding regions, we compared the efficacy of the universal CO1 barcoding region with the other mitochondrial protein-coding genes in eutherian mammals. Four criteria were used for this comparison: the number of recovered species, sequence variability within and between species, resolution to taxonomic levels above that of species, and the degree of mutational saturation. Based on 1,179 mitochondrial genomes of eutherians, we found that the universal CO1 barcoding region is a good representative of mitochondrial genes as a whole because the high species-recovery rate (> 90%) was similar to that of other mitochondrial genes, and there were no significant differences in intra- or interspecific variability among genes. However, an overlap between intra- and interspecific variability was still problematic for all mitochondrial genes. Our results also demonstrated that any choice of mitochondrial gene for DNA barcoding failed to offer significant resolution at higher taxonomic levels. We suggest that the CO1 barcoding region, the universal DNA barcode, is preferred among the mitochondrial protein-coding genes as a molecular diagnostic at least for eutherian species identification. Nevertheless, DNA barcoding with this marker may still be problematic for certain eutherian taxa and our approach can be used to test potential barcoding loci for such groups.
Full Text Available DNA barcoding enhances the prospects for species-level identifications globally using a standardized and authenticated DNA-based approach. Reference libraries comprising validated DNA barcodes (COI constitute robust datasets for testing query sequences, providing considerable utility to identify marine fish and other organisms. Here we test the feasibility of using DNA barcoding to assign species to tissue samples from fish collected in the central Mediterranean Sea, a major contributor to the European marine ichthyofaunal diversity.A dataset of 1278 DNA barcodes, representing 218 marine fish species, was used to test the utility of DNA barcodes to assign species from query sequences. We tested query sequences against 1 a reference library of ranked DNA barcodes from the neighbouring North East Atlantic, and 2 the public databases BOLD and GenBank. In the first case, a reference library comprising DNA barcodes with reliability grades for 146 fish species was used as diagnostic dataset to screen 486 query DNA sequences from fish specimens collected in the central basin of the Mediterranean Sea. Of all query sequences suitable for comparisons 98% were unambiguously confirmed through complete match with reference DNA barcodes. In the second case, it was possible to assign species to 83% (BOLD-IDS and 72% (GenBank of the sequences from the Mediterranean. Relatively high intraspecific genetic distances were found in 7 species (2.2%-18.74%, most of them of high commercial relevance, suggesting possible cryptic species.We emphasize the discriminatory power of COI barcodes and their application to cases requiring species level resolution starting from query sequences. Results highlight the value of public reference libraries of reliability grade-annotated DNA barcodes, to identify species from different geographical origins. The ability to assign species with high precision from DNA samples of disparate quality and origin has major utility in several
Phelan, Don; Jackson, Carl; Redfern, R. Michael; Morrison, Alan P.; Mathewson, Alan
New Geiger Mode Avalanche Photodiodes (GM-APD) have been designed and characterized specifically for use in microarray systems. Critical parameters such as excess reverse bias voltage, hold-off time and optimum operating temperature have been experimentally determined for these photon-counting devices. The photon detection probability, dark count rate and afterpulsing probability have been measured under different operating conditions. An active- quench circuit (AQC) is presented for operating these GM- APDs. This circuit is relatively simple, robust and has such benefits as reducing average power dissipation and afterpulsing. Arrays of these GM-APDs have already been designed and together with AQCs open up the possibility of having a solid-state microarray detector that enables parallel analysis on a single chip. Another advantage of these GM-APDs over current technology is their low voltage CMOS compatibility which could allow for the fabrication of an AQC on the same device. Small are detectors have already been employed in the time-resolved detection of fluorescence from labeled proteins. It is envisaged that operating these new GM-APDs with this active-quench circuit will have numerous applications for the detection of fluorescence in microarray systems.
Akçaalan, Reyhan; Albay, Meric; Koker, Latife; Baudart, Julia; Guillebault, Delphine; Fischer, Sabine; Weigel, Wilfried; Medlin, Linda K
Monitoring drinking water quality is an important public health issue. Two objectives from the 4 years, six nations, EU Project μAqua were to develop hierarchically specific probes to detect and quantify pathogens in drinking water using a PCR-free microarray platform and to design a standardised water sampling program from different sources in Europe to obtain sufficient material for downstream analysis. Our phylochip contains barcodes (probes) that specifically identify freshwater pathogens that are human health risks in a taxonomic hierarchical fashion such that if species is present, the entire taxonomic hierarchy (genus, family, order, phylum, kingdom) leading to it must also be present, which avoids false positives. Molecular tools are more rapid, accurate and reliable than traditional methods, which means faster mitigation strategies with less harm to humans and the community. We present microarray results for the presence of freshwater pathogens from a Turkish lake used drinking water and inferred cyanobacterial cell equivalents from samples concentrated from 40 into 1 L in 45 min using hollow fibre filters. In two companion studies from the same samples, cyanobacterial toxins were analysed using chemical methods and those dates with highest toxin values also had highest cell equivalents as inferred from this microarray study.
Full Text Available Abstract Background The increasing number of gene expression microarray studies represents an important resource in biomedical research. As a result, gene expression based diagnosis has entered clinical practice for patient stratification in breast cancer. However, the integration and combined analysis of microarray studies remains still a challenge. We assessed the potential benefit of data integration on the classification accuracy and systematically evaluated the generalization performance of selected methods on four breast cancer studies comprising almost 1000 independent samples. To this end, we introduced an evaluation framework which aims to establish good statistical practice and a graphical way to monitor differences. The classification goal was to correctly predict estrogen receptor status (negative/positive and histological grade (low/high of each tumor sample in an independent study which was not used for the training. For the classification we chose support vector machines (SVM, predictive analysis of microarrays (PAM, random forest (RF and k-top scoring pairs (kTSP. Guided by considerations relevant for classification across studies we developed a generalization of kTSP which we evaluated in addition. Our derived version (DV aims to improve the robustness of the intrinsic invariance of kTSP with respect to technologies and preprocessing. Results For each individual study the generalization error was benchmarked via complete cross-validation and was found to be similar for all classification methods. The misclassification rates were substantially higher in classification across studies, when each single study was used as an independent test set while all remaining studies were combined for the training of the classifier. However, with increasing number of independent microarray studies used in the training, the overall classification performance improved. DV performed better than the average and showed slightly less variance. In
Xiang, Li; Tang, Huan; Cheng, Jin-le; Chen, Yi-long; Deng, Wen; Zheng, Xia-sheng; Lai, Zhi-tian; Chen, Shi-lin
Ultrafine powder and cell wall-broken powder of herbal medicine lack of the morphological characters and microscopic identification features. This makes it hard to identify herb's authenticity with traditional methods. We tested ITS2 sequence as DNA barcode in identification of herbal medicine in ultrafine powder and cell wall-broken powder in this study. We extracted genomic DNAs of 93 samples of 31 representative herbal medicines (28 species), which include whole plant, roots and bulbs, stems, leaves, flowers, fruits and seeds. The ITS2 sequences were amplified and sequenced bidirectionally. The ITS2 sequences were identified using Basic Local Alignment Search Tool (BLAST) method in the GenBank database and DNA barcoding system to identify the herbal medicine. The genetic distance was analyzed using the Kimura 2-parameter (K2P) model and the Neighbor-joining (NJ) phylogenetic tree was constructed using MEGA 6.0. The results showed that DNA can be extracted successfully from 93 samples and high quality ITS2 sequences can be amplified. All 31 herbal medicines can get correct identification via BLAST method. The ITS2 sequences of raw material medicines, ultrafine powder and cell wall-broken powder have same sequence in 26 herbal medicines, while the ITS2 sequences in other 5 herbal medicines exhibited variation. The maximum intraspecific genetic-distances of each species were all less than the minimum interspecific genetic distances. ITS2 sequences of each species are all converged to their standard DNA barcodes using NJ method. Therefore, using ITS2 barcode can accurately and effectively distinguish ultrafine powder and cell wall-broken powder of herbal medicine. It provides a new molecular method to identify ultrafine powder and cell wall-broken powder of herbal medicine in the quality control and market supervision.
Wilson, John-James; Sing, Kong-Wah; Sofian-Azirun, Mohd
The objective of this study was to build a DNA barcode reference library for the true butterflies of Peninsula Malaysia and assess the value of attaching subspecies names to DNA barcode records. A new DNA barcode library was constructed with butterflies from the Museum of Zoology, University of Malaya collection. The library was analysed in conjunction with publicly available DNA barcodes from other Asia-Pacific localities to test the ability of the DNA barcodes to discriminate species and subspecies. Analyses confirmed the capacity of the new DNA barcode reference library to distinguish the vast majority of species (92%) and revealed that most subspecies possessed unique DNA barcodes (84%). In some cases conspecific subspecies exhibited genetic distances between their DNA barcodes that are typically seen between species, and these were often taxa that have previously been regarded as full species. Subspecies designations as shorthand for geographically and morphologically differentiated groups provide a useful heuristic for assessing how such groups correlate with clustering patterns of DNA barcodes, especially as the number of DNA barcodes per species in reference libraries increases. Our study demonstrates the value in attaching subspecies names to DNA barcode records as they can reveal a history of taxonomic concepts and expose important units of biodiversity.
Wilson, John-James; Sing, Kong-Wah; Sofian-Azirun, Mohd
The objective of this study was to build a DNA barcode reference library for the true butterflies of Peninsula Malaysia and assess the value of attaching subspecies names to DNA barcode records. A new DNA barcode library was constructed with butterflies from the Museum of Zoology, University of Malaya collection. The library was analysed in conjunction with publicly available DNA barcodes from other Asia-Pacific localities to test the ability of the DNA barcodes to discriminate species and subspecies. Analyses confirmed the capacity of the new DNA barcode reference library to distinguish the vast majority of species (92%) and revealed that most subspecies possessed unique DNA barcodes (84%). In some cases conspecific subspecies exhibited genetic distances between their DNA barcodes that are typically seen between species, and these were often taxa that have previously been regarded as full species. Subspecies designations as shorthand for geographically and morphologically differentiated groups provide a useful heuristic for assessing how such groups correlate with clustering patterns of DNA barcodes, especially as the number of DNA barcodes per species in reference libraries increases. Our study demonstrates the value in attaching subspecies names to DNA barcode records as they can reveal a history of taxonomic concepts and expose important units of biodiversity. PMID:24282514
Full Text Available The objective of this study was to build a DNA barcode reference library for the true butterflies of Peninsula Malaysia and assess the value of attaching subspecies names to DNA barcode records. A new DNA barcode library was constructed with butterflies from the Museum of Zoology, University of Malaya collection. The library was analysed in conjunction with publicly available DNA barcodes from other Asia-Pacific localities to test the ability of the DNA barcodes to discriminate species and subspecies. Analyses confirmed the capacity of the new DNA barcode reference library to distinguish the vast majority of species (92% and revealed that most subspecies possessed unique DNA barcodes (84%. In some cases conspecific subspecies exhibited genetic distances between their DNA barcodes that are typically seen between species, and these were often taxa that have previously been regarded as full species. Subspecies designations as shorthand for geographically and morphologically differentiated groups provide a useful heuristic for assessing how such groups correlate with clustering patterns of DNA barcodes, especially as the number of DNA barcodes per species in reference libraries increases. Our study demonstrates the value in attaching subspecies names to DNA barcode records as they can reveal a history of taxonomic concepts and expose important units of biodiversity.
Full Text Available Piper species are used for spices, in traditional and processed forms of medicines, in cosmetic compounds, in cultural activities and insecticides. Here barcode analysis was performed for identification of plant parts, young plants and modified forms of plants. Thirty-six Piper species were collected and the three barcode regions, matK, rbcL and psbA-trnH spacer, were amplified, sequenced and aligned to determine their genetic distances. For intraspecific genetic distances, the most effective values for the species identification ranged from no difference to very low distance values. However, P. betle had the highest values at 0.386 for the matK region. This finding may be due to P. betle being an economic and cultivated species, and thus is supported with growth factors, which may have affected its genetic distance. The interspecific genetic distances that were most effective for identification of different species were from the matK region and ranged from a low of 0.002 in 27 paired species to a high of 0.486. Eight species pairs, P. kraense and P. dominantinervium, P. magnibaccum and P. kraense, P. phuwuaense and P. dominantinervium, P. phuwuaense and P. kraense, P. pilobracteatum and P. dominantinervium, P. pilobracteatum and P. kraense, P. pilobracteatum and P. phuwuaense and P. sylvestre and P. polysyphonum, that presented a genetic distance of 0.000 and were identified by independently using each of the other two regions. Concisely, these three barcode regions are powerful for further efficient identification of the 36 Piper species.
Chaveerach, Arunrat; Tanee, Tawatchai; Sanubol, Arisa; Monkheang, Pansa; Sudmoon, Runglawan
Piper species are used for spices, in traditional and processed forms of medicines, in cosmetic compounds, in cultural activities and insecticides. Here barcode analysis was performed for identification of plant parts, young plants and modified forms of plants. Thirty-six Piper species were collected and the three barcode regions, matK , rbcL and psbA - trnH spacer, were amplified, sequenced and aligned to determine their genetic distances. For intraspecific genetic distances, the most effective values for the species identification ranged from no difference to very low distance values. However, Piper betle had the highest values at 0.386 for the matK region. This finding may be due to Piper betle being an economic and cultivated species, and thus is supported with growth factors, which may have affected its genetic distance. The interspecific genetic distances that were most effective for identification of different species were from the matK region and ranged from a low of 0.002 in 27 paired species to a high of 0.486. Eight species pairs, Piper kraense and Piper dominantinervium , Piper magnibaccum and Piper kraense , Piper phuwuaense and Piper dominantinervium , Piper phuwuaense and Piper kraense , Piper pilobracteatum and Piper dominantinervium , Piper pilobracteatum and Piper kraense , Piper pilobracteatum and Piper phuwuaense and Piper sylvestre and Piper polysyphonum , that presented a genetic distance of 0.000 and were identified by independently using each of the other two regions. Concisely, these three barcode regions are powerful for further efficient identification of the 36 Piper species.
Ortman, Brian D.; Bucklin, Ann; Pagès, Francesc; Youngbluth, Marsh
The Medusozoa are a clade within the Cnidaria comprising the classes Hydrozoa, Scyphozoa, and Cubozoa. Identification of medusozoan species is challenging, even for taxonomic experts, due to their fragile forms and complex, morphologically-distinct life history stages. In this study 231 sequences for a portion of the mitochondrial Cytochrome Oxidase I (mtCOI) gene were obtained from 95 species of Medusozoans including; 84 hydrozoans (61 siphonophores, eight anthomedusae, four leptomedusae, seven trachymedusae, and four narcomedusae), 10 scyphozoans (three coronatae, four semaeostomae, two rhizostomae, and one stauromedusae), and one cubozoan. This region of mtCOI has been used as a DNA barcode (i.e., a molecular character for species recognition and discrimination) for a diverse array of taxa, including some Cnidaria. Kimura 2-parameter (K2P) genetic distances between sequence variants within species ranged from 0 to 0.057 (mean 0.013). Within the 13 genera for which multiple species were available, K2P distance between congeneric species ranged from 0.056 to 0.381. A cluster diagram generated by Neighbor Joining (NJ) using K2P distances reliably clustered all barcodes of the same species with ≥99% bootstrap support, ensuring accurate identification of species. Intra- and inter-specific variation of the mtCOI gene for the Medusozoa are appropriate for this gene to be used as a DNA barcode for species-level identification, but not for phylogenetic analysis or taxonomic classification of unknown sequences at higher taxonomic levels. This study provides a set of molecular tools that can be used to address questions of speciation, biodiversity, life-history, and population boundaries in the Medusozoa.
Full Text Available In most freshwater ecosystems, aquatic insects are dominant in terms of diversity; however, there is a disproportionately low number of records of alien species when compared to other freshwater organisms. The Chironomidae is one aquatic insect family that includes some examples of alien species around the world. During a study on aquatic insects in Amazonas state (Brazil, we collected specimens of Chironomidae that are similar, at the morphological level, to Chironomus kiiensis Tokunaga and Chironomus striatipennis Kieffer, both with distributions restricted to Asia. The objectives of this study were to provide morphological information on this Chironomus population, to investigate its identity using DNA barcoding and, to provide bionomic information about this species. Chironomus DNA barcode data were obtained from GenBank and Barcode of Life Data Systems (BOLD and, together with our data, were analyzed using the neighbor-joining method with 1000 bootstrap replicates and the genetic distances were estimated using the Kimura-2-parameter. At the morphological level, the Brazilian population cannot be distinguished either from C. striatipennis or C. kiiensis, configuring a species complex but, at the molecular level our studied population is placed in a clade together with C. striatipennis, from South Korea. Bionomic characteristics of the Brazilian Chironomus population differ from the ones of C. kiiensis from Japan, the only species in this species complex with bionomic information available. The Brazilian Chironomus population has a smaller size, the double of the number of eggs and inhabits oligotrophic water, in artificial container. In the molecular analysis, populations of C. striatipennis and C. kiiensis are placed in a clade, formed by two groups: Group A (which includes populations from both named species, from different Asiatic regions and our Brazilian population and Group B (with populations of C. kiiensis from Japan and South Korea
Chaveerach, Arunrat; Tanee, Tawatchai; Sanubol, Arisa; Monkheang, Pansa; Sudmoon, Runglawan
Abstract Piper species are used for spices, in traditional and processed forms of medicines, in cosmetic compounds, in cultural activities and insecticides. Here barcode analysis was performed for identification of plant parts, young plants and modified forms of plants. Thirty-six Piper species were collected and the three barcode regions, matK, rbcL and psbA-trnH spacer, were amplified, sequenced and aligned to determine their genetic distances. For intraspecific genetic distances, the most effective values for the species identification ranged from no difference to very low distance values. However, Piper betle had the highest values at 0.386 for the matK region. This finding may be due to Piper betle being an economic and cultivated species, and thus is supported with growth factors, which may have affected its genetic distance. The interspecific genetic distances that were most effective for identification of different species were from the matK region and ranged from a low of 0.002 in 27 paired species to a high of 0.486. Eight species pairs, Piper kraense and Piper dominantinervium, Piper magnibaccum and Piper kraense, Piper phuwuaense and Piper dominantinervium, Piper phuwuaense and Piper kraense, Piper pilobracteatum and Piper dominantinervium, Piper pilobracteatum and Piper kraense, Piper pilobracteatum and Piper phuwuaense and Piper sylvestre and Piper polysyphonum, that presented a genetic distance of 0.000 and were identified by independently using each of the other two regions. Concisely, these three barcode regions are powerful for further efficient identification of the 36 Piper species. PMID:27829794
Full Text Available Recent studies indicate that the discriminatory power of the core DNA barcodes (rbcLa + matK for land plants may have been overestimated since their performance have been tested only on few closely related species. In this study we focused mainly on how the addition of complementary barcodes (nrITS and trnH-psbA to the core barcodes will affect the performance of the core barcodes in discriminating closely related species from family to section levels. In general, we found that the core barcodes performed poorly compared to the various combinations tested. Using multiple criteria, we finally advocated for the use of the core + trnH-psbA as potential DNA barcode for the family Combretaceae at least in southern Africa. Our results also indicate that the success of DNA barcoding in discriminating closely related species may be related to evolutionary and possibly the biogeographic histories of the taxonomic group tested.
Full Text Available The gene microarray analysis and classification have demonstrated an effective way for the effective diagnosis of diseases and cancers. However, it has been also revealed that the basic classification techniques have intrinsic drawbacks in achieving accurate gene classification and cancer diagnosis. On the other hand, classifier ensembles have received increasing attention in various applications. Here, we address the gene classification issue using RotBoost ensemble methodology. This method is a combination of Rotation Forest and AdaBoost techniques which in turn preserve both desirable features of an ensemble architecture, that is, accuracy and diversity. To select a concise subset of informative genes, 5 different feature selection algorithms are considered. To assess the efficiency of the RotBoost, other nonensemble/ensemble techniques including Decision Trees, Support Vector Machines, Rotation Forest, AdaBoost, and Bagging are also deployed. Experimental results have revealed that the combination of the fast correlation-based feature selection method with ICA-based RotBoost ensemble is highly effective for gene classification. In fact, the proposed method can create ensemble classifiers which outperform not only the classifiers produced by the conventional machine learning but also the classifiers generated by two widely used conventional ensemble learning methods, that is, Bagging and AdaBoost.
Stokes, Todd H; Torrance, JT; Li, Henry; Wang, May D
Background A survey of microarray databases reveals that most of the repository contents and data models are heterogeneous (i.e., data obtained from different chip manufacturers), and that the repositories provide only basic biological keywords linking to PubMed. As a result, it is difficult to find datasets using research context or analysis parameters information beyond a few keywords. For example, to reduce the "curse-of-dimension" problem in microarray analysis, the number of samples is often increased by merging array data from different datasets. Knowing chip data parameters such as pre-processing steps (e.g., normalization, artefact removal, etc), and knowing any previous biological validation of the dataset is essential due to the heterogeneity of the data. However, most of the microarray repositories do not have meta-data information in the first place, and do not have a a mechanism to add or insert this information. Thus, there is a critical need to create "intelligent" microarray repositories that (1) enable update of meta-data with the raw array data, and (2) provide standardized archiving protocols to minimize bias from the raw data sources. Results To address the problems discussed, we have developed a community maintained system called ArrayWiki that unites disparate meta-data of microarray meta-experiments from multiple primary sources with four key features. First, ArrayWiki provides a user-friendly knowledge management interface in addition to a programmable interface using standards developed by Wikipedia. Second, ArrayWiki includes automated quality control processes (caCORRECT) and novel visualization methods (BioPNG, Gel Plots), which provide extra information about data quality unavailable in other microarray repositories. Third, it provides a user-curation capability through the familiar Wiki interface. Fourth, ArrayWiki provides users with simple text-based searches across all experiment meta-data, and exposes data to search engine crawlers
Dahl, Mathilde Borg; Brejnrod, Asker Daniel; Unterseher, Martin
Unicellular, eukaryotic organisms (protists) play a key role in soil food webs as major predators of microorganisms. However, due to the polyphyletic nature of protists, no single universal barcode can be established for this group, and the structure of many protistean communities remains...... unresolved. Plasmodial slime moulds (Myxogastria or Myxomycetes) stand out among protists by their formation of fruit bodies, which allow for a morphological species concept. By Sanger sequencing of a large collection of morphospecies, this study presents the largest database to date of dark...... match, thus thought to represent undiscovered diversity of dark-spored myxomycetes....
Full Text Available Abstract Background Mycotoxins are fungal secondary metabolites commonly present in feed and food, and are widely regarded as hazardous contaminants. Citrinin, one of the very well known mycotoxins that was first isolated from Penicillium citrinum, is produced by more than 10 kinds of fungi, and is possibly spread all over the world. However, the information on the action mechanism of the toxin is limited. Thus, we investigated the citrinin-induced genomic response for evaluating its toxicity. Results Citrinin inhibited growth of yeast cells at a concentration higher than 100 ppm. We monitored the citrinin-induced mRNA expression profiles in yeast using the ORF DNA microarray and Oligo DNA microarray, and the expression profiles were compared with those of the other stress-inducing agents. Results obtained from both microarray experiments clustered together, but were different from those of the mycotoxin patulin. The oxidative stress response genes – AADs, FLR1, OYE3, GRE2, and MET17 – were significantly induced. In the functional category, expression of genes involved in "metabolism", "cell rescue, defense and virulence", and "energy" were significantly activated. In the category of "metabolism", genes involved in the glutathione synthesis pathway were activated, and in the category of "cell rescue, defense and virulence", the ABC transporter genes were induced. To alleviate the induced stress, these cells might pump out the citrinin after modification with glutathione. While, the citrinin treatment did not induce the genes involved in the DNA repair. Conclusion Results from both microarray studies suggest that citrinin treatment induced oxidative stress in yeast cells. The genotoxicity was less severe than the patulin, suggesting that citrinin is less toxic than patulin. The reproducibility of the expression profiles was much better with the Oligo DNA microarray. However, the Oligo DNA microarray did not completely overcome cross
Full Text Available Conventional comparative genomic hybridization (CGH profiling of neuroblastomas has identified many genomic aberrations, although the limited resolution has precluded a precise localization of sequences of interest within amplicons. To map high copy number genomic gains in clinically matched stage IV neuroblastomas, CGH analysis using a 19,200-feature cDNA microarray was used. A dedicated (freely available algorithm was developed for rapid in silico determination of chromosomal localizations of microarray cDNA targets, and for generation of an ideogram-type profile of copy number changes. Using these methodologies, novel gene amplifications undetectable by chromosome CGH were identified, and larger MYCN amplicon sizes (in one tumor up to 6 Mb than those previously reported in neuroblastoma were identified. The genes HPCAL1, LPIN1/KIAA0188, NAG, and NSE1/LOC151354 were found to be coamplified with MYCN. To determine whether stage IV primary tumors could be further subclassified based on their genomic copy number profiles, hierarchical clustering was performed. Cluster analysis of microarray CGH data identified three groups: 1 no amplifications evident, 2 a small MYCN amplicon as the only detectable imbalance, and 3 a large MYCN amplicon with additional gene amplifications. Application of CGH to cDNA microarray targets will help to determine both the variation of amplicon size and help better define amplification-dependent and independent pathways of progression in neuroblastoma.
Rivera, Robert; Wang, Jie; Yu, Xiaobo; Demirkan, Gokhan; Hopper, Marika; Bian, Xiaofang; Tahsin, Tasnia; Magee, D Mitchell; Qiu, Ji; LaBaer, Joshua; Wallstrom, Garrick
In recent studies involving NAPPA microarrays, extra-well fluorescence is used as a key measure for identifying disease biomarkers because there is evidence to support that it is better correlated with strong antibody responses than statistical analysis involving intraspot intensity. Because this feature is not well quantified by traditional image analysis software, identification and quantification of extra-well fluorescence is performed manually, which is both time-consuming and highly susceptible to variation between raters. A system that could automate this task efficiently and effectively would greatly improve the process of data acquisition in microarray studies, thereby accelerating the discovery of disease biomarkers. In this study, we experimented with different machine learning methods, as well as novel heuristics, for identifying spots exhibiting extra-well fluorescence (rings) in microarray images and assigning each ring a grade of 1-5 based on its intensity and morphology. The sensitivity of our final system for identifying rings was found to be 72% at 99% specificity and 98% at 92% specificity. Our system performs this task significantly faster than a human, while maintaining high performance, and therefore represents a valuable tool for microarray image analysis.
Full Text Available Abstract Background Salmonids are of interest because of their relatively recent genome duplication, and their extensive use in wild fisheries and aquaculture. A comprehensive gene list and a comparison of genes in some of the different species provide valuable genomic information for one of the most widely studied groups of fish. Results 298,304 expressed sequence tags (ESTs from Atlantic salmon (69% of the total, 11,664 chinook, 10,813 sockeye, 10,051 brook trout, 10,975 grayling, 8,630 lake whitefish, and 3,624 northern pike ESTs were obtained in this study and have been deposited into the public databases. Contigs were built and putative full-length Atlantic salmon clones have been identified. A database containing ESTs, assemblies, consensus sequences, open reading frames, gene predictions and putative annotation is available. The overall similarity between Atlantic salmon ESTs and those of rainbow trout, chinook, sockeye, brook trout, grayling, lake whitefish, northern pike and rainbow smelt is 93.4, 94.2, 94.6, 94.4, 92.5, 91.7, 89.6, and 86.2% respectively. An analysis of 78 transcript sets show Salmo as a sister group to Oncorhynchus and Salvelinus within Salmoninae, and Thymallinae as a sister group to Salmoninae and Coregoninae within Salmonidae. Extensive gene duplication is consistent with a genome duplication in the common ancestor of salmonids. Using all of the available EST data, a new expanded salmonid cDNA microarray of 32,000 features was created. Cross-species hybridizations to this cDNA microarray indicate that this resource will be useful for studies of all 68 salmonid species. Conclusion An extensive collection and analysis of salmonid RNA putative transcripts indicate that Pacific salmon, Atlantic salmon and charr are 94–96% similar while the more distant whitefish, grayling, pike and smelt are 93, 92, 89 and 86% similar to salmon. The salmonid transcriptome reveals a complex history of gene duplication that is
Full Text Available Abstract Background Thousands of plants and animals possess pharmacological properties and there is an increased interest in using these materials for therapy and health maintenance. Efficacies of the application is critically dependent on the use of genuine materials. For time to time, life-threatening poisoning is found because toxic adulterant or substitute is administered. DNA barcoding provides a definitive means of authentication and for conducting molecular systematics studies. Owing to the reduced cost in DNA authentication, the volume of the DNA barcodes produced for medicinal materials is on the rise and necessitates the development of an integrated DNA database. Description We have developed an integrated DNA barcode multimedia information platform- Medicinal Materials DNA Barcode Database (MMDBD for data retrieval and similarity search. MMDBD contains over 1000 species of medicinal materials listed in the Chinese Pharmacopoeia and American Herbal Pharmacopoeia. MMDBD also contains useful information of the medicinal material, including resources, adulterant information, medical parts, photographs, primers used for obtaining the barcodes and key references. MMDBD can be accessed at http://www.cuhk.edu.hk/icm/mmdbd.htm. Conclusions This work provides a centralized medicinal materials DNA barcode database and bioinformatics tools for data storage, analysis and exchange for promoting the identification of medicinal materials. MMDBD has the largest collection of DNA barcodes of medicinal materials and is a useful resource for researchers in conservation, systematic study, forensic and herbal industry.
Zhou, Zhixin; Luo, Guofeng; Wulf, Verena; Willner, Itamar
The study introduces an analytical platform for the detection of genes or aptamer-ligand complexes by nucleic acid barcode patterns generated by DNA machineries. The DNA machineries consist of nucleic acid scaffolds that include specific recognition sites for the different genes or aptamer-ligand analytes. The binding of the analytes to the scaffolds initiate, in the presence of the nucleotide mixture, a cyclic polymerization/nicking machinery that yields displaced strands of variable lengths. The electrophoretic separation of the resulting strands provides barcode patterns for the specific detection of the different analytes. Mixtures of DNA machineries that yield, upon sensing of different genes (or aptamer ligands), one-, two-, or three-band barcode patterns are described. The combination of nucleic acid scaffolds acting, in the presence of polymerase/nicking enzyme and nucleotide mixture, as DNA machineries, that generate multiband barcode patterns provide an analytical platform for the detection of an individual gene out of many possible genes. The diversity of genes (or other analytes) that can be analyzed by the DNA machineries and the barcode patterned imaging is given by the Pascal's triangle. As a proof-of-concept, the detection of one of six genes, that is, TP53, Werner syndrome, Tay-Sachs normal gene, BRCA1, Tay-Sachs mutant gene, and cystic fibrosis disorder gene by six two-band barcode patterns is demonstrated. The advantages and limitations of the detection of analytes by polymerase/nicking DNA machineries that yield barcode patterns as imaging readout signals are discussed.
Gwiazdowski, Rodger A.; Foottit, Robert G.; Maw, H. Eric L.; Hebert, Paul D. N.
DNA barcode reference libraries linked to voucher specimens create new opportunities for high-throughput identification and taxonomic re-evaluations. This study provides a DNA barcode library for about 45% of the recognized species of Canadian Hemiptera, and the publically available R workflow used for its generation. The current library is based on the analysis of 20,851 specimens including 1849 species belonging to 628 genera and 64 families. These individuals were assigned to 1867 Barcode Index Numbers (BINs), sequence clusters that often coincide with species recognized through prior taxonomy. Museum collections were a key source for identified specimens, but we also employed high-throughput collection methods that generated large numbers of unidentified specimens. Many of these specimens represented novel BINs that were subsequently identified by taxonomists, adding barcode coverage for additional species. Our analyses based on both approaches includes 94 species not listed in the most recent Canadian checklist, representing a potential 3% increase in the fauna. We discuss the development of our workflow in the context of prior DNA barcode library construction projects, emphasizing the importance of delineating a set of reference specimens to aid investigations in cases of nomenclatural and DNA barcode discordance. The identification for each specimen in the reference set can be annotated on the Barcode of Life Data System (BOLD), allowing experts to highlight questionable identifications; annotations can be added by any registered user of BOLD, and instructions for this are provided. PMID:25923328
Xie, Lei; Wang, Ying Wei; Guan, Shan Yue; Xie, Li Jing; Long, Xin; Sun, Cheng Ye
Poisonous plants are a deadly threat to public health in China. The traditional clinical diagnosis of the toxic plants is inefficient, fallible, and dependent upon experts. In this study, we tested the performance of DNA barcodes for identification of the most threatening poisonous plants in China. Seventy-four accessions of 27 toxic plant species in 22 genera and 17 families were sampled and three DNA barcodes (matK, rbcL, and ITS) were amplified, sequenced and tested. Three methods, Blast, pairwise global alignment (PWG) distance, and Tree-Building were tested for discrimination power. The primer universality of all the three markers was high. Except in the case of ITS for Hemerocallis minor, the three barcodes were successfully generated from all the selected species. Among the three methods applied, Blast showed the lowest discrimination rate, whereas PWG Distance and Tree-Building methods were equally effective. The ITS barcode showed highest discrimination rates using the PWG Distance and Tree-Building methods. When the barcodes were combined, discrimination rates were increased for the Blast method. DNA barcoding technique provides us a fast tool for clinical identification of poisonous plants in China. We suggest matK, rbcL, ITS used in combination as DNA barcodes for authentication of poisonous plants. Copyright © 2014 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.
Daily, Ashley; Kennedy, Erin D; Fierro, Leslie A; Reed, Jenica Huddleston; Greene, Michael; Williams, Warren W; Evanson, Heather V; Cox, Regina; Koeppl, Patrick; Gerlach, Ken
Accurately recording vaccine lot number, expiration date, and product identifiers, in patient records is an important step in improving supply chain management and patient safety in the event of a recall. These data are being encoded on two-dimensional (2D) barcodes on most vaccine vials and syringes. Using electronic vaccine administration records, we evaluated the accuracy of lot number and expiration date entered using 2D barcode scanning compared to traditional manual or drop-down list entry methods. We analyzed 128,573 electronic records of vaccines administered at 32 facilities. We compared the accuracy of records entered using 2D barcode scanning with those entered using traditional methods using chi-square tests and multilevel logistic regression. When 2D barcodes were scanned, lot number data accuracy was 1.8 percentage points higher (94.3-96.1%, Pmanufacturer, month vaccine was administered, and vaccine type were associated with variation in accuracy for both lot number and expiration date. Two-dimensional barcode scanning shows promise for improving data accuracy of vaccine lot number and expiration date records. Adapting systems to further integrate with 2D barcoding could help increase adoption of 2D barcode scanning technology. Published by Elsevier Ltd.
Full Text Available Phytoplasmas are bacterial phytopathogens responsible for significant losses in agricultural production worldwide. Several molecular markers are available for identification of groups or strains of phytoplasmas. However, they often cannot be used for identification of phytoplasmas from different groups simultaneously or are too long for routine diagnostics. DNA barcoding recently emerged as a convenient tool for species identification. Here, the development of a universal DNA barcode based on the elongation factor Tu (tuf gene for phytoplasma identification is reported.We designed a new set of primers and amplified a 420-444 bp fragment of tuf from all 91 phytoplasmas strains tested (16S rRNA groups -I through -VII, -IX through -XII, -XV, and -XX. Comparison of NJ trees constructed from the tuf barcode and a 1.2 kbp fragment of the 16S ribosomal gene revealed that the tuf tree is highly congruent with the 16S rRNA tree and had higher inter- and intra- group sequence divergence. Mean K2P inter-/intra- group divergences of the tuf barcode did not overlap and had approximately one order of magnitude difference for most groups, suggesting the presence of a DNA barcoding gap. The use of the tuf barcode allowed separation of main ribosomal groups and most of their subgroups. Phytoplasma tuf barcodes were deposited in the NCBI GenBank and Q-bank databases.This study demonstrates that DNA barcoding principles can be applied for identification of phytoplasmas. Our findings suggest that the tuf barcode performs as well or better than a 1.2 kbp fragment of the 16S rRNA gene and thus provides an easy procedure for phytoplasma identification. The obtained sequences were used to create a publicly available reference database that can be used by plant health services and researchers for online phytoplasma identification.
Full Text Available BACKGROUND: Widespread uptake of DNA barcoding technology for vascular plants has been slow due to the relatively poor resolution of species discrimination (∼70% and low sequencing and amplification success of one of the two official barcoding loci, matK. Studies to date have mostly focused on finding a solution to these intrinsic limitations of the markers, rather than posing questions that can maximize the utility of DNA barcodes for plants with the current technology. METHODOLOGY/PRINCIPAL FINDINGS: Here we test the ability of plant DNA barcodes using the two official barcoding loci, rbcLa and matK, plus an alternative barcoding locus, trnH-psbA, to estimate the species diversity of trees in a tropical rainforest plot. Species discrimination accuracy was similar to findings from previous studies but species richness estimation accuracy proved higher, up to 89%. All combinations which included the trnH-psbA locus performed better at both species discrimination and richness estimation than matK, which showed little enhanced species discriminatory power when concatenated with rbcLa. The utility of the trnH-psbA locus is limited however, by the occurrence of intraspecific variation observed in some angiosperm families to occur as an inversion that obscures the monophyly of species. CONCLUSIONS/SIGNIFICANCE: We demonstrate for the first time, using a case study, the potential of plant DNA barcodes for the rapid estimation of species richness in taxonomically poorly known areas or cryptic populations revealing a powerful new tool for rapid biodiversity assessment. The combination of the rbcLa and trnH-psbA loci performed better for this purpose than any two-locus combination that included matK. We show that although DNA barcodes fail to discriminate all species of plants, new perspectives and methods on biodiversity value and quantification may overshadow some of these shortcomings by applying barcode data in new ways.
Costion, Craig; Ford, Andrew; Cross, Hugh; Crayn, Darren; Harrington, Mark; Lowe, Andrew
Widespread uptake of DNA barcoding technology for vascular plants has been slow due to the relatively poor resolution of species discrimination (∼70%) and low sequencing and amplification success of one of the two official barcoding loci, matK. Studies to date have mostly focused on finding a solution to these intrinsic limitations of the markers, rather than posing questions that can maximize the utility of DNA barcodes for plants with the current technology. Here we test the ability of plant DNA barcodes using the two official barcoding loci, rbcLa and matK, plus an alternative barcoding locus, trnH-psbA, to estimate the species diversity of trees in a tropical rainforest plot. Species discrimination accuracy was similar to findings from previous studies but species richness estimation accuracy proved higher, up to 89%. All combinations which included the trnH-psbA locus performed better at both species discrimination and richness estimation than matK, which showed little enhanced species discriminatory power when concatenated with rbcLa. The utility of the trnH-psbA locus is limited however, by the occurrence of intraspecific variation observed in some angiosperm families to occur as an inversion that obscures the monophyly of species. We demonstrate for the first time, using a case study, the potential of plant DNA barcodes for the rapid estimation of species richness in taxonomically poorly known areas or cryptic populations revealing a powerful new tool for rapid biodiversity assessment. The combination of the rbcLa and trnH-psbA loci performed better for this purpose than any two-locus combination that included matK. We show that although DNA barcodes fail to discriminate all species of plants, new perspectives and methods on biodiversity value and quantification may overshadow some of these shortcomings by applying barcode data in new ways.
Full Text Available Abstract Background Most microarray studies are made using labelling with one or two dyes which allows the hybridization of one or two samples on the same slide. In such experiments, the most frequently used dyes are Cy3 and Cy5. Recent improvements in the technology (dye-labelling, scanner and, image analysis allow hybridization up to four samples simultaneously. The two additional dyes are Alexa488 and Alexa494. The triple-target or four-target technology is very promising, since it allows more flexibility in the design of experiments, an increase in the statistical power when comparing gene expressions induced by different conditions and a scaled down number of slides. However, there have been few methods proposed for statistical analysis of such data. Moreover the lowess correction of the global dye effect is available for only two-color experiments, and even if its application can be derived, it does not allow simultaneous correction of the raw data. Results We propose a two-step normalization procedure for triple-target experiments. First the dye bleeding is evaluated and corrected if necessary. Then the signal in each channel is normalized using a generalized lowess procedure to correct a global dye bias. The normalization procedure is validated using triple-self experiments and by comparing the results of triple-target and two-color experiments. Although the focus is on triple-target microarrays, the proposed method can be used to normalize p differently labelled targets co-hybridized on a same array, for any value of p greater than 2. Conclusion The proposed normalization procedure is effective: the technical biases are reduced, the number of false positives is under control in the analysis of differentially expressed genes, and the triple-target experiments are more powerful than the corresponding two-color experiments. There is room for improving the microarray experiments by simultaneously hybridizing more than two samples.
Neigel, J.; Domingo, A.; Stake, J.
DNA Barcoding (DBC) is a method for taxonomic identification of animals that is based entirely on the 5' portion of the mitochondrial gene, cytochrome oxidase subunit I ( COI-5). It can be especially useful for identification of larval forms or incomplete specimens lacking diagnostic morphological characters. DBC can also facilitate the discovery of species and in defining “molecular taxonomic units” in problematic groups. However, DBC is not a panacea for coral reef taxonomy. In two of the most ecologically important groups on coral reefs, the Anthozoa and Porifera, COI-5 sequences have diverged too little to be diagnostic for all species. Other problems for DBC include paraphyly in mitochondrial gene trees and lack of differentiation between hybrids and their maternal ancestors. DBC also depends on the availability of databases of COI-5 sequences, which are still in early stages of development. A global effort to barcode all fish species has demonstrated the importance of large-scale coordination and is yielding promising results. Whether or not COI-5 by itself is sufficient for species assignments has become a contentious question; it is generally advantageous to use sequences from multiple loci.
Full Text Available Two snapshot surveys to establish the diversity and ecological preferences of mosquitoes (Diptera: Culicidae in the terra firme primary rain forest surrounding the Tiputini Biodiversity Station in the UNESCO Yasuní Biosphere Reserve of eastern Amazonian Ecuador were carried out in November 1998 and May 1999. The mosquito fauna of this region is poorly known; the focus of this study was to obtain high quality link-reared specimens that could be used to unequivocally confirm species level diversity through integrated systematic study of all life stages and DNA sequences. A total of 2,284 specimens were preserved; 1,671 specimens were link-reared with associated immature exuviae, all but 108 of which are slide mounted. This study identified 68 unique taxa belonging to 17 genera and 27 subgenera. Of these, 12 are new to science and 37 comprise new country records. DNA barcodes [658-bp of the mtDNA cytochrome c oxidase ( COI I gene] are presented for 58 individuals representing 20 species and nine genera. DNA barcoding proved useful in uncovering and confirming new species and we advocate an integrated systematics approach to biodiversity studies in future. Associated bionomics of all species collected are discussed. An updated systematic checklist of the mosquitoes of Ecuador (n = 179 is presented for the first time in 60 years.
Liu, Chuan; Zhang, Yu-Xin; Liu, Yue; Chen, Yi-Long; Fan, Gang; Xiang, Li; Xu, Jiang; Zhang, Yi
The ITS2 barcode was used toidentify Tibetan medicine "Dida", and tosecure its quality and safety in medication. A total of 13 species, 151 experimental samples for the study from the Tibetan Plateau, including Gentianaceae Swertia, Halenia, Gentianopsis, Comastoma, Lomatogonium ITS2 sequences were amplified, and purified PCR products were sequenced. Sequence assembly and consensus sequence generation were performed using the CodonCode Aligner V3.7.1. The Kimura 2-Parameter (K2P) distances were calculated using MEGA 6.0. The neighbor-joining (NJ) phylogenetic trees were constructed. There are 31 haplotypes among 231 bp after alignment of all ITS2 sequence haplotypes, and the average G±C content of 61.40%. The NJ tree strongly supported that every species clustered into their own clade and high identification success rate, except that Swertia bifolia and Swertia wolfangiana could not be distinguished from each other based on the sequence divergences. DNA barcoding could be used as a fast and accurate identification method to distinguish Tibetan medicine "Dida" to ensure its safe use. Copyright© by the Chinese Pharmaceutical Association.
Sarmiento-Camacho, Stephanie; Valdez-Moreno, Martha
The substitution of high-value fish species for those of lower value is common practice. Although numerous studies have addressed this issue, few have been conducted in Mexico. In this study, we sought to identify fresh fillets of fish, sharks, and rays using DNA barcodes. We analyzed material from "La Viga" in Mexico City, and other markets located on the Gulf and Caribbean coasts of Mexico. From 134 samples, we obtained sequences from 129, identified to 9 orders, 28 families, 38 genera, and 44 species. The most common species were Seriola dumerili, Pangasianodon hypophthalmus, Carcharhinus falciformis, Carcharhinus brevipinna, and Hypanus americanus. Pangasianodon hypophthalmus was most commonly used as a substitute for higher-value species. The substitution rate was 18% of the total. A review of the conservation status of the specimens identified against the IUNC list enabled us to establish that some species marketed in Mexico are threatened: Makaira nigricans, Lachnolaimus maximus, Hyporthodus flavolimbatus, and Isurus oxyrinchus are classified as vulnerable; Lopholatilus chamaeleonticeps and Sphyrna lewini are endangered; and the status of Hyporthodus nigritus is critical. These results will demonstrate to the Mexican authorities that DNA barcoding is a reliable tool for species identification, even when morphological identification is difficult or impossible.
Full Text Available Microarray study enables us to obtain hundreds of thousands of expressions of genes or genotypes at once, and it is an indispensable technology for genome research. The first step is the analysis of scanned microarray images. This is the most important procedure for obtaining biologically reliable data. Currently most microarray image processing systems require burdensome manual block/spot indexing work. Since the amount of experimental data is increasing very quickly, automated microarray image analysis software becomes important. In this paper, we propose two automated methods for analyzing microarray images. First, we propose the extended -regular sequence to index blocks and spots, which enables a novel automatic gridding procedure. Second, we provide a methodology, hierarchical metagrid alignment, to allow reliable and efficient batch processing for a set of microarray images. Experimental results show that the proposed methods are more reliable and convenient than the commercial tools.
Arigi, Emma; Blixt, Klas Ola; Buschard, Karsten
, the major classes of plant and fungal GSLs. In this work, a prototype "universal" GSL-based covalent microarray has been designed, and preliminary evaluation of its potential utility in assaying protein-GSL binding interactions investigated. An essential step in development involved the enzymatic release...... of the fatty acyl moiety of the ceramide aglycone of selected mammalian GSLs with sphingolipid N-deacylase (SCDase). Derivatization of the free amino group of a typical lyso-GSL, lyso-G(M1), with a prototype linker assembled from succinimidyl-[(N-maleimidopropionamido)-diethyleneglycol] ester and 2...
Li, Shuzhao; Pozhitkov, Alexander; Brouwer, Marius
Understanding the difference in probe properties holds the key to absolute quantification of DNA microarrays. So far, Langmuir-like models have failed to link sequence-specific properties to hybridization signals in the presence of a complex hybridization background. Data from washing experiments indicate that the post-hybridization washing has no major effect on the specifically bound targets, which give the final signals. Thus, the amount of specific targets bound to probes is likely determined before washing, by the competition against nonspecific binding. Our competitive hybridization model is a viable alternative to Langmuir-like models. (comment)
Gao, Ting; Sun, Zhiying; Yao, Hui; Song, Jingyuan; Zhu, Yingjie; Ma, Xinye; Chen, Shilin
In this study, we tested the applicability of the core DNA barcode MATK for identifying species within the Fabaceae family. Based on an evaluation of genetic variation, DNA barcoding gaps, and species discrimination power, MATK is a useful barcode for Fabaceae species. Of 1355 plant samples collected from 1079 species belonging to 409 diverse genera, MATK precisely identified approximately 80 % and 96 % of them at the species and genus levels, respectively. Therefore, our research indicates that the MATK region is a valuable marker for plant species within Fabaceae. © Georg Thieme Verlag KG Stuttgart · New York.
Lee A Weigt
Full Text Available This paper represents a DNA barcode data release for 3,400 specimens representing 521 species of fishes from 6 areas across the Caribbean and western central Atlantic regions (FAO Region 31. Merged with our prior published data, the combined efforts result in 3,964 specimens representing 572 species of marine fishes and constitute one of the most comprehensive DNA barcoding "coverages" for a region reported to date. The barcode data are providing new insights into Caribbean shorefish diversity, allowing for more and more accurate DNA-based identifications of larvae, juveniles, and unknown specimens. Examples are given correcting previous work that was erroneous due to database incompleteness.
Levinson, S.; Shemesh, Y.; Ankry, N.; Assido, H.; German, U.; Peled, O.
A bar-code laser system for sample number reading was integrated into the FAG Alpha-Beta automatic counting system. The sample identification by means of an attached bar-code label enables unmistakable and reliable attribution of results to the counted sample. Installation of the bar-code reader system required several modifications: Mechanical changes in the automatic sample changer, design and production of new sample holders, modification of the sample planchettes, changes in the electronic system, update of the operating software of the system (authors)
Cho, Ji Ung; Wu, J.-H.; Min, Ji Hyun; Lee, Ju Hun; Liu, H.-L.; Kim, Young Keun
We have studied the effect of an external magnetic field applied during electrodeposition of Co/Cu barcode nanowires in anodic aluminum oxide nanotemplates. The magnetic properties of the barcode nanowires were greatly enhanced for 50 nm pore diameter regardless of segment aspect ratio, but field deposition has little effect on the 200 nm nanowires. The magnetic improvement is correlated with a structural change, attributed to field modification of the growth habit of the barcode nanowires. A mechanism of growth subject to geometric confinement is proposed
Cho, Ji Ung; Wu, Jun-Hua; Min, Ji Hyun; Lee, Ju Hun; Liu, Hong-Ling; Kim, Young Keun
We have studied the effect of an external magnetic field applied during electrodeposition of Co/Cu barcode nanowires in anodic aluminum oxide nanotemplates. The magnetic properties of the barcode nanowires were greatly enhanced for 50 nm pore diameter regardless of segment aspect ratio, but field deposition has little effect on the 200 nm nanowires. The magnetic improvement is correlated with a structural change, attributed to field modification of the growth habit of the barcode nanowires. A mechanism of growth subject to geometric confinement is proposed.
Ullal, Adeeti V; Weissleder, Ralph
We describe a DNA-barcoded antibody sensing technique for single cell protein analysis in which the barcodes are photocleaved and digitally detected without amplification steps (Ullal et al., Sci Transl Med 6:219, 2014). After photocleaving the unique ~70 mer DNA barcodes we use a fluorescent hybridization technology for detection, similar to what is commonly done for nucleic acid readouts. This protocol offers a simple method for multiplexed protein detection using 100+ antibodies and can be performed on clinical samples as well as single cells.
Marescaux, Jonathan; Van Doninck, Karine
The zebra mussel (Dreissena polymorpha) and the quagga mussel (Dreissena rostriformis bugensis) are considered as the most competitive invaders in freshwaters of Europe and North America. Although shell characteristics exist to differentiate both species, phenotypic plasticity in the genus Dreissena does not always allow a clear identification. Therefore, the need to find an accurate identification method is essential. DNA barcoding has been proven to be an adequate procedure to discriminate species. The cytochrome c oxidase subunit I mitochondrial gene (COI) is considered as the standard barcode for animals. We tested the use of this gene as an efficient DNA barcode and found that it allow rapid and accurate identification of adult Dreissena individuals.
Levinson, S; Shemesh, Y; Ankry, N; Assido, H; German, U; Peled, O [Israel Atomic Energy Commission, Beersheba (Israel). Nuclear Research Center-Negev
A bar-code laser system for sample number reading was integrated into the FAG Alpha-Beta automatic counting system. The sample identification by means of an attached bar-code label enables unmistakable and reliable attribution of results to the counted sample. Installation of the bar-code reader system required several modifications: Mechanical changes in the automatic sample changer, design and production of new sample holders, modification of the sample planchettes, changes in the electronic system, update of the operating software of the system (authors).
Lukjancenko, Oksana; Ussery, David
-density microarray chip has been designed, using 116 Enterobacteriaceae genome sequences, taking into account the enteric pan-genome. Probes for the microarray were checked in silico and performance of the chip, based on experimental strains from four different genera, demonstrate a relatively high ability...... to distinguish those strains on genus, species, and pathotype/serovar levels. Additionally, the microarray performed well when investigating which genes were found in a given strain of interest. The Enterobacteriaceae pan-genome microarray, based on 116 genomes, provides a valuable tool for determination...
Zwinderman Aeilko H
Full Text Available Abstract Background When DNA microarray data are used for gene clustering, genotype/phenotype correlation studies, or tissue classification the signal intensities are usually transformed and normalized in several steps in order to improve comparability and signal/noise ratio. These steps may include subtraction of an estimated background signal, subtracting the reference signal, smoothing (to account for nonlinear measurement effects, and more. Different authors use different approaches, and it is generally not clear to users which method they should prefer. Results We used the ratio between biological variance and measurement variance (which is an F-like statistic as a quality measure for transformation methods, and we demonstrate a method for maximizing that variance ratio on real data. We explore a number of transformations issues, including Box-Cox transformation, baseline shift, partial subtraction of the log-reference signal and smoothing. It appears that the optimal choice of parameters for the transformation methods depends on the data. Further, the behavior of the variance ratio, under the null hypothesis of zero biological variance, appears to depend on the choice of parameters. Conclusions The use of replicates in microarray experiments is important. Adjustment for the null-hypothesis behavior of the variance ratio is critical to the selection of transformation method.
Full Text Available Abstract Background Based on the testing of several loci, predominantly against floristic backgrounds, individual or different combinations of loci have been suggested as possible universal DNA barcodes for plants. The present investigation was undertaken to check the applicability of the recommended locus/loci for congeneric species with Dendrobium species as an illustrative example. Results Six loci, matK, rbcL, rpoB, rpoC1, trnH-psbA spacer from the chloroplast genome and ITS, from the nuclear genome, were compared for their amplification, sequencing and species discrimination success rates among multiple accessions of 36 Dendrobium species. The trnH-psbA spacer could not be considered for analysis as good quality sequences were not obtained with its forward primer. Among the tested loci, ITS, recommended by some as a possible barcode for plants, provided 100% species identification. Another locus, matK, also recommended as a universal barcode for plants, resolved 80.56% species. ITS remained the best even when sequences of investigated loci of additional Dendrobium species available on the NCBI GenBank (93, 33, 20, 18 and 17 of ITS, matK, rbcL, rpoB and rpoC1, respectively were also considered for calculating the percent species resolution capabilities. The species discrimination of various combinations of the loci was also compared based on the 36 investigated species and additional 16 for which sequences of all the five loci were available on GenBank. Two-locus combination of matK+rbcL recommended by the Plant Working Group of Consortium for Barcoding of Life (CBOL could discriminate 86.11% of 36 species. The species discriminating ability of this barcode was reduced to 80.77% when additional sequences available on NCBI were included in the analysis. Among the recommended combinations, the barcode based on three loci - matK, rpoB and rpoC1- resolved maximum number of species. Conclusions Any recommended barcode based on the loci tested so
Singh, Hemant Kumar; Parveen, Iffat; Raghuvanshi, Saurabh; Babbar, Shashi B
Based on the testing of several loci, predominantly against floristic backgrounds, individual or different combinations of loci have been suggested as possible universal DNA barcodes for plants. The present investigation was undertaken to check the applicability of the recommended locus/loci for congeneric species with Dendrobium species as an illustrative example. Six loci, matK, rbcL, rpoB, rpoC1, trnH-psbA spacer from the chloroplast genome and ITS, from the nuclear genome, were compared for their amplification, sequencing and species discrimination success rates among multiple accessions of 36 Dendrobium species. The trnH-psbA spacer could not be considered for analysis as good quality sequences were not obtained with its forward primer. Among the tested loci, ITS, recommended by some as a possible barcode for plants, provided 100% species identification. Another locus, matK, also recommended as a universal barcode for plants, resolved 80.56% species. ITS remained the best even when sequences of investigated loci of additional Dendrobium species available on the NCBI GenBank (93, 33, 20, 18 and 17 of ITS, matK, rbcL, rpoB and rpoC1, respectively) were also considered for calculating the percent species resolution capabilities. The species discrimination of various combinations of the loci was also compared based on the 36 investigated species and additional 16 for which sequences of all the five loci were available on GenBank. Two-locus combination of matK+rbcL recommended by the Plant Working Group of Consortium for Barcoding of Life (CBOL) could discriminate 86.11% of 36 species. The species discriminating ability of this barcode was reduced to 80.77% when additional sequences available on NCBI were included in the analysis. Among the recommended combinations, the barcode based on three loci - matK, rpoB and rpoC1- resolved maximum number of species. Any recommended barcode based on the loci tested so far, is not likely to provide 100% species identification
Dang, Ning-Xin; Sun, Feng-Hui; Lv, Yun-Yun; Zhao, Bo-Han; Wang, Ji-Chao; Murphy, Robert W; Wang, Wen-Zhi; Li, Jia-Tang
The DNA barcoding gene COI (cytochrome c oxidase subunit I) effectively identifies many species. Herein, we barcoded 172 individuals from 37 species belonging to nine genera in Rhacophoridae to test if the gene serves equally well to identify species of tree frogs. Phenetic neighbor joining and phylogenetic Bayesian inference were used to construct phylogenetic trees, which resolved all nine genera as monophyletic taxa except for Rhacophorus, two new matrilines for Liuixalus, and Polypedates leucomystax species complex. Intraspecific genetic distances ranged from 0.000 to 0.119 and interspecific genetic distances ranged from 0.015 to 0.334. Within Rhacophorus and Kurixalus, the intra- and interspecific genetic distances did not reveal an obvious barcode gap. Notwithstanding, we found that COI sequences unambiguously identified rhacophorid species and helped to discover likely new cryptic species via the synthesis of genealogical relationships and divergence patterns. Our results supported that COI is an effective DNA barcoding marker for Rhacophoridae.
Hill, Haley D.; Vega, Rafael A.; Mirkin, Chad A.
The detection of bacterial genomic DNA through a non-enzymatic nanomaterials based amplification method, the bio-barcode assay, is reported. The assay utilizes oligonucleotide functionalized magnetic microparticles to capture the target of interest from the sample. A critical step in the new assay involves the use of blocking oligonucleotides during heat denaturation of the double stranded DNA. These blockers bind to specific regions of the target DNA upon cooling, and prevent the duplex DNA from re-hybridizing, which allows the particle probes to bind. Following target isolation using the magnetic particles, oligonucleotide functionalized gold nanoparticles act as target recognition agents. The oligonucleotides on the nanoparticle (barcodes) act as amplification surrogates. The barcodes are then detected using the Scanometric method. The limit of detection for this assay was determined to be 2.5 femtomolar, and this is the first demonstration of a barcode type assay for the detection of double stranded, genomic DNA. PMID:17927207
Marsic, Damien; Méndez-Gómez, Héctor R; Zolotukhin, Sergei
Biodistribution analysis is a key step in the evaluation of adeno-associated virus (AAV) capsid variants, whether natural isolates or produced by rational design or directed evolution. Indeed, when screening candidate vectors, accurate knowledge about which tissues are infected and how efficiently is essential. We describe the design, validation, and application of a new vector, pTR-UF50-BC, encoding a bioluminescent protein, a fluorescent protein and a DNA barcode, which can be used to visualize localization of transduction at the organism, organ, tissue, or cellular levels. In addition, by linking capsid variants to different barcoded versions of the vector and amplifying the barcode region from various tissue samples using barcoded primers, biodistribution of viral genomes can be analyzed with high accuracy and efficiency.
Samerpitak, Kittipan; Gerrits van den Ende, Bert H G; Stielow, J Benjamin; Menken, Steph B J; de Hoog, G Sybren
The genera Ochroconis and Verruconis (Sympoventuriaceae, Venturiales) have remarkably high molecular diversity despite relatively high degrees of phenotypic similarity. Tree topologies, inter-specific and intra-specific heterogeneities, barcoding gaps and reciprocal monophyly of all currently known
Full Text Available Simultaneously detecting CRISPR-based perturbations and induced transcriptional changes in the same cell is a powerful approach to unraveling genome function. Several lentiviral approaches have been developed, some of which rely on the detection of distally located genetic barcodes as an indirect proxy of sgRNA identity. Since barcodes are often several kilobases from their corresponding sgRNAs, viral recombination-mediated swapping of barcodes and sgRNAs is feasible. Using a self-circularization-based sgRNA-barcode library preparation protocol, we estimate the recombination rate to be ~50% and we trace this phenomenon to the pooled viral packaging step. Recombination is random, and decreases the signal-to-noise ratio of the assay. Our results suggest that alternative approaches can increase the throughput and sensitivity of single-cell perturbation assays.
Ondrejicka, Danielle A; Locke, Sean A; Morey, Kevin; Borisenko, Alex V; Hanner, Robert H
For over 10 years, DNA barcoding has been used to identify specimens and discern species. Its potential benefits in parasitology were recognized early, but its utility and uptake remain unclear. Here we review studies using DNA barcoding in parasites and vectors affecting humans and find that the technique is accurate (accords with author identifications based on morphology or other markers) in 94-95% of cases, although aspects of DNA barcoding (vouchering, marker implicated) have often been misunderstood. In a newly compiled checklist of parasites, vectors, and hazards, barcodes are available for 43% of all 1403 species and for more than half of 429 species of greater medical importance. This is encouraging coverage that would improve with an active campaign targeting parasites and vectors. Copyright © 2014 Elsevier Ltd. All rights reserved.
Full Text Available Identification by DNA barcoding is more likely to be erroneous when it is based on a large distance between the query (the barcode sequence of the specimen to identify and its best match in a reference barcode library. The number of such false positive identifications can be decreased by setting a distance threshold above which identification has to be rejected. To this end, we proposed recently to use an ad hoc distance threshold producing identifications with an estimated relative error probability that can be fixed by the user (e.g. 5%. Here we introduce two R functions that automate the calculation of ad hoc distance thresholds for reference libraries of DNA barcodes. The scripts of both functions, a user manual and an example file are available on the JEMU website (http://jemu.myspecies.info/computer-programs as well as on the comprehensive R archive network (CRAN, http://cran.r-project.org.
Full Text Available Rigorous diagnostics and documentation of fungal species are fundamental to their conservation. During the course of a species-level study of UK waxcap (Hygrophoraceae diversity, two previously unrecognized species were discovered. We describe Gliophorus europerplexus sp. nov. and G. reginae sp. nov., respectively orange–brown and purple–pink waxcap mushrooms, from nutrient-poor grasslands in Britain. Both share some morphological features with specimens assigned to Gliophorus (=Hygrocybe psittacinus. However, analysis of sequences of the nuclear ITS DNA barcode region from these and related taxa confirms the phylogenetic distinctness of these lineages. Furthermore, we demonstrated that the holotype of Hygrophorus perplexus, a North American species morphologically resembling G. europerplexus, is phylogenetically divergent from all our collections. It is likely that further collections of G. europerplexus will be revealed by sequencing European material currently filed under G. perplexus and its synonyms. However, two such collections in the Kew fungarium yielded sequences that clustered together but were divergent from those of G. europerplexus, G. perplexus and G. psittacinus and may represent a further novel taxon. By contrast, G. reginae is morphologically distinct and can usually be recognized in the field by its purplish viscid pileus and relatively stout, flexuose, pale stipe. It is named to commemorate the diamond jubilee of Her Majesty Queen Elizabeth II in 2012 and the 60th anniversary of her coronation in 2013.
Full Text Available Cancers often involve the synergistic effects of gene–gene interactions, but identifying these interactions remains challenging. Here, we present an odds ratio-based genetic algorithm (OR-GA that is able to solve the problems associated with the simultaneous analysis of multiple independent single nucleotide polymorphisms (SNPs that are associated with oral cancer. The SNP interactions between four SNPs—namely rs1799782, rs2040639, rs861539, rs2075685, and belonging to four genes (XRCC1, XRCC2, XRCC3, and XRCC4—were tested in this study, respectively. The GA decomposes the SNPs sets into different SNP combinations with their corresponding genotypes (called SNP barcodes. The GA can effectively identify a specific SNP barcode that has an optimized fitness value and uses this to calculate the difference between the case and control groups. The SNP barcodes with a low fitness value are naturally removed from the population. Using two to four SNPs, the best SNP barcodes with maximum differences in occurrence between the case and control groups were generated by GA algorithm. Subsequently, the OR provides a quantitative measure of the multiple SNP synergies between the oral cancer and control groups by calculating the risk related to the best SNP barcodes and others. When these were compared to their corresponding non-SNP barcodes, the estimated ORs for oral cancer were found to be great than 1 [approx. 1.72–2.23; confidence intervals (CIs: 0.94–5.30, p < 0.03–0.07] for various specific SNP barcodes with two to four SNPs. In conclusion, the proposed OR-GA method successfully generates SNP barcodes, which allow oral cancer risk to be evaluated and in the process the OR-GA method identifies possible SNP–SNP interactions.
Chambers, E Anne; Hebert, Paul D N
High rates of species discovery and loss have led to the urgent need for more rapid assessment of species diversity in the herpetofauna. DNA barcoding allows for the preliminary identification of species based on sequence divergence. Prior DNA barcoding work on reptiles and amphibians has revealed higher biodiversity counts than previously estimated due to cases of cryptic and undiscovered species. Past studies have provided DNA barcodes for just 14% of the North American herpetofauna, revealing the need for expanded coverage. This study extends the DNA barcode reference library for North American herpetofauna, assesses the utility of this approach in aiding species delimitation, and examines the correspondence between current species boundaries and sequence clusters designated by the BIN system. Sequences were obtained from 730 specimens, representing 274 species (43%) from the North American herpetofauna. Mean intraspecific divergences were 1% and 3%, while average congeneric sequence divergences were 16% and 14% in amphibians and reptiles, respectively. BIN assignments corresponded with current species boundaries in 79% of amphibians, 100% of turtles, and 60% of squamates. Deep divergences (>2%) were noted in 35% of squamate and 16% of amphibian species, and low divergences (reptiles and 23% of amphibians, patterns reflected in BIN assignments. Sequence recovery declined with specimen age, and variation in recovery success was noted among collections. Within collections, barcodes effectively flagged seven mislabeled tissues, and barcode fragments were recovered from five formalin-fixed specimens. This study demonstrates that DNA barcodes can effectively flag errors in museum collections, while BIN splits and merges reveal taxa belonging to deeply diverged or hybridizing lineages. This study is the first effort to compile a reference library of DNA barcodes for herpetofauna on a continental scale.
Song, Ming; Dong, Gang-Qiang; Zhang, Ya-Qin; Liu, Xia; Sun, Wei
Most of Chinese medicinal herbs are subjected to traditional processing procedures, including stir-frying, charring, steaming, boiling, and calcining before they are released into dispensaries. The marketing and identification of processed medicinal materials is a growing issue in the marketplace. However, conventional methods of identification have limitations, while DNA mini-barcoding, based on the sequencing of a short-standardized region, has received considerable attention as a new potential means to identify processed medicinal materials. In the present study, six DNA barcode loci including ITS2, psbA-trnH, rbcL, matK, trnL (UAA) intron and its P6 loop, were employed for the authentication of 45 processed samples belonging to 15 species. We evaluated the amplification efficiency of each locus. We also examined the identification accuracy of the potential mini-barcode locus, of trnL (UAA) intron P6 loop. Our results showed that the five primary barcode loci were successfully amplified in only 8.89%-20% of the processed samples, while the amplification rates of the trnL (UAA) intron P6 loop were higher, at 75.56% successful amplification. We compared the mini-barcode sequences with Genbank using the Blast program. The analysis showed that 45.23% samples could be identified to genus level, while only one sample could be identified to the species level. We conclude that trnL (UAA) p6 loop is a candidate mini-barcode that has shown its potential and may become a universal mini-barcode as complementary barcode for authenticity testing and will play an important role in medicinal materials control. Copyright © 2017 China Pharmaceutical University. Published by Elsevier B.V. All rights reserved.
Rupert A Collins
Full Text Available Poorly regulated international trade in ornamental fishes poses risks to both biodiversity and economic activity via invasive alien species and exotic pathogens. Border security officials need robust tools to confirm identifications, often requiring hard-to-obtain taxonomic literature and expertise. DNA barcoding offers a potentially attractive tool for quarantine inspection, but has yet to be scrutinised for aquarium fishes. Here, we present a barcoding approach for ornamental cyprinid fishes by: (1 expanding current barcode reference libraries; (2 assessing barcode congruence with morphological identifications under numerous scenarios (e.g. inclusion of GenBank data, presence of singleton species, choice of analytical method; and (3 providing supplementary information to identify difficult species.We sampled 172 ornamental cyprinid fish species from the international trade, and provide data for 91 species currently unrepresented in reference libraries (GenBank/Bold. DNA barcodes were found to be highly congruent with our morphological assignments, achieving success rates of 90-99%, depending on the method used (neighbour-joining monophyly, bootstrap, nearest neighbour, GMYC, percent threshold. Inclusion of data from GenBank (additional 157 spp. resulted in a more comprehensive library, but at a cost to success rate due to the increased number of singleton species. In addition to DNA barcodes, our study also provides supporting data in the form of specimen images, morphological characters, taxonomic bibliography, preserved vouchers, and nuclear rhodopsin sequences. Using this nuclear rhodopsin data we also uncovered evidence of interspecific hybridisation, and highlighted unrecognised diversity within popular aquarium species, including the endangered Indian barb Puntius denisonii.We demonstrate that DNA barcoding provides a highly effective biosecurity tool for rapidly identifying ornamental fishes. In cases where DNA barcodes are unable to
Severgnini, Marco; Bicciato, Silvio; Mangano, Eleonora; Scarlatti, Francesca; Mezzelani, Alessandra; Mattioli, Michela; Ghidoni, Riccardo; Peano, Clelia; Bonnal, Raoul; Viti, Federica; Milanesi, Luciano; De Bellis, Gianluca; Battaglia, Cristina
Meta-analysis of microarray data is increasingly important, considering both the availability of multiple platforms using disparate technologies and the accumulation in public repositories of data sets from different laboratories. We addressed the issue of comparing gene expression profiles from two microarray platforms by devising a standardized investigative strategy. We tested this procedure by studying MDA-MB-231 cells, which undergo apoptosis on treatment with resveratrol. Gene expression profiles were obtained using high-density, short-oligonucleotide, single-color microarray platforms: GeneChip (Affymetrix) and CodeLink (Amersham). Interplatform analyses were carried out on 8414 common transcripts represented on both platforms, as identified by LocusLink ID, representing 70.8% and 88.6% of annotated GeneChip and CodeLink features, respectively. We identified 105 differentially expressed genes (DEGs) on CodeLink and 42 DEGs on GeneChip. Among them, only 9 DEGs were commonly identified by both platforms. Multiple analyses (BLAST alignment of probes with target sequences, gene ontology, literature mining, and quantitative real-time PCR) permitted us to investigate the factors contributing to the generation of platform-dependent results in single-color microarray experiments. An effective approach to cross-platform comparison involves microarrays of similar technologies, samples prepared by identical methods, and a standardized battery of bioinformatic and statistical analyses.
Full Text Available Abstract Background We report an attempt to extend the previously successful approach of combining SNP (single nucleotide polymorphism microarrays and DNA pooling (SNP-MaP employing high-density microarrays. Whereas earlier studies employed a range of Affymetrix SNP microarrays comprising from 10 K to 500 K SNPs, this most recent investigation used the 6.0 chip which displays 906,600 SNP probes and 946,000 probes for the interrogation of CNVs (copy number variations. The genotyping assay using the Affymetrix SNP 6.0 array is highly demanding on sample quality due to the small feature size, low redundancy, and lack of mismatch probes. Findings In the first study published so far using this microarray on pooled DNA, we found that pooled cheek swab DNA could not accurately predict real allele frequencies of the samples that comprised the pools. In contrast, the allele frequency estimates using blood DNA pools were reasonable, although inferior compared to those obtained with previously employed Affymetrix microarrays. However, it might be possible to improve performance by developing improved analysis methods. Conclusions Despite the decreasing costs of genome-wide individual genotyping, the pooling approach may have applications in very large-scale case-control association studies. In such cases, our study suggests that high-quality DNA preparations and lower density platforms should be preferred.
Zhao, Xiaobo; Pang, Shaojun; Shan, Tifeng; Liu, Feng
This study is part of the endeavor to construct a comprehensive DNA barcoding database for common seaweeds in China. Identifications of red seaweeds, which have simple morphology and anatomy, are sometimes difficult solely depending on morphological characteristics. In recent years, DNA barcode technique has become a more and more effective tool to help solve some of the taxonomic difficulties. Some DNA markers such as COI (cytochrome oxidase subunit I) are proposed as standardized DNA barcodes for all seaweed species. In this study, COI, UPA (universal plastid amplicon, domain V of 23S rRNA), and ITS (nuclear internal transcribed spacer) were employed to analyze common species of intertidal red seaweeds in Qingdao (119.3°-121°E, 35.35°-37.09°N). The applicability of using one or a few combined barcodes to identify red seaweed species was tested. The results indicated that COI is a sensitive marker at species level. However, not all the tested species gave PCR amplification products due to lack of the universal primers. The second barcode UPA had effective universal primers but needed to be tested for the effectiveness of resolving closely related species. More than one ITS sequence types were found in some species in this investigation, which might lead to confusion in further analysis. Therefore ITS sequence is not recommended as a universal barcode for seaweeds identification.
Hausmann, Axel; Haszprunar, Gerhard; Hebert, Paul D. N.
Background The State of Bavaria is involved in a research program that will lead to the construction of a DNA barcode library for all animal species within its territorial boundaries. The present study provides a comprehensive DNA barcode library for the Geometridae, one of the most diverse of insect families. Methodology/Principal Findings This study reports DNA barcodes for 400 Bavarian geometrid species, 98 per cent of the known fauna, and approximately one per cent of all Bavarian animal species. Although 98.5% of these species possess diagnostic barcode sequences in Bavaria, records from neighbouring countries suggest that species-level resolution may be compromised in up to 3.5% of cases. All taxa which apparently share barcodes are discussed in detail. One case of modest divergence (1.4%) revealed a species overlooked by the current taxonomic system: Eupithecia goossensiata Mabille, 1869 stat.n. is raised from synonymy with Eupithecia absinthiata (Clerck, 1759) to species rank. Deep intraspecific sequence divergences (>2%) were detected in 20 traditionally recognized species. Conclusions/Significance The study emphasizes the effectiveness of DNA barcoding as a tool for monitoring biodiversity. Open access is provided to a data set that includes records for 1,395 geometrid specimens (331 species) from Bavaria, with 69 additional species from neighbouring regions. Taxa with deep intraspecific sequence divergences are undergoing more detailed analysis to ascertain if they represent cases of cryptic diversity. PMID:21423340
Ekrem, Torbjørn; Stur, Elisabeth
Abstract Chironomidae (Diptera) pupal exuviae samples are commonly used for biological monitoring of aquatic habitats. DNA barcoding has proved useful for species identification of chironomid life stages containing cellular tissue, but the barcoding success of chironomid pupal exuviae is unknown. We assessed whether standard DNA barcoding could be efficiently used for species identification of chironomid pupal exuviae when compared with morphological techniques and if there were differences in performance between temperate and tropical ecosystems, subfamilies, and tribes. PCR, sequence, and identification success differed significantly between geographic regions and taxonomic groups. For Norway, 27 out of 190 (14.2%) of pupal exuviae resulted in high-quality chironomid sequences that match species. For Costa Rica, 69 out of 190 (36.3%) Costa Rican pupal exuviae resulted in high-quality sequences, but none matched known species. Standard DNA barcoding of chironomid pupal exuviae had limited success in species identification of unknown specimens due to contaminations and lack of matching references in available barcode libraries, especially from Costa Rica. Therefore, we recommend future biodiversity studies that focus their efforts on understudied regions, to simultaneously use morphological and molecular identification techniques to identify all life stages of chironomids and populate the barcode reference library with identified sequences.
Smurthwaite, Cameron A; Hilton, Brett J; O'Hanlon, Ryan; Stolp, Zachary D; Hancock, Bryan M; Abbadessa, Darin; Stotland, Aleksandr; Sklar, Larry A; Wolkowicz, Roland
The discovery of the green fluorescent protein from Aequorea victoria has revolutionized the field of cell and molecular biology. Since its discovery a growing panel of fluorescent proteins, fluorophores and fluorescent-coupled staining methodologies, have expanded the analytical capabilities of flow cytometry. Here, we exploit the power of genetic engineering to barcode individual cells with genes encoding fluorescent proteins. For genetic engineering, we utilize retroviral technology, which allows for the expression of ectopic genetic information in a stable manner in mammalian cells. We have genetically barcoded both adherent and nonadherent cells with different fluorescent proteins. Multiplexing power was increased by combining both the number of distinct fluorescent proteins, and the fluorescence intensity in each channel. Moreover, retroviral expression has proven to be stable for at least a 6-month period, which is critical for applications such as biological screens. We have shown the applicability of fluorescent barcoded multiplexing to cell-based assays that rely themselves on genetic barcoding, or on classical staining protocols. Fluorescent genetic barcoding gives the cell an inherited characteristic that distinguishes it from its counterpart. Once cell lines are developed, no further manipulation or staining is required, decreasing time, nonspecific background associated with staining protocols, and cost. The increasing number of discovered and/or engineered fluorescent proteins with unique absorbance/emission spectra, combined with the growing number of detection devices and lasers, increases multiplexing versatility, making fluorescent genetic barcoding a powerful tool for flow cytometry-based analysis. © 2013 International Society for Advancement of Cytometry.
Full Text Available The calculation of the level of attendance is very important, because one indicator of a person's credibility can be seen from the level of attendance. For example, at a university, data about the level of attendance of a student in a lecture is very important as one of components in the assessment. The manual presence system is considered less effective. This research presents the draft of presence system using bar codes (barcodes as input data representing the attendance. The presence system is supported by three main components, those are a bar code found on the student card (KTM, a CCD barcode scanner series and a CD-108E computer. Management of attendance list using this system allows for optimization of functions of KTM. The presence system has been tested with several KTM through a variety of distances and positions of the barcode scanner barcode. The test results is obtained at ideal position for reading a barcode when a barcode scanner is at 2 cm from the object with 90 degree. At this position the level of accuracy reach 100%.
Myhrvold, Cameron; Baym, Michael; Hanikel, Nikita; Ong, Luvena L.; Gootenberg, Jonathan S.; Yin, Peng
Collections of DNA sequences can be rationally designed to self-assemble into predictable three-dimensional structures. The geometric and functional diversity of DNA nanostructures created to date has been enhanced by improvements in DNA synthesis and computational design. However, existing methods for structure characterization typically image the final product or laboriously determine the presence of individual, labelled strands using gel electrophoresis. Here we introduce a new method of structure characterization that uses barcode extension and next-generation DNA sequencing to quantitatively measure the incorporation of every strand into a DNA nanostructure. By quantifying the relative abundances of distinct DNA species in product and monomer bands, we can study the influence of geometry and sequence on assembly. We have tested our method using 2D and 3D DNA brick and DNA origami structures. Our method is general and should be extensible to a wide variety of DNA nanostructures.
Diaz, Patricia L; Hennell, James R; Sucher, Nikolaus J
Endophytes live inter- and/or intracellularly inside healthy aboveground tissues of plants without causing disease. Endophytic fungi are found in virtually every vascular plant species examined. The origins of this symbiotic relationship between endophytes go back to the emergence of vascular plants. Endophytic fungi receive nutrition and protection from their hosts while the plants benefit from the production of fungal secondary metabolites, which enhance the host plants' resistance to herbivores, pathogens, and various abiotic stresses. Endophytic fungi have attracted increased interest as potential sources of secondary metabolites with agricultural, industrial, and medicinal use. This chapter provides detailed protocols for isolation of genomic DNA from fungal endophytes and its use in polymerase chain reaction-based amplification of the internal transcribed spacer region between the conserved flanking regions of the small and large subunit of ribosomal RNA for barcoding purposes.
Park, Bum Chul; Kim, Young Keun
With rapid progress in nanotechnology, nanostructured materials have come closer to our life. Single-component nanowires are actively investigated because of their novel properties, attributed to their nanoscale dimensions and adjustable aspect ratio, but their technical limitations cannot be resolved easily. Heterostructured nanomaterials gained attention as alternatives because they can improve the existing single-component structure or add new functions to it. Among them, barcode nanowires (BNWs), comprising at least two different functional segments, can perform multiple functions for use in biomedical sensors, information encoding and security, and catalysts. BNW applications require reliable response to the external field. Hence, researchers have been attempting to improve the reliability of synthesis and regulate the properties precisely. This article highlights the recent progress and prospects for the synthesis, properties, and applications of metallic BNWs with focus on the dependence of the magnetic, optical, and mechanical properties on material, composition, shape, and microstructure.
Full Text Available Abstract Background Large genomes contain families of highly similar genes that cannot be individually identified by microarray probes. This limitation is due to thermodynamic restrictions and cannot be resolved by any computational method. Since gene annotations are updated more frequently than microarrays, another common issue facing microarray users is that existing microarrays must be routinely reanalyzed to determine probes that are still useful with respect to the updated annotations. Results PICKY 2.0 can design shared probes for sets of genes that cannot be individually identified using unique probes. PICKY 2.0 uses novel algorithms to track sharable regions among genes and to strictly distinguish them from other highly similar but nontarget regions during thermodynamic comparisons. Therefore, PICKY does not sacrifice the quality of shared probes when choosing them. The latest PICKY 2.1 includes the new capability to reanalyze existing microarray probes against updated gene sets to determine probes that are still valid to use. In addition, more precise nonlinear salt effect estimates and other improvements are added, making PICKY 2.1 more versatile to microarray users. Conclusions Shared probes allow expressed gene family members to be detected; this capability is generally more desirable than not knowing anything about these genes. Shared probes also enable the design of cross-genome microarrays, which facilitate multiple species identification in environmental samples. The new nonlinear salt effect calculation significantly increases the precision of probes at a lower buffer salt concentration, and the probe reanalysis function improves existing microarray result interpretations.
Microarrays offer biologists an exciting tool that allows the simultaneous assessment of gene expression levels for thousands of genes at once. At the time of their inception, microarrays were hailed as the new dawn in cancer biology and oncology practice with the hope that within a decade diseases
DNA microarray technology is a powerful functional genomics tool increasingly used for investigating global gene expression in environmental studies. Microarrays can also be used in identifying biological networks, as they give insight on the complex gene-to-gene interactions, ne...
Hal, van N.L.W.; Vorst, O.; Houwelingen, van A.M.M.L.; Kok, E.J.; Peijnenburg, A.A.C.M.; Aharoni, A.; Tunen, van A.J.; Keijer, J.
DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed.
Kerr, Kevin C R
The barcode of life project has assembled a tremendous number of mitochondrial cytochrome c oxidase I (COI) sequences. Although these sequences were gathered to develop a DNA-based system for species identification, it has been suggested that further biological inferences may also be derived from this wealth of data. Recurrent selective sweeps have been invoked as an evolutionary mechanism to explain limited intraspecific COI diversity, particularly in birds, but this hypothesis has not been formally tested. In this study, I collated COI sequences from previous barcoding studies on birds and tested them for evidence of selection. Using this expanded data set, I re-examined the relationships between intraspecific diversity and interspecific divergence and sampling effort, respectively. I employed the McDonald-Kreitman test to test for neutrality in sequence evolution between closely related pairs of species. Because amino acid sequences were generally constrained between closely related pairs, I also included broader intra-order comparisons to quantify patterns of protein variation in avian COI sequences. Lastly, using 22 published whole mitochondrial genomes, I compared the evolutionary rate of COI against the other 12 protein-coding mitochondrial genes to assess intragenomic variability. I found no conclusive evidence of selective sweeps. Most evidence pointed to an overall trend of strong purifying selection and functional constraint. The COI protein did vary across the class Aves, but to a very limited extent. COI was the least variable gene in the mitochondrial genome, suggesting that other genes might be more informative for probing factors constraining mitochondrial variation within species. © 2011 Blackwell Publishing Ltd.
Full Text Available Biosensors such as DNA microarrays and microchips are gaining an increasingimportance in medicinal, forensic, and environmental analyses. Such devices are based onthe detection of supramolecular interactions called hybridizations that occur betweencomplementary oligonucleotides, one linked to a solid surface (the probe, and the other oneto be analyzed (the target. This paper focuses on the improvements that hyperbranched andperfectly defined nanomolecules called dendrimers can provide to this methodology. Twomain uses of dendrimers for such purpose have been described up to now; either thedendrimer is used as linker between the solid surface and the probe oligonucleotide, or thedendrimer is used as a multilabeled entity linked to the target oligonucleotide. In the firstcase the dendrimer generally induces a higher loading of probes and an easier hybridization,due to moving away the solid phase. In the second case the high number of localized labels(generally fluorescent induces an increased sensitivity, allowing the detection of smallquantities of biological entities.
Chaudhry, M. Ahmad [Department of Medical Laboratory and Radiation Sciences, College of Nursing and Health Sciences, University of Vermont, 302 Rowell Building, Burlington, VT 05405 (United States) and DNA Microarray Facility, University of Vermont, Burlington, VT 05405 (United States)]. E-mail: email@example.com
In cell populations exposed to ionizing radiation, the biological effects occur in a much larger proportion of cells than are estimated to be traversed by radiation. It has been suggested that irradiated cells are capable of providing signals to the neighboring unirradiated cells resulting in damage to these cells. This phenomenon is termed the bystander effect. The bystander effect induces persistent, long-term, transmissible changes that result in delayed death and neoplastic transformation. Because the bystander effect is relevant to carcinogenesis, it could have significant implications for risk estimation for radiation exposure. The nature of the bystander effect signal and how it impacts the unirradiated cells remains to be elucidated. Examination of the changes in gene expression could provide clues to understanding the bystander effect and could define the signaling pathways involved in sustaining damage to these cells. The microarray technology serves as a tool to gain insight into the molecular pathways leading to bystander effect. Using medium from irradiated normal human diploid lung fibroblasts as a model system we examined gene expression alterations in bystander cells. The microarray data revealed that the radiation-induced gene expression profile in irradiated cells is different from unirradiated bystander cells suggesting that the pathways leading to biological effects in the bystander cells are different from the directly irradiated cells. The genes known to be responsive to ionizing radiation were observed in irradiated cells. Several genes were upregulated in cells receiving media from irradiated cells. Surprisingly no genes were found to be downregulated in these cells. A number of genes belonging to extracellular signaling, growth factors and several receptors were identified in bystander cells. Interestingly 15 genes involved in the cell communication processes were found to be upregulated. The induction of receptors and the cell
Chaudhry, M. Ahmad
In cell populations exposed to ionizing radiation, the biological effects occur in a much larger proportion of cells than are estimated to be traversed by radiation. It has been suggested that irradiated cells are capable of providing signals to the neighboring unirradiated cells resulting in damage to these cells. This phenomenon is termed the bystander effect. The bystander effect induces persistent, long-term, transmissible changes that result in delayed death and neoplastic transformation. Because the bystander effect is relevant to carcinogenesis, it could have significant implications for risk estimation for radiation exposure. The nature of the bystander effect signal and how it impacts the unirradiated cells remains to be elucidated. Examination of the changes in gene expression could provide clues to understanding the bystander effect and could define the signaling pathways involved in sustaining damage to these cells. The microarray technology serves as a tool to gain insight into the molecular pathways leading to bystander effect. Using medium from irradiated normal human diploid lung fibroblasts as a model system we examined gene expression alterations in bystander cells. The microarray data revealed that the radiation-induced gene expression profile in irradiated cells is different from unirradiated bystander cells suggesting that the pathways leading to biological effects in the bystander cells are different from the directly irradiated cells. The genes known to be responsive to ionizing radiation were observed in irradiated cells. Several genes were upregulated in cells receiving media from irradiated cells. Surprisingly no genes were found to be downregulated in these cells. A number of genes belonging to extracellular signaling, growth factors and several receptors were identified in bystander cells. Interestingly 15 genes involved in the cell communication processes were found to be upregulated. The induction of receptors and the cell
Singh, Anup K.; Throckmorton, Daniel J.; Moran-Mirabal, Jose C.; Edel, Joshua B.; Meyer, Grant D.; Craighead, Harold G.
We present the use of micron-sized lipid domains, patterned onto planar substrates and within microfluidic channels, to assay the binding of bacterial toxins via total internal reflection fluorescence microscopy (TIRFM). The lipid domains were patterned using a polymer lift-off technique and consisted of ganglioside-populated DSPC:cholesterol supported lipid bilayers (SLBs). Lipid patterns were formed on the substrates by vesicle fusion followed by polymer lift-off, which revealed micron-sized SLBs containing either ganglioside GT1b or GM1. The ganglioside-populated SLB arrays were then exposed to either Cholera toxin subunit B (CTB) or Tetanus toxin fragment C (TTC). Binding was assayed on planar substrates by TIRFM down to 1 nM concentration for CTB and 100 nM for TTC. Apparent binding constants extracted from three different models applied to the binding curves suggest that binding of a protein to a lipid-based receptor is strongly affected by the lipid composition of the SLB and by the substrate on which the bilayer is formed. Patterning of SLBs inside microfluidic channels also allowed the preparation of lipid domains with different compositions on a single device. Arrays within microfluidic channels were used to achieve segregation and selective binding from a binary mixture of the toxin fragments in one device. The binding and segregation within the microfluidic channels was assayed with epifluorescence as proof of concept. We propose that the method used for patterning the lipid microarrays on planar substrates and within microfluidic channels can be easily adapted to proteins or nucleic acids and can be used for biosensor applications and cell stimulation assays under different flow conditions. KEYWORDS. Microarray, ganglioside, polymer lift-off, cholera toxin, tetanus toxin, TIRFM, binding constant.4
Roy, Sashwati; Sen, Chandan K.
The cDNA microarray technology and related bioinformatics tools presents a wide range of novel application opportunities. The technology may be productively applied to address food safety. In this mini-review article, we present an update highlighting the late breaking discoveries that demonstrate the vitality of cDNA microarray technology as a tool to analyze food safety with reference to microbial pathogens and genetically modified foods. In order to bring the microarray technology to mainstream food safety, it is important to develop robust user-friendly tools that may be applied in a field setting. In addition, there needs to be a standardized process for regulatory agencies to interpret and act upon microarray-based data. The cDNA microarray approach is an emergent technology in diagnostics. Its values lie in being able to provide complimentary molecular insight when employed in addition to traditional tests for food safety, as part of a more comprehensive battery of tests
Pedersen, Henriette Lodberg; Fangel, Jonatan Ulrik; McCleary, Barry
Microarrays are powerful tools for high throughput analysis, and hundreds or thousands of molecular interactions can be assessed simultaneously using very small amounts of analytes. Nucleotide microarrays are well established in plant research, but carbohydrate microarrays are much less establish...
Full Text Available Abstract Background High-throughput RNAi screening is widely applied in biological research, but remains expensive, infrastructure-intensive and conversion of many assays to HTS applications in microplate format is not feasible. Results Here, we describe the optimization of a miniaturized cell spot microarray (CSMA method, which facilitates utilization of the transfection microarray technique for disparate RNAi analyses. To promote rapid adaptation of the method, the concept has been tested with a panel of 92 adherent cell types, including primary human cells. We demonstrate the method in the systematic screening of 492 GPCR coding genes for impact on growth and survival of cultured human prostate cancer cells. Conclusions The CSMA method facilitates reproducible preparation of highly parallel cell microarrays for large-scale gene knockdown analyses. This will be critical towards expanding the cell based functional genetic screens to include more RNAi constructs, allow combinatorial RNAi analyses, multi-parametric phenotypic readouts or comparative analysis of many different cell types.
Vafaee Sharbaf, Fatemeh; Mosafer, Sara; Moattar, Mohammad Hossein
This paper proposes an approach for gene selection in microarray data. The proposed approach consists of a primary filter approach using Fisher criterion which reduces the initial genes and hence the search space and time complexity. Then, a wrapper approach which is based on cellular learning automata (CLA) optimized with ant colony method (ACO) is used to find the set of features which improve the classification accuracy. CLA is applied due to its capability to learn and model complicated relationships. The selected features from the last phase are evaluated using ROC curve and the most effective while smallest feature subset is determined. The classifiers which are evaluated in the proposed framework are K-nearest neighbor; support vector machine and naïve Bayes. The proposed approach is evaluated on 4 microarray datasets. The evaluations confirm that the proposed approach can find the smallest subset of genes while approaching the maximum accuracy. Copyright © 2016 Elsevier Inc. All rights reserved.
Full Text Available Advances in lithographic approaches to fabricating bio-microarrays have been extensively explored over the last two decades. However, the need for pattern flexibility, a high density, a high resolution, affordability and on-demand fabrication is promoting the development of unconventional routes for microarray fabrication. This review highlights the development and uses of a new molecular lithography approach, called “microintaglio printing technology”, for large-scale bio-microarray fabrication using a microreactor array (µRA-based chip consisting of uniformly-arranged, femtoliter-size µRA molds. In this method, a single-molecule-amplified DNA microarray pattern is self-assembled onto a µRA mold and subsequently converted into a messenger RNA or protein microarray pattern by simultaneously producing and transferring (immobilizing a messenger RNA or a protein from a µRA mold to a glass surface. Microintaglio printing allows the self-assembly and patterning of in situ-synthesized biomolecules into high-density (kilo-giga-density, ordered arrays on a chip surface with µm-order precision. This holistic aim, which is difficult to achieve using conventional printing and microarray approaches, is expected to revolutionize and reshape proteomics. This review is not written comprehensively, but rather substantively, highlighting the versatility of microintaglio printing for developing a prerequisite platform for microarray technology for the postgenomic era.
Geiger, M F; Herder, F; Monaghan, M T; Almada, V; Barbieri, R; Bariche, M; Berrebi, P; Bohlen, J; Casal-Lopez, M; Delmastro, G B; Denys, G P J; Dettai, A; Doadrio, I; Kalogianni, E; Kärst, H; Kottelat, M; Kovačić, M; Laporte, M; Lorenzoni, M; Marčić, Z; Özuluğ, M; Perdices, A; Perea, S; Persat, H; Porcelotti, S; Puzzi, C; Robalo, J; Šanda, R; Schneider, M; Šlechtová, V; Stoumboudi, M; Walter, S; Freyhof, J
Incomplete knowledge of biodiversity remains a stumbling block for conservation planning and even occurs within globally important Biodiversity Hotspots (BH). Although technical advances have boosted the power of molecular biodiversity assessments, the link between DNA sequences and species and the analytics to discriminate entities remain crucial. Here, we present an analysis of the first DNA barcode library for the freshwater fish fauna of the Mediterranean BH (526 spp.), with virtually complete species coverage (498 spp., 98% extant species). In order to build an identification system supporting conservation, we compared species determination by taxonomists to multiple clustering analyses of DNA barcodes for 3165 specimens. The congruence of barcode clusters with morphological determination was strongly dependent on the method of cluster delineation, but was highest with the general mixed Yule-coalescent (GMYC) model-based approach (83% of all species recovered as GMYC entity). Overall, genetic morphological discontinuities suggest the existence of up to 64 previously unrecognized candidate species. We found reduced identification accuracy when using the entire DNA-barcode database, compared with analyses on databases for individual river catchments. This scale effect has important implications for barcoding assessments and suggests that fairly simple identification pipelines provide sufficient resolution in local applications. We calculated Evolutionarily Distinct and Globally Endangered scores in order to identify candidate species for conservation priority and argue that the evolutionary content of barcode data can be used to detect priority species for future IUCN assessments. We show that large-scale barcoding inventories of complex biotas are feasible and contribute directly to the evaluation of conservation priorities. © 2014 John Wiley & Sons Ltd.
Mary Lynn Baniecki
Full Text Available Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25-40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs. Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM, we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding. From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana, Africa (Ethiopia and Asia (Sri Lanka. We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1. Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections.
Parmentier, Ingrid; Duminil, Jérôme; Kuzmina, Maria; Philippe, Morgane; Thomas, Duncan W; Kenfack, David; Chuyong, George B; Cruaud, Corinne; Hardy, Olivier J
DNA barcoding of rain forest trees could potentially help biologists identify species and discover new ones. However, DNA barcodes cannot always distinguish between closely related species, and the size and completeness of barcode databases are key parameters for their successful application. We test the ability of rbcL, matK and trnH-psbA plastid DNA markers to identify rain forest trees at two sites in Atlantic central Africa under the assumption that a database is exhaustive in terms of species content, but not necessarily in terms of haplotype diversity within species. We assess the accuracy of identification to species or genus using a genetic distance matrix between samples either based on a global multiple sequence alignment (GD) or on a basic local alignment search tool (BLAST). Where a local database is available (within a 50 ha plot), barcoding was generally reliable for genus identification (95-100% success), but less for species identification (71-88%). Using a single marker, best results for species identification were obtained with trnH-psbA. There was a significant decrease of barcoding success in species-rich clades. When the local database was used to identify the genus of trees from another region and did include all genera from the query individuals but not all species, genus identification success decreased to 84-90%. The GD method performed best but a global multiple sequence alignment is not applicable on trnH-psbA. Barcoding is a useful tool to assign unidentified African rain forest trees to a genus, but identification to a species is less reliable, especially in species-rich clades, even using an exhaustive local database. Combining two markers improves the accuracy of species identification but it would only marginally improve genus identification. Finally, we highlight some limitations of the BLAST algorithm as currently implemented and suggest possible improvements for barcoding applications.
Hebert, Paul D N; Dewaard, Jeremy R; Zakharov, Evgeny V; Prosser, Sean W J; Sones, Jayme E; McKeown, Jaclyn T A; Mantle, Beth; La Salle, John
DNA barcoding protocols require the linkage of each sequence record to a voucher specimen that has, whenever possible, been authoritatively identified. Natural history collections would seem an ideal resource for barcode library construction, but they have never seen large-scale analysis because of concerns linked to DNA degradation. The present study examines the strength of this barrier, carrying out a comprehensive analysis of moth and butterfly (Lepidoptera) species in the Australian National Insect Collection. Protocols were developed that enabled tissue samples, specimen data, and images to be assembled rapidly. Using these methods, a five-person team processed 41,650 specimens representing 12,699 species in 14 weeks. Subsequent molecular analysis took about six months, reflecting the need for multiple rounds of PCR as sequence recovery was impacted by age, body size, and collection protocols. Despite these variables and the fact that specimens averaged 30.4 years old, barcode records were obtained from 86% of the species. In fact, one or more barcode compliant sequences (>487 bp) were recovered from virtually all species represented by five or more individuals, even when the youngest was 50 years old. By assembling specimen images, distributional data, and DNA barcode sequences on a web-accessible informatics platform, this study has greatly advanced accessibility to information on thousands of species. Moreover, much of the specimen data became publically accessible within days of its acquisition, while most sequence results saw release within three months. As such, this study reveals the speed with which DNA barcode workflows can mobilize biodiversity data, often providing the first web-accessible information for a species. These results further suggest that existing collections can enable the rapid development of a comprehensive DNA barcode library for the most diverse compartment of terrestrial biodiversity - insects.
Baniecki, Mary Lynn; Faust, Aubrey L.; Schaffner, Stephen F.; Park, Daniel J.; Galinsky, Kevin; Daniels, Rachel F.; Hamilton, Elizabeth; Ferreira, Marcelo U.; Karunaweera, Nadira D.; Serre, David; Zimmerman, Peter A.; Sá, Juliana M.; Wellems, Thomas E.; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E.; Volkman, Sarah K.; Wirth, Dyann F.; Sabeti, Pardis C.
Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25–40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections. PMID:25781890
Little, Damon P; Knopf, Patrick; Schulz, Christian
We have generated matK, rbcL, and nrITS2 DNA barcodes for 320 specimens representing all 18 extant genera of the conifer family Podocarpaceae. The sample includes 145 of the 198 recognized species. Comparative analyses of sequence quality and species discrimination were conducted on the 159 individuals from which all three markers were recovered (representing 15 genera and 97 species). The vast majority of sequences were of high quality (B 30 = 0.596-0.989). Even the lowest quality sequences exceeded the minimum requirements of the BARCODE data standard. In the few instances that low quality sequences were generated, the responsible mechanism could not be discerned. There were no statistically significant differences in the discriminatory power of markers or marker combinations (p = 0.05). The discriminatory power of the barcode markers individually and in combination is low (56.7% of species at maximum). In some instances, species discrimination failed in spite of ostensibly useful variation being present (genotypes were shared among species), but in many cases there was simply an absence of sequence variation. Barcode gaps (maximum intraspecific p-distance > minimum interspecific p-distance) were observed in 50.5% of species when all three markers were considered simultaneously. The presence of a barcode gap was not predictive of discrimination success (p = 0.02) and there was no statistically significant difference in the frequency of barcode gaps among markers (p = 0.05). In addition, there was no correlation between number of individuals sampled per species and the presence of a barcode gap (p = 0.27).
Schoch, Conrad L; Seifert, Keith A; Huhndorf, Sabine; Robert, Vincent; Spouge, John L; Levesque, C André; Chen, Wen
Six DNA regions were evaluated as potential DNA barcodes for Fungi, the second largest kingdom of eukaryotic life, by a multinational, multilaboratory consortium. The region of the mitochondrial cytochrome c oxidase subunit 1 used as the animal barcode was excluded as a potential marker, because it is difficult to amplify in fungi, often includes large introns, and can be insufficiently variable. Three subunits from the nuclear ribosomal RNA cistron were compared together with regions of three representative protein-coding genes (largest subunit of RNA polymerase II, second largest subunit of RNA polymerase II, and minichromosome maintenance protein). Although the protein-coding gene regions often had a higher percent of correct identification compared with ribosomal markers, low PCR amplification and sequencing success eliminated them as candidates for a universal fungal barcode. Among the regions of the ribosomal cistron, the internal transcribed spacer (ITS) region has the highest probability of successful identification for the broadest range of fungi, with the most clearly defined barcode gap between inter- and intraspecific variation. The nuclear ribosomal large subunit, a popular phylogenetic marker in certain groups, had superior species resolution in some taxonomic groups, such as the early diverging lineages and the ascomycete yeasts, but was otherwise slightly inferior to the ITS. The nuclear ribosomal small subunit has poor species-level resolution in fungi. ITS will be formally proposed for adoption as the primary fungal barcode marker to the Consortium for the Barcode of Life, with the possibility that supplementary barcodes may be developed for particular narrowly circumscribed taxonomic groups.
Full Text Available BACKGROUND: The geometrid moths of Europe are one of the best investigated insect groups in traditional taxonomy making them an ideal model group to test the accuracy of the Barcode Index Number (BIN system of BOLD (Barcode of Life Datasystems, a method that supports automated, rapid species delineation and identification. METHODOLOGY/PRINCIPAL FINDINGS: This study provides a DNA barcode library for 219 of the 249 European geometrid moth species (88% in five selected subfamilies. The data set includes COI sequences for 2130 specimens. Most species (93% were found to possess diagnostic barcode sequences at the European level while only three species pairs (3% were genetically indistinguishable in areas of sympatry. As a consequence, 97% of the European species we examined were unequivocally discriminated by barcodes within their natural areas of distribution. We found a 1:1 correspondence between BINs and traditionally recognized species for 67% of these species. Another 17% of the species (15 pairs, three triads shared BINs, while specimens from the remaining species (18% were divided among two or more BINs. Five of these species are mixtures, both sharing and splitting BINs. For 82% of the species with two or more BINs, the genetic splits involved allopatric populations, many of which have previously been hypothesized to represent distinct species or subspecies. CONCLUSIONS/SIGNIFICANCE: This study confirms the effectiveness of DNA barcoding as a tool for species identification and illustrates the potential of the BIN system to characterize formal genetic units independently of an existing classification. This suggests the system can be used to efficiently assess the biodiversity of large, poorly known assemblages of organisms. For the moths examined in this study, cases of discordance between traditionally recognized species and BINs arose from several causes including overlooked species, synonymy, and cases where DNA barcodes revealed
Zhou, X.; Robinson, J.L.; Geraci, C.J.; Parker, C.R.; Flint, O.S.; Etnier, D.A.; Ruiter, D.; DeWalt, R.E.; Jacobus, L.M.; Hebert, P.D.N.
Deoxyribonucleic acid (DNA) barcoding is an effective tool for species identification and lifestage association in a wide range of animal taxa. We developed a strategy for rapid construction of a regional DNA-barcode reference library and used the caddisflies (Trichoptera) of the Great Smoky Mountains National Park (GSMNP) as a model. Nearly 1000 cytochrome c oxidase subunit I (COI) sequences, representing 209 caddisfly species previously recorded from GSMNP, were obtained from the global Trichoptera Barcode of Life campaign. Most of these sequences were collected from outside the GSMNP area. Another 645 COI sequences, representing 80 species, were obtained from specimens collected in a 3-d bioblitz (short-term, intense sampling program) in GSMNP. The joint collections provided barcode coverage for 212 species, 91% of the GSMNP fauna. Inclusion of samples from other localities greatly expedited construction of the regional DNA-barcode reference library. This strategy increased intraspecific divergence and decreased average distances to nearest neighboring species, but the DNA-barcode library was able to differentiate 93% of the GSMNP Trichoptera species examined. Global barcoding projects will aid construction of regional DNA-barcode libraries, but local surveys make crucial contributions to progress by contributing rare or endemic species and full-length barcodes generated from high-quality DNA. DNA taxonomy is not a goal of our present work, but the investigation of COI divergence patterns in caddisflies is providing new insights into broader biodiversity patterns in this group and has directed attention to various issues, ranging from the need to re-evaluate species taxonomy with integrated morphological and molecular evidence to the necessity of an appropriate interpretation of barcode analyses and its implications in understanding species diversity (in contrast to a simple claim for barcoding failure).
Background Barcodes are unique DNA sequence tags that can be used to specifically label individual mutants. The barcode-tagged open reading frame (ORF) haploid deletion mutant collections in the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe allow for high-throughput mutant phenotyping because the relative growth of mutants in a population can be determined by monitoring the proportions of their associated barcodes. While these mutant collections have greatly facilitated genome-wide studies, mutations in essential genes are not present, and the roles of these genes are not as easily studied. To further support genome-scale research in S. pombe, we generated a barcode-tagged fission yeast insertion mutant library that has the potential of generating viable mutations in both essential and non-essential genes and can be easily analyzed using standard molecular biological techniques. Results An insertion vector containing a selectable ura4+ marker and a random barcode was used to generate a collection of 10,000 fission yeast insertion mutants stored individually in 384-well plates and as six pools of mixed mutants. Individual barcodes are flanked by Sfi I recognition sites and can be oligomerized in a unique orientation to facilitate barcode sequencing. Independent genetic screens on a subset of mutants suggest that this library contains a diverse collection of single insertion mutations. We present several approaches to determine insertion sites. Conclusions This collection of S. pombe barcode-tagged insertion mutants is well-suited for genome-wide studies. Because insertion mutations may eliminate, reduce or alter the function of essential and non-essential genes, this library will contain strains with a wide range of phenotypes that can be assayed by their associated barcodes. The design of the barcodes in this library allows for barcode sequencing using next generation or standard benchtop cloning approaches. PMID:22554201
Links, Matthew G; Dumonceaux, Tim J; Hemmingsen, Sean M; Hill, Janet E
Barcoding with molecular sequences is widely used to catalogue eukaryotic biodiversity. Studies investigating the community dynamics of microbes have relied heavily on gene-centric metagenomic profiling using two genes (16S rRNA and cpn60) to identify and track Bacteria. While there have been criteria formalized for barcoding of eukaryotes, these criteria have not been used to evaluate gene targets for other domains of life. Using the framework of the International Barcode of Life we evaluated DNA barcodes for Bacteria. Candidates from the 16S rRNA gene and the protein coding cpn60 gene were evaluated. Within complete bacterial genomes in the public domain representing 983 species from 21 phyla, the largest difference between median pairwise inter- and intra-specific distances ("barcode gap") was found from cpn60. Distribution of sequence diversity along the ∼555 bp cpn60 target region was remarkably uniform. The barcode gap of the cpn60 universal target facilitated the faithful de novo assembly of full-length operational taxonomic units from pyrosequencing data from a synthetic microbial community. Analysis supported the recognition of both 16S rRNA and cpn60 as DNA barcodes for Bacteria. The cpn60 universal target was found to have a much larger barcode gap than 16S rRNA suggesting cpn60 as a preferred barcode for Bacteria. A large barcode gap for cpn60 provided a robust target for species-level characterization of data. The assembly of consensus sequences for barcodes was shown to be a reliable method for the identification and tracking of novel microbes in metagenomic studies.
Full Text Available Abstract Background Barcodes are unique DNA sequence tags that can be used to specifically label individual mutants. The barcode-tagged open reading frame (ORF haploid deletion mutant collections in the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe allow for high-throughput mutant phenotyping because the relative growth of mutants in a population can be determined by monitoring the proportions of their associated barcodes. While these mutant collections have greatly facilitated genome-wide studies, mutations in essential genes are not present, and the roles of these genes are not as easily studied. To further support genome-scale research in S. pombe, we generated a barcode-tagged fission yeast insertion mutant library that has the potential of generating viable mutations in both essential and non-essential genes and can be easily analyzed using standard molecular biological techniques. Results An insertion vector containing a selectable ura4+ marker and a random barcode was used to generate a collection of 10,000 fission yeast insertion mutants stored individually in 384-well plates and as six pools of mixed mutants. Individual barcodes are flanked by Sfi I recognition sites and can be oligomerized in a unique orientation to facilitate barcode sequencing. Independent genetic screens on a subset of mutants suggest that this library contains a diverse collection of single insertion mutations. We present several approaches to determine insertion sites. Conclusions This collection of S. pombe barcode-tagged insertion mutants is well-suited for genome-wide studies. Because insertion mutations may eliminate, reduce or alter the function of essential and non-essential genes, this library will contain strains with a wide range of phenotypes that can be assayed by their associated barcodes. The design of the barcodes in this library allows for barcode sequencing using next generation or standard benchtop cloning
Yang, Cheng-Hong; Wu, Kuo-Chuan; Dahms, Hans-Uwe; Chuang, Li-Yeh; Chang, Hsueh-Wei
DNA barcodes are widely used in taxonomy, systematics, species identification, food safety, and forensic science. Most of the conventional DNA barcode sequences contain the whole information of a given barcoding gene. Most of the sequence information does not vary and is uninformative for a given group of taxa within a monophylum. We suggest here a method that reduces the amount of noninformative nucleotides in a given barcoding sequence of a major taxon, like the prokaryotes, or eukaryotic animals, plants, or fungi. The actual differences in genetic sequences, called single nucleotide polymorphism (SNP) genotyping, provide a tool for developing a rapid, reliable, and high-throughput assay for the discrimination between known species. Here, we investigated SNPs as robust markers of genetic variation for identifying different pigeon species based on available cytochrome c oxidase I (COI) data. We propose here a decision tree-based SNP barcoding (DTSB) algorithm where SNP patterns are selected from the DNA barcoding sequence of several evolutionarily related species in order to identify a single species with pigeons as an example. This approach can make use of any established barcoding system. We here firstly used as an example the mitochondrial gene COI information of 17 pigeon species (Columbidae, Aves) using DTSB after sequence trimming and alignment. SNPs were chosen which followed the rule of decision tree and species-specific SNP barcodes. The shortest barcode of about 11 bp was then generated for discriminating 17 pigeon species using the DTSB method. This method provides a sequence alignment and tree decision approach to parsimoniously assign a unique and shortest SNP barcode for any known species of a chosen monophyletic taxon where a barcoding sequence is available.
Stephen J McKenna
Full Text Available Background: Tissue microarrays (TMAs are an important tool in translational research for examining multiple cancers for molecular and protein markers. Automatic immunohistochemical (IHC scoring of breast TMA images remains a challenging problem. Methods: A two-stage approach that involves localization of regions of invasive and in-situ carcinoma followed by ordinal IHC scoring of nuclei in these regions is proposed. The localization stage classifies locations on a grid as tumor or non-tumor based on local image features. These classifications are then refined using an auto-context algorithm called spin-context. Spin-context uses a series of classifiers to integrate image feature information with spatial context information in the form of estimated class probabilities. This is achieved in a rotationally-invariant manner. The second stage estimates ordinal IHC scores in terms of the strength of staining and the proportion of nuclei stained. These estimates take the form of posterior probabilities, enabling images with uncertain scores to be referred for pathologist review. Results: The method was validated against manual pathologist scoring on two nuclear markers, progesterone receptor (PR and estrogen receptor (ER. Errors for PR data were consistently lower than those achieved with ER data. Scoring was in terms of estimated proportion of cells that were positively stained (scored on an ordinal scale of 0-6 and perceived strength of staining (scored on an ordinal scale of 0-3. Average absolute differences between predicted scores and pathologist-assigned scores were 0.74 for proportion of cells and 0.35 for strength of staining (PR. Conclusions: The use of context information via spin-context improved the precision and recall of tumor localization. The combination of the spin-context localization method with the automated scoring method resulted in reduced IHC scoring errors.
Dobbin, Kevin K; Zhao, Yingdong; Simon, Richard M
A common goal of gene expression microarray studies is the development of a classifier that can be used to divide patients into groups with different prognoses, or with different expected responses to a therapy. These types of classifiers are developed on a training set, which is the set of samples used to train a classifier. The question of how many samples are needed in the training set to produce a good classifier from high-dimensional microarray data is challenging. We present a model-based approach to determining the sample size required to adequately train a classifier. It is shown that sample size can be determined from three quantities: standardized fold change, class prevalence, and number of genes or features on the arrays. Numerous examples and important experimental design issues are discussed. The method is adapted to address ex post facto determination of whether the size of a training set used to develop a classifier was adequate. An interactive web site for performing the sample size calculations is provided. We showed that sample size calculations for classifier development from high-dimensional microarray data are feasible, discussed numerous important considerations, and presented examples.
Full Text Available Abstract Background Phytohormones organize plant development and environmental adaptation through cell-to-cell signal transduction, and their action involves transcriptional activation. Recent international efforts to establish and maintain public databases of Arabidopsis microarray data have enabled the utilization of this data in the analysis of various phytohormone responses, providing genome-wide identification of promoters targeted by phytohormones. Results We utilized such microarray data for prediction of cis-regulatory elements with an octamer-based approach. Our test prediction of a drought-responsive RD29A promoter with the aid of microarray data for response to drought, ABA and overexpression of DREB1A, a key regulator of cold and drought response, provided reasonable results that fit with the experimentally identified regulatory elements. With this succession, we expanded the prediction to various phytohormone responses, including those for abscisic acid, auxin, cytokinin, ethylene, brassinosteroid, jasmonic acid, and salicylic acid, as well as for hydrogen peroxide, drought and DREB1A overexpression. Totally 622 promoters that are activated by phytohormones were subjected to the prediction. In addition, we have assigned putative functions to 53 octamers of the Regulatory Element Group (REG that have been extracted as position-dependent cis-regulatory elements with the aid of their feature of preferential appearance in the promoter region. Conclusions Our prediction of Arabidopsis cis-regulatory elements for phytohormone responses provides guidance for experimental analysis of promoters to reveal the basis of the transcriptional network of phytohormone responses.
Gibbons, Brian; Datta, Parikkhit; Wu, Ying; Chan, Alan; Al Armour, John
Current methods for measurement of copy number do not combine all the desirable qualities of convenience, throughput, economy, accuracy and resolution. In this study, to improve the throughput associated with Multiplex Amplifiable Probe Hybridisation (MAPH) we aimed to develop a modification based on the 3-Dimensional, Flow-Through Microarray Platform from PamGene International. In this new method, electrophoretic analysis of amplified products is replaced with photometric analysis of a probed oligonucleotide array. Copy number analysis of hybridised probes is based on a dual-label approach by comparing the intensity of Cy3-labelled MAPH probes amplified from test samples co-hybridised with similarly amplified Cy5-labelled reference MAPH probes. The key feature of using a hybridisation-based end point with MAPH is that discrimination of amplified probes is based on sequence and not fragment length. In this study we showed that microarray MAPH measurement of PMP22 gene dosage correlates well with PMP22 gene dosage determined by capillary MAPH and that copy number was accurately reported in analyses of DNA from 38 individuals, 12 of which were known to have Charcot-Marie-Tooth disease type 1A (CMT1A). Measurement of microarray-based endpoints for MAPH appears to be of comparable accuracy to electrophoretic methods, and holds the prospect of fully exploiting the potential multiplicity of MAPH. The technology has the potential to simplify copy number assays for genes with a large number of exons, or of expanded sets of probes from dispersed genomic locations.
Tárraga, Joaquín; Medina, Ignacio; Carbonell, José; Huerta-Cepas, Jaime; Minguez, Pablo; Alloza, Eva; Al-Shahrour, Fátima; Vegas-Azcárate, Susana; Goetz, Stefan; Escobar, Pablo; Garcia-Garcia, Francisco; Conesa, Ana; Montaner, David; Dopazo, Joaquín
Gene Expression Profile Analysis Suite (GEPAS) is one of the most complete and extensively used web-based packages for microarray data analysis. During its more than 5 years of activity it has continuously been updated to keep pace with the state-of-the-art in the changing microarray data analysis arena. GEPAS offers diverse analysis options that include well established as well as novel algorithms for normalization, gene selection, class prediction, clustering and functional profiling of the experiment. New options for time-course (or dose-response) experiments, microarray-based class prediction, new clustering methods and new tests for differential expression have been included. The new pipeliner module allows automating the execution of sequential analysis steps by means of a simple but powerful graphic interface. An extensive re-engineering of GEPAS has been carried out which includes the use of web services and Web 2.0 technology features, a new user interface with persistent sessions and a new extended database of gene identifiers. GEPAS is nowadays the most quoted web tool in its field and it is extensively used by researchers of many countries and its records indicate an average usage rate of 500 experiments per day. GEPAS, is available at http://www.gepas.org. PMID:18508806
Full Text Available Molecular analysis of diet overcomes the considerable limitations of traditional techniques for identifying prey remains in bat faeces. We collected faeces from individual Mountain Long-eared Bats Plecotus macrobullaris trapped using mist nets during the summers of 2009 and 2010 in the Pyrenees. We analysed their diet using DNA mini-barcodes to identify prey species. In addition, we inferred some basic features of the bat's foraging ecology that had not yet been addressed. P. macrobullaris fed almost exclusively on moths (97.8%. As prey we detected one dipteran genus (Tipulidae and 29 moth taxa: 28 were identified at species level (23 Noctuidae, 1 Crambidae, 1 Geometridae, 1 Pyralidae, 1 Sphingidae, 1 Tortricidae, and one at genus level (Rhyacia sp., Noctuidae. Known ecological information about the prey species allowed us to determine that bats had foraged at elevations between 1,500 and 2,500 m amsl (above mean sea level, mostly in subalpine meadows, followed by other open habitats such as orophilous grasslands and alpine meadows. No forest prey species were identified in the diet. As 96.4% of identified prey species were tympanate moths and no evidence of gleaning behaviour was revealed, we suggest P. macrobullaris probably forages by aerial hawking using faint echolocation pulses to avoid detection by hearing moths. As we could identify 87.8% of the analysed sequences (64.1% of the MOTUs, Molecular Operational Taxonomic Units at species level, we conclude that DNA mini-barcodes are a very useful tool to analyse the diet of moth-specialist bats.
van Hal, N L; Vorst, O; van Houwelingen, A M; Kok, E J; Peijnenburg, A; Aharoni, A; van Tunen, A J; Keijer, J
DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed. These comprise array manufacturing and design, array hybridisation, scanning, and data handling. Furthermore, it is discussed how DNA microarrays can be applied in the working fields of: safety, functionality and health of food and gene discovery and pathway engineering in plants.
Tanabe, Akifumi S; Toju, Hirokazu
Taxonomic identification of biological specimens based on DNA sequence information (a.k.a. DNA barcoding) is becoming increasingly common in biodiversity science. Although several methods have been proposed, many of them are not universally applicable due to the need for prerequisite phylogenetic/machine-learning analyses, the need for huge computational resources, or the lack of a firm theoretical background. Here, we propose two new computational methods of DNA barcoding and show a benchmark for bacterial/archeal 16S, animal COX1, fungal internal transcribed spacer, and three plant chloroplast (rbcL, matK, and trnH-psbA) barcode loci that can be used to compare the performance of existing and new methods. The benchmark was performed under two alternative situations: query sequences were available in the corresponding reference sequence databases in one, but were not available in the other. In the former situation, the commonly used "1-nearest-neighbor" (1-NN) method, which assigns the taxonomic information of the most similar sequences in a reference database (i.e., BLAST-top-hit reference sequence) to a query, displays the highest rate and highest precision of successful taxonomic identification. However, in the latter situation, the 1-NN method produced extremely high rates of misidentification for all the barcode loci examined. In contrast, one of our new methods, the query-centric auto-k-nearest-neighbor (QCauto) method, consistently produced low rates of misidentification for all the loci examined in both situations. These results indicate that the 1-NN method is most suitable if the reference sequences of all potentially observable species are available in databases; otherwise, the QCauto method returns the most reliable identification results. The benchmark results also indicated that the taxon coverage of reference sequences is far from complete for genus or species level identification in all the barcode loci examined. Therefore, we need to accelerate
Full Text Available Abstract Background Statistical analysis of DNA microarray data provides a valuable diagnostic tool for the investigation of genetic components of diseases. To take advantage of the multitude of available data sets and analysis methods, it is desirable to combine both different algorithms and data from different studies. Applying ensemble learning, consensus clustering and cross-study normalization methods for this purpose in an almost fully automated process and linking different analysis modules together under a single interface would simplify many microarray analysis tasks. Results We present ArrayMining.net, a web-application for microarray analysis that provides easy access to a wide choice of feature selection, clustering, prediction, gene set analysis and cross-study normalization methods. In contrast to other microarray-related web-tools, multiple algorithms and data sets for an analysis task can be combined using ensemble feature selection, ensemble prediction, consensus clustering and cross-platform data integration. By interlinking different analysis tools in a modular fashion, new exploratory routes become available, e.g. ensemble sample classification using features obtained from a gene set analysis and data from multiple studies. The analysis is further simplified by automatic parameter selection mechanisms and linkage to web tools and databases for functional annotation and literature mining. Conclusion ArrayMining.net is a free web-application for microarray analysis combining a broad choice of algorithms based on ensemble and consensus methods, using automatic parameter selection and integration with annotation databases.
Weitschek, E.; Velzen, van R.; Felici, G.; Bertolazzi, P.
BLOG (Barcoding with LOGic) is a diagnostic and character-based DNA Barcode analysis method. Its aim is to classify specimens to species based on DNA Barcode sequences and on a supervised machine learning approach, using classification rules that compactly characterize species in terms of DNA
Ramya S Vokuda
Full Text Available In this era of modern revolutionisation in the field of medical laboratory technology, everyone is aiming at taking the innovations from laboratory to bed side. One such technique that is most relevant to the pathologic community is Tissue Microarray (TMA technology. This is becoming quite popular amongst all the members of this family, right from laboratory scientists to clinicians and residents to technologists. The reason for this technique to gain popularity is attributed to its cost effectiveness and time saving protocols. Though, every technique is accompanied by disadvantages, the benefits out number them. This technique is very versatile as many downstream molecular assays such as immunohistochemistry, cytogenetic studies, Fluorescent In situ-Hybridisation (FISH etc., can be carried out on a single slide with multiple numbers of samples. It is a very practical approach that aids effectively to identify novel biomarkers in cancer diagnostics and therapeutics. It helps in assessing the molecular markers on a large scale very quickly. Also, the quality assurance protocols in pathological laboratory has exploited TMA to a great extent. However, the application of TMA technology is beyond oncology. This review shall focus on the different aspects of this technology such as construction of TMA, instrumentation, types, advantages and disadvantages and utilisation of the technique in various disease conditions.
Mello, Rafael Barrios; Silva, Maria Regina Regis; Alves, Maria Teresa Seixas; Evison, Martin Paul; Guimarães, Marco Aurelio; Francisco, Rafaella Arrabaca; Astolphi, Rafael Dias; Iwamura, Edna Sadayo Miazato
Taphonomic processes affecting bone post mortem are important in forensic, archaeological and palaeontological investigations. In this study, the application of tissue microarray (TMA) analysis to a sample of femoral bone specimens from 20 exhumed individuals of known period of burial and age at death is described. TMA allows multiplexing of subsamples, permitting standardized comparative analysis of adjacent sections in 3-D and of representative cross-sections of a large number of specimens. Standard hematoxylin and eosin, periodic acid-Schiff and silver methenamine, and picrosirius red staining, and CD31 and CD34 immunohistochemistry were applied to TMA sections. Osteocyte and osteocyte lacuna counts, percent bone matrix loss, and fungal spheroid element counts could be measured and collagen fibre bundles observed in all specimens. Decalcification with 7% nitric acid proceeded more rapidly than with 0.5 M EDTA and may offer better preservation of histological and cellular structure. No endothelial cells could be detected using CD31 and CD34 immunohistochemistry. Correlation between osteocytes per lacuna and age at death may reflect reported age-related responses to microdamage. Methodological limitations and caveats, and results of the TMA analysis of post mortem diagenesis in bone are discussed, and implications for DNA survival and recovery considered.
Full Text Available Zebrafish (Danio rerio is a well-recognized model for the study of vertebrate developmental genetics, yet at the same time little is known about the transcriptional events that underlie zebrafish embryogenesis. Here we have employed microarray analysis to study the temporal activity of developmentally regulated genes during zebrafish embryogenesis. Transcriptome analysis at 12 different embryonic time points covering five different developmental stages (maternal, blastula, gastrula, segmentation, and pharyngula revealed a highly dynamic transcriptional profile. Hierarchical clustering, stage-specific clustering, and algorithms to detect onset and peak of gene expression revealed clearly demarcated transcript clusters with maximum gene activity at distinct developmental stages as well as co-regulated expression of gene groups involved in dedicated functions such as organogenesis. Our study also revealed a previously unidentified cohort of genes that are transcribed prior to the mid-blastula transition, a time point earlier than when the zygotic genome was traditionally thought to become active. Here we provide, for the first time to our knowledge, a comprehensive list of developmentally regulated zebrafish genes and their expression profiles during embryogenesis, including novel information on the temporal expression of several thousand previously uncharacterized genes. The expression data generated from this study are accessible to all interested scientists from our institute resource database (http://giscompute.gis.a-star.edu.sg/~govind/zebrafish/data_download.html.
Olszak, Andrzej; Jørgensen, Bo Nørregaard
Java programs called Featureous that addresses this issue. Featureous allows a programmer to easily establish feature-code traceability links and to analyze their characteristics using a number of visualizations. Featureous is an extension to the NetBeans IDE, and can itself be extended by third...
Muller, Jean; Mehlen, André; Vetter, Guillaume; Yatskou, Mikalai; Muller, Arnaud; Chalmel, Frédéric; Poch, Olivier; Friederich, Evelyne; Vallar, Laurent
Background The actin cytoskeleton plays a crucial role in supporting and regulating numerous cellular processes. Mutations or alterations in the expression levels affecting the actin cytoskeleton system or related regulatory mechanisms are often associated with complex diseases such as cancer. Understanding how qualitative or quantitative changes in expression of the set of actin cytoskeleton genes are integrated to control actin dynamics and organisation is currently a challenge and should provide insights in identifying potential targets for drug discovery. Here we report the development of a dedicated microarray, the Actichip, containing 60-mer oligonucleotide probes for 327 genes selected for transcriptome analysis of the human actin cytoskeleton. Results Genomic data and sequence analysis features were retrieved from GenBank and stored in an integrative database called Actinome. From these data, probes were designed using a home-made program (CADO4MI) allowing sequence refinement and improved probe specificity by combining the complementary information recovered from the UniGene and RefSeq databases. Actichip performance was analysed by hybridisation with RNAs extracted from epithelial MCF-7 cells and human skeletal muscle. Using thoroughly standardised procedures, we obtained microarray images with excellent quality resulting in high data reproducibility. Actichip displayed a large dynamic range extending over three logs with a limit of sensitivity between one and ten copies of transcript per cell. The array allowed accurate detection of small changes in gene expression and reliable classification of samples based on the expression profiles of tissue-specific genes. When compared to two other oligonucleotide microarray platforms, Actichip showed similar sensitivity and concordant expression ratios. Moreover, Actichip was able to discriminate the highly similar actin isoforms whereas the two other platforms did not. Conclusion Our data demonstrate that
Montagna, Matteo; Mereghetti, Valeria; Lencioni, Valeria; Rossaro, Bruno
Rapid and efficient DNA-based tools are recommended for the evaluation of the insect biodiversity of high-altitude streams. In the present study, focused principally on larvae of the genus Diamesa Meigen 1835 (Diptera: Chironomidae), the congruence between morphological/molecular delimitation of species as well as performances in taxonomic assignments were evaluated. A fragment of the mitochondrial cox1 gene was obtained from 112 larvae, pupae and adults (Diamesinae, Orthocladiinae and Tanypodinae) that were collected in different mountain regions of the Alps and Apennines. On the basis of morphological characters 102 specimens were attributed to 16 species, and the remaining ten specimens were identified to the genus level. Molecular species delimitation was performed using: i) distance-based Automatic Barcode Gap Discovery (ABGD), with no a priori assumptions on species identification; and ii) coalescent tree-based approaches as the Generalized Mixed Yule Coalescent model, its Bayesian implementation and Bayesian Poisson Tree Processes. The ABGD analysis, estimating an optimal intra/interspecific nucleotide distance threshold of 0.7%-1.4%, identified 23 putative species; the tree-based approaches, identified between 25-26 entities, provided nearly identical results. All species belonging to zernyi, steinboecki, latitarsis, bertrami, dampfi and incallida groups, as well as outgroup species, are recovered as separate entities, perfectly matching the identified morphospecies. In contrast, within the cinerella group, cases of discrepancy arose: i) the two morphologically separate species D. cinerella and D. tonsa are neither monophyletic nor diagnosable exhibiting low values of between-taxa nucleotide mean divergence (0.94%); ii) few cases of larvae morphological misidentification were observed. Head capsule color is confirmed to be a valid character able to discriminate larvae of D. zernyi, D. tonsa and D. cinerella, but it is here better defined as a color gradient
Jonathan A. Coddington
Full Text Available The use of unique DNA sequences as a method for taxonomic identification is no longer fundamentally controversial, even though debate continues on the best markers, methods, and technology to use. Although both existing databanks such as GenBank and BOLD, as well as reference taxonomies, are imperfect, in best case scenarios “barcodes” (whether single or multiple, organelle or nuclear, loci clearly are an increasingly fast and inexpensive method of identification, especially as compared to manual identification of unknowns by increasingly rare expert taxonomists. Because most species on Earth are undescribed, a complete reference database at the species level is impractical in the near term. The question therefore arises whether unidentified species can, using DNA barcodes, be accurately assigned to more inclusive groups such as genera and families—taxonomic ranks of putatively monophyletic groups for which the global inventory is more complete and stable. We used a carefully chosen test library of CO1 sequences from 49 families, 313 genera, and 816 species of spiders to assess the accuracy of genus and family-level assignment. We used BLAST queries of each sequence against the entire library and got the top ten hits. The percent sequence identity was reported from these hits (PIdent, range 75–100%. Accurate assignment of higher taxa (PIdent above which errors totaled less than 5% occurred for genera at PIdent values >95 and families at PIdent values ≥ 91, suggesting these as heuristic thresholds for accurate generic and familial identifications in spiders. Accuracy of identification increases with numbers of species/genus and genera/family in the library; above five genera per family and fifteen species per genus all higher taxon assignments were correct. We propose that using percent sequence identity between conventional barcode sequences may be a feasible and reasonably accurate method to identify animals to family/genus. However
Walther, G; Pawłowska, J; Alastruey-Izquierdo, A; Wrzosek, M; Rodriguez-Tudela, J L; Dolatabadi, S; Chakrabarti, A; de Hoog, G S
The order Mucorales comprises predominantly fast-growing saprotrophic fungi, some of which are used for the fermentation of foodstuffs but it also includes species known to cause infections in patients with severe immune or metabolic impairments. To inventory biodiversity in Mucorales ITS barcodes of 668 strains in 203 taxa were generated covering more than two thirds of the recognised species. Using the ITS sequences, Molecular Operational Taxonomic Units were defined by a similarity threshold of 99 %. An LSU sequence was generated for each unit as well. Analysis of the LSU sequences revealed that conventional phenotypic classifications of the Mucoraceae are highly artificial. The LSU- and ITS-based trees suggest that characters, such as rhizoids and sporangiola, traditionally used in mucoralean taxonomy are plesiomorphic traits. The ITS region turned out to be an appropriate barcoding marker in Mucorales. It could be sequenced directly in 82 % of the strains and its variability was sufficient to resolve most of the morphospecies. Molecular identification turned out to be problematic only for the species complexes of Mucor circinelloides, M. flavus, M. piriformis and Zygorhynchus moelleri. As many as 12 possibly undescribed species were detected. Intraspecific variability differed widely among mucorealean species ranging from 0 % in Backusella circina to 13.3 % in Cunninghamella echinulata. A high proportion of clinical strains was included for molecular identification. Clinical isolates of Cunninghamella elegans were identified molecularly for the first time. As a result of the phylogenetic analyses several taxonomic and nomenclatural changes became necessary. The genus Backusella was emended to include all species with transitorily recurved sporangiophores. Since this matched molecular data all Mucor species possessing this character were transferred to Backusella. The genus Zygorhynchus was shown to be polyphyletic based on ITS and LSU data. Consequently
Full Text Available BACKGROUND: Possible single nucleotide polymorphism (SNP interactions in breast cancer are usually not investigated in genome-wide association studies. Previously, we proposed a particle swarm optimization (PSO method to compute these kinds of SNP interactions. However, this PSO does not guarantee to find the best result in every implement, especially when high-dimensional data is investigated for SNP-SNP interactions. METHODOLOGY/PRINCIPAL FINDINGS: In this study, we propose IPSO algorithm to improve the reliability of PSO for the identification of the best protective SNP barcodes (SNP combinations and genotypes with maximum difference between cases and controls associated with breast cancer. SNP barcodes containing different numbers of SNPs were computed. The top five SNP barcode results are retained for computing the next SNP barcode with a one-SNP-increase for each processing step. Based on the simulated data for 23 SNPs of six steroid hormone metabolisms and signalling-related genes, the performance of our proposed IPSO algorithm is evaluated. Among 23 SNPs, 13 SNPs displayed significant odds ratio (OR values (1.268 to 0.848; p<0.05 for breast cancer. Based on IPSO algorithm, the jointed effect in terms of SNP barcodes with two to seven SNPs show significantly decreasing OR values (0.84 to 0.57; p<0.05 to 0.001. Using PSO algorithm, two to four SNPs show significantly decreasing OR values (0.84 to 0.77; p<0.05 to 0.001. Based on the results of 20 simulations, medians of the maximum differences for each SNP barcode generated by IPSO are higher than by PSO. The interquartile ranges of the boxplot, as well as the upper and lower hinges for each n-SNP barcode (n = 3∼10 are more narrow in IPSO than in PSO, suggesting that IPSO is highly reliable for SNP barcode identification. CONCLUSIONS/SIGNIFICANCE: Overall, the proposed IPSO algorithm is robust to provide exact identification of the best protective SNP barcodes for breast cancer.
Hussain, Fatma; Ahmed, Nisar; Ghorbani, Abdolbaset
In pursuit of developing fast and accurate species-level molecular identification methods, we tested six DNA barcodes, namely ITS2, matK, rbcLa, ITS2+matK, ITS2+rbcLa, matK+rbcLa and ITS2+matK+rbcLa, for their capacity to identify frequently consumed but geographically isolated medicinal species of Fabaceae and Poaceae indigenous to the desert of Cholistan. Data were analysed by BLASTn sequence similarity, pairwise sequence divergence in TAXONDNA, and phylogenetic (neighbour-joining and maximum-likelihood trees) methods. Comparison of six barcode regions showed that ITS2 has the highest number of variable sites (209/360) for tested Fabaceae and (106/365) Poaceae species, the highest species-level identification (40%) in BLASTn procedure, distinct DNA barcoding gap, 100% correct species identification in BM and BCM functions of TAXONDNA, and clear cladding pattern with high nodal support in phylogenetic trees in both families. ITS2+matK+rbcLa followed ITS2 in its species-level identification capacity. The study was concluded with advocating the DNA barcoding as an effective tool for species identification and ITS2 as the best barcode region in identifying medicinal species of Fabaceae and Poaceae. Current research has practical implementation potential in the fields of pharmaco-vigilance, trade of medicinal plants and biodiversity conservation. PMID:29576968
Charles M Francis
Full Text Available BACKGROUND: Southeast Asia is recognized as a region of very high biodiversity, much of which is currently at risk due to habitat loss and other threats. However, many aspects of this diversity, even for relatively well-known groups such as mammals, are poorly known, limiting ability to develop conservation plans. This study examines the value of DNA barcodes, sequences of the mitochondrial COI gene, to enhance understanding of mammalian diversity in the region and hence to aid conservation planning. METHODOLOGY AND PRINCIPAL FINDINGS: DNA barcodes were obtained from nearly 1900 specimens representing 165 recognized species of bats. All morphologically or acoustically distinct species, based on classical taxonomy, could be discriminated with DNA barcodes except four closely allied species pairs. Many currently recognized species contained multiple barcode lineages, often with deep divergence suggesting unrecognized species. In addition, most widespread species showed substantial genetic differentiation across their distributions. Our results suggest that mammal species richness within the region may be underestimated by at least 50%, and there are higher levels of endemism and greater intra-specific population structure than previously recognized. CONCLUSIONS: DNA barcodes can aid conservation and research by assisting field workers in identifying species, by helping taxonomists determine species groups needing more detailed analysis, and by facilitating the recognition of the appropriate units and scales for conservation planning.
Yan, Hao; Labean, Thomas H.; Feng, Liping; Reif, John H.
The programmed self-assembly of patterned aperiodic molecular structures is a major challenge in nanotechnology and has numerous potential applications for nanofabrication of complex structures and useful devices. Here we report the construction of an aperiodic patterned DNA lattice (barcode lattice) by a self-assembly process of directed nucleation of DNA tiles around a scaffold DNA strand. The input DNA scaffold strand, constructed by ligation of shorter synthetic oligonucleotides, provides layers of the DNA lattice with barcode patterning information represented by the presence or absence of DNA hairpin loops protruding out of the lattice plane. Self-assembly of multiple DNA tiles around the scaffold strand was shown to result in a patterned lattice containing barcode information of 01101. We have also demonstrated the reprogramming of the system to another patterning. An inverted barcode pattern of 10010 was achieved by modifying the scaffold strands and one of the strands composing each tile. A ribbon lattice, consisting of repetitions of the barcode pattern with expected periodicity, was also constructed by the addition of sticky ends. The patterning of both classes of lattices was clearly observable via atomic force microscopy. These results represent a step toward implementation of a visual readout system capable of converting information encoded on a 1D DNA strand into a 2D form readable by advanced microscopic techniques. A functioning visual output method would not only increase the readout speed of DNA-based computers, but may also find use in other sequence identification techniques such as mutation or allele mapping.
Yang, Zhaofu; Landry, Jean-François; Hebert, Paul D N
Although members of the crambid subfamily Pyraustinae are frequently important crop pests, their identification is often difficult because many species lack conspicuous diagnostic morphological characters. DNA barcoding employs sequence diversity in a short standardized gene region to facilitate specimen identifications and species discovery. This study provides a DNA barcode reference library for North American pyraustines based upon the analysis of 1589 sequences recovered from 137 nominal species, 87% of the fauna. Data from 125 species were barcode compliant (>500bp, barcode sharing, creating a total of 155 BINs. Two systems for OTU designation, ABGD and BIN, were examined to check the correspondence between current taxonomy and sequence clusters. The BIN system performed better than ABGD in delimiting closely related species, while OTU counts with ABGD were influenced by the value employed for relative gap width. Different species with low or no interspecific divergence may represent cases of unrecognized synonymy, whereas those with high intraspecific divergence require further taxonomic scrutiny as they may involve cryptic diversity. The barcode library developed in this study will also help to advance understanding of relationships among species of Pyraustinae.
Zhang, Dequan; Jiang, Bei; Duan, Lizhen; Zhou, Nong
DNA barcoding is a technique used to identify species based on species-specific differences in short regions of their DNA. It is widely used in species discrimination of medicinal plants and traditional medicines. In the present study, four potential DNA barcodes, namely rbcL , matK , trnH-psbA and ITS (nuclear ribosomal internal transcribed spacer) were adopted for species discrimination in Crawfurdia Wall (Genetiaceae). Identification ability of these DNA barcodes and combinations were evaluated using three classic methods (Distance, Blast and Tree-Building). As a result, ITS, trnH-psbA and rbcL regions showed great universality for a success rate of 100%; whereas matK was disappointing for which only 65% samples gained useful DNA sequences. ITS region, which could clearly and effectively identify the five species in Crawfurdia , performed very well in this study. On the contrary, trnH-psbA and rbcL performed poorly in discrimination among these species. ITS marker was an ideal DNA barcode in Crawfurdia and it should be incorporated into one of the core barcodes for seed plants.
Iftikhar, Romana; Ashfaq, Muhammad; Rasool, Akhtar; Hebert, Paul D N
Although thrips are globally important crop pests and vectors of viral disease, species identifications are difficult because of their small size and inconspicuous morphological differences. Sequence variation in the mitochondrial COI-5' (DNA barcode) region has proven effective for the identification of species in many groups of insect pests. We analyzed barcode sequence variation among 471 thrips from various plant hosts in north-central Pakistan. The Barcode Index Number (BIN) system assigned these sequences to 55 BINs, while the Automatic Barcode Gap Discovery detected 56 partitions, a count that coincided with the number of monophyletic lineages recognized by Neighbor-Joining analysis and Bayesian inference. Congeneric species showed an average of 19% sequence divergence (range = 5.6% - 27%) at COI, while intraspecific distances averaged 0.6% (range = 0.0% - 7.6%). BIN analysis suggested that all intraspecific divergence >3.0% actually involved a species complex. In fact, sequences for three major pest species (Haplothrips reuteri, Thrips palmi, Thrips tabaci), and one predatory thrips (Aeolothrips intermedius) showed deep intraspecific divergences, providing evidence that each is a cryptic species complex. The study compiles the first barcode reference library for the thrips of Pakistan, and examines global haplotype diversity in four important pest thrips.
Dentinger, Bryn T M; Didukh, Maryna Y; Moncalvo, Jean-Marc
DNA barcoding is an approach to rapidly identify species using short, standard genetic markers. The mitochondrial cytochrome oxidase I gene (COI) has been proposed as the universal barcode locus, but its utility for barcoding in mushrooms (ca. 20,000 species) has not been established. We succeeded in generating 167 partial COI sequences (~450 bp) representing ~100 morphospecies from ~650 collections of Agaricomycotina using several sets of new primers. Large introns (~1500 bp) at variable locations were detected in ~5% of the sequences we obtained. We suspect that widespread presence of large introns is responsible for our low PCR success (~30%) with this locus. We also sequenced the nuclear internal transcribed spacer rDNA regions (ITS) to compare with COI. Among the small proportion of taxa for which COI could be sequenced, COI and ITS perform similarly as a barcode. However, in a densely sampled set of closely related taxa, COI was less divergent than ITS and failed to distinguish all terminal clades. Given our results and the wealth of ITS data already available in public databases, we recommend that COI be abandoned in favor of ITS as the primary DNA barcode locus in mushrooms.
Virgilio, Massimiliano; Jordaens, Kurt; Breman, Floris C; Backeljau, Thierry; De Meyer, Marc
We propose a general working strategy to deal with incomplete reference libraries in the DNA barcoding identification of species. Considering that (1) queries with a large genetic distance with their best DNA barcode match are more likely to be misidentified and (2) imposing a distance threshold profitably reduces identification errors, we modelled relationships between identification performances and distance thresholds in four DNA barcode libraries of Diptera (n = 4270), Lepidoptera (n = 7577), Hymenoptera (n = 2067) and Tephritidae (n = 602 DNA barcodes). In all cases, more restrictive distance thresholds produced a gradual increase in the proportion of true negatives, a gradual decrease of false positives and more abrupt variations in the proportions of true positives and false negatives. More restrictive distance thresholds improved precision, yet negatively affected accuracy due to the higher proportions of queries discarded (viz. having a distance query-best match above the threshold). Using a simple linear regression we calculated an ad hoc distance threshold for the tephritid library producing an estimated relative identification error DNA barcodes and should be used as cut-off mark defining whether we can proceed identifying the query with a known estimated error probability (e.g. 5%) or whether we should discard the query and consider alternative/complementary identification methods.
Lim, Voon-Ching; Ramli, Rosli; Bhassu, Subha; Wilson, John-James
Several published checklists of bat species have covered Peninsular Malaysia as part of a broader region and/or in combination with other mammal groups. Other researchers have produced comprehensive checklists for specific localities within the peninsula. To our knowledge, a comprehensive checklist of bats specifically for the entire geopolitical region of Peninsular Malaysia has never been published, yet knowing which species are present in Peninsular Malaysia and their distributions across the region are crucial in developing suitable conservation plans. Our literature search revealed that 110 bat species have been documented in Peninsular Malaysia; 105 species have precise locality records while five species lack recent and/or precise locality records. We retrieved 18 species from records dated before the year 2000 and seven species have only ever been recorded once. Our search of Barcode of Life Datasystems (BOLD) found that 86 (of the 110) species have public records of which 48 species have public DNA barcodes available from bats sampled in Peninsular Malaysia. Based on Neighbour-Joining tree analyses and the allocation of DNA barcodes to Barcode Index Number system (BINs) by BOLD, several DNA barcodes recorded under the same species name are likely to represent distinct taxa. We discuss these cases in detail and highlight the importance of further surveys to determine the occurences and resolve the taxonomy of particular bat species in Peninsular Malaysia, with implications for conservation priorities.
Guo, Shaokun; He, Jia; Zhao, Zihua; Liu, Lijun; Gao, Liyuan; Wei, Shuhua; Guo, Xiaoyu; Zhang, Rong; Li, Zhihong
Neoceratitis asiatica (Becker), which especially infests wolfberry (Lycium barbarum L.), could cause serious economic losses every year in China, especially to organic wolfberry production. In some important wolfberry plantings, it is difficult and time-consuming to rear the larvae or pupae to adults for morphological identification. Molecular identification based on DNA barcode is a solution to the problem. In this study, 15 samples were collected from Ningxia, China. Among them, five adults were identified according to their morphological characteristics. The utility of mitochondrial DNA (mtDNA) cytochrome c oxidase I (COI) gene sequence as DNA barcode in distinguishing N. asiatica was evaluated by analysing Kimura 2-parameter distances and phylogenetic trees. There were significant differences between intra-specific and inter-specific genetic distances according to the barcoding gap analysis. The uncertain larval and pupal samples were within the same cluster as N. asiatica adults and formed sister cluster to N. cyanescens. A combination of morphological and molecular methods enabled accurate identification of N. asiatica. This is the first study using DNA barcode to identify N. asiatica and the obtained DNA sequences will be added to the DNA barcode database.
Background This study reports progress in assembling a DNA barcode reference library for Ephemeroptera, Plecoptera, and Trichoptera ("EPTs") from a Canadian subarctic site, which is the focus of a comprehensive biodiversity inventory using DNA barcoding. These three groups of aquatic insects exhibit a moderate level of species diversity, making them ideal for testing the feasibility of DNA barcoding for routine biotic surveys. We explore the correlation between the morphological species delineations, DNA barcode-based haplotype clusters delimited by a sequence threshold (2%), and a threshold-free approach to biodiversity quantification--phylogenetic diversity. Results A DNA barcode reference library is built for 112 EPT species for the focal region, consisting of 2277 COI sequences. Close correspondence was found between EPT morphospecies and haplotype clusters as designated using a standard threshold value. Similarly, the shapes of taxon accumulation curves based upon haplotype clusters were very similar to those generated using phylogenetic diversity accumulation curves, but were much more computationally efficient. Conclusion The results of this study will facilitate other lines of research on northern EPTs and also bode well for rapidly conducting initial biodiversity assessments in unknown EPT faunas. PMID:20003245
Ashfaq, Muhammad; Hebert, Paul D N
Many of the arthropod species that are important pests of agriculture and forestry are impossible to discriminate morphologically throughout all of their life stages. Some cannot be differentiated at any life stage. Over the past decade, DNA barcoding has gained increasing adoption as a tool to both identify known species and to reveal cryptic taxa. Although there has not been a focused effort to develop a barcode library for them, reference sequences are now available for 77% of the 409 species of arthropods documented on major pest databases. Aside from developing the reference library needed to guide specimen identifications, past barcode studies have revealed that a significant fraction of arthropod pests are a complex of allied taxa. Because of their importance as pests and disease vectors impacting global agriculture and forestry, DNA barcode results on these arthropods have significant implications for quarantine detection, regulation, and management. The current review discusses these implications in light of the presence of cryptic species in plant pests exposed by DNA barcoding.
Chen, Juan; Zhao, Jietang; Erickson, David L; Xia, Nianhe; Kress, W John
The genus Curcuma L. is commonly used as spices, medicines, dyes and ornamentals. Owing to its economic significance and lack of clear-cut morphological differences between species, this genus is an ideal case for developing DNA barcodes. In this study, four chloroplast DNA regions (matK, rbcL, trnH-psbA and trnL-F) and one nuclear region (ITS2) were generated for 44 Curcuma species and five species from closely related genera, represented by 96 samples. PCR amplification success rate, intra- and inter-specific genetic distance variation and the correct identification percentage were taken into account to assess candidate barcode regions. PCR and sequence success rate were high in matK (89.7%), rbcL (100%), trnH-psbA (100%), trnL-F (95.7%) and ITS2 (82.6%) regions. The results further showed that four candidate chloroplast barcoding regions (matK, rbcL, trnH-psbA and trnL-F) yield no barcode gaps, indicating that the genus Curcuma represents a challenging group for DNA barcoding. The ITS2 region presented large interspecific variation and provided the highest correct identification rates (46.7%) based on BLASTClust method among the five regions. However, the ITS2 only provided 7.9% based on NJ tree method. An increase in discriminatory power needs the development of more variable markers. © 2014 John Wiley & Sons Ltd.
Full Text Available Herbal drug authentication is an important task in traditional medicine; however, it is challenged by the limitations of traditional authentication methods and the lack of trained experts. DNA barcoding is conspicuous in almost all areas of the biological sciences and has already been added to the British pharmacopeia and Chinese pharmacopeia for routine herbal drug authentication. However, DNA barcoding for the Korean pharmacopeia still requires significant improvements. Here, we present a DNA barcode reference library for herbal drugs in the Korean pharmacopeia and developed a species identification engine named KP-IDE to facilitate the adoption of this DNA reference library for the herbal drug authentication. Using taxonomy records, specimen records, sequence records, and reference records, KP-IDE can identify an unknown specimen. Currently, there are 6,777 taxonomy records, 1,054 specimen records, 30,744 sequence records (ITS2 and psbA-trnH and 285 reference records. Moreover, 27 herbal drug materials were collected from the Seoul Yangnyeongsi herbal medicine market to give an example for real herbal drugs authentications. Our study demonstrates the prospects of the DNA barcode reference library for the Korean pharmacopeia and provides future directions for the use of DNA barcoding for authenticating herbal drugs listed in other modern pharmacopeias.
Nagpure, Naresh Sahebrao; Rashid, Iliyas; Pathak, Ajey Kumar; Singh, Mahender; Singh, Shri Prakash; Sarkar, Uttam Kumar
DNA barcode is a new tool for taxon recognition and classification of biological organisms based on sequence of a fragment of mitochondrial gene, cytochrome c oxidase I (COI). In view of the growing importance of the fish DNA barcoding for species identification, molecular taxonomy and fish diversity conservation, we developed a Fish Barcode Information System (FBIS) for Indian fishes, which will serve as a regional DNA barcode archival and analysis system. The database presently contains 2334 sequence records of COI gene for 472 aquatic species belonging to 39 orders and 136 families, collected from available published data sources. Additionally, it contains information on phenotype, distribution and IUCN Red List status of fishes. The web version of FBIS was designed using MySQL, Perl and PHP under Linux operating platform to (a) store and manage the acquisition (b) analyze and explore DNA barcode records (c) identify species and estimate genetic divergence. FBIS has also been integrated with appropriate tools for retrieving and viewing information about the database statistics and taxonomy. It is expected that FBIS would be useful as a potent information system in fish molecular taxonomy, phylogeny and genomics. Availability The database is available for free at http://mail.nbfgr.res.in/fbis/ PMID:22715304
Wilson, John-James; Sing, Kong-Wah; Lee, Ping-Shin; Wee, Alison K S
Over the past 50 years, Tropical East Asia has lost more biodiversity than any tropical region. Tropical East Asia is a megadiverse region with an acute taxonomic impediment. DNA barcodes are short standardized DNA sequences used for taxonomic purposes and have the potential to lessen the challenges of biodiversity inventory and assessments in regions where they are most needed. We reviewed DNA barcoding efforts in Tropical East Asia relative to other tropical regions. We suggest DNA barcodes (or metabarcodes from next-generation sequencers) may be especially useful for characterizing and connecting species-level biodiversity units in inventories encompassing taxa lacking formal description (particularly arthropods) and in large-scale, minimal-impact approaches to vertebrate monitoring and population assessments through secondary sources of DNA (invertebrate derived DNA and environmental DNA). We suggest interest and capacity for DNA barcoding are slowly growing in Tropical East Asia, particularly among the younger generation of researchers who can connect with the barcoding analogy and understand the need for new approaches to the conservation challenges being faced. © 2016 Society for Conservation Biology.
Samerpitak, Kittipan; Gerrits van den Ende, Bert H G; Stielow, J Benjamin; Menken, Steph B J; de Hoog, G Sybren
The genera Ochroconis and Verruconis (Sympoventuriaceae, Venturiales) have remarkably high molecular diversity despite relatively high degrees of phenotypic similarity. Tree topologies, inter-specific and intra-specific heterogeneities, barcoding gaps and reciprocal monophyly of all currently known species were analyzed. It was concluded that all currently used genes viz. SSU, ITS, LSU, ACT1, BT2, and TEF1 were unable to reach all 'gold standard' criteria of barcoding markers. They could nevertheless be used for reasonably reliable identification of species, because the markers, although variable, were associated with large inter-specific heterogeneity. Of the coding protein-genes, ACT1 revealed highest potentiality as barcoding marker in mostly all parts of the investigated sequence. SSU, LSU, ITS, and ACT1 yielded consistent monophyly in all investigated species, but only SSU and LSU generated clear barcoding gaps. For phylogeny, LSU was an informative marker, suitable to reconstruct gene-trees showing correct phylogenetic relationships. Cryptic species were revealed especially in complexes with very high intra-specific variability. When all these complexes will be taxonomically resolved, ACT1 will probably appear to be the most reliable barcoding gene for Ochroconis and Verruconis. Copyright © 2015 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
An, Jeung Hee; Oh, Byung-Keun; Choi, Jeong Woo
Tyrosine hydroxylase, the rate-limiting enzyme of catecholamine biosysthesis, is predominantly expressed in several cell groups within the brain, including the dopaminergic neurons of the substantia nigra and ventral tegmental area. We evaluated the efficacy of this protein-detection method in detecting tyrosine hydroxylase in normal and oxidative stress damaged dopaminergic cells. In this study, a coupling of DNA barcode and bead-based immnunoassay for detecting tyrosine hydroxylaser with PCR-like sensitivity is reported. The method relies on magnetic nanoparticles with antibodies and nanoparticles that are encoded with DNA and antibodies that can sandwich the target protein captured by the nanoparticle-bound antibodies. The aggregate sandwich structures are magnetically separated from solution, and treated to remove the conjugated barcode DNA. The DNA barcodes were identified by PCR analysis. The concentration of tyrosine hydroxylase in dopaminergic cell can be easily and rapidly detected using bio-barcode assay. The bio-barcode assay is a rapid and high-throughput screening tool to detect of neurotransmitter such as dopamine.
Ma, Eddie Y T; Ratnasingham, Sujeevan; Kremer, Stefan C
This study presents a machine learning method that increases the number of identified bases in Sanger Sequencing. The system post-processes a KB basecalled chromatogram. It selects a recoverable subset of N-labels in the KB-called chromatogram to replace with basecalls (A,C,G,T). An N-label correction is defined given an additional read of the same sequence, and a human finished sequence. Corrections are added to the dataset when an alignment determines the additional read and human agree on the identity of the N-label. KB must also rate the replacement with quality value of in the additional read. Corrections are only available during system training. Developing the system, nearly 850,000 N-labels are obtained from Barcode of Life Datasystems, the premier database of genetic markers called DNA Barcodes. Increasing the number of correct bases improves reference sequence reliability, increases sequence identification accuracy, and assures analysis correctness. Keeping with barcoding standards, our system maintains an error rate of percent. Our system only applies corrections when it estimates low rate of error. Tested on this data, our automation selects and recovers: 79 percent of N-labels from COI (animal barcode); 80 percent from matK and rbcL (plant barcodes); and 58 percent from non-protein-coding sequences (across eukaryotes).
Linard, Benjamin; Nguyen, Ngoc Hoan; Prosdocimi, Francisco; Poch, Olivier; Thompson, Julie D
Evolutionary systems biology aims to uncover the general trends and principles governing the evolution of biological networks. An essential part of this process is the reconstruction and analysis of the evolutionary histories of these complex, dynamic networks. Unfortunately, the methodologies for representing and exploiting such complex evolutionary histories in large scale studies are currently limited. Here, we propose a new formalism, called EvoluCode (Evolutionary barCode), which allows the integration of different evolutionary parameters (eg, sequence conservation, orthology, synteny …) in a unifying format and facilitates the multilevel analysis and visualization of complex evolutionary histories at the genome scale. The advantages of the approach are demonstrated by constructing barcodes representing the evolution of the complete human proteome. Two large-scale studies are then described: (i) the mapping and visualization of the barcodes on the human chromosomes and (ii) automatic clustering of the barcodes to highlight protein subsets sharing similar evolutionary histories and their functional analysis. The methodologies developed here open the way to the efficient application of other data mining and knowledge extraction techniques in evolutionary systems biology studies. A database containing all EvoluCode data is available at: http://lbgi.igbmc.fr/barcodes.
Anderson, L.K.; Boor, M.G.; Hurford, J.M.
Over the past seven years, Los Alamos National Laboratory developed several generations of computerized nuclear materials control and accountability (MC and A) systems for tracking and reporting the storage, movement, and management of nuclear materials at domestic and international facilities. During the same period, Oak Ridge National Laboratory was involved with automated data acquisition (ADA) equipment, including installation of numerous bar-code scanning stations at various facilities to serve as input devices to computerized systems. Bar-code readers, as well as other ADA devices, reduce input errors, provide faster input, and allow the capture of data in remote areas where workstations do not exist. Los Alamos National Laboratory and Oak Ridge National Laboratory teamed together to implement the integration of bar-code hardware technology with computerized MC and A systems. With the expertise of both sites, the two technologies were successfully merged with little difficulty. Bar-code input is now available with several functions of the MC and A systems: material movements within material balance areas (MBAs), material movements between MBAs, and physical inventory verification. This paper describes the various components required for the integration of these MC and A systems with the installed bar-code reader devices and the future directions for these technologies
Song, Chao; Wang, Qian; Zhang, Ruilei; Sun, Bingjiao; Wang, Xinhua
In this study, we tested the utility of the mitochondrial gene cytochrome c oxidase subunit 1 (CO1) as the barcode region to deal with taxonomical problems of Polypedilum (Tripodura) non-biting midges (Diptera: Chironomidae). The 114 DNA barcodes representing 27 morphospecies are divided into 33 well separated clusters based on both Neighbor Joining and Maximum Likelihood methods. DNA barcodes revealed an 82% success rate in matching with morphospecies. The selected DNA barcode data support 37-64 operational taxonomic units (OTUs) based on the methods of Automatic Barcode Gap Discovery (ABGD) and Poisson Tree Process (PTP). Furthermore, a priori species based on consistent phenotypic variations were attested by molecular analysis, and a taxonomical misidentification of barcode sequences from GenBank was found. We could not observe a distinct barcode gap but an overlap ranged from 9-12%. Our results supported DNA barcoding as an ideal method to detect cryptic species, delimit sibling species, and associate different life stages in non-biting midges.
Lewis, C.T.; Bilkhu, S.; Robert, V.; Eberhardt, U.; Szoke, S.; Seifert, K.A.; Lévesque, C.A.
Abstract: DNA barcoding is the application of DNA sequences of standardized genetic markers for the identification of eukaryotic organisms. We attempted to identify alternative candidate barcode gene targets for the fungal biota from available fungal genomes using a taxonomy-aware processing
Jisming-See, Shi-Wei; Sing, Kong-Wah; Wilson, John-James
The "rings" belonging to the genus Ypthima are amongst the most common butterflies in Peninsular Malaysia. However, the species can be difficult to tell apart, with keys relying on minor and often non-discrete ring characters found on the hindwing. Seven species have been reported from Peninsular Malaysia, but this is thought to be an underestimate of diversity. DNA barcodes of 165 individuals, and wing and genital morphology, were examined to reappraise species diversity of this genus in Peninsular Malaysia. DNA barcodes collected during citizen science projects-School Butterfly Project and Peninsular Malaysia Butterfly Count-recently conducted in Peninsular Malaysia were included. The new DNA barcodes formed six groups with different Barcode Index Numbers (BINs) representing four species reported in Peninsular Malaysia. When combined with public DNA barcodes from the Barcode Of Life Datasystems, several taxonomic issues arose. We consider the taxon Y. newboldi, formerly treated as a subspecies of Y. baldus, as a distinct species. DNA barcodes also supported an earlier suggestion that Y. nebulosa is a synonym under Y. horsfieldii humei. Two BINs of the genus Ypthima comprising DNA barcodes collected during citizen science projects did not correspond to any species previously reported in Peninsular Malaysia.
DNA/RNA and protein microarrays have proven their outstanding bioanalytical performance throughout the past decades, given the unprecedented level of parallelization by which molecular recognition assays can be performed and analyzed. Cell microarrays (CMAs) make use of similar construction principles. They are applied to profile a given cell population with respect to the expression of specific molecular markers and also to measure functional cell responses to drugs and chemicals. This review focuses on the use of cell-based microarrays for assessing the cytotoxicity of drugs, toxins, or chemicals in general. It also summarizes CMA construction principles with respect to the cell types that are used for such microarrays, the readout parameters to assess toxicity, and the various formats that have been established and applied. The review ends with a critical comparison of CMAs and well-established microtiter plate (MTP) approaches.
Tanackovic, Vanja; Rydahl, Maja Gro; Pedersen, Henriette Lodberg
In this study we introduce the starch-recognising carbohydrate binding module family 20 (CBM20) from Aspergillus niger for screening biological variations in starch molecular structure using high throughput carbohydrate microarray technology. Defined linear, branched and phosphorylated...
黄承志; 李原芳; 黄新华; 范美坤
The microarray of DNA probes with 5’ -NH2 and 5’ -Tex/3’ -NH2 modified terminus on 10 um carboxylate functional beads surface in the presence of 1-ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC) is characterized in the preseni paper. it was found that the microarray capacity of DNA probes on the beads surface depends on the pH of the aqueous solution, the concentra-tion of DNA probe and the total surface area of the beads. On optimal conditions, the minimum distance of 20 mer single-stranded DNA probe microarrayed on beads surface is about 14 nm, while that of 20 mer double-stranded DNA probes is about 27 nm. If the probe length increases from 20 mer to 35 mer, its microarray density decreases correspondingly. Mechanism study shows that the binding mode of DNA probes on the beads surface is nearly parallel to the beads surface.
The microarray of DNA probes with 5′-NH2 and 5′-Tex/3′-NH2 modified terminus on 10 m m carboxylate functional beads surface in the presence of 1-ethyl-3-(3-dimethylaminopropyl)- carbodiimide (EDC) is characterized in the present paper. It was found that the microarray capacity of DNA probes on the beads surface depends on the pH of the aqueous solution, the concentration of DNA probe and the total surface area of the beads. On optimal conditions, the minimum distance of 20 mer single-stranded DNA probe microarrayed on beads surface is about 14 nm, while that of 20 mer double-stranded DNA probes is about 27 nm. If the probe length increases from 20 mer to 35 mer, its microarray density decreases correspondingly. Mechanism study shows that the binding mode of DNA probes on the beads surface is nearly parallel to the beads surface.
Conclusion: The microarray method provides a more accurate and rapid diagnostic tool for bacterial meningitis compared to traditional culture methods. Clinical application of this new technique may reduce the potential risk of delay in treatment.
Wang, Yuedong; Ma, Yanyuan; Carroll, Raymond J.
Microarrays are one of the most widely used high throughput technologies. One of the main problems in the area is that conventional estimates of the variances that are required in the t-statistic and other statistics are unreliable owing
The authors developed a novel macro and nanoporous silicon surface for protein microarrays to facilitate high-throughput biomarker discovery, and high-density protein-chip array analyses of complex biological samples...
Wang, Huibin; Zhang, Yiming; Yuan, Xun; Chen, Yi; Yan, Mingdi
A universal photochemical method has been established for the immobilization of intact carbohydrates and their analogues, and for the fabrication of carbohydrate microarrays. The method features the use of perfluorophenyl azide (PFPA)-modified substrates and the photochemical reaction of surface azido groups with printed carbohydrates. Various aldoses, ketoses, non-reducing sugars such as alditols and their derivatives can be directly arrayed on the PFPA-modified chips. The lectin-recognition ability of arrayed mannose, glucose and their oligo- and polysaccharides were confirmed using surface plasmon resonance imaging and laser-induced fluorescence imaging. PMID:21138274
Full Text Available Abstract Background Obtaining reliable and reproducible two-color microarray gene expression data is critically important for understanding the biological significance of perturbations made on a cellular system. Microarray design, RNA preparation and labeling, hybridization conditions and data acquisition and analysis are variables difficult to simultaneously control. A useful tool for monitoring and controlling intra- and inter-experimental variation is Universal Reference RNA (URR, developed with the goal of providing hybridization signal at each microarray probe location (spot. Measuring signal at each spot as the ratio of experimental RNA to reference RNA targets, rather than relying on absolute signal intensity, decreases variability by normalizing signal output in any two-color hybridization experiment. Results Human, mouse and rat URR (UHRR, UMRR and URRR, respectively were prepared from pools of RNA derived from individual cell lines representing different tissues. A variety of microarrays were used to determine percentage of spots hybridizing with URR and producing signal above a user defined threshold (microarray coverage. Microarray coverage was consistently greater than 80% for all arrays tested. We confirmed that individual cell lines contribute their own unique set of genes to URR, arguing for a pool of RNA from several cell lines as a better configuration for URR as opposed to a single cell line source for URR. Microarray coverage comparing two separately prepared batches each of UHRR, UMRR and URRR were highly correlated (Pearson's correlation coefficients of 0.97. Conclusion Results of this study demonstrate that large quantities of pooled RNA from individual cell lines are reproducibly prepared and possess diverse gene representation. This type of reference provides a standard for reducing variation in microarray experiments and allows more reliable comparison of gene expression data within and between experiments and
Salehi-Reyhani, Ali; Burgin, Edward; Ces, Oscar; Willison, Keith R; Klug, David R
Addressable droplet microarrays are potentially attractive as a way to achieve miniaturised, reduced volume, high sensitivity analyses without the need to fabricate microfluidic devices or small volume chambers. We report a practical method for producing oil-encapsulated addressable droplet microarrays which can be used for such analyses. To demonstrate their utility, we undertake a series of single cell analyses, to determine the variation in copy number of p53 proteins in cells of a human cancer cell line.
Nicolaisen, Mogens; Nyskjold, Henriette; Bertaccini, Assunta
Detection and identification of phytoplasmas is a laborious process often involving nested PCR followed by restriction enzyme analysis and fine-resolution gel electrophoresis. To improve throughput, other methods are needed. Microarray technology offers a generic assay that can potentially detect...... and differentiate all types of phytoplasmas in one assay. The present protocol describes a microarray-based method for identification of phytoplasmas to 16Sr group level....
Wullschleger, Stan D; Difazio, Stephen P
Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology were selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.
Stephen P. Difazio
Full Text Available Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology were selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.
Lodha, T D; Basak, J
Plant defense responses are mediated by elementary regulatory proteins that affect expression of thousands of genes. Over the last decade, microarray technology has played a key role in deciphering the underlying networks of gene regulation in plants that lead to a wide variety of defence responses. Microarray is an important tool to quantify and profile the expression of thousands of genes simultaneously, with two main aims: (1) gene discovery and (2) global expression profiling. Several microarray technologies are currently in use; most include a glass slide platform with spotted cDNA or oligonucleotides. Till date, microarray technology has been used in the identification of regulatory genes, end-point defence genes, to understand the signal transduction processes underlying disease resistance and its intimate links to other physiological pathways. Microarray technology can be used for in-depth, simultaneous profiling of host/pathogen genes as the disease progresses from infection to resistance/susceptibility at different developmental stages of the host, which can be done in different environments, for clearer understanding of the processes involved. A thorough knowledge of plant disease resistance using successful combination of microarray and other high throughput techniques, as well as biochemical, genetic, and cell biological experiments is needed for practical application to secure and stabilize yield of many crop plants. This review starts with a brief introduction to microarray technology, followed by the basics of plant-pathogen interaction, the use of DNA microarrays over the last decade to unravel the mysteries of plant-pathogen interaction, and ends with the future prospects of this technology.
This vital new resource offers engineers and researchers a window on important new technology that will supersede the barcode and is destined to change the face of logistics and product data handling. In the last two decades, radio-frequency identification has grown fast, with accelerated take-up of RFID into the mainstream through its adoption by key users such as Wal-Mart, K-Mart and the US Department of Defense. RFID has many potential applications due to its flexibility, capability to operate out of line of sight, and its high data-carrying capacity. Yet despite optimistic projections of a market worth $25 billion by 2018, potential users are concerned about costs and investment returns. Clearly demonstrating the need for a fully printable chipless RFID tag as well as a powerful and efficient reader to assimilate the tag’s data, this book moves on to describe both. Introducing the general concepts in the field including technical data, it then describes how a chipless RFID tag can be made using a planar...
Strain-specific genomic diversity in the Mycobacterium tuberculosis complex (MTBC) is an important factor in pathogenesis that may affect virulence, transmissibility, host response and emergence of drug resistance. Several systems have been proposed to classify MTBC strains into distinct lineages and families. Here, we investigate single-nucleotide polymorphisms (SNPs) as robust (stable) markers of genetic variation for phylogenetic analysis. We identify ∼92k SNP across a global collection of 1,601 genomes. The SNP-based phylogeny is consistent with the gold-standard regions of difference (RD) classification system. Of the ∼7k strain-specific SNPs identified, 62 markers are proposed to discriminate known circulating strains. This SNP-based barcode is the first to cover all main lineages, and classifies a greater number of sublineages than current alternatives. It may be used to classify clinical isolates to evaluate tools to control the disease, including therapeutics and vaccines whose effectiveness may vary by strain type. © 2014 Macmillan Publishers Limited.
Full Text Available Accurate identification of fish and fish products, from eggs to adults, is important in many areas. Grey mullets of the family Mugilidae are distributed worldwide and inhabit marine, estuarine, and freshwater environments in all tropical and temperate regions. Various Mugilid species are commercially important species in fishery and aquaculture of many countries. For the present study we have chosen two Mugilid genes with different phylogenetic signals: relatively variable mitochondrial cytochrome oxidase subunit I (COI and conservative nuclear rhodopsin (RHO. We examined their diversity within and among 9 Mugilid species belonging to 4 genera, many of which have been examined from multiple specimens, with the goal of determining whether DNA barcoding can achieve unambiguous species recognition of Mugilid species. The data obtained showed that information based on COI sequences was diagnostic not only for species-level identification but also for recognition of intraspecific units, e.g., allopatric populations of circumtropical Mugil cephalus, or even native and acclimatized specimens of Chelon haematocheila. All RHO sequences appeared strictly species specific. Based on the data obtained, we conclude that COI, as well as RHO sequencing can be used to unambiguously identify fish species. Topologies of phylogeny based on RHO and COI sequences coincided with each other, while together they had a good phylogenetic signal.
Full Text Available DNA barcoding is a promising tool to facilitate a rapid and unambiguous identification of sponge species. Demosponges of the order Dictyoceratida are particularly challenging to identify, but are of ecological as well as biochemical importance.Here we apply DNA barcoding with the standard CO1-barcoding marker on selected Indo-Pacific specimens of two genera, Ircinia and Psammocinia of the family Irciniidae. We show that the CO1 marker identifies several species new to science, reveals separate radiation patterns of deep-sea Ircinia sponges and indicates dispersal patterns of Psammocinia species. However, some species cannot be unambiguously barcoded by solely this marker due to low evolutionary rates.We support previous suggestions for a combination of the standard CO1 fragment with an additional fragment for sponge DNA barcoding.
Home; Journals; Resonance – Journal of Science Education. Feature Article. Articles in Resonance – Journal of Science Education. Volume 1 Issue 1 January 1996 pp 80-85 Feature Article. What's New in Computers Windows 95 · Vijnan Shastri · More Details Fulltext PDF. Volume 1 Issue 1 January 1996 pp 86-89 Feature ...
Full Text Available Abstract Background Veterinary drugs such as clenbuterol (CL and sulfamethazine (SM2 are low molecular weight ( Results The artificial antigens were spotted on microarray slides. Standard concentrations of the compounds were added to compete with the spotted antigens for binding to the antisera to determine the IC50. Our microarray assay showed the IC50 were 39.6 ng/ml for CL and 48.8 ng/ml for SM2, while the traditional competitive indirect-ELISA (ci-ELISA showed the IC50 were 190.7 ng/ml for CL and 156.7 ng/ml for SM2. We further validated the two methods with CL fortified chicken muscle tissues, and the protein microarray assay showed 90% recovery while the ci-ELISA had 76% recovery rate. When tested with CL-fed chicken muscle tissues, the protein microarray assay had higher sensitivity (0.9 ng/g than the ci-ELISA (0.1 ng/g for detection of CL residues. Conclusions The protein microarrays showed 4.5 and 3.5 times lower IC50 than the ci-ELISA detection for CL and SM2, respectively, suggesting that immunodetection of small molecules with protein microarray is a better approach than the traditional ELISA technique.
Full Text Available Carbohydrates play a crucial role in host-microorganism interactions and many host glycoconjugates are receptors or co-receptors for microbial binding. Host glycosylation varies with species and location in the body, and this contributes to species specificity and tropism of commensal and pathogenic bacteria. Additionally, bacterial glycosylation is often the first bacterial molecular species encountered and responded to by the host system. Accordingly, characterising and identifying the exact structures involved in these critical interactions is an important priority in deciphering microbial pathogenesis. Carbohydrate-based microarray platforms have been an underused tool for screening bacterial interactions with specific carbohydrate structures, but they are growing in popularity in recent years. In this review, we discuss carbohydrate-based microarrays that have been profiled with whole bacteria, recombinantly expressed adhesins or serum antibodies. Three main types of carbohydrate-based microarray platform are considered; (i conventional carbohydrate or glycan microarrays; (ii whole mucin microarrays; and (iii microarrays constructed from bacterial polysaccharides or their components. Determining the nature of the interactions between bacteria and host can help clarify the molecular mechanisms of carbohydrate-mediated interactions in microbial pathogenesis, infectious disease and host immune response and may lead to new strategies to boost therapeutic treatments.
Erickson, A; Fisher, M; Furukawa-Stoffer, T; Ambagala, A; Hodko, D; Pasick, J; King, D P; Nfon, C; Ortega Polo, R; Lung, O
Microarray technology can be useful for pathogen detection as it allows simultaneous interrogation of the presence or absence of a large number of genetic signatures. However, most microarray assays are labour-intensive and time-consuming to perform. This study describes the development and initial evaluation of a multiplex reverse transcription (RT)-PCR and novel accompanying automated electronic microarray assay for simultaneous detection and differentiation of seven important viruses that affect swine (foot-and-mouth disease virus [FMDV], swine vesicular disease virus [SVDV], vesicular exanthema of swine virus [VESV], African swine fever virus [ASFV], classical swine fever virus [CSFV], porcine respiratory and reproductive syndrome virus [PRRSV] and porcine circovirus type 2 [PCV2]). The novel electronic microarray assay utilizes a single, user-friendly instrument that integrates and automates capture probe printing, hybridization, washing and reporting on a disposable electronic microarray cartridge with 400 features. This assay accurately detected and identified a total of 68 isolates of the seven targeted virus species including 23 samples of FMDV, representing all seven serotypes, and 10 CSFV strains, representing all three genotypes. The assay successfully detected viruses in clinical samples from the field, experimentally infected animals (as early as 1 day post-infection (dpi) for FMDV and SVDV, 4 dpi for ASFV, 5 dpi for CSFV), as well as in biological material that were spiked with target viruses. The limit of detection was 10 copies/μl for ASFV, PCV2 and PRRSV, 100 copies/μl for SVDV, CSFV, VESV and 1,000 copies/μl for FMDV. The electronic microarray component had reduced analytical sensitivity for several of the target viruses when compared with the multiplex RT-PCR. The integration of capture probe printing allows custom onsite array printing as needed, while electrophoretically driven hybridization generates results faster than conventional
Tan, Ji; Lim, Phaik-Eem; Phang, Siew-Moi; Hong, Dang Diem; Sunarpi, H; Hurtado, Anicia Q
DNA barcoding has been a major advancement in the field of taxonomy, seeing much effort put into the barcoding of wide taxa of organisms, macro and microalgae included. The mitochondrial-encoded cox1 and plastid-encoded rbcL has been proposed as potential DNA barcodes for rhodophytes, but are yet to be tested on the commercially important carrageenophytes Kappaphycus and Eucheuma. This study gauges the effectiveness of four markers, namely the mitochondrial cox1, cox2, cox2-3 spacer and the plastid rbcL in DNA barcoding on selected Kappaphycus and Eucheuma from Southeast Asia. Marker assessments were performed using established distance and tree-based identification criteria from earlier studies. Barcoding patterns on a larger scale were simulated by empirically testing on the commonly used cox2-3 spacer. The phylogeny of these rhodophytes was also briefly described. In this study, the cox2 marker which satisfies the prerequisites of DNA barcodes was found to exhibit moderately high interspecific divergences with no intraspecific variations, thus a promising marker for the DNA barcoding of Kappaphycus and Eucheuma. However, the already extensively used cox2-3 spacer was deemed to be in overall more appropriate as a DNA barcode for these two genera. On a wider scale, cox1 and rbcL were still better DNA barcodes across the rhodophyte taxa when practicality and cost-efficiency were taken into account. The phylogeny of Kappaphycus and Eucheuma were generally similar to those earlier reported. Still, the application of DNA barcoding has demonstrated our relatively poor taxonomic comprehension of these seaweeds, thus suggesting more in-depth efforts in taxonomic restructuring as well as establishment.
Nowdays, due to the increasing importance of quality care, organizations focuse on the improving provision, management and distribution of health. On one hand, incremental costs of the new technologies and on the other hand, increased knowledge of health care recipients and their expectations for high quality services have doubled the need to make changes in order to respond to resource constraints (financial, human, material). For this purpose, several technologies, such as barcode, have been used in hospitals to improve services and staff productivity; but various factors effect on the adoption of new technologies and despite good implementation of a technology and its benefits, sometimes personnel don't accept and don't use it. This is an applied descriptive cross-sectional study in which all the barcode users in health information management department of the three academic hospitals (Feiz, Al-Zahra, Ayatollah Kashani) affiliated to Isfahan University of Medical Sciences were surveyed by the barcode technology acceptance questionnaire, in six areas as following: barcode ease of learning, capabilities, perception of its usefulness and its ease of use, users attitudes towards its using, and users intention. The finding showed that barcode technology total acceptance was relatively desirable (%76.9); the most compliance with TAM model was related to the user perceptions about the ease of use of barcode technology and the least compliance was related to the ease of learning barcode technology (respectively %83.7 and %71.5). Ease of learning and barcode capability effect of usefulness and perceived ease of barcode technology. Users perceptions effect their attitudes toward greater use of technology and their attitudes have an effect on their intention to use the technology and finally, their intention makes actual use of the technology (acceptance). Therefore, considering the six elements related to technology implementation can be important in the barcode
Jelacic, Srdjan; Bowdle, Andrew; Nair, Bala G; Kusulos, Dolly; Bower, Lynnette; Togashi, Kei
Many anesthetic drug errors result from vial or syringe swaps. Scanning the barcodes on vials before drug preparation, creating syringe labels that include barcodes, and scanning the syringe label barcodes before drug administration may help to prevent errors. In contrast, making syringe labels by hand that comply with the recommendations of regulatory agencies and standards-setting bodies is tedious and time consuming. A computerized system that uses vial barcodes and generates barcoded syringe labels could address both safety issues and labeling recommendations. We measured compliance of syringe labels in multiple operating rooms (ORs) with the recommendations of regulatory agencies and standards-setting bodies before and after the introduction of the Codonics Safe Label System (SLS). The Codonics SLS was then combined with Smart Anesthesia Manager software to create an anesthesia barcode drug administration system, which allowed us to measure the rate of scanning syringe label barcodes at the time of drug administration in 2 cardiothoracic ORs before and after introducing a coffee card incentive. Twelve attending cardiothoracic anesthesiologists and the OR satellite pharmacy participated. The use of the Codonics SLS drug labeling system resulted in >75% compliant syringe labels (95% confidence interval, 75%-98%). All syringe labels made using the Codonics SLS system were compliant. The average rate of scanning barcodes on syringe labels using Smart Anesthesia Manager was 25% (730 of 2976) over 13 weeks but increased to 58% (956 of 1645) over 8 weeks after introduction of a simple (coffee card) incentive (P < 0.001). An anesthesia barcode drug administration system resulted in a moderate rate of scanning syringe label barcodes at the time of drug administration. Further, adaptation of the system will be required to achieve a higher utilization rate.
Full Text Available The family Miridae is one of the most species-rich families of insects. To better understand the diversity and evolution of mirids, we determined the mitogenome of Lygus pratenszs and re-sequenced the mitogenomes of four mirids (i.e., Apolygus lucorum, Adelphocoris suturalis, Ade. fasciaticollis and Ade. lineolatus. We performed a comparative analysis for 15 mitogenomic sequences representing 11 species of five genera within Miridae and evaluated the potential of these mitochondrial genes as molecular markers. Our results showed that the general mitogenomic features (gene content, gene arrangement, base composition and codon usage were well conserved among these mirids. Four protein-coding genes (PCGs (cox1, cox3, nad1 and nad3 had no length variability, where nad5 showed the largest size variation; no intraspecific length variation was found in PCGs. Two PCGs (nad4 and nad5 showed relatively high substitution rates at the nucleotide and amino acid levels, where cox1 had the lowest substitution rate. The Ka/Ks values for all PCGs were far lower than 1 (<0.59, but the Ka/Ks values of cox1-barcode sequences were always larger than 1 (1.34 –15.20, indicating that the 658 bp sequences of cox1 may be not the appropriate marker due to positive selection or selection relaxation. Phylogenetic analyses based on two concatenated mitogenomic datasets consistently supported the relationship of Nesidiocoris + (Trigonotylus + (Adelphocoris + (Apolygus + Lygus, as revealed by nad4, nad5, rrnL and the combined 22 transfer RNA genes (tRNAs, respectively. Taken sequence length, substitution rate and phylogenetic signal together, the individual genes (nad4, nad5 and rrnL and the combined 22 tRNAs could been used as potential molecular markers for Miridae at various taxonomic levels. Our results suggest that it is essential to evaluate and select suitable markers for different taxa groups when performing phylogenetic, population genetic and species identification
Wang, Juan; Zhang, Li; Zhang, Qi-Lin; Zhou, Min-Qiang; Wang, Xiao-Tong; Yang, Xing-Zhuo; Yuan, Ming-Long
The family Miridae is one of the most species-rich families of insects. To better understand the diversity and evolution of mirids, we determined the mitogenome of Lygus pratenszs and re-sequenced the mitogenomes of four mirids (i.e., Apolygus lucorum , Adelphocoris suturalis , Ade. fasciaticollis and Ade. lineolatus ). We performed a comparative analysis for 15 mitogenomic sequences representing 11 species of five genera within Miridae and evaluated the potential of these mitochondrial genes as molecular markers. Our results showed that the general mitogenomic features (gene content, gene arrangement, base composition and codon usage) were well conserved among these mirids. Four protein-coding genes (PCGs) ( cox1 , cox3 , nad1 and nad3 ) had no length variability, where nad5 showed the largest size variation; no intraspecific length variation was found in PCGs. Two PCGs ( nad4 and nad5 ) showed relatively high substitution rates at the nucleotide and amino acid levels, where cox1 had the lowest substitution rate. The Ka/Ks values for all PCGs were far lower than 1 (barcode sequences were always larger than 1 (1.34 -15.20), indicating that the 658 bp sequences of cox1 may be not the appropriate marker due to positive selection or selection relaxation. Phylogenetic analyses based on two concatenated mitogenomic datasets consistently supported the relationship of Nesidiocoris + ( Trigonotylus + ( Adelphocoris + ( Apolygus + Lygus ))), as revealed by nad4 , nad5 , rrnL and the combined 22 transfer RNA genes (tRNAs), respectively. Taken sequence length, substitution rate and phylogenetic signal together, the individual genes ( nad4 , nad5 and rrnL ) and the combined 22 tRNAs could been used as potential molecular markers for Miridae at various taxonomic levels. Our results suggest that it is essential to evaluate and select suitable markers for different taxa groups when performing phylogenetic, population genetic and species identification studies.
Makarova, Olga; Contaldo, Nicoletta; Paltrinieri, Samanta
Background Phytoplasmas are bacterial phytopathogens responsible for significant losses in agricultural production worldwide. Several molecular markers are available for identification of groups or strains of phytoplasmas. However, they often cannot be used for identification of phytoplasmas from...... different groups simultaneously or are too long for routine diagnostics. DNA barcoding recently emerged as a convenient tool for species identification. Here, the development of a universal DNA barcode based on the elongation factor Tu (tuf) gene for phytoplasma identification is reported. Methodology....../Principal Findings We designed a new set of primers and amplified a 420–444 bp fragment of tuf from all 91 phytoplasmas strains tested (16S rRNA groups -I through -VII, -IX through -XII, -XV, and -XX). Comparison of NJ trees constructed from the tuf barcode and a 1.2 kbp fragment of the 16S ribosomal gene revealed...
Goldstein, Paul Z; DeSalle, Rob
DNA barcodes, like traditional sources of taxonomic information, are potentially powerful heuristics in the identification of described species but require mindful analytical interpretation. The role of DNA barcoding in generating hypotheses of new taxa in need of formal taxonomic treatment is discussed, and it is emphasized that the recursive process of character evaluation is both necessary and best served by understanding the empirical mechanics of the discovery process. These undertakings carry enormous ramifications not only for the translation of DNA sequence data into taxonomic information but also for our comprehension of the magnitude of species diversity and its disappearance. This paper examines the potential strengths and pitfalls of integrating DNA sequence data, specifically in the form of DNA barcodes as they are currently generated and analyzed, with taxonomic practice.
Full Text Available In this paper, it is shown that S-shaped split ring resonators (S-SRRs are useful particles for the implementation of spectral signature (i.e., a class of radiofrequency barcodes based on coplanar waveguide (CPW transmission lines loaded with such resonant elements. By virtue of its S shape, these resonators are electrically small. Hence S-SRRs are of interest for the miniaturization of the barcodes, since multiple resonators, each tuned at a different frequency, are used for encoding purposes. In particular, a 10-bit barcode occupying 1 GHz spectral bandwidth centered at 2.5 GHz, with dimensions of 9 cm2, is presented in this paper.
Novo, Sergi; Nogués, Carme; Penon, Oriol; Barrios, Leonardo; Santaló, Josep; Gómez-Martínez, Rodrigo; Esteve, Jaume; Errachid, Abdelhamid; Plaza, José Antonio; Pérez-García, Lluïsa; Ibáñez, Elena
Is the attachment of biofunctionalized polysilicon barcodes to the outer surface of the zona pellucida an effective approach for the direct tagging and identification of human oocytes and embryos during assisted reproduction technologies (ARTs)? The direct tagging system based on lectin-biofunctionalized polysilicon barcodes of micrometric dimensions is simple, safe and highly efficient, allowing the identification of human oocytes and embryos during the various procedures typically conducted during an assisted reproduction cycle. Measures to prevent mismatching errors (mix-ups) of the reproductive samples are currently in place in fertility clinics, but none of them are totally effective and several mix-up cases have been reported worldwide. Using a mouse model, our group has previously developed an effective direct embryo tagging system which does not interfere with the in vitro and in vivo development of the tagged embryos. This system has now been tested in human oocytes and embryos. Fresh immature and mature fertilization-failed oocytes (n = 21) and cryopreserved day 1 embryos produced by in vitro fertilization (IVF) or intracytoplasmic sperm injection (ICSI) (n = 205) were donated by patients (n = 76) undergoing ARTs. In vitro development rates, embryo quality and post-vitrification survival were compared between tagged (n = 106) and non-tagged (control) embryos (n = 99). Barcode retention and identification rates were also calculated, both for embryos and for oocytes subjected to a simulated ICSI and parthenogenetic activation. Experiments were conducted from January 2012 to January 2013. Barcodes were fabricated in polysilicon and biofunctionalizated with wheat germ agglutinin lectin. Embryos were tagged with 10 barcodes and cultured in vitro until the blastocyst stage, when they were either differentially stained with propidium iodide and Hoechst or vitrified using the Cryotop method. Embryo quality was also analyzed by embryo grading and time
Kebschull, Justus M; Garcia da Silva, Pedro; Reid, Ashlan P; Peikon, Ian D; Albeanu, Dinu F; Zador, Anthony M
Neurons transmit information to distant brain regions via long-range axonal projections. In the mouse, area-to-area connections have only been systematically mapped using bulk labeling techniques, which obscure the diverse projections of intermingled single neurons. Here we describe MAPseq (Multiplexed Analysis of Projections by Sequencing), a technique that can map the projections of thousands or even millions of single neurons by labeling large sets of neurons with random RNA sequences ("barcodes"). Axons are filled with barcode mRNA, each putative projection area is dissected, and the barcode mRNA is extracted and sequenced. Applying MAPseq to the locus coeruleus (LC), we find that individual LC neurons have preferred cortical targets. By recasting neuroanatomy, which is traditionally viewed as a problem of microscopy, as a problem of sequencing, MAPseq harnesses advances in sequencing technology to permit high-throughput interrogation of brain circuits. Copyright © 2016 Elsevier Inc. All rights reserved.
Liao, Jing; Chao, Zhi; Zhang, Liang
To identify the common snakes in medicated liquor of Guangdong using COI barcode sequence,and to test the feasibility. The COI barcode sequences of collected medicinal snakes were amplified and sequenced. The sequences combined with the data from GenBank were analyzed for divergence and building a neighbor-joining(NJ) tree with MEGA 5.0. The genetic distance and NJ tree demonstrated that there were 241 variable sites in these species, and the average (A + T) content of 56.2% was higher than the average (G + C) content of 43.7%. The maximum interspecific genetic distance was 0.2568, and the minimum was 0. 1519. In the NJ tree,each species formed a monophyletic clade with bootstrap supports of 100%. DNA barcoding identification method based on the COI sequence is accurate and can be applied to identify the common medicinal snakes.
Full Text Available The zebra mussel (Dreissena polymorpha and the quagga mussel (Dreissena rostriformis bugensis are considered as the most competitive invaders in freshwaters of Europe and North America. Although shell characteristics exist to differentiate both species, phenotypic plasticity in the genus Dreissena does not always allow a clear identification. Therefore, the need to find an accurate identification method is essential. DNA barcoding has been proven to be an adequate procedure to discriminate species. The cytochrome c oxidase subunit 1 mitochondrial gene (COI is considered as the standard barcode for animals. We tested the use of this gene as an efficient DNA barcode and found that it allow rapid and accurate identification of adult Dreissena individuals.
Schirripa Spagnolo, Giuseppe; Cozzella, Lorenzo; Simonetti, Carla
Nowadays all the National Central Banks are continuously studying innovative anti-counterfeiting systems for banknotes. In this note, an innovative solution is proposed, which combines the potentiality of a hylemetric approach (methodology conceptually similar to biometry), based on notes' intrinsic characteristics, with a well-known and consolidated 2D barcode identification system. In particular, in this note we propose to extract from the banknotes a univocal binary control sequence (template) and insert an encrypted version of it in a barcode printed on the same banknote. For a more acceptable look and feel of a banknote, the superposed barcode can be stamped using IR ink that is visible to near-IR image sensors. This makes the banknote verification simpler.
Littlefair, Joanne E; Clare, Elizabeth L
Society faces the complex challenge of supporting biodiversity and ecosystem functioning, while ensuring food security by providing safe traceable food through an ever-more-complex global food chain. The increase in human mobility brings the added threat of pests, parasites, and invaders that further complicate our agro-industrial efforts. DNA barcoding technologies allow researchers to identify both individual species, and, when combined with universal primers and high-throughput sequencing techniques, the diversity within mixed samples (metabarcoding). These tools are already being employed to detect market substitutions, trace pests through the forensic evaluation of trace "environmental DNA", and to track parasitic infections in livestock. The potential of DNA barcoding to contribute to increased security of the food chain is clear, but challenges remain in regulation and the need for validation of experimental analysis. Here, we present an overview of the current uses and challenges of applied DNA barcoding in agriculture, from agro-ecosystems within farmland to the kitchen table.
Matthew T Aliota
Full Text Available Defining the complex dynamics of Zika virus (ZIKV infection in pregnancy and during transmission between vertebrate hosts and mosquito vectors is critical for a thorough understanding of viral transmission, pathogenesis, immune evasion, and potential reservoir establishment. Within-host viral diversity in ZIKV infection is low, which makes it difficult to evaluate infection dynamics. To overcome this biological hurdle, we constructed a molecularly barcoded ZIKV. This virus stock consists of a "synthetic swarm" whose members are genetically identical except for a run of eight consecutive degenerate codons, which creates approximately 64,000 theoretical nucleotide combinations that all encode the same amino acids. Deep sequencing this region of the ZIKV genome enables counting of individual barcodes to quantify the number and relative proportions of viral lineages present within a host. Here we used these molecularly barcoded ZIKV variants to study the dynamics of ZIKV infection in pregnant and non-pregnant macaques as well as during mosquito infection/transmission. The barcoded virus had no discernible fitness defects in vivo, and the proportions of individual barcoded virus templates remained stable throughout the duration of acute plasma viremia. ZIKV RNA also was detected in maternal plasma from a pregnant animal infected with barcoded virus for 67 days. The complexity of the virus population declined precipitously 8 days following infection of the dam, consistent with the timing of typical resolution of ZIKV in non-pregnant macaques and remained low for the subsequent duration of viremia. Our approach showed that synthetic swarm viruses can be used to probe the composition of ZIKV populations over time in vivo to understand vertical transmission, persistent reservoirs, bottlenecks, and evolutionary dynamics.
Robin van Velzen
Full Text Available Recently diverged species are challenging for identification, yet they are frequently of special interest scientifically as well as from a regulatory perspective. DNA barcoding has proven instrumental in species identification, especially in insects and vertebrates, but for the identification of recently diverged species it has been reported to be problematic in some cases. Problems are mostly due to incomplete lineage sorting or simply lack of a 'barcode gap' and probably related to large effective population size and/or low mutation rate. Our objective was to compare six methods in their ability to correctly identify recently diverged species with DNA barcodes: neighbor joining and parsimony (both tree-based, nearest neighbor and BLAST (similarity-based, and the diagnostic methods DNA-BAR, and BLOG. We analyzed simulated data assuming three different effective population sizes as well as three selected empirical data sets from published studies. Results show, as expected, that success rates are significantly lower for recently diverged species (∼75% than for older species (∼97% (P<0.00001. Similarity-based and diagnostic methods significantly outperform tree-based methods, when applied to simulated DNA barcode data (P<0.00001. The diagnostic method BLOG had highest correct query identification rate based on simulated (86.2% as well as empirical data (93.1%, indicating that it is a consistently better method overall. Another advantage of BLOG is that it offers species-level information that can be used outside the realm of DNA barcoding, for instance in species description or molecular detection assays. Even though we can confirm that identification success based on DNA barcoding is generally high in our data, recently diverged species remain difficult to identify. Nevertheless, our results contribute to improved solutions for their accurate identification.
Saddhe, Ankush Ashok; Jamdade, Rahul Arvind; Kumar, Kundan
Mangroves are salt-tolerant forest ecosystems of tropical and subtropical intertidal regions. They are among most productive, diverse, biologically important ecosystem and inclined toward threatened system. Identification of mangrove species is of critical importance in conserving and utilizing biodiversity, which apparently hindered by a lack of taxonomic expertise. In recent years, DNA barcoding using plastid markers rbcL and matK has been suggested as an effective method to enrich traditional taxonomic expertise for rapid species identification and biodiversity inventories. In the present study, we performed assessment of available 14 mangrove species of Goa, west coast India based on core DNA barcode markers, rbcL and matK. PCR amplification success rate, intra- and inter-specific genetic distance variation and the correct identification percentage were taken into account to assess candidate barcode regions. PCR and sequence success rate were high in rbcL (97.7 %) and matK (95.5 %) region. The two candidate chloroplast barcoding regions (rbcL, matK) yielded barcode gaps. Our results clearly demonstrated that matK locus assigned highest correct identification rates (72.09 %) based on TaxonDNA Best Match criteria. The concatenated rbcL + matK loci were able to adequately discriminate all mangrove genera and species to some extent except those in Rhizophora, Sonneratia and Avicennia. Our study provides the first endorsement of the species resolution among mangroves using plastid genes with few exceptions. Our future work will be focused on evaluation of other barcode markers to delineate complete resolution of mangrove species and identification of putative hybrids.
Dhar, Bishal; Ghosh, Sankar Kumar
The ornamental fishes were exported under the trade names or generic names, thus creating problems in species identification. In this regard, DNA barcoding could effectively elucidate the actual species status. However, the problem arises if the specimen is having taxonomic disputes, falsified by trade/generic names, etc., On the other hand, barcoding the archival museum specimens would be of greater benefit to address such issues as it would create firm, error-free reference database for rapid identification of any species. This can be achieved only by generating short sequences as DNA from chemically preserved are mostly degraded. Here we aimed to identify a short stretch of informative sites within the full-length barcode segment, capable of delineating diverse group of ornamental fish species, commonly traded from NE India. We analyzed 287 full-length barcode sequences from the major fish orders and compared the interspecific K2P distance with nucleotide substitutions patterns and found a strong correlation of interspecies distance with transversions (0.95, pbarcode. The proposed segment was compared with the full-length barcodes and found to delineate the species effectively. Successful PCR amplification and sequencing of the 171bp segment using designed primers for different orders validated it as mini-barcodes for ornamental fishes. Thus, our findings would be helpful in strengthening the global database with the sequence of archived fish species as well as an effective identification tool of the traded ornamental fish species, as a less time consuming, cost effective field-based application. Copyright © 2017 Elsevier B.V. All rights reserved.
Zeng, Zhaoqing; Zhao, Peng; Luo, Jing; Zhuang, Wenying; Yu, Zhihe
A DNA barcode is a short segment of sequence that is able to distinguish species. A barcode must ideally contain enough variation to distinguish every individual species and be easily obtained. Fungi of Nectriaceae are economically important and show high species diversity. To establish a standard DNA barcode for this group of fungi, the genomes of Neurospora crassa and 30 other filamentous fungi were compared. The expect value was treated as a criterion to recognize homologous sequences. Four candidate markers, Hsp90, AAC, CDC48, and EF3, were tested for their feasibility as barcodes in the identification of 34 well-established species belonging to 13 genera of Nectriaceae. Two hundred and fifteen sequences were analyzed. Intra- and inter-specific variations and the success rate of PCR amplification and sequencing were considered as important criteria for estimation of the candidate markers. Ultimately, the partial EF3 gene met the requirements for a good DNA barcode: No overlap was found between the intra- and inter-specific pairwise distances. The smallest inter-specific distance of EF3 gene was 3.19%, while the largest intra-specific distance was 1.79%. In addition, there was a high success rate in PCR and sequencing for this gene (96.3%). CDC48 showed sufficiently high sequence variation among species, but the PCR and sequencing success rate was 84% using a single pair of primers. Although the Hsp90 and AAC genes had higher PCR and sequencing success rates (96.3% and 97.5%, respectively), overlapping occurred between the intra- and inter-specific variations, which could lead to misidentification. Therefore, we propose the EF3 gene as a possible DNA barcode for the nectriaceous fungi.
Lobo, Jorge; Teixeira, Marcos A L; Borges, Luisa M S; Ferreira, Maria S G; Hollatz, Claudia; Gomes, Pedro T; Sousa, Ronaldo; Ravara, Ascensão; Costa, Maria H; Costa, Filipe O
Annelid polychaetes have been seldom the focus of dedicated DNA barcoding studies, despite their ecological relevance and often dominance, particularly in soft-bottom estuarine and coastal marine ecosystems. Here, we report the first assessment of the performance of DNA barcodes in the discrimination of shallow water polychaete species from the southern European Atlantic coast, focusing on specimens collected in estuaries and coastal ecosystems of Portugal. We analysed cytochrome oxidase I DNA barcodes (COI-5P) from 164 specimens, which were assigned to 51 morphospecies. To our data set from Portugal, we added available published sequences selected from the same species, genus or family, to inspect for taxonomic congruence among studies and collection location. The final data set comprised 290 specimens and 79 morphospecies, which generated 99 Barcode Index Numbers (BINs) within Barcode of Life Data Systems (BOLD). Among these, 22 BINs were singletons, 47 other BINs were concordant, confirming the initial identification based on morphological characters, and 30 were discordant, most of which consisted on multiple BINs found for the same morphospecies. Some of the most prominent cases in the latter category include Hediste diversicolor (O.F. Müller, 1776) (7), Eulalia viridis (Linnaeus, 1767) (2) and Owenia fusiformis (delle Chiaje, 1844) (5), all of them reported from Portugal and frequently used in ecological studies as environmental quality indicators. Our results for these species showed discordance between molecular lineages and morphospecies, or added additional relatively divergent lineages. The potential inaccuracies in environmental assessments, where underpinning polychaete species diversity is poorly resolved or clarified, demand additional and extensive investigation of the DNA barcode diversity in this group, in parallel with alpha taxonomy efforts. © 2015 John Wiley & Sons Ltd.
Wirta, H; Várkonyi, G; Rasmussen, C; Kaartinen, R; Schmidt, N M; Hebert, P D N; Barták, M; Blagoev, G; Disney, H; Ertl, S; Gjelstrup, P; Gwiazdowicz, D J; Huldén, L; Ilmonen, J; Jakovlev, J; Jaschhof, M; Kahanpää, J; Kankaanpää, T; Krogh, P H; Labbee, R; Lettner, C; Michelsen, V; Nielsen, S A; Nielsen, T R; Paasivirta, L; Pedersen, S; Pohjoismäki, J; Salmela, J; Vilkamaa, P; Väre, H; von Tschirnhaus, M; Roslin, T
DNA sequences offer powerful tools for describing the members and interactions of natural communities. In this study, we establish the to-date most comprehensive library of DNA barcodes for a terrestrial site, including all known macroscopic animals and vascular plants of an intensively studied area of the High Arctic, the Zackenberg Valley in Northeast Greenland. To demonstrate its utility, we apply the library to identify nearly 20 000 arthropod individuals from two Malaise traps, each operated for two summers. Drawing on this material, we estimate the coverage of previous morphology-based species inventories, derive a snapshot of faunal turnover in space and time and describe the abundance and phenology of species in the rapidly changing arctic environment. Overall, 403 terrestrial animal and 160 vascular plant species were recorded by morphology-based techniques. DNA barcodes (CO1) offered high resolution in discriminating among the local animal taxa, with 92% of morphologically distinguishable taxa assigned to unique Barcode Index Numbers (BINs) and 93% to monophyletic clusters. For vascular plants, resolution was lower, with 54% of species forming monophyletic clusters based on barcode regions rbcLa and ITS2. Malaise catches revealed 122 BINs not detected by previous sampling and DNA barcoding. The insect community was dominated by a few highly abundant taxa. Even closely related taxa differed in phenology, emphasizing the need for species-level resolution when describing ongoing shifts in arctic communities and ecosystems. The DNA barcode library now established for Zackenberg offers new scope for such explorations, and for the detailed dissection of interspecific interactions throughout the community. © 2015 John Wiley & Sons Ltd.
A DNA barcode is a preferrably short and highly variable region of DNA supposed to facilitate a rapid identification of species. In many protistan lineages, a lack of species-specific morphological characters hampers an identification of species by light or electron microscopy, and difficulties to perform mating experiments in laboratory cultures also do not allow for an identification of biological species. Thus, testing candidate barcode markers as well as establishment of accurately working species identification systems are more challenging than in multicellular organisms. In cryptic species complexes the performance of a potential barcode marker can not be monitored using morphological characters as a feedback, but an inappropriate choice of DNA region may result in artifactual species trees for several reasons. Therefore a priori knowledge of the systematics of a group is required. In addition to identification of known species, methods for an automatic delimitation of species with DNA barcodes have been proposed. The Cryptophyceae provide a mixture of systematically well characterized as well as badly characterized groups and are used in this study to test the suitability of some of the methods for protists. As species identification method the performance of blast in searches against badly to well-sampled reference databases has been tested with COI-5P and 5'-partial LSU rDNA (domains A to D of the nuclear LSU rRNA gene). In addition the performance of two different methods for automatic species delimitation, fixed thresholds of genetic divergence and the general mixed Yule-coalescent model (GMYC), have been examined. The study demonstrates some pitfalls of barcoding methods that have to be taken care of. Also a best-practice approach towards establishing a DNA barcode system in protists is proposed.
Full Text Available A DNA barcode is a preferrably short and highly variable region of DNA supposed to facilitate a rapid identification of species. In many protistan lineages, a lack of species-specific morphological characters hampers an identification of species by light or electron microscopy, and difficulties to perform mating experiments in laboratory cultures also do not allow for an identification of biological species. Thus, testing candidate barcode markers as well as establishment of accurately working species identification systems are more challenging than in multicellular organisms. In cryptic species complexes the performance of a potential barcode marker can not be monitored using morphological characters as a feedback, but an inappropriate choice of DNA region may result in artifactual species trees for several reasons. Therefore a priori knowledge of the systematics of a group is required. In addition to identification of known species, methods for an automatic delimitation of species with DNA barcodes have been proposed. The Cryptophyceae provide a mixture of systematically well characterized as well as badly characterized groups and are used in this study to test the suitability of some of the methods for protists. As species identification method the performance of blast in searches against badly to well-sampled reference databases has been tested with COI-5P and 5'-partial LSU rDNA (domains A to D of the nuclear LSU rRNA gene. In addition the performance of two different methods for automatic species delimitation, fixed thresholds of genetic divergence and the general mixed Yule-coalescent model (GMYC, have been examined. The study demonstrates some pitfalls of barcoding methods that have to be taken care of. Also a best-practice approach towards establishing a DNA barcode system in protists is proposed.
A DNA barcode is a preferrably short and highly variable region of DNA supposed to facilitate a rapid identification of species. In many protistan lineages, a lack of species-specific morphological characters hampers an identification of species by light or electron microscopy, and difficulties to perform mating experiments in laboratory cultures also do not allow for an identification of biological species. Thus, testing candidate barcode markers as well as establishment of accurately working species identification systems are more challenging than in multicellular organisms. In cryptic species complexes the performance of a potential barcode marker can not be monitored using morphological characters as a feedback, but an inappropriate choice of DNA region may result in artifactual species trees for several reasons. Therefore a priori knowledge of the systematics of a group is required. In addition to identification of known species, methods for an automatic delimitation of species with DNA barcodes have been proposed. The Cryptophyceae provide a mixture of systematically well characterized as well as badly characterized groups and are used in this study to test the suitability of some of the methods for protists. As species identification method the performance of blast in searches against badly to well-sampled reference databases has been tested with COI-5P and 5′-partial LSU rDNA (domains A to D of the nuclear LSU rRNA gene). In addition the performance of two different methods for automatic species delimitation, fixed thresholds of genetic divergence and the general mixed Yule-coalescent model (GMYC), have been examined. The study demonstrates some pitfalls of barcoding methods that have to be taken care of. Also a best-practice approach towards establishing a DNA barcode system in protists is proposed. PMID:22970104
Magnacca Karl N
Full Text Available Abstract Background The past several years have seen a flurry of papers seeking to clarify the utility and limits of DNA barcoding, particularly in areas such as species discovery and paralogy due to nuclear pseudogenes. Heteroplasmy, the coexistence of multiple mitochondrial haplotypes in a single organism, has been cited as a potentially serious problem for DNA barcoding but its effect on identification accuracy has not been tested. In addition, few studies of barcoding have tested a large group of closely-related species with a well-established morphological taxonomy. In this study we examine both of these issues, by densely sampling the Hawaiian Hylaeus bee radiation. Results Individuals from 21 of the 49 a priori morphologically-defined species exhibited coding sequence heteroplasmy at levels of 1-6% or more. All homoplasmic species were successfully identified by COI using standard methods of analysis, but only 71% of heteroplasmic species. The success rate in identifying heteroplasmic species was increased to 86% by treating polymorphisms as character states rather than ambiguities. Nuclear pseudogenes (numts were also present in four species, and were distinguishable from heteroplasmic sequences by patterns of nucleotide and amino acid change. Conclusions Heteroplasmy significantly decreased the reliability of species identification. In addition, the practical issue of dealing with large numbers of polymorphisms- and resulting increased time and labor required - makes the development of DNA barcode databases considerably more complex than has previously been suggested. The impact of heteroplasmy on the utility of DNA barcoding as a bulk specimen identification tool will depend upon its frequency across populations, which remains unknown. However, DNA barcoding is still likely to remain an important identification tool for those species that are difficult or impossible to identify through morphology, as is the case for the ecologically
Clement Wendy L
Full Text Available Abstract Background The chloroplast genes matK and rbcL have been proposed as a “core” DNA barcode for identifying plant species. Published estimates of successful species identification using these loci (70-80% may be inflated because they may have involved comparisons among distantly related species within target genera. To assess the ability of the proposed two-locus barcode to discriminate closely related species, we carried out a hierarchically structured set of comparisons within Viburnum, a clade of woody angiosperms containing ca. 170 species (some 70 of which are currently used in horticulture. For 112 Viburnum species, we evaluated rbcL + matK, as well as the chloroplast regions rpl32-trnL, trnH-psbA, trnK, and the nuclear ribosomal internal transcribed spacer region (nrITS. Results At most, rbcL + matK could discriminate 53% of all Viburnum species, with only 18% of the comparisons having genetic distances >1%. When comparisons were progressively restricted to species within major Viburnum subclades, there was a significant decrease in both the discriminatory power and the genetic distances. trnH-psbA and nrITS show much higher levels of variation and potential discriminatory power, and their use in plant barcoding should be reconsidered. As barcoding has often been used to discriminate species within local areas, we also compared Viburnum species within two regions, Japan and Mexico and Central America. Greater success in discriminating among the Japanese species reflects the deeper evolutionary history of Viburnum in that area, as compared to the recent radiation of a single clade into the mountains of Latin America. Conclusions We found very low levels of discrimination among closely related species of Viburnum, and low levels of variation in the proposed barcoding loci may limit success within other clades of long-lived woody plants. Inclusion of the supplementary barcodes trnH-psbA and nrITS increased discrimination rates but<