WorldWideScience

Sample records for genomics approach matching

  1. Functional Associations by Response Overlap (FARO), a functional genomics approach matching gene expression phenotypes

    DEFF Research Database (Denmark)

    Nielsen, Henrik Bjørn; Mundy, J.; Willenbrock, Hanni

    2007-01-01

    The systematic comparison of transcriptional responses of organisms is a powerful tool in functional genomics. For example, mutants may be characterized by comparing their transcript profiles to those obtained in other experiments querying the effects on gene expression of many experimental factors...... including treatments, mutations and pathogen infections. Similarly, drugs may be discovered by the relationship between the transcript profiles effectuated or impacted by a candidate drug and by the target disease. The integration of such data enables systems biology to predict the interplay between...

  2. Approaches for Stereo Matching

    Directory of Open Access Journals (Sweden)

    Takouhi Ozanian

    1995-04-01

    Full Text Available This review focuses on the last decade's development of the computational stereopsis for recovering three-dimensional information. The main components of the stereo analysis are exposed: image acquisition and camera modeling, feature selection, feature matching and disparity interpretation. A brief survey is given of the well known feature selection approaches and the estimation parameters for this selection are mentioned. The difficulties in identifying correspondent locations in the two images are explained. Methods as to how effectively to constrain the search for correct solution of the correspondence problem are discussed, as are strategies for the whole matching process. Reasons for the occurrence of matching errors are considered. Some recently proposed approaches, employing new ideas in the modeling of stereo matching in terms of energy minimization, are described. Acknowledging the importance of computation time for real-time applications, special attention is paid to parallelism as a way to achieve the required level of performance. The development of trinocular stereo analysis as an alternative to the conventional binocular one, is described. Finally a classification based on the test images for verification of the stereo matching algorithms, is supplied.

  3. Review og pattern matching approaches

    DEFF Research Database (Denmark)

    Manfaat, D.; Duffy, Alex; Lee, B. S.

    1996-01-01

    This paper presents a review of pattern matching techniques. The application areas for pattern matching are extensive, ranging from CAD systems to chemical analysis and from manufacturing to image processing. Published techniques and methods are classified and assessed within the context of three...... key issues: pattern classes, similiarity types and mathing methods. It has been shown that the techniques and approaches are as diverse and varied as the applications....

  4. Matched filter based iterative adaptive approach

    Science.gov (United States)

    Nepal, Ramesh; Zhang, Yan Rockee; Li, Zhengzheng; Blake, William

    2016-05-01

    Matched Filter sidelobes from diversified LPI waveform design and sensor resolution are two important considerations in radars and active sensors in general. Matched Filter sidelobes can potentially mask weaker targets, and low sensor resolution not only causes a high margin of error but also limits sensing in target-rich environment/ sector. The improvement in those factors, in part, concern with the transmitted waveform and consequently pulse compression techniques. An adaptive pulse compression algorithm is hence desired that can mitigate the aforementioned limitations. A new Matched Filter based Iterative Adaptive Approach, MF-IAA, as an extension to traditional Iterative Adaptive Approach, IAA, has been developed. MF-IAA takes its input as the Matched Filter output. The motivation here is to facilitate implementation of Iterative Adaptive Approach without disrupting the processing chain of traditional Matched Filter. Similar to IAA, MF-IAA is a user parameter free, iterative, weighted least square based spectral identification algorithm. This work focuses on the implementation of MF-IAA. The feasibility of MF-IAA is studied using a realistic airborne radar simulator as well as actual measured airborne radar data. The performance of MF-IAA is measured with different test waveforms, and different Signal-to-Noise (SNR) levels. In addition, Range-Doppler super-resolution using MF-IAA is investigated. Sidelobe reduction as well as super-resolution enhancement is validated. The robustness of MF-IAA with respect to different LPI waveforms and SNR levels is also demonstrated.

  5. Matching curated genome databases: a non trivial task

    Directory of Open Access Journals (Sweden)

    Labedan Bernard

    2008-10-01

    Full Text Available Abstract Background Curated databases of completely sequenced genomes have been designed independently at the NCBI (RefSeq and EBI (Genome Reviews to cope with non-standard annotation found in the version of the sequenced genome that has been published by databanks GenBank/EMBL/DDBJ. These curation attempts were expected to review the annotations and to improve their pertinence when using them to annotate newly released genome sequences by homology to previously annotated genomes. However, we observed that such an uncoordinated effort has two unwanted consequences. First, it is not trivial to map the protein identifiers of the same sequence in both databases. Secondly, the two reannotated versions of the same genome differ at the level of their structural annotation. Results Here, we propose CorBank, a program devised to provide cross-referencing protein identifiers no matter what the level of identity is found between their matching sequences. Approximately 98% of the 1,983,258 amino acid sequences are matching, allowing instantaneous retrieval of their respective cross-references. CorBank further allows detecting any differences between the independently curated versions of the same genome. We found that the RefSeq and Genome Reviews versions are perfectly matching for only 50 of the 641 complete genomes we have analyzed. In all other cases there are differences occurring at the level of the coding sequence (CDS, and/or in the total number of CDS in the respective version of the same genome. CorBank is freely accessible at http://www.corbank.u-psud.fr. The CorBank site contains also updated publication of the exhaustive results obtained by comparing RefSeq and Genome Reviews versions of each genome. Accordingly, this web site allows easy search of cross-references between RefSeq, Genome Reviews, and UniProt, for either a single CDS or a whole replicon. Conclusion CorBank is very efficient in rapid detection of the numerous differences existing

  6. Local Search Approaches in Stable Matching Problems

    Directory of Open Access Journals (Sweden)

    Toby Walsh

    2013-10-01

    Full Text Available The stable marriage (SM problem has a wide variety of practical applications, ranging from matching resident doctors to hospitals, to matching students to schools or, more generally, to any two-sided market. In the classical formulation, n men and n women express their preferences (via a strict total order over the members of the other sex. Solving an SM problem means finding a stable marriage where stability is an envy-free notion: no man and woman who are not married to each other would both prefer each other to their partners or to being single. We consider both the classical stable marriage problem and one of its useful variations (denoted SMTI (Stable Marriage with Ties and Incomplete lists where the men and women express their preferences in the form of an incomplete preference list with ties over a subset of the members of the other sex. Matchings are permitted only with people who appear in these preference lists, and we try to find a stable matching that marries as many people as possible. Whilst the SM problem is polynomial to solve, the SMTI problem is NP-hard. We propose to tackle both problems via a local search approach, which exploits properties of the problems to reduce the size of the neighborhood and to make local moves efficiently. We empirically evaluate our algorithm for SM problems by measuring its runtime behavior and its ability to sample the lattice of all possible stable marriages. We evaluate our algorithm for SMTI problems in terms of both its runtime behavior and its ability to find a maximum cardinality stable marriage. Experimental results suggest that for SM problems, the number of steps of our algorithm grows only as O(n log(n, and that it samples very well the set of all stable marriages. It is thus a fair and efficient approach to generate stable marriages. Furthermore, our approach for SMTI problems is able to solve large problems, quickly returning stable matchings of large and often optimal size, despite the

  7. Matching of array CGH and gene expression microarray features for the purpose of integrative genomic analyses

    Directory of Open Access Journals (Sweden)

    van Wieringen Wessel N

    2012-05-01

    Full Text Available Abstract Background An increasing number of genomic studies interrogating more than one molecular level is published. Bioinformatics follows biological practice, and recent years have seen a surge in methodology for the integrative analysis of genomic data. Often such analyses require knowledge of which elements of one platform link to those of another. Although important, many integrative analyses do not or insufficiently detail the matching of the platforms. Results We describe, illustrate and discuss six matching procedures. They are implemented in the R-package sigaR (available from Bioconductor. The principles underlying the presented matching procedures are generic, and can be combined to form new matching approaches or be applied to the matching of other platforms. Illustration of the matching procedures on a variety of data sets reveals how the procedures differ in the use of the available data, and may even lead to different results for individual genes. Conclusions Matching of data from multiple genomics platforms is an important preprocessing step for many integrative bioinformatic analysis, for which we present six generic procedures, both old and new. They have been implemented in the R-package sigaR, available from Bioconductor.

  8. Novel genetic matching methods for handling population stratification in genome-wide association studies.

    Science.gov (United States)

    Lacour, André; Schüller, Vitalia; Drichel, Dmitriy; Herold, Christine; Jessen, Frank; Leber, Markus; Maier, Wolfgang; Noethen, Markus M; Ramirez, Alfredo; Vaitsiakhovich, Tatsiana; Becker, Tim

    2015-03-14

    A usually confronted problem in association studies is the occurrence of population stratification. In this work, we propose a novel framework to consider population matchings in the contexts of genome-wide and sequencing association studies. We employ pairwise and groupwise optimal case-control matchings and present an agglomerative hierarchical clustering, both based on a genetic similarity score matrix. In order to ensure that the resulting matches obtained from the matching algorithm capture correctly the population structure, we propose and discuss two stratum validation methods. We also invent a decisive extension to the Cochran-Armitage Trend test to explicitly take into account the particular population structure. We assess our framework by simulations of genotype data under the null hypothesis, to affirm that it correctly controls for the type-1 error rate. By a power study we evaluate that structured association testing using our framework displays reasonable power. We compare our result with those obtained from a logistic regression model with principal component covariates. Using the principal components approaches we also find a possible false-positive association to Alzheimer's disease, which is neither supported by our new methods, nor by the results of a most recent large meta analysis or by a mixed model approach. Matching methods provide an alternative handling of confounding due to population stratification for statistical tests for which covariates are hard to model. As a benchmark, we show that our matching framework performs equally well to state of the art models on common variants.

  9. Matching Alternative Addresses: a Semantic Web Approach

    Science.gov (United States)

    Ariannamazi, S.; Karimipour, F.; Hakimpour, F.

    2015-12-01

    Rapid development of crowd-sourcing or volunteered geographic information (VGI) provides opportunities for authoritatives that deal with geospatial information. Heterogeneity of multiple data sources and inconsistency of data types is a key characteristics of VGI datasets. The expansion of cities resulted in the growing number of POIs in the OpenStreetMap, a well-known VGI source, which causes the datasets to outdate in short periods of time. These changes made to spatial and aspatial attributes of features such as names and addresses might cause confusion or ambiguity in the processes that require feature's literal information like addressing and geocoding. VGI sources neither will conform specific vocabularies nor will remain in a specific schema for a long period of time. As a result, the integration of VGI sources is crucial and inevitable in order to avoid duplication and the waste of resources. Information integration can be used to match features and qualify different annotation alternatives for disambiguation. This study enhances the search capabilities of geospatial tools with applications able to understand user terminology to pursuit an efficient way for finding desired results. Semantic web is a capable tool for developing technologies that deal with lexical and numerical calculations and estimations. There are a vast amount of literal-spatial data representing the capability of linguistic information in knowledge modeling, but these resources need to be harmonized based on Semantic Web standards. The process of making addresses homogenous generates a helpful tool based on spatial data integration and lexical annotation matching and disambiguating.

  10. MATCHING ALTERNATIVE ADDRESSES: A SEMANTIC WEB APPROACH

    Directory of Open Access Journals (Sweden)

    S. Ariannamazi

    2015-12-01

    Full Text Available Rapid development of crowd-sourcing or volunteered geographic information (VGI provides opportunities for authoritatives that deal with geospatial information. Heterogeneity of multiple data sources and inconsistency of data types is a key characteristics of VGI datasets. The expansion of cities resulted in the growing number of POIs in the OpenStreetMap, a well-known VGI source, which causes the datasets to outdate in short periods of time. These changes made to spatial and aspatial attributes of features such as names and addresses might cause confusion or ambiguity in the processes that require feature’s literal information like addressing and geocoding. VGI sources neither will conform specific vocabularies nor will remain in a specific schema for a long period of time. As a result, the integration of VGI sources is crucial and inevitable in order to avoid duplication and the waste of resources. Information integration can be used to match features and qualify different annotation alternatives for disambiguation. This study enhances the search capabilities of geospatial tools with applications able to understand user terminology to pursuit an efficient way for finding desired results. Semantic web is a capable tool for developing technologies that deal with lexical and numerical calculations and estimations. There are a vast amount of literal-spatial data representing the capability of linguistic information in knowledge modeling, but these resources need to be harmonized based on Semantic Web standards. The process of making addresses homogenous generates a helpful tool based on spatial data integration and lexical annotation matching and disambiguating.

  11. Template Matching Approach to Signal Prediction

    Science.gov (United States)

    Mackey, Ryan; Kulikov, Igor

    2010-01-01

    A new approach to signal prediction and prognostic assessment of spacecraft health resolves an inherent difficulty in fusing sensor data with simulated data. This technique builds upon previous work that demonstrated the importance of physics-based transient models to accurate prediction of signal dynamics and system performance. While models can greatly improve predictive accuracy, they are difficult to apply in general because of variations in model type, accuracy, or intended purpose. However, virtually any flight project will have at least some modeling capability at its disposal, whether a full-blown simulation, partial physics models, dynamic look-up tables, a brassboard analogue system, or simple hand-driven calculation by a team of experts. Many models can be used to develop a predict, or an estimate of the next day s or next cycle s behavior, which is typically used for planning purposes. The fidelity of a predict varies from one project to another, depending on the complexity of the simulation (i.e. linearized or full differential equations) and the level of detail in anticipated system operation, but typically any predict cannot be adapted to changing conditions or adjusted spacecraft command execution. Applying a predict blindly, without adapting the predict to current conditions, produces mixed results at best, primarily due to mismatches between assumed execution of spacecraft activities and actual times of execution. This results in the predict becoming useless during periods of complicated behavior, exactly when the predict would be most valuable. Each spacecraft operation tends to show up as a transient in the data, and if the transients are misaligned, using the predict can actually harm forecasting performance. To address this problem, the approach here expresses the predict in terms of a baseline function superposed with one or more transient functions. These transients serve as signal templates, which can be relocated in time and space against

  12. Iris Matching Based On a Stack Like Structure Graph Approach

    Directory of Open Access Journals (Sweden)

    Roushdi Mohamed FAROUK

    2012-12-01

    Full Text Available In this paper, we present the elastic bunch graph matching as a new approach for iris recognition. The task is difficult because of iris variation in terms of position, size, and partial occlusion. We have used the circular Hough transform to determine the iris boundaries. Individual segmented irises are represented as labeled graphs. We have combined a representative set of individual model graphs into a stack like structure called an iris bunch graph (IBG. Finally, a bunch graph similarity function is proposed to compare a test graph with the IBG. Recognition results are given for galleries of irises from CASIA version and UBIRIS databases. The numerical results show that, the elastic bunch graph matching is an effective technique for iris matching. We also compare our results with previous results and find that, the elastic bunch graph matching is an effective matching performance.

  13. SmileFinder: a resampling-based approach to evaluate signatures of selection from genome-wide sets of matching allele frequency data in two or more diploid populations.

    Science.gov (United States)

    Guiblet, Wilfried M; Zhao, Kai; O'Brien, Stephen J; Massey, Steven E; Roca, Alfred L; Oleksyk, Taras K

    2015-01-01

    Adaptive alleles may rise in frequency as a consequence of positive selection, creating a pattern of decreased variation in the neighboring loci, known as a selective sweep. When the region containing this pattern is compared to another population with no history of selection, a rise in variance of allele frequencies between populations is observed. One challenge presented by large genome-wide datasets is the ability to differentiate between patterns that are remnants of natural selection from those expected to arise at random and/or as a consequence of selectively neutral demographic forces acting in the population. SmileFinder is a simple program that looks for diversity and divergence patterns consistent with selection sweeps by evaluating allele frequencies in windows, including neighboring loci from two or more populations of a diploid species against the genome-wide neutral expectation. The program calculates the mean of heterozygosity and FST in a set of sliding windows of incrementally increasing sizes, and then builds a resampled distribution (the baseline) of random multi-locus sets matched to the sizes of sliding windows, using an unrestricted sampling. Percentiles of the values in the sliding windows are derived from the superimposed resampled distribution. The resampling can easily be scaled from 1 K to 100 M; the higher the number, the more precise the percentiles ascribed to the extreme observed values. The output from SmileFinder can be used to plot percentile values to look for population diversity and divergence patterns that may suggest past actions of positive selection along chromosome maps, and to compare lists of suspected candidate genes under random gene sets to test for the overrepresentation of these patterns among gene categories. Both applications of the algorithm have already been used in published studies. Here we present a publicly available, open source program that will serve as a useful tool for preliminary scans of selection

  14. How evolution of genomes is reflected in exact DNA sequence match statistics.

    Science.gov (United States)

    Massip, Florian; Sheinman, Michael; Schbath, Sophie; Arndt, Peter F

    2015-02-01

    Genome evolution is shaped by a multitude of mutational processes, including point mutations, insertions, and deletions of DNA sequences, as well as segmental duplications. These mutational processes can leave distinctive qualitative marks in the statistical features of genomic DNA sequences. One such feature is the match length distribution (MLD) of exactly matching sequence segments within an individual genome or between the genomes of related species. These have been observed to exhibit characteristic power law decays in many species. Here, we show that simple dynamical models consisting solely of duplication and mutation processes can already explain the characteristic features of MLDs observed in genomic sequences. Surprisingly, we find that these features are largely insensitive to details of the underlying mutational processes and do not necessarily rely on the action of natural selection. Our results demonstrate how analyzing statistical features of DNA sequences can help us reveal and quantify the different mutational processes that underlie genome evolution.

  15. Efficient wave-function matching approach for quantum transport calculations

    DEFF Research Database (Denmark)

    Sørensen, Hans Henrik Brandenborg; Hansen, Per Christian; Petersen, Dan Erik;

    2009-01-01

    The wave-function matching (WFM) technique has recently been developed for the calculation of electronic transport in quantum two-probe systems. In terms of efficiency it is comparable to the widely used Green's function approach. The WFM formalism presented so far requires the evaluation of all ...

  16. A Moment Matching Approach for Generating Synthetic Data.

    Science.gov (United States)

    Bogle, Brittany Megan; Mehrotra, Sanjay

    2016-09-01

    Synthetic data are becoming increasingly important mechanisms for sharing data among collaborators and with the public. Multiple methods for the generation of synthetic data have been proposed, but many have short comings with respect to maintaining the statistical properties of the original data. We propose a new method for fully synthetic data generation that leverages linear and integer mathematical programming models in order to match the moments of the original data in the synthetic data. This method has no inherent disclosure risk and does not require parametric or distributional assumptions. We demonstrate this methodology using the Framingham Heart Study. Existing synthetic data methods that use chained equations were compared with our approach. We fit Cox proportional hazards, logistic regression, and nonparametric models to synthetic data and compared with models fitted to the original data. True coverage, the proportion of synthetic data parameter confidence intervals that include the original data's parameter estimate, was 100% for parametric models when up to four moments were matched, and consistently outperformed the chained equations approach. The area under the curve and accuracy of the nonparametric models trained on synthetic data marginally differed when tested on the full original data. Models were also trained on synthetic data and a partition of original data and were tested on a held-out portion of original data. Fourth-order moment matched synthetic data outperformed others with respect to fitted parametric models but did not always outperform other methods with fitted nonparametric models. No single synthetic data method consistently outperformed others when assessing the performance of nonparametric models. The performance of fourth-order moment matched synthetic data in fitting parametric models suggests its use in these cases. Our empirical results also suggest that the performance of synthetic data generation techniques, including the

  17. Matching sensors to missions using a knowledge-based approach

    Science.gov (United States)

    Preece, Alun; Gomez, Mario; de Mel, Geeth; Vasconcelos, Wamberto; Sleeman, Derek; Colley, Stuart; Pearson, Gavin; Pham, Tien; La Porta, Thomas

    2008-04-01

    Making decisions on how best to utilise limited intelligence, surveillance and reconnaisance (ISR) resources is a key issue in mission planning. This requires judgements about which kinds of available sensors are more or less appropriate for specific ISR tasks in a mission. A methodological approach to addressing this kind of decision problem in the military context is the Missions and Means Framework (MMF), which provides a structured way to analyse a mission in terms of tasks, and assess the effectiveness of various means for accomplishing those tasks. Moreover, the problem can be defined as knowledge-based matchmaking: matching the ISR requirements of tasks to the ISR-providing capabilities of available sensors. In this paper we show how the MMF can be represented formally as an ontology (that is, a specification of a conceptualisation); we also represent knowledge about ISR requirements and sensors, and then use automated reasoning to solve the matchmaking problem. We adopt the Semantic Web approach and the Web Ontology Language (OWL), allowing us to import elements of existing sensor knowledge bases. Our core ontologies use the description logic subset of OWL, providing efficient reasoning. We describe a prototype tool as a proof-of-concept for our approach. We discuss the various kinds of possible sensor-mission matches, both exact and inexact, and how the tool helps mission planners consider alternative choices of sensors.

  18. [Genomic approach to pathophysiology of rheumatoid arthritis].

    Science.gov (United States)

    Yamada, Ryo

    2012-11-01

    Genetic studies identified multiple genes and polymorphisms that increase risk to develop rheumatoid arthritis. Genomic approach is characterized with its integrative style using mathematical and statistical models. Its main targets include (1)combinatorial effect of multiple genetic and environmental factors, (2)heterogeneity of pathological states and its individuality, and (3)their chronological heterogeneity. Genomic approach will clarify pathophysiology of various diseases along with the progresses in molecular biology and other researches on individual molecules.

  19. Privacy‐Preserving Friend Matching Protocol approach for Pre‐match in Social Networks

    DEFF Research Database (Denmark)

    Ople, Shubhangi S.; Deshmukh, Aaradhana A.; Mihovska, Albena Dimitrova

    2016-01-01

    that a secure match can achieve at least one order of accuracy and better computational performance than the techniques that use homomorphic encryption.It can handle and tackle new characteristics and an environment for a particular application in a mobile social network....... for use in social networks due to its data sharing problems and information leakage. In this paper, we propose a novel framework for privacy–preserving profile matching. We implement both the client and server portion of the secure match and evaluate its performance network dataset. The results show......Social services make the most use of the user profile matching to help the users to discover friends with similar social attributes (e.g. interests, location, age). However, there are many privacy concerns that prevent to enable this functionality. Privacy preserving encryption is not suitable...

  20. Mapping topographic plant location properties using a dense matching approach

    Science.gov (United States)

    Niederheiser, Robert; Rutzinger, Martin; Lamprecht, Andrea; Bardy-Durchhalter, Manfred; Pauli, Harald; Winkler, Manuela

    2017-04-01

    Within the project MEDIALPS (Disentangling anthropogenic drivers of climate change impacts on alpine plant species: Alps vs. Mediterranean mountains) six regions in Alpine and in Mediterranean mountain regions are investigated to assess how plant species respond to climate change. The project is embedded in the Global Observation Research Initiative in Alpine Environments (GLORIA), which is a well-established global monitoring initiative for systematic observation of changes in the plant species composition and soil temperature on mountain summits worldwide to discern accelerating climate change pressures on these fragile alpine ecosystems. Close-range sensing techniques such as terrestrial photogrammetry are well suited for mapping terrain topography of small areas with high resolution. Lightweight equipment, flexible positioning for image acquisition in the field, and independence on weather conditions (i.e. wind) make this a feasible method for in-situ data collection. New developments of dense matching approaches allow high quality 3D terrain mapping with less requirements for field set-up. However, challenges occur in post-processing and required data storage if many sites have to be mapped. Within MEDIALPS dense matching is used for mapping high resolution topography for 284 3x3 meter plots deriving information on vegetation coverage, roughness, slope, aspect and modelled solar radiation. This information helps identifying types of topography-dependent ecological growing conditions and evaluating the potential for existing refugial locations for specific plant species under climate change. This research is conducted within the project MEDIALPS - Disentangling anthropogenic drivers of climate change impacts on alpine plant species: Alps vs. Mediterranean mountains funded by the Earth System Sciences Programme of the Austrian Academy of Sciences.

  1. Genomic approaches in aquaculture and fisheries

    DEFF Research Database (Denmark)

    Cancela, M. Leonor; Bargelloni, Luca; Boudry, Pierre

    2010-01-01

    Despite the enormous input into the worldwide development of fish and shellfish farming in the recent decades, in part as an attempt to minimize the impact of fishing on already overexploited natural populations, the application of genomics to aquaculture and fisheries remains poorly developed....... Improving state-of-the-art genomics research in various aquaculture systems, as well as its industrial applications, remains one of the major challenges in this area and should be the focus of well developed strategies to be implemented in the next generation of projects. This chapter will first provide...... an overview of the genomic tools and resources available, then discuss the application of genomic approaches to the improvement of fish and shellfish farming (e.g. breeding, reproduction, growth, nutrition and product quality), including the evaluation of stock diversity and the use of selection procedures...

  2. (Post-)genomics approaches in fungal research.

    Science.gov (United States)

    Aguilar-Pontes, María Victoria; de Vries, Ronald P; Zhou, Miaomiao

    2014-11-01

    To date, hundreds of fungal genomes have been sequenced and many more are in progress. This wealth of genomic information has provided new directions to study fungal biodiversity. However, to further dissect and understand the complicated biological mechanisms involved in fungal life styles, functional studies beyond genomes are required. Thanks to the developments of current -omics techniques, it is possible to produce large amounts of fungal functional data in a high-throughput fashion (e.g. transcriptome, proteome, etc.). The increasing ease of creating -omics data has also created a major challenge for downstream data handling and analysis. Numerous databases, tools and software have been created to meet this challenge. Facing such a richness of techniques and information, hereby we provide a brief roadmap on current wet-lab and bioinformatics approaches to study functional genomics in fungi.

  3. genomic and transcriptomic approaches towards the genetic ...

    African Journals Online (AJOL)

    USER

    to the complex nature of these stresses, and the genotype x environment interaction (GxE). .... collection (Azam-Ali et al., 2001); (vi) biological .... Integrative platform to study gene function and gene evolution in legumes ..... a powerful dissection of the genetic control of ... complemented by a new approach called genomic.

  4. Pattern classification approaches to matching building polygons at multiple scales

    NARCIS (Netherlands)

    Zhang, X; Zhao, X.; Molenaar, M.; Stoter, J.; Kraak M-J.; Ai, T.

    2012-01-01

    Matching of building polygons with different levels of detail is crucial in the maintenance and quality assessment of multi-representation databases. Two general problems need to be addressed in the matching process: (1) Which criteria are suitable? (2) How to effectively combine different criteria

  5. Pattern classification approaches to matching building polygons at multiple scales

    NARCIS (Netherlands)

    Zhang, X; Zhao, X.; Molenaar, M.; Stoter, J.; Kraak M-J.; Ai, T.

    2012-01-01

    Matching of building polygons with different levels of detail is crucial in the maintenance and quality assessment of multi-representation databases. Two general problems need to be addressed in the matching process: (1) Which criteria are suitable? (2) How to effectively combine different criteria

  6. Genomic approaches to research in pulmonary hypertension

    Directory of Open Access Journals (Sweden)

    Tuder Rubin M

    2001-05-01

    Full Text Available Abstract Genomics, or the study of genes and their function, is a burgeoning field with many new technologies. In the present review, we explore the application of genomic approaches to the study of pulmonary hypertension (PH. Candidate genes, important to the pathobiology of the disease, have been investigated. Rodent models enable the manipulation of selected genes, either by transgenesis or targeted disruption. Mutational analysis of genes in the transforming growth factor-β family have proven pivotal in both familial and sporadic forms of primary PH. Finally, microarray gene expression analysis is a robust molecular tool to aid in delineating the pathobiology of this disease.

  7. A reference pan-genome approach to comparative bacterial genomics: identification of novel epidemiological markers in pathogenic Campylobacter.

    Directory of Open Access Journals (Sweden)

    Guillaume Méric

    Full Text Available The increasing availability of hundreds of whole bacterial genomes provides opportunities for enhanced understanding of the genes and alleles responsible for clinically important phenotypes and how they evolved. However, it is a significant challenge to develop easy-to-use and scalable methods for characterizing these large and complex data and relating it to disease epidemiology. Existing approaches typically focus on either homologous sequence variation in genes that are shared by all isolates, or non-homologous sequence variation--focusing on genes that are differentially present in the population. Here we present a comparative genomics approach that simultaneously approximates core and accessory genome variation in pathogen populations and apply it to pathogenic species in the genus Campylobacter. A total of 7 published Campylobacter jejuni and Campylobacter coli genomes were selected to represent diversity across these species, and a list of all loci that were present at least once was compiled. After filtering duplicates a 7-isolate reference pan-genome, of 3,933 loci, was defined. A core genome of 1,035 genes was ubiquitous in the sample accounting for 59% of the genes in each isolate (average genome size of 1.68 Mb. The accessory genome contained 2,792 genes. A Campylobacter population sample of 192 genomes was screened for the presence of reference pan-genome loci with gene presence defined as a BLAST match of ≥ 70% identity over ≥ 50% of the locus length--aligned using MUSCLE on a gene-by-gene basis. A total of 21 genes were present only in C. coli and 27 only in C. jejuni, providing information about functional differences associated with species and novel epidemiological markers for population genomic analyses. Homologs of these genes were found in several of the genomes used to define the pan-genome and, therefore, would not have been identified using a single reference strain approach.

  8. iPiG: integrating peptide spectrum matches into genome browser visualizations.

    Directory of Open Access Journals (Sweden)

    Mathias Kuhring

    Full Text Available Proteogenomic approaches have gained increasing popularity, however it is still difficult to integrate mass spectrometry identifications with genomic data due to differing data formats. To address this difficulty, we introduce iPiG as a tool for the integration of peptide identifications from mass spectrometry experiments into existing genome browser visualizations. Thereby, the concurrent analysis of proteomic and genomic data is simplified and proteomic results can directly be compared to genomic data. iPiG is freely available from https://sourceforge.net/projects/ipig/. It is implemented in Java and can be run as a stand-alone tool with a graphical user-interface or integrated into existing workflows. Supplementary data are available at PLOS ONE online.

  9. A bayesian approach to deformed pattern matching of iris images.

    Science.gov (United States)

    Thornton, Jason; Savvides, Marios; Vijaya Kumar, B V K

    2007-04-01

    We describe a general probabilistic framework for matching patterns that experience in-plane nonlinear deformations, such as iris patterns. Given a pair of images, we derive a maximum a posteriori probability (MAP) estimate of the parameters of the relative deformation between them. Our estimation process accomplishes two things simultaneously: It normalizes for pattern warping and it returns a distortion-tolerant similarity metric which can be used for matching two nonlinearly deformed image patterns. The prior probability of the deformation parameters is specific to the pattern-type and, therefore, should result in more accurate matching than an arbitrary general distribution. We show that the proposed method is very well suited for handling iris biometrics, applying it to two databases of iris images which contain real instances of warped patterns. We demonstrate a significant improvement in matching accuracy using the proposed deformed Bayesian matching methodology. We also show that the additional computation required to estimate the deformation is relatively inexpensive, making it suitable for real-time applications.

  10. Synoname: The Getty's New Approach to Pattern Matching for Personal Names.

    Science.gov (United States)

    Siegfried, Susan L.; Bernstein, Julie

    1991-01-01

    Describes "Synoname," the Getty Museum's computer program that matches varying versions of personal names for research purposes. Reports that the program uses an ordered algorithm sequence for pattern matching that includes both character- and word-matching techniques. Concludes that the technique can approach near-total accuracy at the…

  11. An Aerial-Image Dense Matching Approach Based on Optical Flow Field

    Science.gov (United States)

    Yuan, Wei; Chen, Shiyu; Zhang, Yong; Gong, Jianya; Shibasaki, Ryosuke

    2016-06-01

    Dense matching plays an important role in many fields, such as DEM (digital evaluation model) producing, robot navigation and 3D environment reconstruction. Traditional approaches may meet the demand of accuracy. But the calculation time and out puts density is hardly be accepted. Focus on the matching efficiency and complex terrain surface matching feasibility an aerial image dense matching method based on optical flow field is proposed in this paper. First, some high accurate and uniformed control points are extracted by using the feature based matching method. Then the optical flow is calculated by using these control points, so as to determine the similar region between two images. Second, the optical flow field is interpolated by using the multi-level B-spline interpolation in the similar region and accomplished the pixel by pixel coarse matching. Final, the results related to the coarse matching refinement based on the combined constraint, which recognizes the same points between images. The experimental results have shown that our method can achieve per-pixel dense matching points, the matching accuracy achieves sub-pixel level, and fully meet the three-dimensional reconstruction and automatic generation of DSM-intensive matching's requirements. The comparison experiments demonstrated that our approach's matching efficiency is higher than semi-global matching (SGM) and Patch-based multi-view stereo matching (PMVS) which verifies the feasibility and effectiveness of the algorithm.

  12. A network approach in analysis of the matching hypothesis

    Science.gov (United States)

    Jia, Tao; Spivey, Robert; Korniss, Gyorgy; Szymanski, Boleslaw

    2014-03-01

    The matching hypothesis in social psychology claimed that people are more likely to form a committed relationship with someone who is equally attractive. This phenomenon can be well interpreted by the principle of homophily that people are apt to get in touch with others similar to them. Yet, social experiments indicate that people in general tend to prefer more attractive individuals regardless of their own attractiveness. Here study the stochastic matching process for different underlying networks and different attractiveness distributions. We showed that the correlation of attractiveness within couples could purely due to the limited number of acquaintance each person has and such correlation decreases as the network becomes more sparse. We also analyzed the effect of the degree distribution and the attractiveness on the number of individuals that can not find their partners. This work is supported by ARL NS-CTA, ARO, and ONR.

  13. Mutual Fund Style, Characteristic-Matched Performance Benchmarks and Activity Measures: A New Approach

    OpenAIRE

    Daniel Buncic; Jon E. Eggins; Robert J. Hill

    2010-01-01

    We propose a new approach for measuring mutual fund style and constructing characteristic-matched performance benchmarks that requires only portfolio holdings and two reference portfolios in each style dimension. The characteristic-matched performance benchmark literature typically follows a bottom-up approach by first matching individual stocks with benchmarks and then obtaining a portfolio’s excess return as a weighted average of the excess returns on each of its constituent stocks. Our app...

  14. Active network alignment: a matching-based approach

    CERN Document Server

    Malmi, Eric; Gionis, Aristides

    2016-01-01

    Network alignment is the problem of matching the nodes of two graphs, maximizing the similarity of the matched nodes and the edges between them. This problem is encountered in a wide array of applications - from biological networks to social networks to ontologies - where multiple networked data sources need to be integrated. Due to the difficulty of the task, an accurate alignment can rarely be found without human assistance. Thus, it is of great practical importance to develop network alignment algorithms that can optimally leverage experts who are able to provide the correct alignment for a small number of nodes. Yet, only a handful of existing works address this active network alignment setting. The majority of the existing active methods focus on absolute queries ("are nodes $a$ and $b$ the same or not?"), whereas we argue that it is generally easier for a human expert to answer relative queries ("which node in the set $\\{b_1, \\ldots, b_n\\}$ is the most similar to node $a$?"). This paper introduces a nov...

  15. Pattern matching approach to pseudosymmetry problems in electron backscatter diffraction.

    Science.gov (United States)

    Nolze, Gert; Winkelmann, Aimo; Boyle, Alan P

    2016-01-01

    We demonstrate an approach to overcome Kikuchi pattern misindexing problems caused by crystallographic pseudosymmetry in electron backscatter diffraction (EBSD) measurements. Based on the quantitative comparison of experimentally measured Kikuchi patterns with dynamical electron diffraction simulations, the algorithm identifies the best-fit orientation from a set of pseudosymmetric candidates. Using measurements on framboidal pyrite (FeS2) as an example, we also show the improvement of the orientation precision using this approach.

  16. AN AERIAL-IMAGE DENSE MATCHING APPROACH BASED ON OPTICAL FLOW FIELD

    Directory of Open Access Journals (Sweden)

    W. Yuan

    2016-06-01

    Full Text Available Dense matching plays an important role in many fields, such as DEM (digital evaluation model producing, robot navigation and 3D environment reconstruction. Traditional approaches may meet the demand of accuracy. But the calculation time and out puts density is hardly be accepted. Focus on the matching efficiency and complex terrain surface matching feasibility an aerial image dense matching method based on optical flow field is proposed in this paper. First, some high accurate and uniformed control points are extracted by using the feature based matching method. Then the optical flow is calculated by using these control points, so as to determine the similar region between two images. Second, the optical flow field is interpolated by using the multi-level B-spline interpolation in the similar region and accomplished the pixel by pixel coarse matching. Final, the results related to the coarse matching refinement based on the combined constraint, which recognizes the same points between images. The experimental results have shown that our method can achieve per-pixel dense matching points, the matching accuracy achieves sub-pixel level, and fully meet the three-dimensional reconstruction and automatic generation of DSM-intensive matching’s requirements. The comparison experiments demonstrated that our approach’s matching efficiency is higher than semi-global matching (SGM and Patch-based multi-view stereo matching (PMVS which verifies the feasibility and effectiveness of the algorithm.

  17. A computationally efficient approach for template matching-based image registration

    Indian Academy of Sciences (India)

    Vilas H Gaidhane; Yogesh V Hote; Vijander Singh

    2014-04-01

    Image registration using template matching is an important step in image processing. In this paper, a simple, robust and computationally efficient approach is presented. The proposed approach is based on the properties of a normalized covariance matrix. The main advantage of the proposed approach is that the image matching can be achieved without calculating eigenvalues and eigenvectors of a covariance matrix, hence reduces the computational complexity. The experimental results show that the proposed approach performs better in the presence of various noises and rigid geometric transformations.

  18. The Use of Evolutionary Approaches to Understand Single Cell Genomes

    Directory of Open Access Journals (Sweden)

    Haiwei eLuo

    2015-03-01

    Full Text Available The vast majority of environmental bacteria and archaea remain uncultivated, yet their genome sequences are rapidly becoming available through single cell sequencing technologies. Reconstructing metabolism is one common way to make use of genome sequences of ecologically important bacteria, but molecular evolutionary analysis is another approach that, while currently underused, can reveal important insights into the function of these uncultivated microbes in nature. Because genome sequences from single cells are often incomplete, metabolic reconstruction based on genome content can be compromised. However, this problem does not necessarily impede the use of phylogenomic and population genomic approaches that are based on patterns of polymorphisms and substitutions at nucleotide and amino acid sites. These approaches explore how various evolutionary forces act to assemble genetic diversity within and between lineages. In this mini-review, I present examples illustrating the benefits of analyzing single cell genomes using evolutionary approaches.

  19. Matching theory

    CERN Document Server

    Plummer, MD

    1986-01-01

    This study of matching theory deals with bipartite matching, network flows, and presents fundamental results for the non-bipartite case. It goes on to study elementary bipartite graphs and elementary graphs in general. Further discussed are 2-matchings, general matching problems as linear programs, the Edmonds Matching Algorithm (and other algorithmic approaches), f-factors and vertex packing.

  20. A matching approach to communicate through the plasma sheath surrounding a hypersonic vehicle

    Energy Technology Data Exchange (ETDEWEB)

    Gao, Xiaotian; Jiang, Binhao, E-mail: jiangbh@hit.edu.cn [Harbin Institute of Technology, 92 West Dazhi Street, Nan Gang District, Harbin (China)

    2015-06-21

    In order to overcome the communication blackout problem suffered by hypersonic vehicles, a matching approach has been proposed for the first time in this paper. It utilizes a double-positive (DPS) material layer surrounding a hypersonic vehicle antenna to match with the plasma sheath enclosing the vehicle. Analytical analysis and numerical results indicate a resonance between the matched layer and the plasma sheath will be formed to mitigate the blackout problem in some conditions. The calculated results present a perfect radiated performance of the antenna, when the match is exactly built between these two layers. The effects of the parameters of the plasma sheath have been researched by numerical methods. Based on these results, the proposed approach is easier to realize and more flexible to the varying radiated conditions in hypersonic flight comparing with other methods.

  1. (Post-)genomics approaches in fungal research

    NARCIS (Netherlands)

    Aguilar-Pontes, María Victoria; de Vries, Ronald P; Zhou, M.; van den Brink, J.

    2014-01-01

    To date, hundreds of fungal genomes have been sequenced and many more are in progress. This wealth of genomic information has provided new directions to study fungal biodiversity. However, to further dissect and understand the complicated biological mechanisms involved in fungal life styles, functio

  2. A Parameterized Approach to Personalized Variable Length Summarization of Soccer Matches

    OpenAIRE

    Sukhwani, Mohak; Kothari, Ravi

    2017-01-01

    We present a parameterized approach to produce personalized variable length summaries of soccer matches. Our approach is based on temporally segmenting the soccer video into 'plays', associating a user-specifiable 'utility' for each type of play and using 'bin-packing' to select a subset of the plays that add up to the desired length while maximizing the overall utility (volume in bin-packing terms). Our approach systematically allows a user to override the default weights assigned to each ty...

  3. Genome-wide approaches to understanding behaviour in Drosophila melanogaster.

    Science.gov (United States)

    Neville, Megan; Goodwin, Stephen F

    2012-09-01

    Understanding how an organism exhibits specific behaviours remains a major and important biological question. Studying behaviour in a simple model organism like the fruit fly Drosophila melanogaster has the advantages of advanced molecular genetics approaches along with well-defined anatomy and physiology. With advancements in functional genomic technologies, researchers are now attempting to uncover genes and pathways involved in complex behaviours on a genome-wide scale. A systems-level network approach, which will include genomic approaches, to study behaviour will be key to understanding the regulation and modulation of behaviours and the importance of context in regulating them.

  4. AN INTEGRATED RANSAC AND GRAPH BASED MISMATCH ELIMINATION APPROACH FOR WIDE-BASELINE IMAGE MATCHING

    Directory of Open Access Journals (Sweden)

    M. Hasheminasab

    2015-12-01

    Full Text Available In this paper we propose an integrated approach in order to increase the precision of feature point matching. Many different algorithms have been developed as to optimizing the short-baseline image matching while because of illumination differences and viewpoints changes, wide-baseline image matching is so difficult to handle. Fortunately, the recent developments in the automatic extraction of local invariant features make wide-baseline image matching possible. The matching algorithms which are based on local feature similarity principle, using feature descriptor as to establish correspondence between feature point sets. To date, the most remarkable descriptor is the scale-invariant feature transform (SIFT descriptor , which is invariant to image rotation and scale, and it remains robust across a substantial range of affine distortion, presence of noise, and changes in illumination. The epipolar constraint based on RANSAC (random sample consensus method is a conventional model for mismatch elimination, particularly in computer vision. Because only the distance from the epipolar line is considered, there are a few false matches in the selected matching results based on epipolar geometry and RANSAC. Aguilariu et al. proposed Graph Transformation Matching (GTM algorithm to remove outliers which has some difficulties when the mismatched points surrounded by the same local neighbor structure. In this study to overcome these limitations, which mentioned above, a new three step matching scheme is presented where the SIFT algorithm is used to obtain initial corresponding point sets. In the second step, in order to reduce the outliers, RANSAC algorithm is applied. Finally, to remove the remained mismatches, based on the adjacent K-NN graph, the GTM is implemented. Four different close range image datasets with changes in viewpoint are utilized to evaluate the performance of the proposed method and the experimental results indicate its robustness and

  5. Whole genome approaches to quantitative genetics.

    Science.gov (United States)

    Visscher, Peter M

    2009-06-01

    Apart from parent-offspring pairs and clones, relative pairs vary in the proportion of the genome that they share identical by descent. In the past, quantitative geneticists have used the expected value of sharing genes by descent to estimate genetic parameters and predict breeding values. With the possibility to genotype individuals for many markers across the genome it is now possible to empirically estimate the actual relationship between relatives. We review some of the theory underlying the variation in genetic identity, show applications to estimating genetic variance for height in humans and discuss other applications.

  6. Genome and exome sequencing in the clinic: unbiased genomic approaches with a high diagnostic yield

    NARCIS (Netherlands)

    Nelen, M.; Veltman, J.A.

    2012-01-01

    For the reasons discussed here, we think whole-genome- or exome-based approaches are currently most suited for diagnostic implementation in genetically heterogeneous diseases, initially to complement and later to replace Sanger sequencing, qPCR and genomic microarrays. Patients do need to be counsel

  7. Genome Editing: A New Approach to Human Therapeutics.

    Science.gov (United States)

    Porteus, Matthew

    2016-01-01

    The ability to manipulate the genome with precise spatial and nucleotide resolution (genome editing) has been a powerful research tool. In the past decade, the tools and expertise for using genome editing in human somatic cells and pluripotent cells have increased to such an extent that the approach is now being developed widely as a strategy to treat human disease. The fundamental process depends on creating a site-specific DNA double-strand break (DSB) in the genome and then allowing the cell's endogenous DSB repair machinery to fix the break such that precise nucleotide changes are made to the DNA sequence. With the development and discovery of several different nuclease platforms and increasing knowledge of the parameters affecting different genome editing outcomes, genome editing frequencies now reach therapeutic relevance for a wide variety of diseases. Moreover, there is a series of complementary approaches to assessing the safety and toxicity of any genome editing process, irrespective of the underlying nuclease used. Finally, the development of genome editing has raised the issue of whether it should be used to engineer the human germline. Although such an approach could clearly prevent the birth of people with devastating and destructive genetic diseases, questions remain about whether human society is morally responsible enough to use this tool.

  8. An approach to improve the match-on-card fingerprint authentication system security

    CSIR Research Space (South Africa)

    Nair, Kishor Krishnan

    2016-07-01

    Full Text Available -on-Card (TOC), Match-on- Card (MOC), Work-Sharing On-Card (WSOC), and System-on-Card (SOC). Out of these four approaches, the SOC is considered as the most secure and expensive, whereas the TOC is considered as the least secure and least expensive. The MOC...

  9. Analytical Structure Matching and Very Precise Approach to the Coulombic Quantum Three-Body Problem

    Institute of Scientific and Technical Information of China (English)

    TAN Shi-Na

    2001-01-01

    A powerful approach to solve the Coulombic quantum three-body problem is proposed. The approach is exponentially convergent and more efficient than the hypcrsphcrical coordinate method and the correlation-function hyperspherical harmonic method. This approach is numerically competitive with the variational methods, such as that using the Hylleraas-type basis functions. Numerical comparisons are made to demonstrate the efficiency of this approach, by calculating the nonrelativistic and infinite-nuclear-mass limit of the ground state energy of the helium atom. The exponential convergency of this approach is due to the full matching between the analytical structure of the basis functions that are used in this paper and the true wavefunction. This full matching was not reached by most other methods. For example, the variational method using the Hylleraas-type basis does not reflects the logarithmic singularity of the true wavefunction at the origin as predicted by Bartlett and Fock. Two important approaches are proposed in this work to reach this full matching: the coordinate transformation method and the asymptotic series method. Besides these, this work makes use of the lcast square method to substitute complicated numerical integrations in solving the Schrodinger equation without much loss of accuracy, which is routinely used by people to fit a theoretical curve with discrete experimental data, but here is used to simplify thc computation.``

  10. Tailoring science outreach through E-matching using a community-based participatory approach.

    Directory of Open Access Journals (Sweden)

    Bernice B Rumala

    2011-03-01

    Full Text Available In an effort to increase science exposure for pre-college (K-12 students and as part of the science education reform agenda, many biomedical research institutions have established university-community partnerships. Typically, these science outreach programs consist of pre-structured, generic exposure for students, with little community engagement. However, the use of a medium that is accessible to both teachers and scientists, electronic web-based matchmaking (E-matching provides an opportunity for tailored outreach utilizing a community-based participatory approach (CBPA, which involves all stakeholders in the planning and implementation of the science outreach based on the interests of teachers/students and scientists. E-matching is a timely and urgent endeavor that provides a rapid connection for science engagement between teachers/students and experts in an effort to fill the science outreach gap. National Lab Network (formerly National Lab Day, an ongoing initiative to increase science equity and literacy, provides a model for engaging the public in science via an E-matching and hands-on learning approach. We argue that science outreach should be a dynamic endeavor that changes according to the needs of a target school. We will describe a case study of a tailored science outreach activity in which a public school that serves mostly under-represented minority students from disadvantaged backgrounds were E-matched with a university, and subsequently became equal partners in the development of the science outreach plan. In addition, we will show how global science outreach endeavors may utilize a CBPA, like E-matching, to support a pipeline to science among under-represented minority students and students from disadvantaged backgrounds. By merging the CBPA concept with a practical case example, we hope to inform science outreach practices via the lens of a tailored E-matching approach.

  11. Impact of genomics approaches on plant genetics and physiology.

    Science.gov (United States)

    Tabata, Satoshi

    2002-08-01

    Comprehensive analysis of genetic information in higher plants is under way for several plants of biological and agronomical importance. Among them, Arabidopsis thaliana, a member of Brassica family, and Oryza sativa(rice) have been chosen as model plants most suitable for genome analysis. Sequencing of the genome of A. thaliana was completed in December 2000, and rice genome sequencing is in progress. The accumulated genome sequences, together with the hundreds of thousands of ESTs from several tens of plant species, have drastically changed the strategy of plant genetics. By utilizing the information on the genome and gene structures, comprehensive approaches for genome-wide functional analysis of the genes, including transcriptome analysis using microarray systems and a comprehensive analysis of a large number of insertion mutant lines, have been widely adopted. As a consequence, a large quantity of information on both the structure and function of genes in these model plants has been accumulated. However, other plant species may have their own characteristics and advantages to study individual phenomena. Application of knowledge from the model plants to other plant species and vice versa through the common language, namely the genome information, should facilitate understanding of the genetic systems underlying a variety of biological phenomena. Introduction of this common language may not be very simple, especially in the case of complex pathways such as a process of cell-covering formation. Nevertheless, it should be emphasized that genomics approaches are the most promising way to understand these processes.

  12. Mystery behind the match: an undergraduate medical education–graduate medical education collaborative approach to understanding match goals and outcomes

    Directory of Open Access Journals (Sweden)

    Alisa Nagler

    2016-09-01

    Full Text Available Background: There is a paucity of information regarding institutional targets for the number of undergraduate medical education (UME graduates being matched to graduate medical education (GME programs at their home institutions. At our institution, the Duke University, the number of UME graduates matched to GME programs declined dramatically in 2011. To better understand why this decline may have happened, we sought to identify perceived quality metrics for UME and GME learners, evaluate trends in match outcomes and educational program characteristics, and explore whether there is an ideal retention rate for UME graduates in their home institutions’ GME programs. Methods: We analyzed the number of Duke University UME graduates remaining at Duke for GME training over the past 5 years. We collected data to assess for changing characteristics of UME and GME, and performed descriptive analysis of trends over time to investigate the potential impact on match outcomes. Results: A one-sample t-test analysis showed no statistically significant difference in the number of Duke UME graduates who stayed for GME training. For both UME and GME, no significant changes in the characteristics of either program were found. Discussion: We created a process for monitoring data related to the characteristics or perceived quality of UME and GME programs and developed a shared understanding of what may impact match lists for both UME graduates and GME programs, leaving the Match somewhat less mysterious. While we understand the trend of graduates remaining at their home institutions for GME training, we are uncertain whether setting a goal for retention is reasonable, and so some mystery remains. We believe there is an invaluable opportunity for collaboration between UME and GME stakeholders to facilitate discussion about setting shared institutional goals.

  13. A Bayesian approach to matched field processing in uncertain ocean environments

    Institute of Scientific and Technical Information of China (English)

    LI Jianlong; PAN Xiang

    2008-01-01

    An approach of Bayesian Matched Field Processing(MFP)was discussed in the uncertain ocean environment.In this approach,uncertainty knowledge is modeled and spatial and temporal data Received by the array are fully used.Therefore,a mechanism for MFP is found.which well combines model-based and data-driven methods of uncertain field processing.By theoretical derivation,simulation analysis and the validation of the experimental array data at sea,we find that(1)the basic components of Bayesian matched field processors are the corresponding sets of Bartlett matched field processor,MVDR(minimum variance distortionless response)matched field processor,etc.;(2)Bayesian MVDR/Bartlett MFP are the weighted sum of the MVDR/Bartlett MFP,where the weighted coefficients are the values of the a posteriori probability;(3)with the uncertain ocean environment,Bayesian MFP can more correctly locate the source than MVDR MFP or Bartlett MFP;(4)Bayesian MFP call better suppress sidelobes of the ambiguity surfaces.

  14. Comparative genomic and proteomic analysis of high grade glioma primary cultures and matched tumor in situ.

    LENUS (Irish Health Repository)

    Howley, R

    2012-10-15

    Developing targeted therapies for high grade gliomas (HGG), the most common primary brain tumor in adults, relies largely on glioma cultures. However, it is unclear if HGG tumorigenic signaling pathways are retained under in-vitro conditions. Using array comparative genomic hybridization and immunohistochemical profiling, we contrasted the epidermal and platelet-derived growth factor receptor (EGFR\\/PDGFR) in-vitro pathway status of twenty-six primary HGG cultures with the pathway status of their original HGG biopsies. Genomic gains or amplifications were lost during culturing while genomic losses were more likely to be retained. Loss of EGFR amplification was further verified immunohistochemically when EGFR over expression was decreased in the majority of cultures. Conversely, PDGFRα and PDGFRβ were more abundantly expressed in primary cultures than in the original tumor (p<0.05). Despite these genomic and proteomic differences, primary HGG cultures retained key aspects of dysregulated tumorigenic signaling. Both in-vivo and in-vitro the presence of EGFR resulted in downstream activation of P70s6K while reduced downstream activation was associated with the presence of PDGFR and the tumor suppressor, PTEN. The preserved pathway dysregulation make this glioma model suitable for further studies of glioma tumorigenesis, however individual culture related differences must be taken into consideration when testing responsiveness to chemotherapeutic agents.

  15. Public-Private Wage Gap In Latin America (1999-2007): A Matching Approach

    OpenAIRE

    Alejandra Mizala; Pilar Romaguera; Sebastian Gallegos

    2010-01-01

    Using matching methods, we estimate the public-private wage gap in seven Latin American countries—Argentina, Bolivia, Brazil, Chile, Costa Rica, Paraguay and Uruguay—for the years 1999 and 2007. These methods do not require any estimation of earnings equations and hence no validity-out-of-the-support assumptions; furthermore, this approach allows us to estimate not only the average wage gap but also its distribution. Our main findings indicate that the average public sector worker earns more ...

  16. Contemporary approaches for modifying the mouse genome

    Science.gov (United States)

    Adams, David J.; van der Weyden, Louise

    2008-01-01

    The mouse is a premiere experimental organism that has contributed significantly to our understanding of vertebrate biology. Manipulation of the mouse genome via embryonic stem (ES) cell technology makes it possible to engineer an almost limitless repertoire of mutations to model human disease and assess gene function. In this review we outline recent advances in mouse experimental genetics and provide a “how-to” guide for those people wishing to access this technology. We also discuss new technologies, such as transposon-mediated mutagenesis, and resources of targeting vectors and ES cells, which are likely to dramatically accelerate the pace with which we can assess gene function in vivo, and the progress of forward and reverse genetic screens in mice. PMID:18559964

  17. A polytomous conditional likelihood approach for combining matched and unmatched case-control studies.

    Science.gov (United States)

    Gebregziabher, Mulugeta; Guimaraes, Paulo; Cozen, Wendy; Conti, David V

    2010-04-30

    In genetic association studies it is becoming increasingly imperative to have large sample sizes to identify and replicate genetic effects. To achieve these sample sizes, many research initiatives are encouraging the collaboration and combination of several existing matched and unmatched case-control studies. Thus, it is becoming more common to compare multiple sets of controls with the same case group or multiple case groups to validate or confirm a positive or negative finding. Usually, a naive approach of fitting separate models for each case-control comparison is used to make inference about disease-exposure association. But, this approach does not make use of all the observed data and hence could lead to inconsistent results. The problem is compounded when a common case group is used in each case-control comparison. An alternative to fitting separate models is to use a polytomous logistic model but, this model does not combine matched and unmatched case-control data. Thus, we propose a polytomous logistic regression approach based on a latent group indicator and a conditional likelihood to do a combined analysis of matched and unmatched case-control data. We use simulation studies to evaluate the performance of the proposed method and a case-control study of multiple myeloma and Inter-Leukin-6 as an example. Our results indicate that the proposed method leads to a more efficient homogeneity test and a pooled estimate with smaller standard error.

  18. Single Cell HLA Matching Feasibility by Whole Genomic Amplification and Nested PCR

    Institute of Scientific and Technical Information of China (English)

    Xiao-hong Li; Fang-yin Meng

    2004-01-01

    @@ PCR based single-cell DNA analysis has been widely used in forensic science, preimplantation genetic diagnosis and so on. However, the original sample cannot be efficiently retrieved following single cell PCR, consequently the amount of information gained is limited. HLA system is too sophisticated that it is very hard to complete HLA typing by single cell. A Taq polymerase-based method using random primers to amplify whole genome termed as whole genome amplification (WGA) has demonstrated to be a useful method in increasing the copies of minimum sample. We establish a technique in this study to amplify HLA-A and HLA-B loci at same time in a single cell using WGA.

  19. HANDBOOK OF SOCCER MATCH ANALYSIS: A SYSTEMATIC APPROACH TO IMPROVING PERFORMANCE

    Directory of Open Access Journals (Sweden)

    Christopher Carling

    2006-03-01

    Analysis Tells Us about Successful Strategy and Tactics in Soccer, 8. From Technical and Tactical Performance Analysis to Training Drills, 9. The Future of Soccer Match Analysis. ASSESSMENT The authors have assembled an essential reading for all who are interested in understanding and doing better coaching and improving the performance in soccer. To this purpose, there is a strong practical approach in the book by giving plenty of examples along with a satisfactory scientific analysis of the subject area. It is concise and well organized in its presentation, creating an effective textbook. I believe, therefore, the book will serve as a first-rate teaching tool and reference for coaches, athletes and professionals in the human performance sciences.

  20. Genome classification by gene distribution: An overlapping subspace clustering approach

    Directory of Open Access Journals (Sweden)

    Halgamuge Saman K

    2008-04-01

    Full Text Available Abstract Background Genomes of lower organisms have been observed with a large amount of horizontal gene transfers, which cause difficulties in their evolutionary study. Bacteriophage genomes are a typical example. One recent approach that addresses this problem is the unsupervised clustering of genomes based on gene order and genome position, which helps to reveal species relationships that may not be apparent from traditional phylogenetic methods. Results We propose the use of an overlapping subspace clustering algorithm for such genome classification problems. The advantage of subspace clustering over traditional clustering is that it can associate clusters with gene arrangement patterns, preserving genomic information in the clusters produced. Additionally, overlapping capability is desirable for the discovery of multiple conserved patterns within a single genome, such as those acquired from different species via horizontal gene transfers. The proposed method involves a novel strategy to vectorize genomes based on their gene distribution. A number of existing subspace clustering and biclustering algorithms were evaluated to identify the best framework upon which to develop our algorithm; we extended a generic subspace clustering algorithm called HARP to incorporate overlapping capability. The proposed algorithm was assessed and applied on bacteriophage genomes. The phage grouping results are consistent overall with the Phage Proteomic Tree and showed common genomic characteristics among the TP901-like, Sfi21-like and sk1-like phage groups. Among 441 phage genomes, we identified four significantly conserved distribution patterns structured by the terminase, portal, integrase, holin and lysin genes. We also observed a subgroup of Sfi21-like phages comprising a distinctive divergent genome organization and identified nine new phage members to the Sfi21-like genus: Staphylococcus 71, phiPVL108, Listeria A118, 2389, Lactobacillus phi AT3, A2

  1. Genome-Wide Approaches to Drosophila Heart Development

    Directory of Open Access Journals (Sweden)

    Manfred Frasch

    2016-05-01

    Full Text Available The development of the dorsal vessel in Drosophila is one of the first systems in which key mechanisms regulating cardiogenesis have been defined in great detail at the genetic and molecular level. Due to evolutionary conservation, these findings have also provided major inputs into studies of cardiogenesis in vertebrates. Many of the major components that control Drosophila cardiogenesis were discovered based on candidate gene approaches and their functions were defined by employing the outstanding genetic tools and molecular techniques available in this system. More recently, approaches have been taken that aim to interrogate the entire genome in order to identify novel components and describe genomic features that are pertinent to the regulation of heart development. Apart from classical forward genetic screens, the availability of the thoroughly annotated Drosophila genome sequence made new genome-wide approaches possible, which include the generation of massive numbers of RNA interference (RNAi reagents that were used in forward genetic screens, as well as studies of the transcriptomes and proteomes of the developing heart under normal and experimentally manipulated conditions. Moreover, genome-wide chromatin immunoprecipitation experiments have been performed with the aim to define the full set of genomic binding sites of the major cardiogenic transcription factors, their relevant target genes, and a more complete picture of the regulatory network that drives cardiogenesis. This review will give an overview on these genome-wide approaches to Drosophila heart development and on computational analyses of the obtained information that ultimately aim to provide a description of this process at the systems level.

  2. A New Approach towards Bibliographic Reference Identification, Parsing and Inline Citation Matching

    Science.gov (United States)

    Gupta, Deepank; Morris, Bob; Catapano, Terry; Sautter, Guido

    A number of algorithms and approaches have been proposed towards the problem of scanning and digitizing research papers. We can classify work done in the past into three major approaches: regular expression based heuristics, learning based algorithm and knowledge based systems. Our findings point to the inadequacy of existing open-source solutions such as Paracite for papers with “micro-citations” in various European Languages. This paper describes the work done as part of the Google Summer of Code 2008 using a combination of regular-expression based heuristics and knowledge-based systems to develop a system which matches inline citations to their corresponding bibliographic references and identifies and extracts metadata from references. The description, implementation and results of our approach have been presented here. Our approach enhances the accuracy and provides better recognition rates.

  3. A validation of a posture matching approach for the determination of 3D cumulative back loads.

    Science.gov (United States)

    Sutherland, Chad A; Albert, Wayne J; Wrigley, Allan T; Callaghan, Jack P

    2008-03-01

    The purpose of this project was to investigate the amount of error in calculating cumulative lumbar spine kinetics using a posture matching approach (3DMatch) compared to a 3D coordinate electromagnetic tracking approach (FASTRAK). Six subjects were required to perform five repeats each of two symmetrical and two asymmetrical lifts while being simultaneously recorded from 4 camera views at viewing angles of 0 degrees , 45 degrees , 60 degrees and 90 degrees to the sagittal plane while wearing eight FASTRAK sensors to define an 8 segment rigid link model (RLM) of the head, arms, and trunk. Four hundred and eighty lifts (6 subjects x20 lifts x4 camera views) were analyzed using the 3DMatch posture-matching program to calculate the following cumulative loads at the L4/L5 joint: compression, anterior shear, posterior shear, reaction shear and extension moment. The errors in cumulative load calculation were determined as the difference between the values calculated for the same lifts using a 3D RLM that used electromagnetic motion tracking sensors (FASTRAK) positioned at the segment center of masses as model inputs. No significant difference (pposture matching by trained users can provide reasonable 3D data to calculate cumulative low back loads with a biomechanical model.

  4. Face recognition using elastic grid matching through photoshop: A new approach

    Directory of Open Access Journals (Sweden)

    Manavpreet Kaur

    2015-12-01

    Full Text Available Computing grids propose to be a very efficacious, economic and ascendable way of image identification. In this paper, we propose a grid based face recognition overture employing a general template matching method to solve the timeconsuming face recognition problem. A new approach has been employed in which the grid was prepared for a specific individual over his photograph using Adobe Photoshop CS5 software. The background was later removed and the grid prepared by merging layers was used as a template for image matching or comparison. This overture is computationally efficient, has high recognition rates and is able to identify a person with minimal efforts and in short time even from photographs taken at different magnifications and from different distances.

  5. A robust approach to optimal matched filter design in ultrasonic non-destructive evaluation (NDE)

    Science.gov (United States)

    Li, Minghui; Hayward, Gordon

    2017-02-01

    The matched filter was demonstrated to be a powerful yet efficient technique to enhance defect detection and imaging in ultrasonic non-destructive evaluation (NDE) of coarse grain materials, provided that the filter was properly designed and optimized. In the literature, in order to accurately approximate the defect echoes, the design utilized the real excitation signals, which made it time consuming and less straightforward to implement in practice. In this paper, we present a more robust and flexible approach to optimal matched filter design using the simulated excitation signals, and the control parameters are chosen and optimized based on the real scenario of array transducer, transmitter-receiver system response, and the test sample, as a result, the filter response is optimized and depends on the material characteristics. Experiments on industrial samples are conducted and the results confirm the great benefits of the method.

  6. Bioagent Sample Matching using Elemental Composition Data: an Approach to Validation

    Energy Technology Data Exchange (ETDEWEB)

    Velsko, S P

    2006-04-21

    Sample matching is a fundamental capability that can have high probative value in a forensic context if proper validation studies are performed. In this report we discuss the potential utility of using the elemental composition of two bioagent samples to decide if they were produced in the same batch, or by the same process. Using guidance from the recent NRC study of bullet lead analysis and other sources, we develop a basic likelihood ratio framework for evaluating the evidentiary weight of elemental analysis data for sample matching. We define an objective metric for comparing two samples, and propose a method for constructing an unbiased population of test samples. We illustrate the basic methodology with some existing data on dry Bacillus thuringiensis preparations, and outline a comprehensive plan for experimental validation of this approach.

  7. A Genomic View of Lactobacilli and Pediococci Demonstrates that Phylogeny Matches Ecology and Physiology.

    Science.gov (United States)

    Zheng, Jinshui; Ruan, Lifang; Sun, Ming; Gänzle, Michael

    2015-10-01

    Lactobacilli are used widely in food, feed, and health applications. The taxonomy of the genus Lactobacillus, however, is confounded by the apparent lack of physiological markers for phylogenetic groups of lactobacilli and the unclear relationships between the diverse phylogenetic groups. This study used the core and pan-genomes of 174 type strains of Lactobacillus and Pediococcus to establish phylogenetic relationships and to identify metabolic properties differentiating phylogenetic groups. The core genome phylogenetic tree separated homofermentative lactobacilli and pediococci from heterofermentative lactobacilli. Aldolase and phosphofructokinase were generally present in homofermentative but not in heterofermentative lactobacilli; a two-domain alcohol dehydrogenase and mannitol dehydrogenase were present in most heterofermentative lactobacilli but absent in most homofermentative organisms. Other genes were predominantly present in homofermentative lactobacilli (pyruvate formate lyase) or heterofermentative lactobacilli (lactaldehyde dehydrogenase and glycerol dehydratase). Cluster analysis of the phylogenomic tree and the average nucleotide identity grouped the genus Lactobacillus sensu lato into 24 phylogenetic groups, including pediococci, with stable intra- and intergroup relationships. Individual groups may be differentiated by characteristic metabolic properties. The link between phylogeny and physiology that is proposed in this study facilitates future studies on the ecology, physiology, and industrial applications of lactobacilli.

  8. Genetic and genomic approaches to understanding macrophage identity and function.

    Science.gov (United States)

    Glass, Christopher K

    2015-04-01

    A major goal of our laboratory is to understand the molecular mechanisms that underlie the development and functions of diverse macrophage phenotypes in health and disease. Recent studies using genetic and genomic approaches suggest a relatively simple model of collaborative and hierarchical interactions between lineage-determining and signal-dependent transcription factors that enable selection and activation of transcriptional enhancers that specify macrophage identity and function. In addition, we have found that it is possible to use natural genetic variation as a powerful tool for advancing our understanding of how the macrophage deciphers the information encoded by the genome to attain specific phenotypes in a context-dependent manner. Here, I will describe our recent efforts to extend genetic and genomic approaches to investigate the roles of distinct tissue environments in determining the phenotypes of different resident populations of macrophages.

  9. Patient-controlled encrypted genomic data: an approach to advance clinical genomics

    Directory of Open Access Journals (Sweden)

    Trakadis Yannis J

    2012-07-01

    Full Text Available Abstract Background The revolution in DNA sequencing technologies over the past decade has made it feasible to sequence an individual’s whole genome at a relatively low cost. The potential value of the information generated by genomic technologies for medicine and society is enormous. However, in order for exome sequencing, and eventually whole genome sequencing, to be implemented clinically, a number of major challenges need to be overcome. For instance, obtaining meaningful informed-consent, managing incidental findings and the great volume of data generated (including multiple findings with uncertain clinical significance, re-interpreting the genomic data and providing additional counselling to patients as genetic knowledge evolves are issues that need to be addressed. It appears that medical genetics is shifting from the present “phenotype-first” medical model to a “data-first” model which leads to multiple complexities. Discussion This manuscript discusses the different challenges associated with integrating genomic technologies into clinical practice and describes a “phenotype-first” approach, namely, “Individualized Mutation-weighed Phenotype Search”, and its benefits. The proposed approach allows for a more efficient prioritization of the genes to be tested in a clinical lab based on both the patient’s phenotype and his/her entire genomic data. It simplifies “informed-consent” for clinical use of genomic technologies and helps to protect the patient’s autonomy and privacy. Overall, this approach could potentially render widespread use of genomic technologies, in the immediate future, practical, ethical and clinically useful. Summary The “Individualized Mutation-weighed Phenotype Search” approach allows for an incremental integration of genomic technologies into clinical practice. It ensures that we do not over-medicalize genomic data but, rather, continue our current medical model which is based on serving

  10. A pattern matching approach for the estimation of alignment between any two given DNA sequences.

    Science.gov (United States)

    Basu, K; Sriraam, N; Richard, R J A

    2007-08-01

    For a given DNA sequence, it is well known that pair wise alignment schemes are used to determine the similarity with the DNA sequences available in the databanks. The efficiency of the alignment decides the type of amino acids and its corresponding proteins. In order to evaluate the given DNA sequence for its proteomic identity, a pattern matching approach is proposed in this paper. A block based semi-global alignment scheme is introduced to determine the similarity between the DNA sequences (known and given). The two DNA sequences are divided into blocks of equal length and alignment is performed which minimizes the computational complexity. The efficiency of the alignment scheme is evaluated using the parameter, percentage of similarity (POS). Four essential DNA version of the amino acids that emphasize the importance of proteomic functionalities are chosen as patterns and matching is performed with the known and given DNA sequences to determine the similarity between them. The ratio of amino acid counts between the two sequences is estimated and the results are compared with that of the POS value. It is found from the experimental results that higher the POS value and the pattern matching higher are the similarity between the two DNA sequences. The optimal block is also identified based on the POS value and amino acids count.

  11. Histogram Bins Matching Approach for CBIR Based on Linear grouping for Dimensionality Reduction

    Directory of Open Access Journals (Sweden)

    H. B. Kekre

    2013-11-01

    Full Text Available This paper describes the histogram bins matching approach for CBIR. Histogram bins are reduced from 256 to 32 and 16 by linear grouping and effect of this dimensionality reduction is analyzed, compared, and evaluated. Work presented in this paper contributes in all three main phases of CBIR that are feature extraction, similarity matching and performance evaluation. Feature extraction explores the idea of histogram bins matching for three colors R, G and B. Histogram bin contents are used to represent the feature vector in three forms. First form of feature is count of pixels, and then other forms are obtained by computing the total and mean of intensities for the pixels falling in each of the histogram bins. Initially the size of the feature vector is 256 components as histogram with the all 256 bins. Further the size of the feature vector is reduced to 32 bins and then 16 bins by simple linear grouping of the bins. Feature extraction processes for each size and type of the feature vector is executed over the database of 2000 BMP images having 20 different classes. It prepares the feature vector databases as preprocessing part of this work. Similarity matching between query and database image feature vectors is carried out by means of first five orders of Minkowski distance and also with the cosine correlation distance. Same set of 200 query images are executed for all types of feature vector and for all similarity measures. Performance of all aspects addressed in this paper are evaluated using three parameters PRCP (Precision Recall Cross over Point, LS (longest string, LSRR (Length of String to Retrieve all Relevant images.

  12. Deciphering Squamous Cell Carcinoma Using Multidimensional Genomic Approaches

    Directory of Open Access Journals (Sweden)

    Ewan A. Gibb

    2011-01-01

    Full Text Available Squamous cell carcinomas (SqCCs arise in a wide range of tissues including skin, lung, and oral mucosa. Although all SqCCs are epithelial in origin and share common nomenclature, these cancers differ greatly with respect to incidence, prognosis, and treatment. Current knowledge of genetic similarities and differences between SqCCs is insufficient to describe the biology of these cancers, which arise from diverse tissue origins. In this paper we provide a general overview of whole genome approaches for gene and pathway discovery and highlight the advancement of integrative genomics as a state-of-the-art technology in the study of SqCC genetics.

  13. A new approach for using genome scans to detect recent positive selection in the human genome.

    Directory of Open Access Journals (Sweden)

    Kun Tang

    2007-07-01

    Full Text Available Genome-wide scanning for signals of recent positive selection is essential for a comprehensive and systematic understanding of human adaptation. Here, we present a genomic survey of recent local selective sweeps, especially aimed at those nearly or recently completed. A novel approach was developed for such signals, based on contrasting the extended haplotype homozygosity (EHH profiles between populations. We applied this method to the genome single nucleotide polymorphism (SNP data of both the International HapMap Project and Perlegen Sciences, and detected widespread signals of recent local selection across the genome, consisting of both complete and partial sweeps. A challenging problem of genomic scans of recent positive selection is to clearly distinguish selection from neutral effects, given the high sensitivity of the test statistics to departures from neutral demographic assumptions and the lack of a single, accurate neutral model of human history. We therefore developed a new procedure that is robust across a wide range of demographic and ascertainment models, one that indicates that certain portions of the genome clearly depart from neutrality. Simulations of positive selection showed that our tests have high power towards strong selection sweeps that have undergone fixation. Gene ontology analysis of the candidate regions revealed several new functional groups that might help explain some important interpopulation differences in phenotypic traits.

  14. Matching the proteome to the genome : the microbody of penicillin-producing Penicillium chrysogenum cells

    NARCIS (Netherlands)

    Kiel, Jan A. K. W.; van den Berg, Marco A.; Fusetti, Fabrizia; Poolman, Bert; Bovenberg, Roel A. L.; Veenhuis, Marten; van der Klei, Ida J.

    2009-01-01

    In the filamentous fungus Penicillium chrysogenum, microbodies are essential for penicillin biosynthesis. To better understand the role of these organelles in antibiotics production, we determined the matrix enzyme contents of P. chrysogenum microbodies. Using a novel in silico approach, we first ob

  15. An evaluation of the genetic-matched pair study design using genome-wide SNP data from the European population

    DEFF Research Database (Denmark)

    Lu, Timothy Tehua; Lao, Oscar; Nothnagel, Michael;

    2009-01-01

    of cases (76.0%), the BOM of a given individual, based on the complete marker set, came from a different recruitment site than the individual itself. A second marker set, specifically selected for ancestry sensitivity using singular value decomposition, performed even more poorly and was no more capable......Chip Human Mapping 500K Array) from 2457 individuals, sampled at 23 different recruitment sites across Europe. Using pair-wise identity-by-state (IBS) as a matching criterion, we tried to derive a subset of markers that would allow identification of the best overall matching (BOM) partner for a given...... individual, based on the IBS status for the subset alone. However, our results suggest that, by following this approach, the prediction accuracy is only notably improved by the first 20 markers selected, and increases proportionally to the marker number thereafter. Furthermore, in a considerable proportion...

  16. Combining genomic and proteomic approaches for epigenetics research

    Science.gov (United States)

    Han, Yumiao; Garcia, Benjamin A

    2014-01-01

    Epigenetics is the study of changes in gene expression or cellular phenotype that do not change the DNA sequence. In this review, current methods, both genomic and proteomic, associated with epigenetics research are discussed. Among them, chromatin immunoprecipitation (ChIP) followed by sequencing and other ChIP-based techniques are powerful techniques for genome-wide profiling of DNA-binding proteins, histone post-translational modifications or nucleosome positions. However, mass spectrometry-based proteomics is increasingly being used in functional biological studies and has proved to be an indispensable tool to characterize histone modifications, as well as DNA–protein and protein–protein interactions. With the development of genomic and proteomic approaches, combination of ChIP and mass spectrometry has the potential to expand our knowledge of epigenetics research to a higher level. PMID:23895656

  17. Demons deformable registration of CT and cone-beam CT using an iterative intensity matching approach

    Energy Technology Data Exchange (ETDEWEB)

    Nithiananthan, Sajendra; Schafer, Sebastian; Uneri, Ali [Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21205 (United States); and others

    2011-04-15

    Purpose: A method of intensity-based deformable registration of CT and cone-beam CT (CBCT) images is described, in which intensity correction occurs simultaneously within the iterative registration process. The method preserves the speed and simplicity of the popular Demons algorithm while providing robustness and accuracy in the presence of large mismatch between CT and CBCT voxel values (''intensity''). Methods: A variant of the Demons algorithm was developed in which an estimate of the relationship between CT and CBCT intensity values for specific materials in the image is computed at each iteration based on the set of currently overlapping voxels. This tissue-specific intensity correction is then used to estimate the registration output for that iteration and the process is repeated. The robustness of the method was tested in CBCT images of a cadaveric head exhibiting a broad range of simulated intensity variations associated with x-ray scatter, object truncation, and/or errors in the reconstruction algorithm. The accuracy of CT-CBCT registration was also measured in six real cases, exhibiting deformations ranging from simple to complex during surgery or radiotherapy guided by a CBCT-capable C-arm or linear accelerator, respectively. Results: The iterative intensity matching approach was robust against all levels of intensity variation examined, including spatially varying errors in voxel value of a factor of 2 or more, as can be encountered in cases of high x-ray scatter. Registration accuracy without intensity matching degraded severely with increasing magnitude of intensity error and introduced image distortion. A single histogram match performed prior to registration alleviated some of these effects but was also prone to image distortion and was quantifiably less robust and accurate than the iterative approach. Within the six case registration accuracy study, iterative intensity matching Demons reduced mean TRE to (2.5{+-}2.8) mm

  18. The benefits of a laparoscopic approach in ileal pouch anal anastomosis formation: a single institutional retrospective case-matched experience.

    LENUS (Irish Health Repository)

    Kelly, J

    2010-06-01

    A laparoscopic approach to ileoanal pouch formation is novel. By using prospectively gathered data, laparoscopic and open restorative proctocolectomy procedures in mucosal ulcerative colitis (UC) and familial adenomatous polyposis (FAP) patients were compared using a case-matched design.

  19. Precursor-centric genome-mining approach for lasso peptide discovery.

    Science.gov (United States)

    Maksimov, Mikhail O; Pelczer, István; Link, A James

    2012-09-18

    Lasso peptides are a class of ribosomally synthesized posttranslationally modified natural products found in bacteria. Currently known lasso peptides have a diverse set of pharmacologically relevant activities, including inhibition of bacterial growth, receptor antagonism, and enzyme inhibition. The biosynthesis of lasso peptides is specified by a cluster of three genes encoding a precursor protein and two enzymes. Here we develop a unique genome-mining algorithm to identify lasso peptide gene clusters in prokaryotes. Our approach involves pattern matching to a small number of conserved amino acids in precursor proteins, and thus allows for a more global survey of lasso peptide gene clusters than does homology-based genome mining. Of more than 3,000 currently sequenced prokaryotic genomes, we found 76 organisms that are putative lasso peptide producers. These organisms span nine bacterial phyla and an archaeal phylum. To provide validation of the genome-mining method, we focused on a single lasso peptide predicted to be produced by the freshwater bacterium Asticcacaulis excentricus. Heterologous expression of an engineered, minimal gene cluster in Escherichia coli led to the production of a unique lasso peptide, astexin-1. At 23 aa, astexin-1 is the largest lasso peptide isolated to date. It is also highly polar, in contrast to many lasso peptides that are primarily hydrophobic. Astexin-1 has modest antimicrobial activity against its phylogenetic relative Caulobacter crescentus. The solution structure of astexin-1 was determined revealing a unique topology that is stabilized by hydrogen bonding between segments of the peptide.

  20. Whole genome phylogeny of Prochlorococcus marinus group of cyanobacteria: genome alignment and overlapping gene approach.

    Science.gov (United States)

    Prabha, Ratna; Singh, Dhananjaya P; Gupta, Shailendra K; Rai, Anil

    2014-06-01

    Prochlorococcus is the smallest known oxygenic phototrophic marine cyanobacterium dominating the mid-latitude oceans. Physiologically and genetically distinct P. marinus isolates from many oceans in the world were assigned two different groups, a tightly clustered high-light (HL)-adapted and a divergent low-light (LL-) adapted clade. Phylogenetic analysis of this cyanobacterium on the basis of 16S rRNA and other conserved genes did not show consistency with its phenotypic behavior. We analyzed phylogeny of this genus on the basis of complete genome sequences through genome alignment, overlapping-gene content and gene-order approach. Phylogenetic tree of P. marinus obtained by comparing whole genome sequences in contrast to that based on 16S rRNA gene, corresponded well with the HL/LL ecotypic distinction of twelve strains and showed consistency with phenotypic classification of P. marinus. Evidence for the horizontal descent and acquisition of genes within and across the genus was observed. Many genes involved in metabolic functions were found to be conserved across these genomes and many were continuously gained by different strains as per their needs during the course of their evolution. Consistency in the physiological and genetic phylogeny based on whole genome sequence is established. These observations improve our understanding about the adaptation and diversification of these organisms under evolutionary pressure.

  1. A new approach of QRS complex detection based on matched filtering and triangle character analysis.

    Science.gov (United States)

    Li, Yanjun; Yan, Hong; Hong, Feng; Song, Jinzhong

    2012-09-01

    QRS complex detection usually provides the fundamentals to automated electrocardiogram (ECG) analysis. In this paper, a new approach of QRS complex detection without the stage of noise suppression was developed and evaluated, which was based on the combination of two techniques: matched filtering and triangle character analysis. Firstly, a template of QRS complex was selected automatically by the triangle character in ECG, and then it was time-reversed after removing its direct current component. Secondly, matched filtering was implemented at low computational cost by finite impulse response, which further enhanced QRS complex and attenuated non-QRS regions containing P-wave, T-wave and various noise components. Subsequently, triangle structure-based threshold decision was processed to detect QRS complexes. And RR intervals and triangle structures were further analyzed for the reduction of false-positive and false-negative detections. Finally, the performance of the proposed algorithm was tested on all 48 records of the MIT-BIH Arrhythmia Database. The results demonstrated that the detection rate reached 99.62 %, the sensitivity got 99.78 %, and the positive prediction was 99.85 %. In addition, the proposed method was able to identify QRS complexes reliably even under the condition of poor signal quality.

  2. Matching the proteome to the genome: the microbody of penicillin-producing Penicillium chrysogenum cells.

    Science.gov (United States)

    Kiel, Jan A K W; van den Berg, Marco A; Fusetti, Fabrizia; Poolman, Bert; Bovenberg, Roel A L; Veenhuis, Marten; van der Klei, Ida J

    2009-05-01

    In the filamentous fungus Penicillium chrysogenum, microbodies are essential for penicillin biosynthesis. To better understand the role of these organelles in antibiotics production, we determined the matrix enzyme contents of P. chrysogenum microbodies. Using a novel in silico approach, we first obtained a catalogue of 200 P. chrysogenum proteins with putative microbody targeting signals (PTSs). This included two orthologs of proteins involved in cephalosporin biosynthesis, which we demonstrate to be bona fide microbody matrix constituents. Subsequently, we performed a proteomics based inventory of P. chrysogenum microbody matrix proteins using nano-LC-MS/MS analysis. We identified 89 microbody proteins, 79 with a PTS, including the two known microbody-borne penicillin biosynthesis enzymes, isopenicillin N:acyl CoA acyltransferase and phenylacetyl-CoA ligase. Comparative analysis revealed that 69 out of 79 PTS proteins identified experimentally were in the reference list. A prominent microbody protein was identified as a novel fumarate reductase-cytochrome b5 fusion protein, which contains an internal PTS2 between the two functional domains. We show that this protein indeed localizes to P. chrysogenum microbodies.

  3. CLARREO Approach for Reference Intercalibration of Reflected Solar Sensors: On-Orbit Data Matching and Sampling

    Science.gov (United States)

    Roithmayr, Carlos; Lukashin, Constantine; Speth, Paul W.; Kopp, Gregg; Thome, Kurt; Wielicki, Bruce A.; Young, David F.

    2014-01-01

    The implementation of the Climate Absolute Radiance and Refractivity Observatory (CLARREO) mission was recommended by the National Research Council in 2007 to provide an on-orbit intercalibration standard with accuracy of 0.3% (k = 2) for relevant Earth observing sensors. The goal of reference intercalibration, as established in the Decadal Survey, is to enable rigorous high-accuracy observations of critical climate change parameters, including reflected broadband radiation [Clouds and Earth's Radiant Energy System (CERES)], cloud properties [Visible Infrared Imaging Radiometer Suite (VIIRS)], and changes in surface albedo, including snow and ice albedo feedback. In this paper, we describe the CLARREO approach for performing intercalibration on orbit in the reflected solar (RS) wavelength domain. It is based on providing highly accurate spectral reflectance and reflected radiance measurements from the CLARREO Reflected Solar Spectrometer (RSS) to establish an on-orbit reference for existing sensors, namely, CERES and VIIRS on Joint Polar Satellite System satellites, Advanced Very High Resolution Radiometer and follow-on imagers on MetOp, Landsat imagers, and imagers on geostationary platforms. One of two fundamental CLARREO mission goals is to provide sufficient sampling of high-accuracy observations that are matched in time, space, and viewing angles with measurements made by existing instruments, to a degree that overcomes the random error sources from imperfect data matching and instrument noise. The data matching is achieved through CLARREO RSS pointing operations on orbit that align its line of sight with the intercalibrated sensor. These operations must be planned in advance; therefore, intercalibration events must be predicted by orbital modeling. If two competing opportunities are identified, one target sensor must be given priority over the other. The intercalibration method is to monitor changes in targeted sensor response function parameters: effective

  4. IMC-PID design based on model matching approach and closed-loop shaping.

    Science.gov (United States)

    Jin, Qi B; Liu, Q

    2014-03-01

    Motivated by the limitations of the conventional internal model control (IMC), this communication addresses the design of IMC-based PID in terms of the robust performance of the control system. The IMC controller form is obtained by solving an H-infinity problem based on the model matching approach, and the parameters are determined by closed-loop shaping. The shaping of the closed-loop transfer function is considered both for the set-point tracking and for the load disturbance rejection. The design procedure is formulated as a multi-objective optimization problem which is solved by a specific optimization algorithm. A nice feature of this design method is that it permits a clear tradeoff between robustness and performance. Simulation examples show that the proposed method is effective and has a wide applicability.

  5. The impact of the Indonesian health card program: a matching estimator approach.

    Science.gov (United States)

    Johar, Meliyanni

    2009-01-01

    This study evaluates the effectiveness of a pro-poor nation-wide health card program, which provides free basic health care at public health facilities in Indonesia. To quantify the effect of the program, it departs from the traditional regression-based approach in the literature. It employs propensity score matching to reduce the selection bias due to non-random health card distribution. The setting of the program and the richness of the data set support this strategy in providing accurate estimates of the program's effect on its recipients. The results indicate that, in general, the health card program only has limited impact on the consumption of primary health care by its recipients. This finding suggests the presence of other factors counteracting the generous demand incentive.

  6. A Vocabulary Approach to Partial Streamline Matching and Exploratory Flow Visualization.

    Science.gov (United States)

    Tao, Jun; Wang, Chaoli; Shene, Ching-Kuang; Shaw, Raymond A

    2016-05-01

    Measuring the similarity of integral curves is fundamental to many important flow data analysis and visualization tasks such as feature detection, pattern querying, streamline clustering, and hierarchical exploration. In this paper, we introduce FlowString, a novel vocabulary approach that extracts shape invariant features from streamlines and utilizes a string-based method for exploratory streamline analysis and visualization. Our solution first resamples streamlines by considering their local feature scales. We then classify resampled points along streamlines based on the shape similarity around their local neighborhoods. We encode each streamline into a string of well-selected shape characters, from which we construct meaningful words for querying and retrieval. A unique feature of our approach is that it captures intrinsic streamline similarity that is invariant under translation, rotation and scaling. We design an intuitive interface and user interactions to support flexible querying, allowing exact and approximate searches for partial streamline matching. Users can perform queries at either the character level or the word level, and define their own characters or words conveniently for customized search. We demonstrate the effectiveness of FlowString with several flow field data sets of different sizes and characteristics. We also extend FlowString to handle multiple data sets and perform an empirical expert evaluation to confirm the usefulness of this approach.

  7. Generalized Coupled Dictionary Learning Approach With Applications to Cross-Modal Matching.

    Science.gov (United States)

    Mandal, Devraj; Biswas, Soma

    2016-08-01

    Coupled dictionary learning (CDL) has recently emerged as a powerful technique with wide variety of applications ranging from image synthesis to classification tasks. In this paper, we extend the existing CDL approaches in two aspects to make them more suitable for the task of cross-modal matching. Data coming from different modalities may or may not be paired. For example, for image-text retrieval problem, 100 images of a class are available as opposed to only 50 samples of text data for training. Current CDL approaches are not designed to handle such scenarios, where classes of data points in one modality correspond to classes of data points in the other modality. Given the data from the two modalities, first two dictionaries are learnt for the respective modalities, so that the data have a sparse representation with respect to their own dictionaries. Then, the sparse coefficients from the two modalities are transformed in such a manner that data from the same class are maximally correlated, while that from different classes have very less correlation. This way of modeling the coupling between the sparse representations of the two modalities makes this approach work seamlessly for paired as well as unpaired data. The discriminative coupling term also makes the approach better suited for classification tasks. Experiments on different publicly available cross-modal data sets, namely, CUHK photosketch face data set, HFB visible and near-infrared facial images data set, IXMAS multiview action recognition data set, wiki image and text data set and Multiple Features data set, show that this generalized CDL approach performs better than the state-of-the-art for both paired as well as unpaired data.

  8. An improved probability mapping approach to assess genome mosaicism

    Directory of Open Access Journals (Sweden)

    Gogarten J Peter

    2003-09-01

    Full Text Available Abstract Background Maximum likelihood and posterior probability mapping are useful visualization techniques that are used to ascertain the mosaic nature of prokaryotic genomes. However, posterior probabilities, especially when calculated for four-taxon cases, tend to overestimate the support for tree topologies. Furthermore, because of poor taxon sampling four-taxon analyses suffer from sensitivity to the long branch attraction artifact. Here we extend the probability mapping approach by improving taxon sampling of the analyzed datasets, and by using bootstrap support values, a more conservative tool to assess reliability. Results Quartets of orthologous proteins were complemented with homologs from selected reference genomes. The mapping of bootstrap support values from these extended datasets gives results similar to the original maximum likelihood and posterior probability mapping. The more conservative nature of the plotted support values allows to focus further analyses on those protein families that strongly disagree with the majority or plurality of genes present in the analyzed genomes. Conclusion Posterior probability is a non-conservative measure for support, and posterior probability mapping only provides a quick estimation of phylogenetic information content of four genomes. This approach can be utilized as a pre-screen to select genes that might have been horizontally transferred. Better taxon sampling combined with subtree analyses prevents the inconsistencies associated with four-taxon analyses, but retains the power of visual representation. Nevertheless, a case-by-case inspection of individual multi-taxon phylogenies remains necessary to differentiate unrecognized paralogy and shared phylogenetic reconstruction artifacts from horizontal gene transfer events.

  9. Integrative genome-wide approaches in embryonic stem cell research.

    Science.gov (United States)

    Zhang, Xinyue; Huang, Jing

    2010-10-01

    Embryonic stem (ES) cells are derived from blastocysts. They can differentiate into the three embryonic germ layers and essentially any type of somatic cells. They therefore hold great potential in tissue regeneration therapy. The ethical issues associated with the use of human embryonic stem cells are resolved by the technical break-through of generating induced pluripotent stem (iPS) cells from various types of somatic cells. However, how ES and iPS cells self-renew and maintain their pluripotency is still largely unknown in spite of the great progress that has been made in the last two decades. Integrative genome-wide approaches, such as the gene expression microarray, chromatin immunoprecipitation based microarray (ChIP-chip) and chromatin immunoprecipitation followed by massive parallel sequencing (ChIP-seq) offer unprecedented opportunities to elucidate the mechanism of the pluripotency, reprogramming and DNA damage response of ES and iPS cells. This frontier article summarizes the fundamental biological questions about ES and iPS cells and reviews the recent advances in ES and iPS cell research using genome-wide technologies. To this end, we offer our perspectives on the future of genome-wide studies on stem cells.

  10. Assessing the hydrologic alteration of the Yangtze River using the histogram matching approach

    Science.gov (United States)

    Huang, F.; Zhang, N.; Guo, L. D.; Xia, Z. Q.

    2016-08-01

    Hydrologic changes of the Yangtze River, an important river with abundant water resources in China, were investigated using the Histogram Matching Approach. Daily streamflow data spanning the time interval from 1955 to 2013 was collected from Yichang and Datong stations, which monitor the hydrologic processes of the upper and lower reach of the Yangtze River, respectively. The Gezhouba Dam, the first dam constructed at the main stream of the Yangtze River, started operations in 1981. 1981 was used to differentiate the pre-dam (1955-1980) and post-dam (1981-2013) hydrologic regimes. The hydrologic regime was quantified by the Indicators of Hydrologic Alteration. The overall alteration degree of the upper Yangtze River was 31% and the alteration degree of every hydrologic indicator ranged from 10% to 81%. Only 1, 5 and 26 hydrologic indicators were altered at high, moderate and low degrees, respectively. The overall alteration degree of the lower Yangtze River was 30%, and the alteration degree of every hydrologic indicator ranged from 8% to 49%. No high alteration degree was detected at the Datong station. Ten hydrologic indicators were altered at moderate degrees and 22 hydrologic indicators were altered at low degrees. Significant increases could be observed for the low-flow relevant indicators, including the monthly flow from January-March, the annual minimum 1, 3, 7, 30 and 90-day flows, and the base flow index.

  11. Based on Regular Expression Matching of Evaluation of the Task Performance in WSN: A Queue Theory Approach

    Directory of Open Access Journals (Sweden)

    Jie Wang

    2014-01-01

    Full Text Available Due to the limited resources of wireless sensor network, low efficiency of real-time communication scheduling, poor safety defects, and so forth, a queuing performance evaluation approach based on regular expression match is proposed, which is a method that consists of matching preprocessing phase, validation phase, and queuing model of performance evaluation phase. Firstly, the subset of related sequence is generated in preprocessing phase, guiding the validation phase distributed matching. Secondly, in the validation phase, the subset of features clustering, the compressed matching table is more convenient for distributed parallel matching. Finally, based on the queuing model, the sensor networks of task scheduling dynamic performance are evaluated. Experiments show that our approach ensures accurate matching and computational efficiency of more than 70%; it not only effectively detects data packets and access control, but also uses queuing method to determine the parameters of task scheduling in wireless sensor networks. The method for medium scale or large scale distributed wireless node has a good applicability.

  12. Based on regular expression matching of evaluation of the task performance in WSN: a queue theory approach.

    Science.gov (United States)

    Wang, Jie; Cui, Kai; Zhou, Kuanjiu; Yu, Yanshuo

    2014-01-01

    Due to the limited resources of wireless sensor network, low efficiency of real-time communication scheduling, poor safety defects, and so forth, a queuing performance evaluation approach based on regular expression match is proposed, which is a method that consists of matching preprocessing phase, validation phase, and queuing model of performance evaluation phase. Firstly, the subset of related sequence is generated in preprocessing phase, guiding the validation phase distributed matching. Secondly, in the validation phase, the subset of features clustering, the compressed matching table is more convenient for distributed parallel matching. Finally, based on the queuing model, the sensor networks of task scheduling dynamic performance are evaluated. Experiments show that our approach ensures accurate matching and computational efficiency of more than 70%; it not only effectively detects data packets and access control, but also uses queuing method to determine the parameters of task scheduling in wireless sensor networks. The method for medium scale or large scale distributed wireless node has a good applicability.

  13. New algorithmic approaches to protein spot detection and pattern matching in two-dimensional electrophoresis gel databases.

    Science.gov (United States)

    Pleissner, K P; Hoffmann, F; Kriegel, K; Wenk, C; Wegner, S; Sahlström, A; Oswald, H; Alt, H; Fleck, E

    1999-01-01

    Protein spot identification in two-dimensional electrophoresis gels can be supported by the comparison of gel images accessible in different World Wide Web two-dimensional electrophoresis (2-DE) gel protein databases. The comparison may be performed either by visual cross-matching between gel images or by automatic recognition of similar protein spot patterns. A prerequisite for the automatic point pattern matching approach is the detection of protein spots yielding the x(s),y(s) coordinates and integrated spot intensities i(s). For this purpose an algorithm is developed based on a combination of hierarchical watershed transformation and feature extraction methods. This approach reduces the strong over-segmentation of spot regions normally produced by watershed transformation. Measures for the ellipticity and curvature are determined as features of spot regions. The resulting spot lists containing x(s),y(s),i(s)-triplets are calculated for a source as well as for a target gel image accessible in 2-DE gel protein databases. After spot detection a matching procedure is applied. Both the matching of a local pattern vs. a full 2-DE gel image and the global matching between full images are discussed. Preset slope and length tolerances of pattern edges serve as matching criteria. The local matching algorithm relies on a data structure derived from the incremental Delaunay triangulation of a point set and a two-step hashing technique. For the incremental construction of triangles the spot intensities are considered in decreasing order. The algorithm needs neither landmarks nor an a priori image alignment. A graphical user interface for spot detection and gel matching is written in the Java programming language for the Internet. The software package called CAROL (http://gelmatching.inf.fu-berlin.de) is realized in a client-server architecture.

  14. A hybrid approach for de novo human genome sequence assembly and phasing.

    Science.gov (United States)

    Mostovoy, Yulia; Levy-Sakin, Michal; Lam, Jessica; Lam, Ernest T; Hastie, Alex R; Marks, Patrick; Lee, Joyce; Chu, Catherine; Lin, Chin; Džakula, Željko; Cao, Han; Schlebusch, Stephen A; Giorda, Kristina; Schnall-Levin, Michael; Wall, Jeffrey D; Kwok, Pui-Yan

    2016-07-01

    Despite tremendous progress in genome sequencing, the basic goal of producing a phased (haplotype-resolved) genome sequence with end-to-end contiguity for each chromosome at reasonable cost and effort is still unrealized. In this study, we describe an approach to performing de novo genome assembly and experimental phasing by integrating the data from Illumina short-read sequencing, 10X Genomics linked-read sequencing, and BioNano Genomics genome mapping to yield a high-quality, phased, de novo assembled human genome.

  15. Development and clinical application of an integrative genomic approach to personalized cancer therapy.

    Science.gov (United States)

    Uzilov, Andrew V; Ding, Wei; Fink, Marc Y; Antipin, Yevgeniy; Brohl, Andrew S; Davis, Claire; Lau, Chun Yee; Pandya, Chetanya; Shah, Hardik; Kasai, Yumi; Powell, James; Micchelli, Mark; Castellanos, Rafael; Zhang, Zhongyang; Linderman, Michael; Kinoshita, Yayoi; Zweig, Micol; Raustad, Katie; Cheung, Kakit; Castillo, Diane; Wooten, Melissa; Bourzgui, Imane; Newman, Leah C; Deikus, Gintaras; Mathew, Bino; Zhu, Jun; Glicksberg, Benjamin S; Moe, Aye S; Liao, Jun; Edelmann, Lisa; Dudley, Joel T; Maki, Robert G; Kasarskis, Andrew; Holcombe, Randall F; Mahajan, Milind; Hao, Ke; Reva, Boris; Longtine, Janina; Starcevic, Daniela; Sebra, Robert; Donovan, Michael J; Li, Shuyu; Schadt, Eric E; Chen, Rong

    2016-06-01

    Personalized therapy provides the best outcome of cancer care and its implementation in the clinic has been greatly facilitated by recent convergence of enormous progress in basic cancer research, rapid advancement of new tumor profiling technologies, and an expanding compendium of targeted cancer therapeutics. We developed a personalized cancer therapy (PCT) program in a clinical setting, using an integrative genomics approach to fully characterize the complexity of each tumor. We carried out whole exome sequencing (WES) and single-nucleotide polymorphism (SNP) microarray genotyping on DNA from tumor and patient-matched normal specimens, as well as RNA sequencing (RNA-Seq) on available frozen specimens, to identify somatic (tumor-specific) mutations, copy number alterations (CNAs), gene expression changes, gene fusions, and also germline variants. To provide high sensitivity in known cancer mutation hotspots, Ion AmpliSeq Cancer Hotspot Panel v2 (CHPv2) was also employed. We integrated the resulting data with cancer knowledge bases and developed a specific workflow for each cancer type to improve interpretation of genomic data. We returned genomics findings to 46 patients and their physicians describing somatic alterations and predicting drug response, toxicity, and prognosis. Mean 17.3 cancer-relevant somatic mutations per patient were identified, 13.3-fold, 6.9-fold, and 4.7-fold more than could have been detected using CHPv2, Oncomine Cancer Panel (OCP), and FoundationOne, respectively. Our approach delineated the underlying genetic drivers at the pathway level and provided meaningful predictions of therapeutic efficacy and toxicity. Actionable alterations were found in 91 % of patients (mean 4.9 per patient, including somatic mutations, copy number alterations, gene expression alterations, and germline variants), a 7.5-fold, 2.0-fold, and 1.9-fold increase over what could have been uncovered by CHPv2, OCP, and FoundationOne, respectively. The findings altered

  16. Genomic Insights into Geothermal Spring Community Members using a 16S Agnostic Single-Cell Approach

    Science.gov (United States)

    Bowers, R. M.

    2016-12-01

    INSTUTIONS (ALL): DOE Joint Genome Institute, Walnut Creek, CA USA. Bigelow Laboratory for Ocean Sciences, East Boothbay, ME USA. Department of Biological Sciences, University of Calgary, Calgary, Alberta, Canada. ABSTRACT BODY: With recent advances in DNA sequencing, rapid and affordable screening of single-cell genomes has become a reality. Single-cell sequencing is a multi-step process that takes advantage of any number of single-cell sorting techniques, whole genome amplification (WGA), and 16S rRNA gene based PCR screening to identify the microbes of interest prior to shotgun sequencing. However, the 16S PCR based screening step is costly and may lead to unanticipated losses of microbial diversity, as cells that do not produce a clean 16S amplicon are typically omitted from downstream shotgun sequencing. While many of the sorted cells that fail the 16S PCR step likely originate from poor quality amplified DNA, some of the cells with good WGA kinetics may instead represent bacteria or archaea with 16S genes that fail to amplify due to primer mis-matches or the presence of intervening sequences. Using cell material from Dewar Creek, a hot spring in British Columbia, we sequenced all sorted cells with good WGA kinetics irrespective of their 16S amplification success. We show that this high-throughput approach to single-cell sequencing (i) can reduce the overall cost of single-cell genome production, and (ii). may lead to the discovery of previously unknown branches on the microbial tree of life.

  17. Artificial intelligence (AI)-based relational matching and multimodal medical image fusion: generalized 3D approaches

    Science.gov (United States)

    Vajdic, Stevan M.; Katz, Henry E.; Downing, Andrew R.; Brooks, Michael J.

    1994-09-01

    A 3D relational image matching/fusion algorithm is introduced. It is implemented in the domain of medical imaging and is based on Artificial Intelligence paradigms--in particular, knowledge base representation and tree search. The 2D reference and target images are selected from 3D sets and segmented into non-touching and non-overlapping regions, using iterative thresholding and/or knowledge about the anatomical shapes of human organs. Selected image region attributes are calculated. Region matches are obtained using a tree search, and the error is minimized by evaluating a `goodness' of matching function based on similarities of region attributes. Once the matched regions are found and the spline geometric transform is applied to regional centers of gravity, images are ready for fusion and visualization into a single 3D image of higher clarity.

  18. Log-Spiral Keypoint: A Robust Approach toward Image Patch Matching

    Directory of Open Access Journals (Sweden)

    Kangho Paek

    2015-01-01

    Full Text Available Matching of keypoints across image patches forms the basis of computer vision applications, such as object detection, recognition, and tracking in real-world images. Most of keypoint methods are mainly used to match the high-resolution images, which always utilize an image pyramid for multiscale keypoint detection. In this paper, we propose a novel keypoint method to improve the matching performance of image patches with the low-resolution and small size. The location, scale, and orientation of keypoints are directly estimated from an original image patch using a Log-Spiral sampling pattern for keypoint detection without consideration of image pyramid. A Log-Spiral sampling pattern for keypoint description and two bit-generated functions are designed for generating a binary descriptor. Extensive experiments show that the proposed method is more effective and robust than existing binary-based methods for image patch matching.

  19. A Search and Matching Approach to Labor Markets: Did the Natural Rate of Unemployment Rise?

    National Research Council Canada - National Science Library

    Mary C. Daly; Bart Hobijn; Ayşcegül Şahin; Robert G. Valletta

    2012-01-01

    .... Relying on a standard job search and matching framework and empirical evidence from a wide array of labor market indicators, we examine whether the natural rate of unemployment has increased since...

  20. REVERSE DESIGN APPROACH FOR MECHANISM TRAJECTORY BASED ON CODE-CHAINS MATCHING

    Institute of Scientific and Technical Information of China (English)

    ZHANG Shuyou; YI Guodong; XU Xiaofeng

    2007-01-01

    Aiming at the problem of reverse-design of mechanism, a method based on the matching of trajectory code-chains is presented. The motion trajectory of mechanism is described with code-chain,which is normalized to simplify the operation of geometric transformation. The geometric transformation formulas of scale, mirror and rotation for trajectory code-chain are defined, and the reverse design for mechanism trajectory is realized through the analysis and solution of similarity matching between the desired trajectory and the predefined trajectory. The algorithm program and prototype system of reverse design for mechanism trajectory are developed. Application samples show that the method can break the restriction of trajectory patterns in matching, meet the demand of partial matching, and overcome the influence of geometric transformation of trajectory on the reverse design for mechanism.

  1. New approach to navigation: matching sequential images to 3D terrain maps

    Science.gov (United States)

    Zhang, Tianxu; Hu, Bo; Li, Wei

    1998-03-01

    In this paper an efficient image matching algorithm is presented for use in aircraft navigation. A sequence images with each two successive images partially overlapped is sensed by a monocular optical system. 3D undulation features are recovered from the image pairs, and then matched against a reference undulation feature map. Finally, the aircraft position is estimated by minimizing Hausdorff distance measure. The simulation experiment using real terrain data is reported.

  2. Predicting disease trait with genomic data: a composite kernel approach.

    Science.gov (United States)

    Yang, Haitao; Li, Shaoyu; Cao, Hongyan; Zhang, Chichen; Cui, Yuehua

    2016-06-02

    With the advancement of biotechniques, a vast amount of genomic data is generated with no limit. Predicting a disease trait based on these data offers a cost-effective and time-efficient way for early disease screening. Here we proposed a composite kernel partial least squares (CKPLS) regression model for quantitative disease trait prediction focusing on genomic data. It can efficiently capture nonlinear relationships among features compared with linear learning algorithms such as Least Absolute Shrinkage and Selection Operator or ridge regression. We proposed to optimize the kernel parameters and kernel weights with the genetic algorithm (GA). In addition to improved performance for parameter optimization, the proposed GA-CKPLS approach also has better learning capacity and generalization ability compared with single kernel-based KPLS method as well as other nonlinear prediction models such as the support vector regression. Extensive simulation studies demonstrated that GA-CKPLS had better prediction performance than its counterparts under different scenarios. The utility of the method was further demonstrated through two case studies. Our method provides an efficient quantitative platform for disease trait prediction based on increasing volume of omics data.

  3. An integrative computational approach for prioritization of genomic variants.

    Directory of Open Access Journals (Sweden)

    Inna Dubchak

    Full Text Available An essential step in the discovery of molecular mechanisms contributing to disease phenotypes and efficient experimental planning is the development of weighted hypotheses that estimate the functional effects of sequence variants discovered by high-throughput genomics. With the increasing specialization of the bioinformatics resources, creating analytical workflows that seamlessly integrate data and bioinformatics tools developed by multiple groups becomes inevitable. Here we present a case study of a use of the distributed analytical environment integrating four complementary specialized resources, namely the Lynx platform, VISTA RViewer, the Developmental Brain Disorders Database (DBDB, and the RaptorX server, for the identification of high-confidence candidate genes contributing to pathogenesis of spina bifida. The analysis resulted in prediction and validation of deleterious mutations in the SLC19A placental transporter in mothers of the affected children that causes narrowing of the outlet channel and therefore leads to the reduced folate permeation rate. The described approach also enabled correct identification of several genes, previously shown to contribute to pathogenesis of spina bifida, and suggestion of additional genes for experimental validations. The study demonstrates that the seamless integration of bioinformatics resources enables fast and efficient prioritization and characterization of genomic factors and molecular networks contributing to the phenotypes of interest.

  4. Experimental Approaches to Study Genome Packaging of Influenza A Viruses

    Directory of Open Access Journals (Sweden)

    Catherine Isel

    2016-08-01

    Full Text Available The genome of influenza A viruses (IAV consists of eight single-stranded negative sense viral RNAs (vRNAs encapsidated into viral ribonucleoproteins (vRNPs. It is now well established that genome packaging (i.e., the incorporation of a set of eight distinct vRNPs into budding viral particles, follows a specific pathway guided by segment-specific cis-acting packaging signals on each vRNA. However, the precise nature and function of the packaging signals, and the mechanisms underlying the assembly of vRNPs into sub-bundles in the cytoplasm and their selective packaging at the viral budding site, remain largely unknown. Here, we review the diverse and complementary methods currently being used to elucidate these aspects of the viral cycle. They range from conventional and competitive reverse genetics, single molecule imaging of vRNPs by fluorescence in situ hybridization (FISH and high-resolution electron microscopy and tomography of budding viral particles, to solely in vitro approaches to investigate vRNA-vRNA interactions at the molecular level.

  5. An Integrative Computational Approach for Prioritization of Genomic Variants

    Science.gov (United States)

    Wang, Sheng; Meyden, Cem; Sulakhe, Dinanath; Poliakov, Alexander; Börnigen, Daniela; Xie, Bingqing; Taylor, Andrew; Ma, Jianzhu; Paciorkowski, Alex R.; Mirzaa, Ghayda M.; Dave, Paul; Agam, Gady; Xu, Jinbo; Al-Gazali, Lihadh; Mason, Christopher E.; Ross, M. Elizabeth; Maltsev, Natalia; Gilliam, T. Conrad

    2014-01-01

    An essential step in the discovery of molecular mechanisms contributing to disease phenotypes and efficient experimental planning is the development of weighted hypotheses that estimate the functional effects of sequence variants discovered by high-throughput genomics. With the increasing specialization of the bioinformatics resources, creating analytical workflows that seamlessly integrate data and bioinformatics tools developed by multiple groups becomes inevitable. Here we present a case study of a use of the distributed analytical environment integrating four complementary specialized resources, namely the Lynx platform, VISTA RViewer, the Developmental Brain Disorders Database (DBDB), and the RaptorX server, for the identification of high-confidence candidate genes contributing to pathogenesis of spina bifida. The analysis resulted in prediction and validation of deleterious mutations in the SLC19A placental transporter in mothers of the affected children that causes narrowing of the outlet channel and therefore leads to the reduced folate permeation rate. The described approach also enabled correct identification of several genes, previously shown to contribute to pathogenesis of spina bifida, and suggestion of additional genes for experimental validations. The study demonstrates that the seamless integration of bioinformatics resources enables fast and efficient prioritization and characterization of genomic factors and molecular networks contributing to the phenotypes of interest. PMID:25506935

  6. Approaches for Comparative Genomics in Aspergillus and Penicillium

    DEFF Research Database (Denmark)

    Rasmussen, Jane Lind Nybo; Theobald, Sebastian; Brandl, Julian

    2016-01-01

    The number of available genomes in the closely related fungal genera Aspergillus and Penicillium is rapidly increasing. At the time of writing, the genomes of 62 species are available, and an even higher number is being prepared. Fungal comparative genomics is thus becoming steadily more powerful...... and applicable for many types of studies. In this chapter, we provide an overview of the state-of-the-art of comparative genomics in these fungi, along with recommended methods. The chapter describes databases for fungal comparative genomics. Based on experience, we suggest strategies for multiple types...... of comparative genomics, ranging from analysis of single genes, over gene clusters and CaZymes to genome-scale comparative genomics. Furthermore, we have examined published comparative genomics papers to summarize the preferred bioinformatic methods and parameters for a given type of analysis, highly useful...

  7. A hybrid clustering approach to recognition of protein families in 114 microbial genomes

    Directory of Open Access Journals (Sweden)

    Gogarten J Peter

    2004-04-01

    Full Text Available Abstract Background Grouping proteins into sequence-based clusters is a fundamental step in many bioinformatic analyses (e.g., homology-based prediction of structure or function. Standard clustering methods such as single-linkage clustering capture a history of cluster topologies as a function of threshold, but in practice their usefulness is limited because unrelated sequences join clusters before biologically meaningful families are fully constituted, e.g. as the result of matches to so-called promiscuous domains. Use of the Markov Cluster algorithm avoids this non-specificity, but does not preserve topological or threshold information about protein families. Results We describe a hybrid approach to sequence-based clustering of proteins that combines the advantages of standard and Markov clustering. We have implemented this hybrid approach over a relational database environment, and describe its application to clustering a large subset of PDB, and to 328577 proteins from 114 fully sequenced microbial genomes. To demonstrate utility with difficult problems, we show that hybrid clustering allows us to constitute the paralogous family of ATP synthase F1 rotary motor subunits into a single, biologically interpretable hierarchical grouping that was not accessible using either single-linkage or Markov clustering alone. We describe validation of this method by hybrid clustering of PDB and mapping SCOP families and domains onto the resulting clusters. Conclusion Hybrid (Markov followed by single-linkage clustering combines the advantages of the Markov Cluster algorithm (avoidance of non-specific clusters resulting from matches to promiscuous domains and single-linkage clustering (preservation of topological information as a function of threshold. Within the individual Markov clusters, single-linkage clustering is a more-precise instrument, discerning sub-clusters of biological relevance. Our hybrid approach thus provides a computationally efficient

  8. Computational approaches to identify functional genetic variants in cancer genomes

    DEFF Research Database (Denmark)

    Gonzalez-Perez, Abel; Mustonen, Ville; Reva, Boris

    2013-01-01

    The International Cancer Genome Consortium (ICGC) aims to catalog genomic abnormalities in tumors from 50 different cancer types. Genome sequencing reveals hundreds to thousands of somatic mutations in each tumor but only a minority of these drive tumor progression. We present the result of discu......The International Cancer Genome Consortium (ICGC) aims to catalog genomic abnormalities in tumors from 50 different cancer types. Genome sequencing reveals hundreds to thousands of somatic mutations in each tumor but only a minority of these drive tumor progression. We present the result...

  9. Approaches for Comparative Genomics in Aspergillus and Penicillium

    DEFF Research Database (Denmark)

    Rasmussen, Jane Lind Nybo; Theobald, Sebastian; Brandl, Julian

    2016-01-01

    of comparative genomics, ranging from analysis of single genes, over gene clusters and CaZymes to genome-scale comparative genomics. Furthermore, we have examined published comparative genomics papers to summarize the preferred bioinformatic methods and parameters for a given type of analysis, highly useful...... for new fungal geneticists. Moreover, the chapter contains a detailed overview of comparative genomics studies of key fungal traits such as primary metabolism, secondary metabolism, and secretome analysis. Finally, we gaze into a possible future of the field by comparing the current state of fungal......The number of available genomes in the closely related fungal genera Aspergillus and Penicillium is rapidly increasing. At the time of writing, the genomes of 62 species are available, and an even higher number is being prepared. Fungal comparative genomics is thus becoming steadily more powerful...

  10. Joint Genome Institute's Automation Approach and History

    Energy Technology Data Exchange (ETDEWEB)

    Roberts, Simon

    2006-07-05

    Department of Energy/Joint Genome Institute (DOE/JGI) collaborates with DOE national laboratories and community users, to advance genome science in support of the DOE missions of clean bio-energy, carbon cycling, and bioremediation.

  11. Using ancestry matching to combine family-based and unrelated samples for genome-wide association studies‡

    Science.gov (United States)

    Crossett, Andrew; Kent, Brian P.; Klei, Lambertus; Ringquist, Steven; Trucco, Massimo; Roeder, Kathryn; Devlin, Bernie

    2015-01-01

    We propose a method to analyze family-based samples together with unrelated cases and controls. The method builds on the idea of matched case–control analysis using conditional logistic regression (CLR). For each trio within the family, a case (the proband) and matched pseudo-controls are constructed, based upon the transmitted and untransmitted alleles. Unrelated controls, matched by genetic ancestry, supplement the sample of pseudo-controls; likewise unrelated cases are also paired with genetically matched controls. Within each matched stratum, the case genotype is contrasted with control pseudo-control genotypes via CLR, using a method we call matched-CLR (mCLR). Eigenanalysis of numerous SNP genotypes provides a tool for mapping genetic ancestry. The result of such an analysis can be thought of as a multidimensional map, or eigenmap, in which the relative genetic similarities and differences amongst individuals is encoded in the map. Once constructed, new individuals can be projected onto the ancestry map based on their genotypes. Successful differentiation of individuals of distinct ancestry depends on having a diverse, yet representative sample from which to construct the ancestry map. Once samples are well-matched, mCLR yields comparable power to competing methods while ensuring excellent control over Type I error. PMID:20862653

  12. Using ancestry matching to combine family-based and unrelated samples for genome-wide association studies.

    Science.gov (United States)

    Crossett, Andrew; Kent, Brian P; Klei, Lambertus; Ringquist, Steven; Trucco, Massimo; Roeder, Kathryn; Devlin, Bernie

    2010-12-10

    We propose a method to analyze family-based samples together with unrelated cases and controls. The method builds on the idea of matched case-control analysis using conditional logistic regression (CLR). For each trio within the family, a case (the proband) and matched pseudo-controls are constructed, based upon the transmitted and untransmitted alleles. Unrelated controls, matched by genetic ancestry, supplement the sample of pseudo-controls; likewise unrelated cases are also paired with genetically matched controls. Within each matched stratum, the case genotype is contrasted with control/pseudo-control genotypes via CLR, using a method we call matched-CLR (mCLR). Eigenanalysis of numerous SNP genotypes provides a tool for mapping genetic ancestry. The result of such an analysis can be thought of as a multidimensional map, or eigenmap, in which the relative genetic similarities and differences amongst individuals is encoded in the map. Once constructed, new individuals can be projected onto the ancestry map based on their genotypes. Successful differentiation of individuals of distinct ancestry depends on having a diverse, yet representative sample from which to construct the ancestry map. Once samples are well-matched, mCLR yields comparable power to competing methods while ensuring excellent control over Type I error.

  13. A Match-based approach to the estimation of polar stratospheric ozone loss using Aura Microwave Limb Sounder observations

    Directory of Open Access Journals (Sweden)

    N. J. Livesey

    2015-04-01

    Full Text Available The well-established "Match" approach to quantifying chemical destruction of ozone in the polar lower stratosphere is applied to ozone observations from the Microwave Limb Sounder (MLS on NASA's Aura spacecraft. Quantification of ozone loss requires distinguishing transport- and chemically induced changes in ozone abundance. This is accomplished in the Match approach by examining cases where trajectories indicate that the same airmass has been observed on multiple occasions. The method was pioneered using ozone sonde observations, for which hundreds of matched ozone observations per winter are typically available. The dense coverage of the MLS measurements, particularly at polar latitudes, allows matches to be made to thousands of observations each day. This study is enabled by recently developed MLS Lagrangian Trajectory Diagnostic (LTD support products. Sensitivity studies indicate that the largest influence on the ozone loss estimates are the value of potential vorticity (PV used to define the edge of the polar vortex (within which matched observations must lie and the degree to which the PV of an airmass is allowed to vary between matched observations. Applying Match calculations to MLS observations of nitrous oxide, a long-lived tracer whose expected rate of change on these timescales is negligible, enables quantification of the impact of transport errors on the Match-based ozone loss estimates. Our loss estimates are generally in agreement with previous estimates for selected Arctic winters, though indicating smaller losses than many other studies. Arctic ozone losses are greatest during the 2010/11 winter, as seen in prior studies, with 2.0 ppmv (parts per million by volume loss estimated at 450 K potential temperature. As expected, Antarctic winter ozone losses are consistently greater than those for the Arctic, with less interannual variability (e.g., ranging between 2.3 and 3.0 ppmv at 450 K. This study exemplifies the insights into

  14. A Match-based approach to the estimation of polar stratospheric ozone loss using Aura Microwave Limb Sounder observations

    Science.gov (United States)

    Livesey, N. J.; Santee, M. L.; Manney, G. L.

    2015-09-01

    The well-established "Match" approach to quantifying chemical destruction of ozone in the polar lower stratosphere is applied to ozone observations from the Microwave Limb Sounder (MLS) on NASA's Aura spacecraft. Quantification of ozone loss requires distinguishing transport- and chemically induced changes in ozone abundance. This is accomplished in the Match approach by examining cases where trajectories indicate that the same air mass has been observed on multiple occasions. The method was pioneered using ozonesonde observations, for which hundreds of matched ozone observations per winter are typically available. The dense coverage of the MLS measurements, particularly at polar latitudes, allows matches to be made to thousands of observations each day. This study is enabled by recently developed MLS Lagrangian trajectory diagnostic (LTD) support products. Sensitivity studies indicate that the largest influence on the ozone loss estimates are the value of potential vorticity (PV) used to define the edge of the polar vortex (within which matched observations must lie) and the degree to which the PV of an air mass is allowed to vary between matched observations. Applying Match calculations to MLS observations of nitrous oxide, a long-lived tracer whose expected rate of change is negligible on the weekly to monthly timescales considered here, enables quantification of the impact of transport errors on the Match-based ozone loss estimates. Our loss estimates are generally in agreement with previous estimates for selected Arctic winters, though indicating smaller losses than many other studies. Arctic ozone losses are greatest during the 2010/11 winter, as seen in prior studies, with 2.0 ppmv (parts per million by volume) loss estimated at 450 K potential temperature (~ 18 km altitude). As expected, Antarctic winter ozone losses are consistently greater than those for the Arctic, with less interannual variability (e.g., ranging between 2.3 and 3.0 ppmv at 450 K). This

  15. A Match-based approach to the estimation of polar stratospheric ozone loss using Aura Microwave Limb Sounder observations

    Directory of Open Access Journals (Sweden)

    N. J. Livesey

    2015-09-01

    Full Text Available The well-established "Match" approach to quantifying chemical destruction of ozone in the polar lower stratosphere is applied to ozone observations from the Microwave Limb Sounder (MLS on NASA's Aura spacecraft. Quantification of ozone loss requires distinguishing transport- and chemically induced changes in ozone abundance. This is accomplished in the Match approach by examining cases where trajectories indicate that the same air mass has been observed on multiple occasions. The method was pioneered using ozonesonde observations, for which hundreds of matched ozone observations per winter are typically available. The dense coverage of the MLS measurements, particularly at polar latitudes, allows matches to be made to thousands of observations each day. This study is enabled by recently developed MLS Lagrangian trajectory diagnostic (LTD support products. Sensitivity studies indicate that the largest influence on the ozone loss estimates are the value of potential vorticity (PV used to define the edge of the polar vortex (within which matched observations must lie and the degree to which the PV of an air mass is allowed to vary between matched observations. Applying Match calculations to MLS observations of nitrous oxide, a long-lived tracer whose expected rate of change is negligible on the weekly to monthly timescales considered here, enables quantification of the impact of transport errors on the Match-based ozone loss estimates. Our loss estimates are generally in agreement with previous estimates for selected Arctic winters, though indicating smaller losses than many other studies. Arctic ozone losses are greatest during the 2010/11 winter, as seen in prior studies, with 2.0 ppmv (parts per million by volume loss estimated at 450 K potential temperature (~ 18 km altitude. As expected, Antarctic winter ozone losses are consistently greater than those for the Arctic, with less interannual variability (e.g., ranging between 2.3 and 3.0 ppmv at

  16. Combining Generalized Phase Contrast with matched filtering into a versatile beam shaping approach

    DEFF Research Database (Denmark)

    Glückstad, Jesper; Palima, Darwin

    2010-01-01

    We adapt concepts from matched filtering to propose a method for generating reconfigurable multiple beams. Combined with the Generalized Phase Contrast (GPC) technique, the proposed method coined mGPC can yield dynamically reconfigurable optical beam arrays with high light efficiency for optical ...... manipulation, high-speed sorting and other parallel spatial light applications [1].......We adapt concepts from matched filtering to propose a method for generating reconfigurable multiple beams. Combined with the Generalized Phase Contrast (GPC) technique, the proposed method coined mGPC can yield dynamically reconfigurable optical beam arrays with high light efficiency for optical...

  17. Genomic insights into ayurvedic and western approaches to personalized medicine

    Indian Academy of Sciences (India)

    Bhavana Prasher; Greg Gibson; Mitali Mukerji

    2016-03-01

    Ayurveda, an ancient Indian system of medicine documented and practised since 1500 B.C., follows a systems approach that has interesting parallels with contemporary personalized genomic medicine approaches to the understanding and management of health and disease. It is based on the trisutra, which are the three aspects of causes, features and therapeutics that are interconnected through a common organizing principle termed ‘tridosha’. Tridosha comprise three ascertainable physiological entities; vata (kinetic), pitta (metabolic) and kapha (potential) that are pervasive across systems, work in conjunction with each other, respond to the external environment and maintain homeostasis. Each individual is born with a specific proportion of tridosha that are not only genetically determined but also influenced by the environment during foetal development. Jointly they determine a person’s basic constitution, which is termed their ‘prakriti’. Development and progression of different diseases with their subtypes are thought to depend on the origin and mechanism of perturbation of the doshas, and the aim of therapeutic practice is to ensure that the doshas retain their homeostatic state. Similarly, western systems biology epitomized by translational P4 medicine envisages the integration of multiscalar genetic, cellular, physiological and environmental networks to predict phenotypic outcomes of perturbations. In this perspective article, we aim to outline the shape of a unifying scaffold that may allow the two intellectual traditions to enhance one another. Specifically, we illustrate how a unique integrative ‘Ayurgenomics’ approach can be used to integrate the trisutra concept of Ayurveda with genomics. We observe biochemical and molecular correlates of prakriti and show how these differ significantly in processes that are linked to intermediate patho-phenotypes, known to take different course in diseases. We also observe a significant enrichment of the highly

  18. Genomic insights into ayurvedic and western approaches to personalized medicine.

    Science.gov (United States)

    Prasher, Bhavana; Gibson, Greg; Mukerji, Mitali

    2016-03-01

    Ayurveda, an ancient Indian system of medicine documented and practised since 1500 B.C., follows a systems approach that has interesting parallels with contemporary personalized genomic medicine approaches to the understanding and management of health and disease. It is based on the trisutra, which are the three aspects of causes, features and therapeutics that are interconnected through a common organizing principle termed 'tridosha'. Tridosha comprise three ascertainable physiological entities; vata (kinetic), pitta (metabolic) and kapha (potential) that are pervasive across systems, work in conjunction with each other, respond to the external environment and maintain homeostasis. Each individual is born with a specific proportion of tridosha that are not only genetically determined but also influenced by the environment during foetal development. Jointly they determine a person's basic constitution, which is termed their 'prakriti'. Development and progressi on of different diseases with their subtypes are thought to depend on the origin and mechanism of perturbation of the doshas, and the aim of therapeutic practice is to ensure that the doshas retain their homeostatic state. Similarly, western systems biology epitomized by translational P4 medicine envisages the integration of multiscalar genetic, cellular, physiological and environmental networks to predict phenotypic outcomes of perturbations. In this perspective article, we aim to outline the shape of a unifying scaffold that may allow the two intellectual traditions to enhance one another. Specifically, we illustrate how a unique integrative 'Ayurgenomics' approach can be used to integrate the trisutra concept of Ayurveda with genomics. We observe biochemical and molecular correlates of prakriti and show how these differ significantly in processes that are linked to intermediate patho-phenotypes, known to take different course in diseases. We also observe a significant enr ichment of the highly connected

  19. Genomics, Physiology, and Molecular Breeding Approaches for Improving Salt Tolerance.

    Science.gov (United States)

    Ismail, Abdelbagi M; Horie, Tomoaki

    2017-02-22

    Salt stress reduces land and water productivity and contributes to poverty and food insecurity. Increased salinization caused by human practices and climate change is progressively reducing agriculture productivity despite escalating calls for more food. Plant responses to salt stress are fairly well understood, involving numerous critical processes that are each controlled by multiple genes. Knowledge of the critical mechanisms controlling salt uptake and exclusion from functioning tissues, signaling of salt stress, and the arsenal of protective metabolites is advancing. However, little progress has been made in developing salt-tolerant varieties of crop species using standard (but slow) breeding approaches. The genetic diversity available within cultivated crops and their wild relatives provides rich sources for trait and gene discovery that has yet to be sufficiently utilized. Transforming this knowledge into modern approaches using genomics and molecular tools for precision breeding will accelerate the development of tolerant cultivars and help sustain food production. Expected final online publication date for the Annual Review of Plant Biology Volume 68 is April 29, 2017. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

  20. Labor tax reform and equilibrium unemployment : a search and matching approach

    NARCIS (Netherlands)

    Heijdra, Ben J.; Ligthart, Jenny E.

    2004-01-01

    The paper studies simple strategies of labor tax reform in a search and matching model of the labor market featuring endogenous labor supply. Changing the composition of the tax wedge---that is, reducing a payroll tax and increasing a progressive wage tax such that the marginal tax wedge remains una

  1. Perfecting the Frankenstein Approach: Improved asymptotically matched initial data for non-spinning black hole binaries

    Science.gov (United States)

    Yunes, Nicolas; Tichy, Wolfgang

    2006-04-01

    The accuracy of gravitational wave templates produced by numerical simulations is partially determined by the initial data chosen. A promising method to construct accurate data employs asymptotic matching to construct an approximate global 4-metric. In this talk, we will apply this method to a binary system of non-spinning black holes and discuss improvements. A global metric can be constructed by asymptotically matching two tidally perturbed Schwarzschild metrics in isotropic coordinates valid near each hole to an ADMTT post-Newtonian metric valid far from them. As a result, adjacent metrics agree in the matching region up to uncontrolled remainders in the approximations. We build a smooth global 4-metric with transition functions, carefully constructed to avoid introducing errors larger than those in the approximations. The main improvement arises by using metrics in similar coordinates before performing the matching. This similarity leads to adjacent metrics that are similar even near the horizons, thus allowing for a smoother transition and constraint violations. We also construct a map that takes this metric to Kerr-Schild coordinates near each hole.

  2. A zero-one programming approach to Gulliksen's matched random subtests method

    NARCIS (Netherlands)

    van der Linden, Willem J.; Boekkooi-Timminga, Ellen

    1986-01-01

    In order to estimate the classical coefficient of test reliability, parallel measurements are needed. H. Gulliksen's matched random subtests method, which is a graphical method for splitting a test into parallel test halves, has practical relevance because it maximizes the alpha coefficient as a low

  3. A zero-one programming approach to Gulliksen's matched random subtests method

    NARCIS (Netherlands)

    Linden, van der Wim J.; Boekkooi-Timminga, Ellen

    1988-01-01

    Gulliksen’s matched random subtests method is a graphical method to split a test into parallel test halves. The method has practical relevance because it maximizes coefficient α as a lower bound to the classical test reliability coefficient. In this paper the same problem is formulated as a zero-one

  4. Pigeons ("Columba Livia") Approach Nash Equilibrium in Experimental Matching Pennies Competitions

    Science.gov (United States)

    Sanabria, Federico; Thrailkill, Eric

    2009-01-01

    The game of Matching Pennies (MP), a simplified version of the more popular Rock, Papers, Scissors, schematically represents competitions between organisms with incentives to predict each other's behavior. Optimal performance in iterated MP competitions involves the production of random choice patterns and the detection of nonrandomness in the…

  5. Labor Tax Reform and Equilibrium Unemployment : A Search and Matching Approach

    NARCIS (Netherlands)

    Heijdra, B.J.; Ligthart, J.E.

    2004-01-01

    The paper studies simple strategies of labor tax reform in a search and matching model of the labor market featuring endogenous labor supply.Changing the composition of the tax wedge|that is, reducing a payroll tax and increasing a progressive wage tax such that the marginal tax wedge remains unaffe

  6. The matching pursuit approach based on the modulated Gaussian pulse for efficient guided-wave damage inspection

    Science.gov (United States)

    Hong, Jin-Chul; Sun, Kyung Ho; Kim, Yoon Young

    2005-08-01

    The success of the guided-wave damage inspection technology depends not only on the generation and measurement of desired waveforms but also on the signal processing of the measured waves, but less attention has been paid to the latter. This research aims to develop an efficient signal processing technique especially suitable for the current guided-wave technology. To achieve this objective, the use of a two-stage matching pursuit approach based on the Gabor dictionary is proposed. Instead of truncated sine pulses commonly used in waveguide inspection, Gabor pulses, the modulated Gaussian pulses, are chosen as the elastic energy carrier to facilitate the matching pursuit algorithm. To extract meaningful waves out of noisy signals, a two-stage matching pursuit strategy is developed, which consists of the following: rough approximations with a set of predetermined parameters characterizing the Gabor pulse, and fine adjustments of the parameters by optimization. The parameters estimated from measured longitudinal elastic waves can be then directly used to assess not only the location but also the size of a crack in a rod. For the estimation of the crack size, in particular, Love's theory is incorporated in the matching pursuit analysis. Several experiments were conducted to verify the validity of the proposed approach in damage assessment.

  7. (Far) Outside the box: genomic approach to acute porphyria.

    Science.gov (United States)

    Thunell, S

    2006-01-01

    If I were living in Caucasus I would be writing fairy tales there Chekov, 1888 The question of the reasons for the extreme variation in morbidity among the gene carriers of acute porphyria and the great diversity of the precipitating factors are approached by the aid of a model of interacting genomic circuits. It is based on the current paradigm of the acute porphyric attack as a result of a toxic proximal overload of the enzyme-deficient heme-biosynthetic patway. Porphyrogenic influx of precursors is seen as a consequence of uncontrolled induction of its gate-keeping enzyme, ubiquitous 5-aminolevulinate synthase (ALAS1), due to attenuated post-translational control of the enzyme combined with activated gene transcription. Focus is directed on the genomic control of the master-regulator of ALAS1-transcription, the nuclear receptor pair constitutively active receptor (CAR) and pregnane xenobiotic receptor (PXR). On activation by their ligands, i.e. lipophilic drugs, solvents, alcohols, hormonal steroids and biocides, these DNA-binding proteins transform xenobiotic or steroid stimuli to coordinated activations of gene transcription-programs for ALAS1 and apo-cytochromes P450 (apo-CYPs), thus effecting the formation of xenobiotic-metabolizing cytochrome P450 enzymes. The potency of the CAR/PXR-transduction axis is enhanced by co-activators generated in at least four other genomic circuits, each triggered by different external and internal stimuli clinically experienced to be porphyrogenic, and each controlled by co-activating and co-repressing modulators. The expressions of the genes for CAR and PXR are thus augmented by binding glucocorticoid receptor (GR) activated by a steroid hormone, e.g, cortisol generated in fasting, infection or different forms of stress. The promotor regions of ALAS1 and apoCYPs contain binding sites for at least three co-activating transcription factors enhancing CAR/PXR transduction: i.e. the ligand-independent growth hormone (GH

  8. The active disturbance rejection control approach to stabilisation of coupled heat and ODE system subject to boundary control matched disturbance

    Science.gov (United States)

    Guo, Bao-Zhu; Liu, Jun-Jun; AL-Fhaid, A. S.; Younas, Arshad Mahmood M.; Asiri, Asim

    2015-08-01

    We consider stabilisation for a linear ordinary differential equation system with input dynamics governed by a heat equation, subject to boundary control matched disturbance. The active disturbance rejection control approach is applied to estimate, in real time, the disturbance with both constant high gain and time-varying high gain. The disturbance is cancelled in the feedback loop. The closed-loop systems with constant high gain and time-varying high gain are shown, respectively, to be practically stable and asymptotically stable.

  9. Offering a New Approach for Approximate Pattern Matching in Example-Based Machine Translation

    Directory of Open Access Journals (Sweden)

    Reza Akbari

    2015-01-01

    Full Text Available In this article, a new model is proposed in order to measure the degree of similarity between two sentences in machine translation based on example. The proposed model has applied genetic algorithm beside a new fitness function which is based on semantic load matching between the two sentences. Here, verbs are considered as the heart of a sentence because they are the main part of a sentence and carry the major part of the semantic load in the sentence; therefore more attention is paid to the verbs in the fitness function. It is noteworthy that the proposed model is largely dependent on the verbal part and the extracted synonyms from WordNet as well as the arrangement of words. The results are promising by precision and recall, indicating that the proposed method improves the quality of the retrieved matched sentences.

  10. Self-reported hand washing behaviors and foodborne illness: a propensity score matching approach.

    Science.gov (United States)

    Ali, Mir M; Verrill, Linda; Zhang, Yuanting

    2014-03-01

    Hand washing is a simple and effective but easily overlooked way to reduce cross-contamination and the transmission of foodborne pathogens. In this study, we used the propensity score matching methodology to account for potential selection bias to explore our hypothesis that always washing hands before food preparation tasks is associated with a reduction in the probability of reported foodborne illness. Propensity score matching can simulate random assignment to a condition so that pretreatment observable differences between a treatment group and a control group are homogenous on all the covariates except the treatment variable. Using the U.S. Food and Drug Administration's 2010 Food Safety Survey, we estimated the effect of self-reported hand washing behavior on the probability of self-reported foodborne illness. Our results indicate that reported washing of hands with soap always before food preparation leads to a reduction in the probability of reported foodborne illness.

  11. Output feedback model matching in linear impulsive systems with control feedthrough: a structural approach

    Science.gov (United States)

    Zattoni, Elena

    2017-01-01

    This paper investigates the problem of structural model matching by output feedback in linear impulsive systems with control feedthrough. Namely, given a linear impulsive plant, possibly featuring an algebraic link from the control input to the output, and given a linear impulsive model, the problem consists in finding a linear impulsive regulator that achieves exact matching between the respective forced responses of the linear impulsive plant and of the linear impulsive model, for all the admissible input functions and all the admissible sequences of jump times, by means of a dynamic feedback of the plant output. The problem solvability is characterized by a necessary and sufficient condition. The regulator synthesis is outlined through the proof of sufficiency, which is constructive.

  12. Different approaches to generate matching effects using arrays in contact with superconducting films.

    Science.gov (United States)

    del Valle, J.; Gomez, A.; Luis-Hita, J.; Rollano, V.; Gonzalez, E. M.; Vicent, J. L.

    2017-02-01

    Superconducting films in contact with non-superconducting regular arrays can exhibit commensurability effects between the vortex lattice and the unit cell of the pinning array. These matching effects yield a slowdown of the vortex flow and the corresponding dissipation decrease. The superconducting samples are Nb films grown on Si substrates. We have studied these matching effects with the array on top, embedded or threading the Nb superconducting films and using different materials (Si, Cu, Ni, Py dots and dots fabricated with Co/Pd multilayers). These hybrids allow for studying the contribution of different pinning potentials to the matching effects. The main findings are: (i) Periodic roughness induced in the superconducting film is enough to generate resistivity minima; (ii) A minor effect is achieved by magnetic pinning from periodic magnetic field potentials obtained by dots with out of plane magnetization grown on top of the superconducting film, (iii) In the case of array of magnetic dots embedded in the films, vortex flow probes the magnetic state; i.e. magnetoresistance measurements detect the magnetic state of very small nanomagnets. In addition, we have studied the role played by the local order in the commensurability effects. This was attained using an array that mimics a smectic crystal. We have found that preserving the local order is crucial. If the local order is not retained the magnetoresistance minima vanish.

  13. A Bioinformatics Approach for Detecting Repetitive Nested Motifs using Pattern Matching

    Science.gov (United States)

    Romero, José R.; Carballido, Jessica A.; Garbus, Ingrid; Echenique, Viviana C.; Ponzoni, Ignacio

    2016-01-01

    The identification of nested motifs in genomic sequences is a complex computational problem. The detection of these patterns is important to allow the discovery of transposable element (TE) insertions, incomplete reverse transcripts, deletions, and/or mutations. In this study, a de novo strategy for detecting patterns that represent nested motifs was designed based on exhaustive searches for pairs of motifs and combinatorial pattern analysis. These patterns can be grouped into three categories, motifs within other motifs, motifs flanked by other motifs, and motifs of large size. The methodology used in this study, applied to genomic sequences from the plant species Aegilops tauschii and Oryza sativa, revealed that it is possible to identify putative nested TEs by detecting these three types of patterns. The results were validated through BLAST alignments, which revealed the efficacy and usefulness of the new method, which is called Mamushka. PMID:27812277

  14. Critical social theory approach to disclosure of genomic incidental findings.

    Science.gov (United States)

    Bevan, Jeffrey L; Senn-Reeves, Julia N; Inventor, Ben R; Greiner, Shawna M; Mayer, Karen M; Rivard, Mary T; Hamilton, Rebekah J

    2012-11-01

    Technology has expanded genomic research and the complexity of extracted gene-related information. Health-related genomic incidental findings pose new dilemmas for nurse researchers regarding the ethical application of disclosure to participants. Consequently, informed consent specific to incidental findings is recommended. Critical Social Theory is used as a guide in recognition of the changing meaning of informed consent and to serve as a framework to inform nursing of the ethical application of disclosure consent in genomic nursing research practices.

  15. Emerging trends in genomic approaches for microbial bioprospecting.

    Science.gov (United States)

    Akondi, K B; Lakshmi, V V

    2013-02-01

    Microorganisms constitute two out of the three domains of life on earth. They exhibit vast biodiversity and metabolic versatility. This enables the microorganisms to inhabit and thrive in even the most extreme environmental conditions, making them all pervading. The magnitude of biodiversity observed among microorganisms substantially supersedes that exhibited by the eukaryotes. These characteristics make the microbial world a very lucrative and inexhaustible resource for prospecting novel bioactive molecules. Despite their vast potential, over 99% of the microbial world still remains to be explored. The primary reason for this is that the culture-dependent methods used in the laboratories are grossly insufficient, as they support the growth of under 1% of the microorganisms found in nature. This limitation necessitated the development of techniques to circumvent culture dependency and gain access to the outstanding majority of the microorganisms. The development of culture-independent techniques has essentially reshaped the study of microbial diversity and community dynamics. Application of genomic and metagenomic approaches is contributing substantially towards characterization of the real microbial diversity. The amenability of these techniques to high throughput has opened the doors to explore the vast number of "uncultivable" microbial forms in substantially lesser time. The present article provides an update on the recent technological advances and emerging trends in exploring microbial community.

  16. Online tuning of impedance matching circuit for long pulse inductively coupled plasma source operation--an alternate approach.

    Science.gov (United States)

    Sudhir, Dass; Bandyopadhyay, M; Kraus, W; Gahlaut, A; Bansal, G; Chakraborty, A

    2014-01-01

    Impedance matching circuit between radio frequency (RF) generator and the plasma load, placed between them, determines the RF power transfer from RF generator to the plasma load. The impedance of plasma load depends on the plasma parameters through skin depth and plasma conductivity or resistivity. Therefore, for long pulse operation of inductively coupled plasmas, particularly for high power (∼100 kW or more) where plasma load condition may vary due to different reasons (e.g., pressure, power, and thermal), online tuning of impedance matching circuit is necessary through feedback. In fusion grade ion source operation, such online methodology through feedback is not present but offline remote tuning by adjusting the matching circuit capacitors and tuning the driving frequency of the RF generator between the ion source operation pulses is envisaged. The present model is an approach for remote impedance tuning methodology for long pulse operation and corresponding online impedance matching algorithm based on RF coil antenna current measurement or coil antenna calorimetric measurement may be useful in this regard.

  17. Approaching the Sequential and Three-Dimensional Organization of Genomes

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2006-01-01

    textabstractGenomes are one of the major foundations of life due to their role in information storage, process regulation and evolution. To achieve a deeper unterstanding of the human genome the three-dimensional organization of the human cell nucleus, the structural-, scaling- and dynamic prope

  18. Matching Matters!

    CERN Document Server

    Freitas, Ayres; Plehn, Tilman

    2016-01-01

    Effective Lagrangians are a useful tool for a data-driven approach to physics beyond the Standard Model at the LHC. However, for the new physics scales accessible at the LHC, the effective operator expansion is only relatively slowly converging at best. For tree-level processes, it has been found that the agreement between the effective Lagrangian and a range of UV-complete models depends sensitively on the appropriate definition of the matching. We extend this analysis to the one-loop level, which is relevant for electroweak precision data and Higgs decay to photons. We show that near the scale of electroweak symmetry breaking the validity of the effective theory description can be systematically improved through an appropriate matching procedure. In particular, we find a significant increase in accuracy when including suitable terms suppressed by the Higgs vacuum expectation value in the matching.

  19. Genotyping-by-sequencing for Populus population genomics: an assessment of genome sampling patterns and filtering approaches.

    Directory of Open Access Journals (Sweden)

    Martin P Schilling

    Full Text Available Continuing advances in nucleotide sequencing technology are inspiring a suite of genomic approaches in studies of natural populations. Researchers are faced with data management and analytical scales that are increasing by orders of magnitude. With such dramatic advances comes a need to understand biases and error rates, which can be propagated and magnified in large-scale data acquisition and processing. Here we assess genomic sampling biases and the effects of various population-level data filtering strategies in a genotyping-by-sequencing (GBS protocol. We focus on data from two species of Populus, because this genus has a relatively small genome and is emerging as a target for population genomic studies. We estimate the proportions and patterns of genomic sampling by examining the Populus trichocarpa genome (Nisqually-1, and demonstrate a pronounced bias towards coding regions when using the methylation-sensitive ApeKI restriction enzyme in this species. Using population-level data from a closely related species (P. tremuloides, we also investigate various approaches for filtering GBS data to retain high-depth, informative SNPs that can be used for population genetic analyses. We find a data filter that includes the designation of ambiguous alleles resulted in metrics of population structure and Hardy-Weinberg equilibrium that were most consistent with previous studies of the same populations based on other genetic markers. Analyses of the filtered data (27,910 SNPs also resulted in patterns of heterozygosity and population structure similar to a previous study using microsatellites. Our application demonstrates that technically and analytically simple approaches can readily be developed for population genomics of natural populations.

  20. Random matrix approach to the distribution of genomic distance.

    Science.gov (United States)

    Alexeev, Nikita; Zograf, Peter

    2014-08-01

    The cycle graph introduced by Bafna and Pevzner is an important tool for evaluating the distance between two genomes, that is, the minimal number of rearrangements needed to transform one genome into another. We interpret this distance in topological terms and relate it to the random matrix theory. Namely, the number of genomes at a given 2-break distance from a fixed one (the Hultman number) is represented by a coefficient in the genus expansion of a matrix integral over the space of complex matrices with the Gaussian measure. We study generating functions for the Hultman numbers and prove that the two-break distance distribution is asymptotically normal.

  1. A Cost-Effective Approach to Sequence Hundreds of Complete Mitochondrial Genomes.

    Science.gov (United States)

    Nunez, Joaquin C B; Oleksiak, Marjorie F

    2016-01-01

    We present a cost-effective approach to sequence whole mitochondrial genomes for hundreds of individuals. Our approach uses small reaction volumes and unmodified (non-phosphorylated) barcoded adaptors to minimize reagent costs. We demonstrate our approach by sequencing 383 Fundulus sp. mitochondrial genomes (192 F. heteroclitus and 191 F. majalis). Prior to sequencing, we amplified the mitochondrial genomes using 4-5 custom-made, overlapping primer pairs, and sequencing was performed on an Illumina HiSeq 2500 platform. After removing low quality and short sequences, 2.9 million and 2.8 million reads were generated for F. heteroclitus and F. majalis respectively. Individual genomes were assembled for each species by mapping barcoded reads to a reference genome. For F. majalis, the reference genome was built de novo. On average, individual consensus sequences had high coverage: 61-fold for F. heteroclitus and 57-fold for F. majalis. The approach discussed in this paper is optimized for sequencing mitochondrial genomes on an Illumina platform. However, with the proper modifications, this approach could be easily applied to other small genomes and sequencing platforms.

  2. Meeting your match: How attractiveness similarity affects approach behavior in mixed-sex dyads

    NARCIS (Netherlands)

    Straaten, I. van; Engels, R.C.M.E.; Finkenauer, C.; Holland, R.W.

    2009-01-01

    This experimental study investigated approach behavior toward opposite-sex others of similar versus dissimilar physical attractiveness. Furthermore, it tested the moderating effects of sex. Single participants interacted with confederates of high and low attractiveness. Observers rated their behavio

  3. Meeting your match: How attractiveness similarity affects approach behavior in mixed-sex dyads

    NARCIS (Netherlands)

    Straaten, I. van; Engels, R.C.M.E.; Finkenauer, C.; Holland, R.W.

    2009-01-01

    This experimental study investigated approach behavior toward opposite-sex others of similar versus dissimilar physical attractiveness. Furthermore, it tested the moderating effects of sex. Single participants interacted with confederates of high and low attractiveness. Observers rated their

  4. Ethical considerations of research policy for personal genome analysis: the approach of the Genome Science Project in Japan.

    Science.gov (United States)

    Minari, Jusaku; Shirai, Tetsuya; Kato, Kazuto

    2014-12-01

    As evidenced by high-throughput sequencers, genomic technologies have recently undergone radical advances. These technologies enable comprehensive sequencing of personal genomes considerably more efficiently and less expensively than heretofore. These developments present a challenge to the conventional framework of biomedical ethics; under these changing circumstances, each research project has to develop a pragmatic research policy. Based on the experience with a new large-scale project-the Genome Science Project-this article presents a novel approach to conducting a specific policy for personal genome research in the Japanese context. In creating an original informed-consent form template for the project, we present a two-tiered process: making the draft of the template following an analysis of national and international policies; refining the draft template in conjunction with genome project researchers for practical application. Through practical use of the template, we have gained valuable experience in addressing challenges in the ethical review process, such as the importance of sharing details of the latest developments in genomics with members of research ethics committees. We discuss certain limitations of the conventional concept of informed consent and its governance system and suggest the potential of an alternative process using information technology.

  5. PRISM offers a comprehensive genomic approach to transcription factor function prediction

    KAUST Repository

    Wenger, A. M.

    2013-02-04

    The human genome encodes 1500-2000 different transcription factors (TFs). ChIP-seq is revealing the global binding profiles of a fraction of TFs in a fraction of their biological contexts. These data show that the majority of TFs bind directly next to a large number of context-relevant target genes, that most binding is distal, and that binding is context specific. Because of the effort and cost involved, ChIP-seq is seldom used in search of novel TF function. Such exploration is instead done using expression perturbation and genetic screens. Here we propose a comprehensive computational framework for transcription factor function prediction. We curate 332 high-quality nonredundant TF binding motifs that represent all major DNA binding domains, and improve cross-species conserved binding site prediction to obtain 3.3 million conserved, mostly distal, binding site predictions. We combine these with 2.4 million facts about all human and mouse gene functions, in a novel statistical framework, in search of enrichments of particular motifs next to groups of target genes of particular functions. Rigorous parameter tuning and a harsh null are used to minimize false positives. Our novel PRISM (predicting regulatory information from single motifs) approach obtains 2543 TF function predictions in a large variety of contexts, at a false discovery rate of 16%. The predictions are highly enriched for validated TF roles, and 45 of 67 (67%) tested binding site regions in five different contexts act as enhancers in functionally matched cells.

  6. Match filtering approach for signal acquisition in radio-pulsar navigation

    NARCIS (Netherlands)

    Heusdens, R.; Engelen, S.; Buist, P.J.; Noroozi, A.; Sundaramoorthy, P.P.; Verhoeven, C.J.M.; Bentum, M.; Gill, E.K.A.

    2012-01-01

    Pulsars with their periodic pulses and known positions are ideal beacons for navigation. The challenge, however, is the detection of the very weak pulsar signals that are submerged in noise. Radio based approaches allow the use of advanced techniques and methods for the detection and acquisition of

  7. Candidate genes, pathways and mechanisms for alcoholism: an expanded convergent functional genomics approach.

    Science.gov (United States)

    Rodd, Z A; Bertsch, B A; Strother, W N; Le-Niculescu, H; Balaraman, Y; Hayden, E; Jerome, R E; Lumeng, L; Nurnberger, J I; Edenberg, H J; McBride, W J; Niculescu, A B

    2007-08-01

    We describe a comprehensive translational approach for identifying candidate genes for alcoholism. The approach relies on the cross-matching of animal model brain gene expression data with human genetic linkage data, as well as human tissue data and biological roles data, an approach termed convergent functional genomics. An analysis of three animal model paradigms, based on inbred alcohol-preferring (iP) and alcohol-non-preferring (iNP) rats, and their response to treatments with alcohol, was used. A comprehensive analysis of microarray gene expression data from five key brain regions (frontal cortex, amygdala, caudate-putamen, nucleus accumbens and hippocampus) was carried out. The Bayesian-like integration of multiple independent lines of evidence, each by itself lacking sufficient discriminatory power, led to the identification of high probability candidate genes, pathways and mechanisms for alcoholism. These data reveal that alcohol has pleiotropic effects on multiple systems, which may explain the diverse neuropsychiatric and medical pathology in alcoholism. Some of the pathways identified suggest avenues for pharmacotherapy of alcoholism with existing agents, such as angiotensin-converting enzyme (ACE) inhibitors. Experiments we carried out in alcohol-preferring rats with an ACE inhibitor show a marked modulation of alcohol intake. Other pathways are new potential targets for drug development. The emergent overall picture is that physical and physiological robustness may permit alcohol-preferring individuals to withstand the aversive effects of alcohol. In conjunction with a higher reactivity to its rewarding effects, they may able to ingest enough of this nonspecific drug for a strong hedonic and addictive effect to occur.

  8. A Functional Genomic Approach to Chlorinated Ethenes Bioremediation

    Science.gov (United States)

    Lee, P. K.; Brodie, E. L.; MacBeth, T. W.; Deeb, R. A.; Sorenson, K. S.; Andersen, G. L.; Alvarez-Cohen, L.

    2007-12-01

    With the recent advances in genomic sciences, a knowledge-based approach can now be taken to optimize the bioremediation of trichloroethene (TCE). During the bioremediation of a heterogeneous subsurface, it is vital to identify and quantify the functionally important microorganisms present, characterize the microbial community and measure their physiological activity. In our field experiments, quantitative PCR (qPCR) was coupled with reverse-transcription (RT) to analyze both copy numbers and transcripts expressed by the 16S rRNA gene and three reductive dehalogenase (RDase) genes as biomarkers of Dehalococcoides spp. in the groundwater of a TCE-DNAPL site at Ft. Lewis (WA) that was serially subjected to biostimulation and bioaugmentation. Genes in the Dehalococcoides genus were targeted as they are the only known organisms that can completely dechlorinate TCE to the innocuous product ethene. Biomarker quantification revealed an overall increase of more than three orders of magnitude in the total Dehalococcoides population and quantification of the more liable and stringently regulated mRNAs confirmed that Dehalococcoides spp. were active. Parallel with our field experiments, laboratory studies were conducted to explore the physiology of Dehalococcoides isolates in order to develop relevant biomarkers that are indicative of the metabolic state of cells. Recently, we verified the function of the nitrogenase operon in Dehalococcoides sp. strain 195 and nitrogenase-encoding genes are ideal biomarker targets to assess cellular nitrogen requirement. To characterize the microbial community, we applied a high-density phylogenetic microarray (16S PhyloChip) that simultaneous monitors over 8,700 unique taxa to track the bacterial and archaeal populations through different phases of treatment. As a measure of species richness, 1,300 to 1,520 taxa were detected in groundwater samples extracted during different stages of treatment as well as in the bioaugmentation culture. We

  9. Mining a database of single amplified genomes from Red Sea brine pool extremophiles-improving reliability of gene function prediction using a profile and pattern matching algorithm (PPMA).

    KAUST Repository

    Grötzinger, Stefan W.

    2014-04-07

    Reliable functional annotation of genomic data is the key-step in the discovery of novel enzymes. Intrinsic sequencing data quality problems of single amplified genomes (SAGs) and poor homology of novel extremophile\\'s genomes pose significant challenges for the attribution of functions to the coding sequences identified. The anoxic deep-sea brine pools of the Red Sea are a promising source of novel enzymes with unique evolutionary adaptation. Sequencing data from Red Sea brine pool cultures and SAGs are annotated and stored in the Integrated Data Warehouse of Microbial Genomes (INDIGO) data warehouse. Low sequence homology of annotated genes (no similarity for 35% of these genes) may translate into false positives when searching for specific functions. The Profile and Pattern Matching (PPM) strategy described here was developed to eliminate false positive annotations of enzyme function before progressing to labor-intensive hyper-saline gene expression and characterization. It utilizes InterPro-derived Gene Ontology (GO)-terms (which represent enzyme function profiles) and annotated relevant PROSITE IDs (which are linked to an amino acid consensus pattern). The PPM algorithm was tested on 15 protein families, which were selected based on scientific and commercial potential. An initial list of 2577 enzyme commission (E.C.) numbers was translated into 171 GO-terms and 49 consensus patterns. A subset of INDIGO-sequences consisting of 58 SAGs from six different taxons of bacteria and archaea were selected from six different brine pool environments. Those SAGs code for 74,516 genes, which were independently scanned for the GO-terms (profile filter) and PROSITE IDs (pattern filter). Following stringent reliability filtering, the non-redundant hits (106 profile hits and 147 pattern hits) are classified as reliable, if at least two relevant descriptors (GO-terms and/or consensus patterns) are present. Scripts for annotation, as well as for the PPM algorithm, are available

  10. Mining a database of single amplified genomes from Red Sea brine pool extremophiles-improving reliability of gene function prediction using a profile and pattern matching algorithm (PPMA).

    Science.gov (United States)

    Grötzinger, Stefan W; Alam, Intikhab; Ba Alawi, Wail; Bajic, Vladimir B; Stingl, Ulrich; Eppinger, Jörg

    2014-01-01

    Reliable functional annotation of genomic data is the key-step in the discovery of novel enzymes. Intrinsic sequencing data quality problems of single amplified genomes (SAGs) and poor homology of novel extremophile's genomes pose significant challenges for the attribution of functions to the coding sequences identified. The anoxic deep-sea brine pools of the Red Sea are a promising source of novel enzymes with unique evolutionary adaptation. Sequencing data from Red Sea brine pool cultures and SAGs are annotated and stored in the Integrated Data Warehouse of Microbial Genomes (INDIGO) data warehouse. Low sequence homology of annotated genes (no similarity for 35% of these genes) may translate into false positives when searching for specific functions. The Profile and Pattern Matching (PPM) strategy described here was developed to eliminate false positive annotations of enzyme function before progressing to labor-intensive hyper-saline gene expression and characterization. It utilizes InterPro-derived Gene Ontology (GO)-terms (which represent enzyme function profiles) and annotated relevant PROSITE IDs (which are linked to an amino acid consensus pattern). The PPM algorithm was tested on 15 protein families, which were selected based on scientific and commercial potential. An initial list of 2577 enzyme commission (E.C.) numbers was translated into 171 GO-terms and 49 consensus patterns. A subset of INDIGO-sequences consisting of 58 SAGs from six different taxons of bacteria and archaea were selected from six different brine pool environments. Those SAGs code for 74,516 genes, which were independently scanned for the GO-terms (profile filter) and PROSITE IDs (pattern filter). Following stringent reliability filtering, the non-redundant hits (106 profile hits and 147 pattern hits) are classified as reliable, if at least two relevant descriptors (GO-terms and/or consensus patterns) are present. Scripts for annotation, as well as for the PPM algorithm, are available

  11. Appraisal, coping, emotion, and performance during elite fencing matches: a random coefficient regression model approach.

    Science.gov (United States)

    Doron, J; Martinent, G

    2016-06-23

    Understanding more about the stress process is important for the performance of athletes during stressful situations. Grounded in Lazarus's (1991, 1999, 2000) CMRT of emotion, this study tracked longitudinally the relationships between cognitive appraisal, coping, emotions, and performance in nine elite fencers across 14 international matches (representing 619 momentary assessments) using a naturalistic, video-assisted methodology. A series of hierarchical linear modeling analyses were conducted to: (a) explore the relationships between cognitive appraisals (challenge and threat), coping strategies (task- and disengagement oriented coping), emotions (positive and negative) and objective performance; (b) ascertain whether the relationship between appraisal and emotion was mediated by coping; and (c) examine whether the relationship between appraisal and objective performance was mediated by emotion and coping. The results of the random coefficient regression models showed: (a) positive relationships between challenge appraisal, task-oriented coping, positive emotions, and performance, as well as between threat appraisal, disengagement-oriented coping and negative emotions; (b) that disengagement-oriented coping partially mediated the relationship between threat and negative emotions, whereas task-oriented coping partially mediated the relationship between challenge and positive emotions; and (c) that disengagement-oriented coping mediated the relationship between threat and performance, whereas task-oriented coping and positive emotions partially mediated the relationship between challenge and performance. As a whole, this study furthered knowledge during sport performance situations of Lazarus's (1999) claim that these psychological constructs exist within a conceptual unit. Specifically, our findings indicated that the ways these constructs are inter-related influence objective performance within competitive settings.

  12. A knowledge based approach to matching human neurodegenerative disease and animal models

    Directory of Open Access Journals (Sweden)

    Maryann E Martone

    2013-05-01

    Full Text Available Neurodegenerative diseases present a wide and complex range of biological and clinical features. Animal models are key to translational research, yet typically only exhibit a subset of disease features rather than being precise replicas of the disease. Consequently, connecting animal to human conditions using direct data-mining strategies has proven challenging, particularly for diseases of the nervous system, with its complicated anatomy and physiology. To address this challenge we have explored the use of ontologies to create formal descriptions of structural phenotypes across scales that are machine processable and amenable to logical inference. As proof of concept, we built a Neurodegenerative Disease Phenotype Ontology and an associated Phenotype Knowledge Base using an entity-quality model that incorporates descriptions for both human disease phenotypes and those of animal models. Entities are drawn from community ontologies made available through the Neuroscience Information Framework and qualities are drawn from the Phenotype and Trait Ontology. We generated ~1200 structured phenotype statements describing structural alterations at the subcellular, cellular and gross anatomical levels observed in 11 human neurodegenerative conditions and associated animal models. PhenoSim, an open source tool for comparing phenotypes, was used to issue a series of competency questions to compare individual phenotypes among organisms and to determine which animal models recapitulate phenotypic aspects of the human disease in aggregate. Overall, the system was able to use relationships within the ontology to bridge phenotypes across scales, returning non-trivial matches based on common subsumers that were meaningful to a neuroscientist with an advanced knowledge of neuroanatomy. The system can be used both to compare individual phenotypes and also phenotypes in aggregate. This proof of concept suggests that expressing complex phenotypes using formal

  13. A knowledge based approach to matching human neurodegenerative disease and animal models

    Science.gov (United States)

    Maynard, Sarah M.; Mungall, Christopher J.; Lewis, Suzanna E.; Imam, Fahim T.; Martone, Maryann E.

    2013-01-01

    Neurodegenerative diseases present a wide and complex range of biological and clinical features. Animal models are key to translational research, yet typically only exhibit a subset of disease features rather than being precise replicas of the disease. Consequently, connecting animal to human conditions using direct data-mining strategies has proven challenging, particularly for diseases of the nervous system, with its complicated anatomy and physiology. To address this challenge we have explored the use of ontologies to create formal descriptions of structural phenotypes across scales that are machine processable and amenable to logical inference. As proof of concept, we built a Neurodegenerative Disease Phenotype Ontology (NDPO) and an associated Phenotype Knowledge Base (PKB) using an entity-quality model that incorporates descriptions for both human disease phenotypes and those of animal models. Entities are drawn from community ontologies made available through the Neuroscience Information Framework (NIF) and qualities are drawn from the Phenotype and Trait Ontology (PATO). We generated ~1200 structured phenotype statements describing structural alterations at the subcellular, cellular and gross anatomical levels observed in 11 human neurodegenerative conditions and associated animal models. PhenoSim, an open source tool for comparing phenotypes, was used to issue a series of competency questions to compare individual phenotypes among organisms and to determine which animal models recapitulate phenotypic aspects of the human disease in aggregate. Overall, the system was able to use relationships within the ontology to bridge phenotypes across scales, returning non-trivial matches based on common subsumers that were meaningful to a neuroscientist with an advanced knowledge of neuroanatomy. The system can be used both to compare individual phenotypes and also phenotypes in aggregate. This proof of concept suggests that expressing complex phenotypes using formal

  14. Genome-wide approaches to understanding human ageing

    Directory of Open Access Journals (Sweden)

    Kaeberlein Matt

    2006-06-01

    Full Text Available Abstract The use of genomic technologies in biogerontology has the potential to greatly enhance our understanding of human ageing. High-throughput screens for alleles correlated with survival in long-lived people have uncovered novel genes involved in age-associated disease. Genome-wide longevity studies in simple eukaryotes are identifying evolutionarily conserved pathways that determine longevity. It is hoped that validation of these 'public' aspects of ageing in mice, along with analyses of variation in candidate human ageing genes, will provide targets for future interventions to slow the ageing process and retard the onset of age-associated pathologies.

  15. An Alternative Methodological Approach for Cost-Effectiveness Analysis and Decision Making in Genomic Medicine.

    Science.gov (United States)

    Fragoulakis, Vasilios; Mitropoulou, Christina; van Schaik, Ron H; Maniadakis, Nikolaos; Patrinos, George P

    2016-05-01

    Genomic Medicine aims to improve therapeutic interventions and diagnostics, the quality of life of patients, but also to rationalize healthcare costs. To reach this goal, careful assessment and identification of evidence gaps for public health genomics priorities are required so that a more efficient healthcare environment is created. Here, we propose a public health genomics-driven approach to adjust the classical healthcare decision making process with an alternative methodological approach of cost-effectiveness analysis, which is particularly helpful for genomic medicine interventions. By combining classical cost-effectiveness analysis with budget constraints, social preferences, and patient ethics, we demonstrate the application of this model, the Genome Economics Model (GEM), based on a previously reported genome-guided intervention from a developing country environment. The model and the attendant rationale provide a practical guide by which all major healthcare stakeholders could ensure the sustainability of funding for genome-guided interventions, their adoption and coverage by health insurance funds, and prioritization of Genomic Medicine research, development, and innovation, given the restriction of budgets, particularly in developing countries and low-income healthcare settings in developed countries. The implications of the GEM for the policy makers interested in Genomic Medicine and new health technology and innovation assessment are also discussed.

  16. Genomic characterization of patient-derived xenograft models established from fine needle aspirate biopsies of a primary pancreatic ductal adenocarcinoma and from patient-matched metastatic sites

    OpenAIRE

    Allaway, Robert J.; Fischer, Dawn A.; de Abreu, Francine B.; Gardner, Timothy B.; Gordon, Stuart R.; Barth, Richard J.; Colacchio, Thomas A.; Wood, Matthew; Kacsoh, Balint Z.; Bouley, Stephanie J.; Cui, Jingxuan; Hamilton, Joanna; Choi, Jungbin A.; Lange, Joshua T.; Peterson, Jason D.

    2016-01-01

    N-of-1 trials target actionable mutations, yet such approaches do not test genomically-informed therapies in patient tumor models prior to patient treatment. To address this, we developed patient-derived xenograft (PDX) models from fine needle aspiration (FNA) biopsies (FNA-PDX) obtained from primary pancreatic ductal adenocarcinoma (PDAC) at the time of diagnosis. Here, we characterize PDX models established from one primary and two metastatic sites of one patient. We identified an activatin...

  17. Insect-Inspired Self-Motion Estimation with Dense Flow Fields--An Adaptive Matched Filter Approach.

    Science.gov (United States)

    Strübbe, Simon; Stürzl, Wolfgang; Egelhaaf, Martin

    2015-01-01

    The control of self-motion is a basic, but complex task for both technical and biological systems. Various algorithms have been proposed that allow the estimation of self-motion from the optic flow on the eyes. We show that two apparently very different approaches to solve this task, one technically and one biologically inspired, can be transformed into each other under certain conditions. One estimator of self-motion is based on a matched filter approach; it has been developed to describe the function of motion sensitive cells in the fly brain. The other estimator, the Koenderink and van Doorn (KvD) algorithm, was derived analytically with a technical background. If the distances to the objects in the environment can be assumed to be known, the two estimators are linear and equivalent, but are expressed in different mathematical forms. However, for most situations it is unrealistic to assume that the distances are known. Therefore, the depth structure of the environment needs to be determined in parallel to the self-motion parameters and leads to a non-linear problem. It is shown that the standard least mean square approach that is used by the KvD algorithm leads to a biased estimator. We derive a modification of this algorithm in order to remove the bias and demonstrate its improved performance by means of numerical simulations. For self-motion estimation it is beneficial to have a spherical visual field, similar to many flying insects. We show that in this case the representation of the depth structure of the environment derived from the optic flow can be simplified. Based on this result, we develop an adaptive matched filter approach for systems with a nearly spherical visual field. Then only eight parameters about the environment have to be memorized and updated during self-motion.

  18. Insect-Inspired Self-Motion Estimation with Dense Flow Fields--An Adaptive Matched Filter Approach.

    Directory of Open Access Journals (Sweden)

    Simon Strübbe

    Full Text Available The control of self-motion is a basic, but complex task for both technical and biological systems. Various algorithms have been proposed that allow the estimation of self-motion from the optic flow on the eyes. We show that two apparently very different approaches to solve this task, one technically and one biologically inspired, can be transformed into each other under certain conditions. One estimator of self-motion is based on a matched filter approach; it has been developed to describe the function of motion sensitive cells in the fly brain. The other estimator, the Koenderink and van Doorn (KvD algorithm, was derived analytically with a technical background. If the distances to the objects in the environment can be assumed to be known, the two estimators are linear and equivalent, but are expressed in different mathematical forms. However, for most situations it is unrealistic to assume that the distances are known. Therefore, the depth structure of the environment needs to be determined in parallel to the self-motion parameters and leads to a non-linear problem. It is shown that the standard least mean square approach that is used by the KvD algorithm leads to a biased estimator. We derive a modification of this algorithm in order to remove the bias and demonstrate its improved performance by means of numerical simulations. For self-motion estimation it is beneficial to have a spherical visual field, similar to many flying insects. We show that in this case the representation of the depth structure of the environment derived from the optic flow can be simplified. Based on this result, we develop an adaptive matched filter approach for systems with a nearly spherical visual field. Then only eight parameters about the environment have to be memorized and updated during self-motion.

  19. A hybrid approach for the automated finishing of bacterial genomes.

    Science.gov (United States)

    Bashir, Ali; Klammer, Aaron A; Robins, William P; Chin, Chen-Shan; Webster, Dale; Paxinos, Ellen; Hsu, David; Ashby, Meredith; Wang, Susana; Peluso, Paul; Sebra, Robert; Sorenson, Jon; Bullard, James; Yen, Jackie; Valdovino, Marie; Mollova, Emilia; Luong, Khai; Lin, Steven; LaMay, Brianna; Joshi, Amruta; Rowe, Lori; Frace, Michael; Tarr, Cheryl L; Turnsek, Maryann; Davis, Brigid M; Kasarskis, Andrew; Mekalanos, John J; Waldor, Matthew K; Schadt, Eric E

    2012-07-01

    Advances in DNA sequencing technology have improved our ability to characterize most genomic diversity. However, accurate resolution of large structural events is challenging because of the short read lengths of second-generation technologies. Third-generation sequencing technologies, which can yield longer multikilobase reads, have the potential to address limitations associated with genome assembly. Here we combine sequencing data from second- and third-generation DNA sequencing technologies to assemble the two-chromosome genome of a recent Haitian cholera outbreak strain into two nearly finished contigs at >99.9% accuracy. Complex regions with clinically relevant structure were completely resolved. In separate control assemblies on experimental and simulated data for the canonical N16961 cholera reference strain, we obtained 14 scaffolds of greater than 1 kb for the experimental data and 8 scaffolds of greater than 1 kb for the simulated data, which allowed us to correct several errors in contigs assembled from the short-read data alone. This work provides a blueprint for the next generation of rapid microbial identification and full-genome assembly.

  20. Single Step, a general approach for genomic selection

    DEFF Research Database (Denmark)

    Legarra, Andres; Christensen, Ole Fredslund; Aguilar, Ignacio;

    2014-01-01

    Genomic evaluation methods assume that the reference population is genotyped and phenotyped. This is most often false and the generation of pseudo-phenotypes is uncertain and inaccurate. However, markers obey transmission rules and therefore the covariances of marker genotypes across individuals ...

  1. Ecology of marine Bacteroidetes: a comparative genomics approach.

    Science.gov (United States)

    Fernández-Gómez, Beatriz; Richter, Michael; Schüler, Margarete; Pinhassi, Jarone; Acinas, Silvia G; González, José M; Pedrós-Alió, Carlos

    2013-05-01

    Bacteroidetes are commonly assumed to be specialized in degrading high molecular weight (HMW) compounds and to have a preference for growth attached to particles, surfaces or algal cells. The first sequenced genomes of marine Bacteroidetes seemed to confirm this assumption. Many more genomes have been sequenced recently. Here, a comparative analysis of marine Bacteroidetes genomes revealed a life strategy different from those of other important phyla of marine bacterioplankton such as Cyanobacteria and Proteobacteria. Bacteroidetes have many adaptations to grow attached to particles, have the capacity to degrade polymers, including a large number of peptidases, glycoside hydrolases (GHs), glycosyl transferases, adhesion proteins, as well as the genes for gliding motility. Several of the polymer degradation genes are located in close association with genes for TonB-dependent receptors and transducers, suggesting an integrated regulation of adhesion and degradation of polymers. This confirmed the role of this abundant group of marine bacteria as degraders of particulate matter. Marine Bacteroidetes had a significantly larger number of proteases than GHs, while non-marine Bacteroidetes had equal numbers of both. Proteorhodopsin containing Bacteroidetes shared two characteristics: small genome size and a higher number of genes involved in CO2 fixation per Mb. The latter may be important in order to survive when floating freely in the illuminated, but nutrient-poor, ocean surface.

  2. Accounting for Linkage Disequilibrium in genome scans for selection without individual genotypes: the local score approach.

    Science.gov (United States)

    Fariello, María Inés; Boitard, Simon; Mercier, Sabine; Robelin, David; Faraut, Thomas; Arnould, Cécile; Recoquillay, Julien; Bouchez, Olivier; Salin, Gérald; Dehais, Patrice; Gourichon, David; Leroux, Sophie; Pitel, Frédérique; Leterrier, Christine; SanCristobal, Magali

    2017-04-10

    Detecting genomic footprints of selection is an important step in the understanding of evolution. Accounting for linkage disequilibrium in genome scans increases detection power, but haplotype-based methods require individual genotypes and are not applicable on pool-sequenced samples. We propose to take advantage of the local score approach to account for linkage disequilibrium in genome scans for selection, cumulating (possibly small) signals from single markers over a genomic segment, to clearly pinpoint a selection signal. Using computer simulations, we demonstrate that this approach detects selection with higher power than several state-of-the-art single marker, windowing or haplotype-based approaches. We illustrate this on two benchmark data sets including individual genotypes, for which we obtain similar results with the local score and one haplotype-based approach. Finally, we apply the local score approach to Pool-Seq data obtained from a divergent selection experiment on behavior in quail, and obtain precise and biologically coherent selection signals: while competing methods fail to highlight any clear selection signature, our method detects several regions involving genes known to act on social responsiveness or autistic traits. Although we focus here on the detection of positive selection from multiple population data, the local score approach is general and can be applied to other genome scans for selection or other genome-wide analyses such as GWAS. This article is protected by copyright. All rights reserved.

  3. Truecluster matching

    CERN Document Server

    Oehlschlägel, Jens

    2007-01-01

    Cluster matching by permuting cluster labels is important in many clustering contexts such as cluster validation and cluster ensemble techniques. The classic approach is to minimize the euclidean distance between two cluster solutions which induces inappropriate stability in certain settings. Therefore, we present the truematch algorithm that introduces two improvements best explained in the crisp case. First, instead of maximizing the trace of the cluster crosstable, we propose to maximize a chi-square transformation of this crosstable. Thus, the trace will not be dominated by the cells with the largest counts but by the cells with the most non-random observations, taking into account the marginals. Second, we suggest a probabilistic component in order to break ties and to make the matching algorithm truly random on random data. The truematch algorithm is designed as a building block of the truecluster framework and scales in polynomial time. First simulation results confirm that the truematch algorithm give...

  4. Sequencing of the mitochondrial genome of the avocado lace bug Pseudacysta perseae (Heteroptera, Tingidae) using a genome skimming approach.

    Science.gov (United States)

    Kocher, Arthur; Guilbert, Éric; Lhuillier, Émeline; Murienne, Jerôme

    2015-03-01

    Lace bugs (Tingidae) are a family of phytophagous heteropterans, some of which are important agricultural and forestry pests. They currently comprise around 2500 species distributed worldwide, for which only one mitochondrial genome has been described so far. We sequenced the complete mitochondrial genome and the nuclear ribosomal gene segment of the avocado lace bug Pseudacysta perseae using a genome skimming approach on an Illumina Hiseq 2000 platform. Fifty-four additional heteropteran mitogenomes, including the one of the sycamore lace bug Corythucha ciliata, were retrieved to allow for comparisons and phylogenetic analyses. P. perseae mitochondrial genome was determined to be 15,850 bp long, and presented the typical organisation of insect mitogenomes. The phylogenetic analysis placed P. perseae as a sister to C. ciliata but did not confirm the monophyly of Miroidae including Tingidae. Our results contradicted widely accepted phylogenetic hypothesis, which highlights the limits of analyses based on mitochondrial data only. Shotgun sequencing approaches should provide substantial improvements in harmonizing mitochondrial and nuclear databases.

  5. Meeting your match: how attractiveness similarity affects approach behavior in mixed-sex dyads.

    Science.gov (United States)

    van Straaten, Ischa; Engels, Rutger C M E; Finkenauer, Catrin; Holland, Rob W

    2009-06-01

    This experimental study investigated approach behavior toward opposite-sex others of similar versus dissimilar physical attractiveness. Furthermore, it tested the moderating effects of sex. Single participants interacted with confederates of high and low attractiveness. Observers rated their behavior in terms of relational investment (i.e., behavioral efforts related to the improvement of interaction fluency, communication of positive interpersonal affect, and positive self-presentation). As expected, men displayed more relational investment behavior if their own physical attractiveness was similar to that of the confederate. For women, no effects of attractiveness similarity on relational investment behavior were found. Results are discussed in the light of positive assortative mating, preferences for physically attractive mates, and sex differences in attraction-related interpersonal behaviors.

  6. Matching diagnosis and management of diabetes in pregnancy to local priorities and resources: an international approach.

    Science.gov (United States)

    McIntyre, H David; Oats, Jeremy J N; Zeck, Willibald; Seshiah, V; Hod, Moshe

    2011-11-01

    The International Association of the Diabetes and Pregnancy Study Groups' (IADPSG) criteria for the diagnosis and classification of hyperglycemia in pregnancy are described and application of these in differing healthcare contexts on a worldwide basis is reported. Existing local protocols and known epidemiologic and clinical data regarding the detection and management of overt diabetes and gestational diabetes in the context of human pregnancy are considered. Although the IADPSG criteria are uniform, their introduction poses a variety of practical and technical challenges in differing healthcare contexts, both between and within countries. Knowledge of local factors will be vital in the implementation of the new guidelines and will require extensive liaison with local clinical and health policy groups. Resource availability will be critical in determining the type of treatment available in this context. The IADPSG criteria offer an important opportunity for a uniform approach to diabetes in pregnancy. Scaled implementation of these criteria adapted to a variety of local healthcare contexts should improve both research endeavors and patient care.

  7. Statistical Approaches in Genome-Wide Association Studies

    OpenAIRE

    Yazdani, Akram

    2014-01-01

    Genome-wide association studies, GWAS, typically contain hundreds of thousands single nucleotide polymorphisms, SNPs, genotyped for few numbers of samples. The aim of these studies is to identify regions harboring SNPs or to predict the outcomes of interest. Since the number of predictors in the GWAS far exceeds the number of samples, it is impossible to analyze the data with classical statistical methods. In the current GWAS, the widely applied methods are based on single marker analysis th...

  8. Functional genomic approach to the study of biodiversitywithin Trichoderma

    Institute of Scientific and Technical Information of China (English)

    Monte E; Hermosa M R; González F J; Rey M; Cardoza R E; Gutiérrez S; Delgado Jarana J; Llobell A

    2004-01-01

    @@ Trichoderma is a fungal genus of great and demonstrable biotechnological value, but its genome is poorly surveyed compared with other model microorganisms. Due to their ubiquity and rapid substrate colonization, Trichoderma species have been widely used as biocontrol organisms for agriculture, and their enzyme systems are widely used in industry. Therefore, there is a clear interest to explore beyond the phenotype to exploit the underlying genetic systems using functional genomics tools. The great diversity of species within the Trichoderma genus, the absence of optimized systems for its exploration, and the great variety of genes expressed under a wide range of ambient conditions are the main challenges to consider when starting a comprehensive functional genomics study. An initial project started by three Spanish groups has been extended into the project TRICHOEST, funded by the EU (FP5, QLRT-2001-02032) to target the transcriptome analysis of selected Trichoderma strains with biocontrol potential, in conditions related to antagonism, nutrient stress and plant interactions. Once specific conditions were defined, cDNA libraries were produced and used for EST sequencing. Nine strains from seven Trichoderma species have been considered in this study and an important amount of gene sequence data has been generated, analyzed and used to compare the gene expression in different strains.In parallel to sequencing, genomic expression studies were carried out by means of macro-arrays to identify genes expressed in specific conditions. In silico analysis of DNA sequencing data together with macro-array expression results have lead to a selection based on the potential use of the gene sequences.The selected clone sequences were completed and cloned in appropriate vectors to initiate functional analysis by means of expression studies in homologous and heterologous systems.

  9. Genomic sister-disorders of neurodevelopment: an evolutionary approach.

    Science.gov (United States)

    Crespi, Bernard; Summers, Kyle; Dorus, Steve

    2009-02-01

    Genomic sister-disorders are defined here as diseases mediated by duplications versus deletions of the same region. Such disorders can provide unique information concerning the genomic underpinnings of human neurodevelopment because effects of diametric variation in gene copy number on cognitive and behavioral phenotypes can be inferred. We describe evidence from the literature on deletions versus duplications for the regions underlying the best-known human neurogenetic sister-disorders, including Williams syndrome, Velocardiofacial syndrome, and Smith-Magenis syndrome, as well as the X-chromosomal conditions Klinefelter and Turner syndromes. These data suggest that diametric copy-number alterations can, like diametric alterations to imprinted genes, generate contrasting phenotypes associated with autistic-spectrum and psychotic-spectrum conditions. Genomically based perturbations to the development of the human social brain are thus apparently mediated to a notable degree by effects of variation in gene copy number. We also conducted the first analyses of positive selection for genes in the regions affected by these disorders. We found evidence consistent with adaptive evolution of protein-coding genes, or selective sweeps, for three of the four sets of sister-syndromes analyzed. These studies of selection facilitate identification of candidate genes for the phenotypes observed and lend a novel evolutionary dimension to the analysis of human cognitive architecture and neurogenetic disorders.

  10. Mining a database of single amplified genomes from Red Sea brine pool extremophiles – Improving reliability of gene function prediction using a profile and pattern matching algorithm (PPMA

    Directory of Open Access Journals (Sweden)

    Stefan Wolfgang Grötzinger

    2014-04-01

    Full Text Available Reliable functional annotation of genomic data is the key-step in the discovery of novel enzymes. Intrinsic sequencing data quality problems of single amplified genomes (SAGs and poor homology of novel extremophile’s genomes pose significant challenges for the attribution of functions to the coding sequences identified. The anoxic deep-sea brine pools of the Red Sea are a promising source of novel enzymes with unique evolutionary adaptation. Sequencing data from Red Sea brine pool cultures and SAGs are annotated and stored in the INDIGO data warehouse. Low sequence homology of annotated genes (no similarity for 35% of these genes may translate into false positives when searching for specific functions. The Profile & Pattern Matching (PPM strategy described here was developed to eliminate false positive annotations of enzyme function before progressing to labor-intensive hyper-saline gene expression and characterization. It utilizes InterPro-derived Gene Ontology (GO-terms (which represent enzyme function profiles and annotated relevant PROSITE IDs (which are linked to an amino acid consensus pattern. The PPM algorithm was tested on 15 protein families, which were selected based on scientific and commercial potential. An initial list of 2,577 E.C. numbers was translated into 171 GO-terms and 49 consensus patterns. A subset of INDIGO-sequences consisting of 58 SAGs from six different taxons of bacteria and archaea were selected from 6 different brine pool environments. Those SAGs code for 74,516 genes, which were independently scanned for the GO-terms (profile filter and PROSITE IDs (pattern filter. Following stringent reliability filtering, the non-redundant hits (106 profile hits and 147 pattern hits are classified as reliable, if at least two relevant descriptors (GO-terms and/or consensus patterns are present. Scripts for annotation, as well as for the PPM algorithm, are available through the INDIGO website.

  11. Pattern matching

    NARCIS (Netherlands)

    A. Hak (Tony); J. Dul (Jan)

    2009-01-01

    textabstractPattern matching is comparing two patterns in order to determine whether they match (i.e., that they are the same) or do not match (i.e., that they differ). Pattern matching is the core procedure of theory-testing with cases. Testing consists of matching an “observed pattern” (a pattern

  12. Integrating Genomic Data Sets for Knowledge Discovery: An Informed Approach to Management of Captive Endangered Species.

    Science.gov (United States)

    Irizarry, Kristopher J L; Bryant, Doug; Kalish, Jordan; Eng, Curtis; Schmidt, Peggy L; Barrett, Gini; Barr, Margaret C

    2016-01-01

    Many endangered captive populations exhibit reduced genetic diversity resulting in health issues that impact reproductive fitness and quality of life. Numerous cost effective genomic sequencing and genotyping technologies provide unparalleled opportunity for incorporating genomics knowledge in management of endangered species. Genomic data, such as sequence data, transcriptome data, and genotyping data, provide critical information about a captive population that, when leveraged correctly, can be utilized to maximize population genetic variation while simultaneously reducing unintended introduction or propagation of undesirable phenotypes. Current approaches aimed at managing endangered captive populations utilize species survival plans (SSPs) that rely upon mean kinship estimates to maximize genetic diversity while simultaneously avoiding artificial selection in the breeding program. However, as genomic resources increase for each endangered species, the potential knowledge available for management also increases. Unlike model organisms in which considerable scientific resources are used to experimentally validate genotype-phenotype relationships, endangered species typically lack the necessary sample sizes and economic resources required for such studies. Even so, in the absence of experimentally verified genetic discoveries, genomics data still provides value. In fact, bioinformatics and comparative genomics approaches offer mechanisms for translating these raw genomics data sets into integrated knowledge that enable an informed approach to endangered species management.

  13. A detection of the integrated Sachs-Wolfe imprint of cosmic superstructures using a matched-filter approach

    CERN Document Server

    Nadathur, Seshadri

    2016-01-01

    We present a new method for detection of the integrated Sachs-Wolfe (ISW) imprints of cosmic superstructures on the cosmic microwave background, based on a matched filtering approach. The expected signal-to-noise ratio for this method is comparable to that obtained from the full cross-correlation, and unlike other stacked filtering techniques it is not subject to an a posteriori bias. We apply this method to Planck CMB data using voids and superclusters identified in the CMASS galaxy data from the Sloan Digital Sky Survey Data Release 12, and measure the ISW amplitude to be $A_\\mathrm{ISW}=1.64\\pm0.53$ relative to the $\\Lambda$CDM expectation, corresponding to a $3.1\\sigma$ detection. In contrast to some previous measurements of the ISW effect of superstructures, our result is in agreement with the $\\Lambda$CDM model.

  14. A Detection of the Integrated Sachs–Wolfe Imprint of Cosmic Superstructures Using a Matched-filter Approach

    Science.gov (United States)

    Nadathur, Seshadri; Crittenden, Robert

    2016-10-01

    We present a new method for detection of the integrated Sachs–Wolfe (ISW) imprints of cosmic superstructures on the cosmic microwave background (CMB), based on a matched-filtering approach. The expected signal-to-noise ratio for this method is comparable to that obtained from the full cross-correlation, and unlike other stacked filtering techniques it is not subject to an a posteriori bias. We apply this method to Planck CMB data using voids and superclusters identified in the CMASS galaxy data from the Sloan Digital Sky Survey Data Release 12, and measure the ISW amplitude to be {A}{ISW}=1.64+/- 0.53 relative to the ΛCDM expectation, corresponding to a 3.1σ detection. In contrast to some previous measurements of the ISW effect of superstructures, our result is in agreement with the ΛCDM model.

  15. Genome-wide analytical approaches for reverse metabolic engineering of industrially relevant phenotypes in yeast

    Science.gov (United States)

    Oud, Bart; Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T

    2012-01-01

    Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. PMID:22152095

  16. Genomic instability in pancreatic adenocarcinoma: a new step towards precision medicine and novel therapeutic approaches.

    Science.gov (United States)

    Sahin, Ibrahim H; Lowery, Maeve A; Stadler, Zsofia K; Salo-Mullen, Erin; Iacobuzio-Donahue, Christine A; Kelsen, David P; O'Reilly, Eileen M

    2016-08-01

    Pancreatic cancer is one of the most challenging cancers. Whole genome sequencing studies have been conducted to elucidate the underlying fundamentals underscoring disease behavior. Studies have identified a subgroup of pancreatic cancer patients with distinct molecular and clinical features. Genetic fingerprinting of these tumors is consistent with an unstable genome and defective DNA repair pathways, which creates unique susceptibility to agents inducing DNA damage. BRCA1/2 mutations, both germline and somatic, which lead to impaired DNA repair, are found to be important biomarkers of genomic instability as well as of response to DNA damaging agents. Recent studies have elucidated that PARP inhibitors and platinum agents may be effective to induce tumor regression in solid tumors bearing an unstable genome including pancreatic cancer. In this review we discuss the characteristics of genomic instability in pancreatic cancer along with its clinical implications and the utility of DNA targeting agents particularly PARP inhibitors as a novel treatment approach.

  17. Genome-wide analytical approaches for reverse metabolic engineering of industrially relevant phenotypes in yeast.

    Science.gov (United States)

    Oud, Bart; van Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T

    2012-03-01

    Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages.

  18. Crowdfunding the Azolla fern genome project: a grassroots approach.

    Science.gov (United States)

    Li, Fay-Wei; Pryer, Kathleen M

    2014-01-01

    Much of science progresses within the tight boundaries of what is often seen as a "black box". Though familiar to funding agencies, researchers and the academic journals they publish in, it is an entity that outsiders rarely get to peek into. Crowdfunding is a novel means that allows the public to participate in, as well as to support and witness advancements in science. Here we describe our recent crowdfunding efforts to sequence the Azolla genome, a little fern with massive green potential. Crowdfunding is a worthy platform not only for obtaining seed money for exploratory research, but also for engaging directly with the general public as a rewarding form of outreach.

  19. Implications of structural genomics target selection strategies: Pfam5000, whole genome, and random approaches

    Energy Technology Data Exchange (ETDEWEB)

    Chandonia, John-Marc; Brenner, Steven E.

    2004-07-14

    The structural genomics project is an international effort to determine the three-dimensional shapes of all important biological macromolecules, with a primary focus on proteins. Target proteins should be selected according to a strategy which is medically and biologically relevant, of good value, and tractable. As an option to consider, we present the Pfam5000 strategy, which involves selecting the 5000 most important families from the Pfam database as sources for targets. We compare the Pfam5000 strategy to several other proposed strategies that would require similar numbers of targets. These include including complete solution of several small to moderately sized bacterial proteomes, partial coverage of the human proteome, and random selection of approximately 5000 targets from sequenced genomes. We measure the impact that successful implementation of these strategies would have upon structural interpretation of the proteins in Swiss-Prot, TrEMBL, and 131 complete proteomes (including 10 of eukaryotes) from the Proteome Analysis database at EBI. Solving the structures of proteins from the 5000 largest Pfam families would allow accurate fold assignment for approximately 68 percent of all prokaryotic proteins (covering 59 percent of residues) and 61 percent of eukaryotic proteins (40 percent of residues). More fine-grained coverage which would allow accurate modeling of these proteins would require an order of magnitude more targets. The Pfam5000 strategy may be modified in several ways, for example to focus on larger families, bacterial sequences, or eukaryotic sequences; as long as secondary consideration is given to large families within Pfam, coverage results vary only slightly. In contrast, focusing structural genomics on a single tractable genome would have only a limited impact in structural knowledge of other proteomes: a significant fraction (about 30-40 percent of the proteins, and 40-60 percent of the residues) of each proteome is classified in small

  20. Flexible approaches for teaching computational genomics in a health information management program.

    Science.gov (United States)

    Zhou, Leming; Watzlaf, Valerie; Abdelhak, Mervat

    2013-01-01

    The astonishing improvement of high-throughput biotechnologies in recent years makes it possible to access a huge amount of genomic data. The association between genomic data and genetic disease has already been and will continue to be applied to personalized healthcare. Health information management (HIM) professionals are the ones who will handle personal genetic information and provide solid evidence to support physicians' diagnoses and personalized treatment strategies, and therefore they will need to have the knowledge and skills to process genomic data. In this paper, we describe flexible approaches for teaching a computational genomics course in the HIM program at the University of Pittsburgh. HIM programs at other universities may choose an appropriate approach to fit into their own curriculum.

  1. A New Approach to Dissect Nuclear Organization: TALE-Mediated Genome Visualization (TGV).

    Science.gov (United States)

    Miyanari, Yusuke

    2016-01-01

    Spatiotemporal organization of chromatin within the nucleus has so far remained elusive. Live visualization of nuclear remodeling could be a promising approach to understand its functional relevance in genome functions and mechanisms regulating genome architecture. Recent technological advances in live imaging of chromosomes begun to explore the biological roles of the movement of the chromatin within the nucleus. Here I describe a new technique, called TALE-mediated genome visualization (TGV), which allows us to visualize endogenous repetitive sequence including centromeric, pericentromeric, and telomeric repeats in living cells.

  2. Matching Through Position Auctions

    OpenAIRE

    Terence Johnson

    2009-01-01

    This paper studies how an intermediary should design two-sided matching markets when agents are privately informed about their quality as a partner and can make payments to the intermediary. Using a mechanism design approach, I derive sufficient conditions for assortative matching to be profit- or welfare-maximizing, and then show how to implement the optimal match and payments through two-sided position auctions. This sharpens our understanding of intermediated matching markets by clarifying...

  3. Statistical Approaches Accomodating Uncertainty in Modern Genomic Data

    DEFF Research Database (Denmark)

    Skotte, Line

    Due to recent technological advances the research fields of human genetics are poised as never before to provide valuable insights on the molecular basis of disease. The technological advances has made it possible to genotype hundreds of thousands known genetic variants, re-sequence entire genomes...... the contributed method applicable to case-control studies as well as mapping of quantitative traits. The contributed method provides a needed association test for quantitative traits in the presence of uncertain genotypes and it further allows correction for population structure in association tests for disease...... imbalanced allelic transcription from RNA sequencing data. The method differs from previous methods in that is accounts for the well-known inherent over-dispersion in re-sequencing data and that it combines information across individuals to form a population-based measure of allelic imbalance. Our...

  4. Statistical Approaches Accomodating Uncertainty in Modern Genomic Data

    DEFF Research Database (Denmark)

    Skotte, Line

    to discover new variants and perform detailed profiling of an individual's entire transcriptome. However, uncertainties pervades every level of modern genome-wide data analyses and timely development of statistical methods that accommodates these uncertainties are therefore necessary to fully exploit...... the potential of the technological advances. The first of the four papers included in this thesis describes a new method for association mapping that accommodates uncertain genotypes from low-coverage re-sequencing data. The method allows uncertain genotypes using a score statistic based on the joint likelihood...... states and quantitative traits in the presence of uncertain genotypes, neither were possible prior to the development of the method. Our simulations show that the contributed method have higher statistical power than methods based on genotypes inferred from the sequencing data. The second paper presents...

  5. Whole genome sequencing: an efficient approach to ensuring food safety

    Science.gov (United States)

    Lakicevic, B.; Nastasijevic, I.; Dimitrijevic, M.

    2017-09-01

    Whole genome sequencing is an effective, powerful tool that can be applied to a wide range of public health and food safety applications. A major difference between WGS and the traditional typing techniques is that WGS allows all genes to be included in the analysis, instead of a well-defined subset of genes or variable intergenic regions. Also, the use of WGS can facilitate the understanding of contamination/colonization routes of foodborne pathogens within the food production environment, and can also afford efficient tracking of pathogens’ entry routes and distribution from farm-to-consumer. Tracking foodborne pathogens in the food processing-distribution-retail-consumer continuum is of the utmost importance for facilitation of outbreak investigations and rapid action in controlling/preventing foodborne outbreaks. Therefore, WGS likely will replace most of the numerous workflows used in public health laboratories to characterize foodborne pathogens into one consolidated, efficient workflow.

  6. Pattern matching

    OpenAIRE

    Hak, Tony; Dul, Jan

    2009-01-01

    textabstractPattern matching is comparing two patterns in order to determine whether they match (i.e., that they are the same) or do not match (i.e., that they differ). Pattern matching is the core procedure of theory-testing with cases. Testing consists of matching an “observed pattern” (a pattern of measured values) with an “expected pattern” (a hypothesis), and deciding whether these patterns match (resulting in a confirmation of the hypothesis) or do not match (resulting in a disconfirmat...

  7. CRISPR/Cas9: A Practical Approach in Date Palm Genome Editing

    Directory of Open Access Journals (Sweden)

    Muhammad N. Sattar

    2017-08-01

    Full Text Available The genetic modifications through breeding of crop plants have long been used to improve the yield and quality. However, precise genome editing (GE could be a very useful supplementary tool for improvement of crop plants by targeted genome modifications. Various GE techniques including ZFNs (zinc finger nucleases, TALENs (transcription activator-like effector nucleases, and most recently clustered regularly interspaced short palindromic repeats (CRISPR/Cas9 (CRISPR-associated protein 9-based approaches have been successfully employed for various crop plants including fruit trees. CRISPR/Cas9-based approaches hold great potential in GE due to their simplicity, competency, and versatility over other GE techniques. However, to the best of our knowledge no such genetic improvement has ever been developed in date palm—an important fruit crop in Oasis agriculture. The applications of CRISPR/Cas9 can be a challenging task in date palm GE due to its large and complex genome, high rate of heterozygosity and outcrossing, in vitro regeneration and screening of mutants, high frequency of single-nucleotide polymorphism in the genome and ultimately genetic instability. In this review, we addressed the potential application of CRISPR/Cas9-based approaches in date palm GE to improve the sustainable date palm production. The availability of the date palm whole genome sequence has made it feasible to use CRISPR/Cas9 GE approach for genetic improvement in this species. Moreover, the future prospects of GE application in date palm are also addressed in this review.

  8. Integrating landscape genomics and spatially explicit approaches to detect loci under selection in clinal populations.

    Science.gov (United States)

    Jones, Matthew R; Forester, Brenna R; Teufel, Ashley I; Adams, Rachael V; Anstett, Daniel N; Goodrich, Betsy A; Landguth, Erin L; Joost, Stéphane; Manel, Stéphanie

    2013-12-01

    Uncovering the genetic basis of adaptation hinges on the ability to detect loci under selection. However, population genomics outlier approaches to detect selected loci may be inappropriate for clinal populations or those with unclear population structure because they require that individuals be clustered into populations. An alternate approach, landscape genomics, uses individual-based approaches to detect loci under selection and reveal potential environmental drivers of selection. We tested four landscape genomics methods on a simulated clinal population to determine their effectiveness at identifying a locus under varying selection strengths along an environmental gradient. We found all methods produced very low type I error rates across all selection strengths, but elevated type II error rates under "weak" selection. We then applied these methods to an AFLP genome scan of an alpine plant, Campanula barbata, and identified five highly supported candidate loci associated with precipitation variables. These loci also showed spatial autocorrelation and cline patterns indicative of selection along a precipitation gradient. Our results suggest that landscape genomics in combination with other spatial analyses provides a powerful approach for identifying loci potentially under selection and explaining spatially complex interactions between species and their environment.

  9. Layers of epistasis: genome-wide regulatory networks and network approaches to genome-wide association studies

    Science.gov (United States)

    Cowper-Sal·lari, Richard; Cole, Michael D.; Karagas, Margaret R.; Lupien, Mathieu; Moore, Jason H.

    2010-01-01

    The conceptual foundation of the genome-wide association study (GWAS) has advanced unchecked since its conception. A revision might seem premature as the potential of GWAS has not been fully realized. Multiple technical and practical limitations need to be overcome before GWAS can be fairly criticized. But with the completion of hundreds of studies and a deeper understanding of the genetic architecture of disease, warnings are being raised. The results compiled to date indicate that risk-associated variants lie predominantly in non-coding regions of the genome. Additionally, alternative methodologies are uncovering large and heterogeneous sets of rare variants underlying disease. The fear is that, even in its fulfilment, the current GWAS paradigm might be incapable of dissecting all kinds of phenotypes. In the following text we review several initiatives that aim to overcome these limitations. The overarching theme of these studies is the inclusion of biological knowledge to both the analysis and interpretation of genotyping data. GWAS is uninformed of biology by design and although there is some virtue in its simplicity it is also its most conspicuous deficiency. We propose a framework in which to integrate these novel approaches, both empirical and theoretical, in the form of a genome-wide regulatory network (GWRN). By processing experimental data into networks, emerging data types based on chromatin-immunoprecipitation are made computationally tractable. This will give GWAS re-analysis efforts the most current and relevant substrates, and root them firmly on our knowledge of human disease. PMID:21197657

  10. A Fast Hybrid Algorithm Approach for the Exact String Matching Problem Via Berry Ravindran and Alpha Skip Search Algorithms

    Directory of Open Access Journals (Sweden)

    A. A. Almazroi

    2011-01-01

    Full Text Available Problem statement: String matching algorithm had been an essential means for searching biological sequence database. With the constant expansion in scientific data such as DNA and Protein; the development of enhanced algorithms have even become more critical as the major concern had always been how to raise the performances of these search algorithms to meet challenges of scientific information. Approach: Therefore a new hybrid algorithm comprising Berry Ravindran (BR and Alpha Skip Search (ASS is presented. The concept is based on BR shift function and combines with ASS to ensure improved performance. Results: The results obtained in percentages from the proposed hybrid algorithm displayed superior results in terms of number of attempts and number of character comparisons than the original algorithms when various types of data namely DNA, Protein and English text are applied to appraise the hybrid performances. The enhancement of the proposed hybrid algorithm performs better at 71%, 60% and 63% when compared to Berry-Ravindran in DNA, Protein and English text correspondingly. Moreover the rate of enhancement over Alpha Skip Search algorithm in DNA, Protein and English text are 48%, 28% and 36% respectively. Conclusion: The new proposed hybrid algorithm is relevant for searching biological science sequence database and also other string search systems.

  11. The Role of Serotype Interactions and Seasonality in Dengue Model Selection and Control: Insights from a Pattern Matching Approach.

    Science.gov (United States)

    Ten Bosch, Quirine A; Singh, Brajendra K; Hassan, Muhammad R A; Chadee, Dave D; Michael, Edwin

    2016-05-01

    The epidemiology of dengue fever is characterized by highly seasonal, multi-annual fluctuations, and the irregular circulation of its four serotypes. It is believed that this behaviour arises from the interplay between environmental drivers and serotype interactions. The exact mechanism, however, is uncertain. Constraining mathematical models to patterns characteristic to dengue epidemiology offers a means for detecting such mechanisms. Here, we used a pattern-oriented modelling (POM) strategy to fit and assess a range of dengue models, driven by combinations of temporary cross protective-immunity, cross-enhancement, and seasonal forcing, on their ability to capture the main characteristics of dengue dynamics. We show that all proposed models reproduce the observed dengue patterns across some part of the parameter space. Which model best supports the dengue dynamics is determined by the level of seasonal forcing. Further, when tertiary and quaternary infections are allowed, the inclusion of temporary cross-immunity alone is strongly supported, but the addition of cross-enhancement markedly reduces the parameter range at which dengue dynamics are produced, irrespective of the strength of seasonal forcing. The implication of these structural uncertainties on predicted vulnerability to control is also discussed. With ever expanding spread of dengue, greater understanding of dengue dynamics and control efforts (e.g. a near-future vaccine introduction) has become critically important. This study highlights the capacity of multi-level pattern-matching modelling approaches to offer an analytic tool for deeper insights into dengue epidemiology and control.

  12. Two heuristic approaches to describe periodicities in genomic microarrays

    Directory of Open Access Journals (Sweden)

    Jörg Aßmus

    2009-09-01

    Full Text Available In the first part we discuss the filtering of panels of time series based on singular value decomposition. The discussion is based on an approach where this filtering is used to normalize microarray data. We point out effects on the periodicity and phases for time series panels. In the second part we investigate time dependent periodic panels with different phases. We align the time series in the panel and discuss the periodogram of the aligned time series with the purpose of describing the periodic structure of the panel. The method is quite powerful assuming known phases in the model, but it deteriorates rapidly for noisy data.  

  13. [MATCHE: Management Approach to Teaching Consumer and Homemaking Education.] Consumer Approach Strand: Textiles and Clothing. Module I-D-1: Consumer Approach to Textiles and Clothing.

    Science.gov (United States)

    California State Univ., Fresno. Dept. of Home Economics.

    This competency-based preservice home economics teacher education module on consumer approach to textiles and clothing is the first in a set of four modules on consumer education related to textiles and clothing. (This set is part of a larger series of sixty-seven modules on the Management Approach to Teaching Consumer and Homemaking Education…

  14. A physical approach to segregation and folding of the Caulobacter crescentus genome.

    Science.gov (United States)

    Dame, Remus T; Tark-Dame, Mariliis; Schiessel, Helmut

    2011-12-01

    Bacterial genomes are functionally organized. This organization is dynamic and globally changing throughout the cell cycle. Upon initiation of replication of the chromosome, the two origins segregate and move towards their new location taking along the newly replicated genome. Caulobacter crescentus employs a dedicated active partitioning (Par) system to move one copy of the parS centromere to the distal pole, while the other stays at the stalked pole. In this issue of Molecular Microbiology, Hong and McAdams describe studies on the speed of segregation of parS and regions up to 150 kb away. They show clear differences in segregation rates between parS and 50 kb flanking regions versus regions further away. To assess segregation rates the authors track fluorescent markers during movement using time-lapse microscopy. The relation between genomic and physical distance of pairs of markers reflects how the genome is folded. This relation permits testing experimental data against models from polymer physics. Such models are helpful in understanding principles of genome folding. Although long used in studies on eukaryotes, this approach has rarely been applied to bacteria. Finally, the authors give the first direct evidence for a role of the bacterial chromatin protein HU in folding the genome in vivo.

  15. Identifying contamination with advanced visualization and analysis practices: metagenomic approaches for eukaryotic genome assemblies

    Directory of Open Access Journals (Sweden)

    Tom O. Delmont

    2016-03-01

    Full Text Available High-throughput sequencing provides a fast and cost-effective mean to recover genomes of organisms from all domains of life. However, adequate curation of the assembly results against potential contamination of non-target organisms requires advanced bioinformatics approaches and practices. Here, we re-analyzed the sequencing data generated for the tardigrade Hypsibius dujardini, and created a holistic display of the eukaryotic genome assembly using DNA data originating from two groups and eleven sequencing libraries. By using bacterial single-copy genes, k-mer frequencies, and coverage values of scaffolds we could identify and characterize multiple near-complete bacterial genomes from the raw assembly, and curate a 182 Mbp draft genome for H. dujardini supported by RNA-Seq data. Our results indicate that most contaminant scaffolds were assembled from Moleculo long-read libraries, and most of these contaminants have differed between library preparations. Our re-analysis shows that visualization and curation of eukaryotic genome assemblies can benefit from tools designed to address the needs of today’s microbiologists, who are constantly challenged by the difficulties associated with the identification of distinct microbial genomes in complex environmental metagenomes.

  16. Identifying contamination with advanced visualization and analysis practices: metagenomic approaches for eukaryotic genome assemblies

    Science.gov (United States)

    Delmont, Tom O.

    2016-01-01

    High-throughput sequencing provides a fast and cost-effective mean to recover genomes of organisms from all domains of life. However, adequate curation of the assembly results against potential contamination of non-target organisms requires advanced bioinformatics approaches and practices. Here, we re-analyzed the sequencing data generated for the tardigrade Hypsibius dujardini, and created a holistic display of the eukaryotic genome assembly using DNA data originating from two groups and eleven sequencing libraries. By using bacterial single-copy genes, k-mer frequencies, and coverage values of scaffolds we could identify and characterize multiple near-complete bacterial genomes from the raw assembly, and curate a 182 Mbp draft genome for H. dujardini supported by RNA-Seq data. Our results indicate that most contaminant scaffolds were assembled from Moleculo long-read libraries, and most of these contaminants have differed between library preparations. Our re-analysis shows that visualization and curation of eukaryotic genome assemblies can benefit from tools designed to address the needs of today’s microbiologists, who are constantly challenged by the difficulties associated with the identification of distinct microbial genomes in complex environmental metagenomes. PMID:27069789

  17. Identifying contamination with advanced visualization and analysis practices: metagenomic approaches for eukaryotic genome assemblies.

    Science.gov (United States)

    Delmont, Tom O; Eren, A Murat

    2016-01-01

    High-throughput sequencing provides a fast and cost-effective mean to recover genomes of organisms from all domains of life. However, adequate curation of the assembly results against potential contamination of non-target organisms requires advanced bioinformatics approaches and practices. Here, we re-analyzed the sequencing data generated for the tardigrade Hypsibius dujardini, and created a holistic display of the eukaryotic genome assembly using DNA data originating from two groups and eleven sequencing libraries. By using bacterial single-copy genes, k-mer frequencies, and coverage values of scaffolds we could identify and characterize multiple near-complete bacterial genomes from the raw assembly, and curate a 182 Mbp draft genome for H. dujardini supported by RNA-Seq data. Our results indicate that most contaminant scaffolds were assembled from Moleculo long-read libraries, and most of these contaminants have differed between library preparations. Our re-analysis shows that visualization and curation of eukaryotic genome assemblies can benefit from tools designed to address the needs of today's microbiologists, who are constantly challenged by the difficulties associated with the identification of distinct microbial genomes in complex environmental metagenomes.

  18. Integrated genomics and molecular breeding approaches for dissecting the complex quantitative traits in crop plants.

    Science.gov (United States)

    Kujur, Alice; Saxena, Maneesha S; Bajaj, Deepak; Laxmi; Parida, Swarup K

    2013-12-01

    The enormous population growth, climate change and global warming are now considered major threats to agriculture and world's food security. To improve the productivity and sustainability of agriculture, the development of highyielding and durable abiotic and biotic stress-tolerant cultivars and/climate resilient crops is essential. Henceforth, understanding the molecular mechanism and dissection of complex quantitative yield and stress tolerance traits is the prime objective in current agricultural biotechnology research. In recent years, tremendous progress has been made in plant genomics and molecular breeding research pertaining to conventional and next-generation whole genome, transcriptome and epigenome sequencing efforts, generation of huge genomic, transcriptomic and epigenomic resources and development of modern genomics-assisted breeding approaches in diverse crop genotypes with contrasting yield and abiotic stress tolerance traits. Unfortunately, the detailed molecular mechanism and gene regulatory networks controlling such complex quantitative traits is not yet well understood in crop plants. Therefore, we propose an integrated strategies involving available enormous and diverse traditional and modern -omics (structural, functional, comparative and epigenomics) approaches/resources and genomics-assisted breeding methods which agricultural biotechnologist can adopt/utilize to dissect and decode the molecular and gene regulatory networks involved in the complex quantitative yield and stress tolerance traits in crop plants. This would provide clues and much needed inputs for rapid selection of novel functionally relevant molecular tags regulating such complex traits to expedite traditional and modern marker-assisted genetic enhancement studies in target crop species for developing high-yielding stress-tolerant varieties.

  19. BG7: A New Approach for Bacterial Genome Annotation Designed for Next Generation Sequencing Data

    Science.gov (United States)

    Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Pareja, Eduardo; Tobes, Raquel

    2012-01-01

    BG7 is a new system for de novo bacterial, archaeal and viral genome annotation based on a new approach specifically designed for annotating genomes sequenced with next generation sequencing technologies. The system is versatile and able to annotate genes even in the step of preliminary assembly of the genome. It is especially efficient detecting unexpected genes horizontally acquired from bacterial or archaeal distant genomes, phages, plasmids, and mobile elements. From the initial phases of the gene annotation process, BG7 exploits the massive availability of annotated protein sequences in databases. BG7 predicts ORFs and infers their function based on protein similarity with a wide set of reference proteins, integrating ORF prediction and functional annotation phases in just one step. BG7 is especially tolerant to sequencing errors in start and stop codons, to frameshifts, and to assembly or scaffolding errors. The system is also tolerant to the high level of gene fragmentation which is frequently found in not fully assembled genomes. BG7 current version – which is developed in Java, takes advantage of Amazon Web Services (AWS) cloud computing features, but it can also be run locally in any operating system. BG7 is a fast, automated and scalable system that can cope with the challenge of analyzing the huge amount of genomes that are being sequenced with NGS technologies. Its capabilities and efficiency were demonstrated in the 2011 EHEC Germany outbreak in which BG7 was used to get the first annotations right the next day after the first entero-hemorrhagic E. coli genome sequences were made publicly available. The suitability of BG7 for genome annotation has been proved for Illumina, 454, Ion Torrent, and PacBio sequencing technologies. Besides, thanks to its plasticity, our system could be very easily adapted to work with new technologies in the future. PMID:23185310

  20. Safety evaluation of continuous green T intersections: A propensity scores-genetic matching-potential outcomes approach.

    Science.gov (United States)

    Wood, Jonathan; Donnell, Eric T

    2016-08-01

    The continuous green T intersection is characterized by a channelized left-turn movement from the minor street approach onto the major street, along with a continuous through movement on the major street. The continuous flow through movement is not controlled by the three-phase traffic signal that is used to separate all other movements at the intersection. Rather, the continuous through movement typically has a green through arrow indicator to inform drivers that they do not have to stop. Past research has consistently shown that there are operational and environmental benefits to implementing this intersection form at three-leg locations, when compared to a conventional signalized intersection. These benefits include reduced delay, fuel consumption, and emissions. The safety effects of the conventional green T intersection are less clear. Past research has been limited to small sample sizes, or utilized only statistical comparisons reported crashes to evaluate the safety performance relative to similar intersection types. The present study overcomes past safety research evaluations by using a propensity scores-potential outcomes framework, with genetic matching, to compare the safety performance of the continuous green T to conventional signalized intersections, using treatment and comparison site data from Florida and South Carolina. The results show that the expected total, fatal and injury, and target crash (rear-end, angle, and sideswipe) frequencies are lower at the continuous green T intersection relative to the conventional signalized intersection (CMFs of 0.958 [95% CI=0.772-1.189], 0.846 [95% CI=0.651-1.099], and 0.920 [95% CI=0.714-1.185], respectively).

  1. SEX-RELATED FUNCTIONAL ASYMMETRY OF THE AMGDALA: PRELIMINARY EVIDENCE USING A CASE-MATCHED LESION APPROACH

    Science.gov (United States)

    Tranel, Daniel; Bechara, Antoine

    2008-01-01

    We have reported previously that there appears to be an intriguing sex-related functional asymmetry of the prefrontal cortices, especially the ventromedial sector, in regard to social conduct, emotional processing, and decision-making, whereby the right-sided sector is important in men but not women and the left-sided sector is important in women but not men. The amygdala is another structure that has been widely implicated in emotion processing and social decision-making, and the question arises as to whether the amygdala, in a manner akin to what has been observed for the prefrontal cortex, might have sex-related functional asymmetry in regard to social and emotional functions. A preliminary test of this question was carried out in the current study, where we used a case-matched lesion approach and contrasted a pair of men cases and a pair of women cases, where in each pair one patient had left amygdala damage and the other had right amygdala damage. We investigated the domains of social conduct, emotional processing and personality, and decision-making. The results provide support for the notion that there is sex-related functional asymmetry of the amygdala in regard to these functions— in the men pair, the patient with right-sided amygdala damage was impaired in these functions, and the patient with left-sided amygdala damage was not, whereas in the women pair, the opposite pattern obtained, with the left-sided woman being impaired and the right-sided woman being unimpaired. These data provide preliminary support for the notion that sex-related functional asymmetry of the amygdala may entail functions such as social conduct, emotional processing, and decision-making, a finding that in turn could reflect (as either a cause or effect) differences in the manner in which men and women apprehend, process, and execute emotion-related information. PMID:19308794

  2. [MATCHE: Management Approach to Teaching Consumer and Homemaking Education.] Occupational Strand: Textiles and Clothing. Module II-D-2: Assembly Line Garment Construction.

    Science.gov (United States)

    Henry, Nina

    This competency-based preservice home economics teacher education module on assembly line garment construction is the second in a set of three modules on occupational aspects of textiles and clothing. (This set is part of a larger series of sixty-seven modules on the Management Approach to Teaching Consumer and Homemaking Education [MATCHE]--see…

  3. Resolving the question of trypanosome monophyly: a comparative genomics approach using whole genome data sets with low taxon sampling.

    Science.gov (United States)

    Leonard, Guy; Soanes, Darren M; Stevens, Jamie R

    2011-07-01

    Since the first attempts to classify the evolutionary history of trypanosomes, there have been conflicting reports regarding their true phylogenetic relationships and, in particular, their relationships with other vertebrate trypanosomatids, e.g. Leishmania sp., as well as with the many insect parasitising trypanosomatids. Perhaps the issue that has provided most debate is that concerning the monophyly (or otherwise) of genus Trypanosoma and, even with the advent of molecular methods, the findings of numerous studies have varied significantly depending on the gene sequences analysed, number of taxa included, choice of outgroup and phylogenetic methodology. While of arguably limited applied importance, resolution of the question as to whether or not trypanosomes are monophyletic is critical to accurate evaluation of competing, mutually exclusive evolutionary scenarios for these parasites, namely the 'vertebrate-first' or 'insect-first' hypotheses. Therefore, a new approach, which could overcome previous limitations was needed. At its most simple, the problem can be defined within the framework of a trifurcated tree with three hypothetical positions at which the root can be placed. Using BLASTp and whole-genome gene-by-gene phylogenetic analyses of Trypanosoma brucei, Trypanosoma cruzi, Leishmania major and Naegleria gruberi, we have identified 599 gene markers--putative homologues--that were shared between the genomes of these four taxa. Of these, 75 homologous gene families that demonstrate monophyly of the kinetoplastids were identified. We then used these data sets in combination with an additional outgroup, Euglena gracilis, coupled with large-scale gene concatenation and diverse phylogenetic techniques to investigate the relative branching order of T. brucei, T. cruzi and L. major. Our findings confirm the monophyly of genus Trypanosoma and demonstrate that <1% of the analysed gene markers shared between the genomes of T. brucei, T. cruzi and L. major reject

  4. Germ line genome editing in clinics: the approaches, objectives and global society.

    Science.gov (United States)

    Ishii, Tetsuya

    2017-01-01

    Genome editing allows for the versatile genetic modification of somatic cells, germ cells and embryos. In particular, CRISPR/Cas9 is worldwide used in biomedical research. Although the first report on Cas9-mediated gene modification in human embryos focused on the prevention of a genetic disease in offspring, it raised profound ethical and social concerns over the safety of subsequent generations and the potential misuse of genome editing for human enhancement. The present article considers germ line genome editing approaches from various clinical and ethical viewpoints and explores its objectives. The risks and benefits of the following three likely objectives are assessed: the prevention of monogenic diseases, personalized assisted reproductive technology (ART) and genetic enhancement. Although genetic enhancement should be avoided, the international regulatory landscape suggests the inevitability of this misuse at ART centers. Under these circumstances, possible regulatory responses and the potential roles of public dialogue are discussed. © The Author 2015. Published by Oxford University Press.

  5. Regulatory hurdles for genome editing: process- vs. product-based approaches in different regulatory contexts.

    Science.gov (United States)

    Sprink, Thorben; Eriksson, Dennis; Schiemann, Joachim; Hartung, Frank

    2016-07-01

    Novel plant genome editing techniques call for an updated legislation regulating the use of plants produced by genetic engineering or genome editing, especially in the European Union. Established more than 25 years ago and based on a clear distinction between transgenic and conventionally bred plants, the current EU Directives fail to accommodate the new continuum between genetic engineering and conventional breeding. Despite the fact that the Directive 2001/18/EC contains both process- and product-related terms, it is commonly interpreted as a strictly process-based legislation. In view of several new emerging techniques which are closer to the conventional breeding than common genetic engineering, we argue that it should be actually interpreted more in relation to the resulting product. A legal guidance on how to define plants produced by exploring novel genome editing techniques in relation to the decade-old legislation is urgently needed, as private companies and public researchers are waiting impatiently with products and projects in the pipeline. We here outline the process in the EU to develop a legislation that properly matches the scientific progress. As the process is facing several hurdles, we also compare with existing frameworks in other countries and discuss ideas for an alternative regulatory system.

  6. A pipeline for automated annotation of yeast genome sequences by a conserved-synteny approach

    Directory of Open Access Journals (Sweden)

    Proux-Wéra Estelle

    2012-09-01

    Full Text Available Abstract Background Yeasts are a model system for exploring eukaryotic genome evolution. Next-generation sequencing technologies are poised to vastly increase the number of yeast genome sequences, both from resequencing projects (population studies and from de novo sequencing projects (new species. However, the annotation of genomes presents a major bottleneck for de novo projects, because it still relies on a process that is largely manual. Results Here we present the Yeast Genome Annotation Pipeline (YGAP, an automated system designed specifically for new yeast genome sequences lacking transcriptome data. YGAP does automatic de novo annotation, exploiting homology and synteny information from other yeast species stored in the Yeast Gene Order Browser (YGOB database. The basic premises underlying YGAP's approach are that data from other species already tells us what genes we should expect to find in any particular genomic region and that we should also expect that orthologous genes are likely to have similar intron/exon structures. Additionally, it is able to detect probable frameshift sequencing errors and can propose corrections for them. YGAP searches intelligently for introns, and detects tRNA genes and Ty-like elements. Conclusions In tests on Saccharomyces cerevisiae and on the genomes of Naumovozyma castellii and Tetrapisispora blattae newly sequenced with Roche-454 technology, YGAP outperformed another popular annotation program (AUGUSTUS. For S. cerevisiae and N. castellii, 91-93% of YGAP's predicted gene structures were identical to those in previous manually curated gene sets. YGAP has been implemented as a webserver with a user-friendly interface at http://wolfe.gen.tcd.ie/annotation.

  7. RegPredict: an integrated system for regulon inference in prokaryotes by comparative genomics approach

    Energy Technology Data Exchange (ETDEWEB)

    Novichkov, Pavel S.; Rodionov, Dmitry A.; Stavrovskaya, Elena D.; Novichkova, Elena S.; Kazakov, Alexey E.; Gelfand, Mikhail S.; Arkin, Adam P.; Mironov, Andrey A.; Dubchak, Inna

    2010-05-26

    RegPredict web server is designed to provide comparative genomics tools for reconstruction and analysis of microbial regulons using comparative genomics approach. The server allows the user to rapidly generate reference sets of regulons and regulatory motif profiles in a group of prokaryotic genomes. The new concept of a cluster of co-regulated orthologous operons allows the user to distribute the analysis of large regulons and to perform the comparative analysis of multiple clusters independently. Two major workflows currently implemented in RegPredict are: (i) regulon reconstruction for a known regulatory motif and (ii) ab initio inference of a novel regulon using several scenarios for the generation of starting gene sets. RegPredict provides a comprehensive collection of manually curated positional weight matrices of regulatory motifs. It is based on genomic sequences, ortholog and operon predictions from the MicrobesOnline. An interactive web interface of RegPredict integrates and presents diverse genomic and functional information about the candidate regulon members from several web resources. RegPredict is freely accessible at http://regpredict.lbl.gov.

  8. Evaluation of somatic genomic imbalances in thyroid carcinomas of follicular origin by CGH-based approaches.

    Science.gov (United States)

    Baldan, Federica; Mio, Catia; Allegri, Lorenzo; Passon, Nadia; Lepore, Saverio M; Russo, Diego; Damante, Giuseppe

    2017-09-07

    Application of distinct technologies of cancer genome analysis has provided important information for the molecular characterization of several human neoplasia, including follicular cell-derived thyroid carcinoma. Among them, comparative genomic hybridization (CGH)-based procedures have been extensively applied to evaluate genomic imbalances present in these tumours, obtaining data leading to an increase in the understanding of their complexity and diversity. In this review, after a brief overview of the most commonly used CGH-based technichs, we will describe the major results deriving from the most influential studies in the literature which used this approach to investigate the genomic aberrations of thyroid cancer cells. In most studies a small number of patients have been analyzed. Deletions and duplications at different chromosomal regions were detected in all investigated cohorts. A higher number of genomic imbalances has been detected in anaplastic or poorly differentiated thyroid carcinomas compared to well differentiated ones. Limitations in the interpretation of the results, as well the potential impact in the clinical practice are discussed. Though a quite heterogeneous picture arises from results so far available, CGH array, combined with other methodologies as well as an accurate clinical management, may offer novel opportunities for a better stratification of thyroid cancer patients.

  9. Genome Investigations of Vector Competence in Aedes aegypti to Inform Novel Arbovirus Disease Control Approaches

    Directory of Open Access Journals (Sweden)

    David W. Severson

    2016-10-01

    Full Text Available Dengue (DENV, yellow fever, chikungunya, and Zika virus transmission to humans by a mosquito host is confounded by both intrinsic and extrinsic variables. Besides virulence factors of the individual arboviruses, likelihood of virus transmission is subject to variability in the genome of the primary mosquito vector, Aedes aegypti. The “vectorial capacity” of A. aegypti varies depending upon its density, biting rate, and survival rate, as well as its intrinsic ability to acquire, host and transmit a given arbovirus. This intrinsic ability is known as “vector competence”. Based on whole transcriptome analysis, several genes and pathways have been predicated to have an association with a susceptible or refractory response in A. aegypti to DENV infection. However, the functional genomics of vector competence of A. aegypti is not well understood, primarily due to lack of integrative approaches in genomic or transcriptomic studies. In this review, we focus on the present status of genomics studies of DENV vector competence in A. aegypti as limited information is available relative to the other arboviruses. We propose future areas of research needed to facilitate the integration of vector and virus genomics and environmental factors to work towards better understanding of vector competence and vectorial capacity in natural conditions.

  10. Genome Investigations of Vector Competence in Aedes aegypti to Inform Novel Arbovirus Disease Control Approaches.

    Science.gov (United States)

    Severson, David W; Behura, Susanta K

    2016-10-30

    Dengue (DENV), yellow fever, chikungunya, and Zika virus transmission to humans by a mosquito host is confounded by both intrinsic and extrinsic variables. Besides virulence factors of the individual arboviruses, likelihood of virus transmission is subject to variability in the genome of the primary mosquito vector, Aedes aegypti. The "vectorial capacity" of A. aegypti varies depending upon its density, biting rate, and survival rate, as well as its intrinsic ability to acquire, host and transmit a given arbovirus. This intrinsic ability is known as "vector competence". Based on whole transcriptome analysis, several genes and pathways have been predicated to have an association with a susceptible or refractory response in A. aegypti to DENV infection. However, the functional genomics of vector competence of A. aegypti is not well understood, primarily due to lack of integrative approaches in genomic or transcriptomic studies. In this review, we focus on the present status of genomics studies of DENV vector competence in A. aegypti as limited information is available relative to the other arboviruses. We propose future areas of research needed to facilitate the integration of vector and virus genomics and environmental factors to work towards better understanding of vector competence and vectorial capacity in natural conditions.

  11. Signature-Discovery Approach for Sample Matching of a Nerve-Agent Precursor using Liquid Chromatography–Mass Spectrometry, XCMS, and Chemometrics

    Energy Technology Data Exchange (ETDEWEB)

    Fraga, Carlos G.; Clowers, Brian H.; Moore, Ronald J.; Zink, Erika M.

    2010-05-15

    This report demonstrates the use of bioinformatic and chemometric tools on liquid chromatography mass spectrometry (LC-MS) data for the discovery of ultra-trace forensic signatures for sample matching of various stocks of the nerve-agent precursor known as methylphosphonic dichloride (dichlor). The use of the bioinformatic tool known as XCMS was used to comprehensively search and find candidate LC-MS peaks in a known set of dichlor samples. These candidate peaks were down selected to a group of 34 impurity peaks. Hierarchal cluster analysis and factor analysis demonstrated the potential of these 34 impurities peaks for matching samples based on their stock source. Only one pair of dichlor stocks was not differentiated from one another. An acceptable chemometric approach for sample matching was determined to be variance scaling and signal averaging of normalized duplicate impurity profiles prior to classification by k-nearest neighbors. Using this approach, a test set of dichlor samples were all correctly matched to their source stock. The sample preparation and LC-MS method permitted the detection of dichlor impurities presumably in the parts-per-trillion (w/w). The detection of a common impurity in all dichlor stocks that were synthesized over a 14-year period and by different manufacturers was an unexpected discovery. Our described signature-discovery approach should be useful in the development of a forensic capability to help in criminal investigations following chemical attacks.

  12. Anaplasma marginale: Diversity, Virulence, and Vaccine Landscape through a Genomics Approach

    Science.gov (United States)

    Amaro-Estrada, Itzel; Rodríguez-Camarillo, Sergio Darío

    2016-01-01

    In order to understand the genetic diversity of A. marginale, several efforts have been made around the world. This rickettsia affects a significant number of ruminants, causing bovine anaplasmosis, so the interest in its virulence and how it is transmitted have drawn interest not only from a molecular point of view but also, recently, some genomics research have been performed to elucidate genes and proteins with potential as antigens. Unfortunately, so far, we still do not have a recombinant anaplasmosis vaccine. In this review, we present a landscape of the multiple approaches carried out from the genomic perspective to generate valuable information that could be used in a holistic way to finally develop an anaplasmosis vaccine. These approaches include the analysis of the genetic diversity of A. marginale and how this affects control measures for the disease. Anaplasmosis vaccine development is also reviewed from the conventional vaccinomics to genome-base vaccinology approach based on proteomics, metabolomics, and transcriptomics analyses reported. The use of these new omics approaches will undoubtedly reveal new targets of interest in the near future, comprising information of potential antigens and the immunogenic effect of A. marginale proteins. PMID:27610385

  13. Anaplasma marginale: Diversity, Virulence, and Vaccine Landscape through a Genomics Approach

    Directory of Open Access Journals (Sweden)

    Rosa Estela Quiroz-Castañeda

    2016-01-01

    Full Text Available In order to understand the genetic diversity of A. marginale, several efforts have been made around the world. This rickettsia affects a significant number of ruminants, causing bovine anaplasmosis, so the interest in its virulence and how it is transmitted have drawn interest not only from a molecular point of view but also, recently, some genomics research have been performed to elucidate genes and proteins with potential as antigens. Unfortunately, so far, we still do not have a recombinant anaplasmosis vaccine. In this review, we present a landscape of the multiple approaches carried out from the genomic perspective to generate valuable information that could be used in a holistic way to finally develop an anaplasmosis vaccine. These approaches include the analysis of the genetic diversity of A. marginale and how this affects control measures for the disease. Anaplasmosis vaccine development is also reviewed from the conventional vaccinomics to genome-base vaccinology approach based on proteomics, metabolomics, and transcriptomics analyses reported. The use of these new omics approaches will undoubtedly reveal new targets of interest in the near future, comprising information of potential antigens and the immunogenic effect of A. marginale proteins.

  14. Phylogeny-guided (meta)genome mining approach for the targeted discovery of new microbial natural products.

    Science.gov (United States)

    Kang, Hahk-Soo

    2017-02-01

    Genomics-based methods are now commonplace in natural products research. A phylogeny-guided mining approach provides a means to quickly screen a large number of microbial genomes or metagenomes in search of new biosynthetic gene clusters of interest. In this approach, biosynthetic genes serve as molecular markers, and phylogenetic trees built with known and unknown marker gene sequences are used to quickly prioritize biosynthetic gene clusters for their metabolites characterization. An increase in the use of this approach has been observed for the last couple of years along with the emergence of low cost sequencing technologies. The aim of this review is to discuss the basic concept of a phylogeny-guided mining approach, and also to provide examples in which this approach was successfully applied to discover new natural products from microbial genomes and metagenomes. I believe that the phylogeny-guided mining approach will continue to play an important role in genomics-based natural products research.

  15. Evaluating the Cassandra NoSQL Database Approach for Genomic Data Persistency.

    Science.gov (United States)

    Aniceto, Rodrigo; Xavier, Rene; Guimarães, Valeria; Hondo, Fernanda; Holanda, Maristela; Walter, Maria Emilia; Lifschitz, Sérgio

    2015-01-01

    Rapid advances in high-throughput sequencing techniques have created interesting computational challenges in bioinformatics. One of them refers to management of massive amounts of data generated by automatic sequencers. We need to deal with the persistency of genomic data, particularly storing and analyzing these large-scale processed data. To find an alternative to the frequently considered relational database model becomes a compelling task. Other data models may be more effective when dealing with a very large amount of nonconventional data, especially for writing and retrieving operations. In this paper, we discuss the Cassandra NoSQL database approach for storing genomic data. We perform an analysis of persistency and I/O operations with real data, using the Cassandra database system. We also compare the results obtained with a classical relational database system and another NoSQL database approach, MongoDB.

  16. Evaluating the Cassandra NoSQL Database Approach for Genomic Data Persistency

    Directory of Open Access Journals (Sweden)

    Rodrigo Aniceto

    2015-01-01

    Full Text Available Rapid advances in high-throughput sequencing techniques have created interesting computational challenges in bioinformatics. One of them refers to management of massive amounts of data generated by automatic sequencers. We need to deal with the persistency of genomic data, particularly storing and analyzing these large-scale processed data. To find an alternative to the frequently considered relational database model becomes a compelling task. Other data models may be more effective when dealing with a very large amount of nonconventional data, especially for writing and retrieving operations. In this paper, we discuss the Cassandra NoSQL database approach for storing genomic data. We perform an analysis of persistency and I/O operations with real data, using the Cassandra database system. We also compare the results obtained with a classical relational database system and another NoSQL database approach, MongoDB.

  17. Genomic approaches for understanding dengue: insights from the virus, vector, and host.

    Science.gov (United States)

    Sim, Shuzhen; Hibberd, Martin L

    2016-03-02

    The incidence and geographic range of dengue have increased dramatically in recent decades. Climate change, rapid urbanization and increased global travel have facilitated the spread of both efficient mosquito vectors and the four dengue virus serotypes between population centers. At the same time, significant advances in genomics approaches have provided insights into host-pathogen interactions, immunogenetics, and viral evolution in both humans and mosquitoes. Here, we review these advances and the innovative treatment and control strategies that they are inspiring.

  18. Joint signal extraction from galaxy clusters in X-ray and SZ surveys: A matched-filter approach

    CERN Document Server

    Tarrío, Paula; Arnaud, Monique; Pratt, Gabriel W

    2016-01-01

    The hot ionized gas of the intra-cluster medium emits thermal radiation in the X-ray band and also distorts the cosmic microwave radiation through the Sunyaev-Zel'dovich (SZ) effect. Combining these two complementary sources of information through innovative techniques can therefore potentially improve the cluster detection rate when compared to using only one of the probes. Our aim is to build such a joint X-ray-SZ analysis tool, which will allow us to detect fainter or more distant clusters while maintaining high catalogue purity. We present a method based on matched multifrequency filters (MMF) for extracting cluster catalogues from SZ and X-ray surveys. We first designed an X-ray matched-filter method, analogous to the classical MMF developed for SZ observations. Then, we built our joint X-ray-SZ algorithm by combining our X-ray matched filter with the classical SZ-MMF, for which we used the physical relation between SZ and X-ray observations. We show that the proposed X-ray matched filter provides correc...

  19. A computational approach for ordering signal transduction pathway components from genomics and proteomics Data

    Directory of Open Access Journals (Sweden)

    Zhao Hongyu

    2004-10-01

    Full Text Available Abstract Background Signal transduction is one of the most important biological processes by which cells convert an external signal into a response. Novel computational approaches to mapping proteins onto signaling pathways are needed to fully take advantage of the rapid accumulation of genomic and proteomics information. However, despite their importance, research on signaling pathways reconstruction utilizing large-scale genomics and proteomics information has been limited. Results We have developed an approach for predicting the order of signaling pathway components, assuming all the components on the pathways are known. Our method is built on a score function that integrates protein-protein interaction data and microarray gene expression data. Compared to the individual datasets, either protein interactions or gene transcript abundance measurements, the integrated approach leads to better identification of the order of the pathway components. Conclusions As demonstrated in our study on the yeast MAPK signaling pathways, the integration analysis of high-throughput genomics and proteomics data can be a powerful means to infer the order of pathway components, enabling the transformation from molecular data into knowledge of cellular mechanisms.

  20. Liver resection for hepatocellular carcinoma within a fast-track management:a propensity-score matched analysis between open and laparoscopic approach

    Institute of Scientific and Technical Information of China (English)

    Francesca Ratti; Federica Cipriani; Raffaella Reineke; Marco Catena; Michele Paganelli; Luigi Beretta; Luca Aldrighetti

    2016-01-01

    Aim: The study was designed to assess the implications of enhanced recovery after surgery (ERAS) approach in patients submitted to open liver resection for hepatocellular carcinoma (HCC) comparing their short term outcome with patients treated by laparoscopic approach, in a case-matched design.Methods: The open-group (n = 60) was matched in a ratio of 1:1 with patients undergoing laparoscopic liver resection for HCC (Lap-group,n= 60), with a matching achieved on a basis of propensity scores including 6 covariates representing patients characteristics and severity of the disease. Primary outcome analysis was performed in terms of ERAS-speciifc items and postoperative morbidity and mortality.Results: Overall morbidity and mortality were comparable between groups. Incidence of ascites was slightly higher in the open- compared with the Lap-group (respectively 11.7% and 13.3%), without statistical signiifcance. The need for introduction or increase of chronic diuretic therapy was signiifcantly higher in the open-compared with the Lap-group (16.7%vs. 11.7%,P = 0.046). Furthermore, ascites more frequently required percutaneous drainage in the open-compared with the Lap-group (5%vs. 1.7% respectively,P = 0.041).Conclusion: In patients who can’t beneift from minimally-invasive approach because of disease characteristics, ERAS management seems to be associated with an improved postoperative functional recovery and postoperative outcomes, comparable to those of the minimally invasive approach.

  1. ChromaSig: a probabilistic approach to finding common chromatin signatures in the human genome.

    Directory of Open Access Journals (Sweden)

    Gary Hon

    2008-10-01

    Full Text Available Computational methods to identify functional genomic elements using genetic information have been very successful in determining gene structure and in identifying a handful of cis-regulatory elements. But the vast majority of regulatory elements have yet to be discovered, and it has become increasingly apparent that their discovery will not come from using genetic information alone. Recently, high-throughput technologies have enabled the creation of information-rich epigenetic maps, most notably for histone modifications. However, tools that search for functional elements using this epigenetic information have been lacking. Here, we describe an unsupervised learning method called ChromaSig to find, in an unbiased fashion, commonly occurring chromatin signatures in both tiling microarray and sequencing data. Applying this algorithm to nine chromatin marks across a 1% sampling of the human genome in HeLa cells, we recover eight clusters of distinct chromatin signatures, five of which correspond to known patterns associated with transcriptional promoters and enhancers. Interestingly, we observe that the distinct chromatin signatures found at enhancers mark distinct functional classes of enhancers in terms of transcription factor and coactivator binding. In addition, we identify three clusters of novel chromatin signatures that contain evolutionarily conserved sequences and potential cis-regulatory elements. Applying ChromaSig to a panel of 21 chromatin marks mapped genomewide by ChIP-Seq reveals 16 classes of genomic elements marked by distinct chromatin signatures. Interestingly, four classes containing enrichment for repressive histone modifications appear to be locally heterochromatic sites and are enriched in quickly evolving regions of the genome. The utility of this approach in uncovering novel, functionally significant genomic elements will aid future efforts of genome annotation via chromatin modifications.

  2. A Novel Multi-Purpose Matching Representation of Local 3D Surfaces: A Rotationally Invariant, Efficient, and Highly Discriminative Approach With an Adjustable Sensitivity.

    Science.gov (United States)

    Al-Osaimi, Faisal R

    2016-02-01

    In this paper, a novel approach to local 3D surface matching representation suitable for a range of 3D vision applications is introduced. Local 3D surface patches around key points on the 3D surface are represented by 2D images such that the representing 2D images enjoy certain characteristics which positively impact the matching accuracy, robustness, and speed. First, the proposed representation is complete, in the sense, there is no information loss during their computation. Second, the 3DoF 2D representations are strictly invariant to all the 3DoF rotations. To optimally avail surface information, the sensitivity of the representations to surface information is adjustable. This also provides the proposed matching representation with the means to optimally adjust to a particular class of problems/applications or an acquisition technology. Each 2D matching representation is a sequence of adjustable integral kernels, where each kernel is efficiently computed from a triple of precise 3D curves (profiles) formed by intersecting three concentric spheres with the 3D surface. Robust techniques for sampling the profiles and establishing correspondences among them were devised. Based on the proposed matching representation, two techniques for the detection of key points were presented. The first is suitable for static images, while the second is suitable for 3D videos. The approach was tested on the face recognition grand challenge v2.0, the 3D twins expression challenge, and the Bosphorus data sets, and a superior face recognition performance was achieved. In addition, the proposed approach was used in object class recognition and tested on a Kinect data set.

  3. Joint signal extraction from galaxy clusters in X-ray and SZ surveys: A matched-filter approach

    Science.gov (United States)

    Tarrío, P.; Melin, J.-B.; Arnaud, M.; Pratt, G. W.

    2016-06-01

    The hot ionized gas of the intra-cluster medium emits thermal radiation in the X-ray band and also distorts the cosmic microwave radiation through the Sunyaev-Zel'dovich (SZ) effect. Combining these two complementary sources of information through innovative techniques can therefore potentially improve the cluster detection rate when compared to using only one of the probes. Our aim is to build such a joint X-ray-SZ analysis tool, which will allow us to detect fainter or more distant clusters while maintaining high catalogue purity. We present a method based on matched multifrequency filters (MMF) for extracting cluster catalogues from SZ and X-ray surveys. We first designed an X-ray matched-filter method, analogous to the classical MMF developed for SZ observations. Then, we built our joint X-ray-SZ algorithm by combining our X-ray matched filter with the classical SZ-MMF, for which we used the physical relation between SZ and X-ray observations. We show that the proposed X-ray matched filter provides correct photometry results, and that the joint matched filter also provides correct photometry when the FX/Y500 relation of the clusters is known. Moreover, the proposed joint algorithm provides a better signal-to-noise ratio than single-map extractions, which improves the detection rate even if we do not exactly know the FX/Y500 relation. The proposed methods were tested using data from the ROSAT all-sky survey and from the Planck survey.

  4. An integrative genomic approach to uncover molecular mechanisms of prokaryotic traits.

    Directory of Open Access Journals (Sweden)

    Yang Liu

    2006-11-01

    Full Text Available With mounting availability of genomic and phenotypic databases, data integration and mining become increasingly challenging. While efforts have been put forward to analyze prokaryotic phenotypes, current computational technologies either lack high throughput capacity for genomic scale analysis, or are limited in their capability to integrate and mine data across different scales of biology. Consequently, simultaneous analysis of associations among genomes, phenotypes, and gene functions is prohibited. Here, we developed a high throughput computational approach, and demonstrated for the first time the feasibility of integrating large quantities of prokaryotic phenotypes along with genomic datasets for mining across multiple scales of biology (protein domains, pathways, molecular functions, and cellular processes. Applying this method over 59 fully sequenced prokaryotic species, we identified genetic basis and molecular mechanisms underlying the phenotypes in bacteria. We identified 3,711 significant correlations between 1,499 distinct Pfam and 63 phenotypes, with 2,650 correlations and 1,061 anti-correlations. Manual evaluation of a random sample of these significant correlations showed a minimal precision of 30% (95% confidence interval: 20%-42%; n = 50. We stratified the most significant 478 predictions and subjected 100 to manual evaluation, of which 60 were corroborated in the literature. We furthermore unveiled 10 significant correlations between phenotypes and KEGG pathways, eight of which were corroborated in the evaluation, and 309 significant correlations between phenotypes and 166 GO concepts evaluated using a random sample (minimal precision = 72%; 95% confidence interval: 60%-80%; n = 50. Additionally, we conducted a novel large-scale phenomic visualization analysis to provide insight into the modular nature of common molecular mechanisms spanning multiple biological scales and reused by related phenotypes (metaphenotypes. We propose

  5. Integrated genomics and molecular breeding approaches for dissecting the complex quantitative traits in crop plants

    Indian Academy of Sciences (India)

    Alice Kujur; Maneesha S Saxena; Deepak Bajaj; Laxmi; Swarup K Parida

    2013-12-01

    The enormous population growth, climate change and global warming are now considered major threats to agriculture and world’s food security. To improve the productivity and sustainability of agriculture, the development of high-yielding and durable abiotic and biotic stress-tolerant cultivars and/climate resilient crops is essential. Henceforth, understanding the molecular mechanism and dissection of complex quantitative yield and stress tolerance traits is the prime objective in current agricultural biotechnology research. In recent years, tremendous progress has been made in plant genomics and molecular breeding research pertaining to conventional and next-generation whole genome, transcriptome and epigenome sequencing efforts, generation of huge genomic, transcriptomic and epigenomic resources and development of modern genomics-assisted breeding approaches in diverse crop genotypes with contrasting yield and abiotic stress tolerance traits. Unfortunately, the detailed molecular mechanism and gene regulatory networks controlling such complex quantitative traits is not yet well understood in crop plants. Therefore, we propose an integrated strategies involving available enormous and diverse traditional and modern –omics (structural, functional, comparative and epigenomics) approaches/resources and genomics-assisted breeding methods which agricultural biotechnologist can adopt/utilize to dissect and decode the molecular and gene regulatory networks involved in the complex quantitative yield and stress tolerance traits in crop plants. This would provide clues and much needed inputs for rapid selection of novel functionally relevant molecular tags regulating such complex traits to expedite traditional and modern marker-assisted genetic enhancement studies in target crop species for developing high-yielding stress-tolerant varieties.

  6. Whole-genome sequencing approaches for conservation biology: advantages, limitations, and practical recommendations.

    Science.gov (United States)

    Fuentes-Pardo, Angela P; Ruzzante, Daniel E

    2017-07-26

    Whole-genome resequencing (WGR) is a powerful method for addressing fundamental evolutionary biology questions that have not been fully resolved using traditional methods. WGR includes four approaches: the sequencing of individuals to a high depth of coverage with either unresolved (huWGR) or resolved haplotypes (hrWGR), the sequencing of population genomes to a high depth by mixing equimolar amounts of unlabelled-individual DNA (Pool-seq), and the sequencing of multiple individuals from a population to a low depth (lcWGR). These techniques require the availability of a reference genome. This, along with the still high cost of shotgun sequencing and the large demand for computing resources and storage, has limited their implementation in non-model species with scarce genomic resources and in fields such as conservation biology. Our goal here is to describe the various WGR methods, their pros and cons, and potential applications in conservation biology. WGR offers an unprecedented marker density and surveys a wide diversity of genetic variations not limited to single nucleotide polymorphisms (e.g. structural variants and mutations in regulatory elements), increasing their power for the detection of signatures of selection and local adaptation as well as for the identification of the genetic basis of phenotypic traits and diseases. Currently though, no single WGR approach fulfills all requirements of conservation genetics, and each method has its own limitations and sources of potential bias. We discuss proposed ways to minimize such biases. We envision a not distant future where the analysis of whole genomes becomes a routine task in many non-model species and fields including conservation biology. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  7. Development of disease-resistant walnut rootstocks: Integration of conventional and genomic approaches (SCRI-match Year 3)

    Science.gov (United States)

    Walnuts are grown on almost every continent with total world-wide production estimated at over 4 billion in-shell pounds. California walnut growers, who produce 99% of the US walnut crop, produced an estimated 1.2 billion pounds on approximately 310,000 bearing acres with a farm gate value of approx...

  8. Genome-wide approaches to identify pharmacogenetic contributions to adverse drug reactions.

    Science.gov (United States)

    Nelson, M R; Bacanu, S-A; Mosteller, M; Li, L; Bowman, C E; Roses, A D; Lai, E H; Ehm, M G

    2009-02-01

    Adverse drug reactions (ADRs) have a major impact on patients, physicians, health care providers, regulatory agencies and pharmaceutical companies. Identifying the genetic contributions to ADR risk may lead to a better understanding of the underlying mechanisms, identification of patients at risk and a decrease in the number of events. Technological advances have made the routine monitoring and investigation of the genetic basis of ADRs during clinical trials possible. We demonstrate through simulation that genome-wide genotyping, coupled with the use of clinically matched or population controls, can yield sufficient statistical power to permit the identification of strong genetic predictors of ADR risk in a prospective manner with modest numbers of ADR cases. The results of a 500,000 single nucleotide polymorphism analysis of abacavir-associated hypersensitivity reaction suggest that the known HLA-B gene region could be identified with as few as 15 cases and 200 population controls in a sequential analysis.

  9. Integrated proteo-genomic approach for early diagnosis and prognosis of cancer.

    Science.gov (United States)

    Shukla, Hem D; Mahmood, Javed; Vujaskovic, Zeljko

    2015-12-01

    Cancer is the leading cause of mortality among men and women worldwide. Despite the availability of numerous diagnostic techniques for various cancers, the overall survival rate remains low and the majority of patients die due to late diagnosis and advanced stage of the disease. Diagnosing and treating cancer at its early stages ideally during the precancerous phase could significantly increase survival rate with the possibility of cure and prolong survival. Cancer is a genetic disease and it is illicitly activated by the acquisition of somatic DNA lesions and aberrations in genome structure and defects in maintenance and repair. These somatic DNA mutations known as driver mutations seem to be the prime cause in initiating tumorigenesis. The advances in genomic technologies have immensely facilitated the understanding of cancer progression and metastasis, and the discovery of novel biomarkers. However, changes in somatic mutational landscape of the oncogenome are translated into aberrantly regulated oncoproteome which drives the cancer initiation. Thus, combination of proteomic and genomic technologies is urgently required to discover biomarkers for early diagnosis. The recent advances in human genome based detection of cancer using advanced genomic technologies like NextGen Sequencing, digital PCR, cfDNA technology have shown promise; for example oncogenic somatic mutation variants, transcriptomic analysis, copy number variant, and methylation data from the Cancer Genome Atlas. Similarly, oncoproteomics has the potential to revolutionize clinical management of the disease, including cancer diagnosis and screening based on new proteomic database which embodies somatic variants and post translational modifications, thus devising proteomic technologies as a complement to histopathology. Further, the use of multiple proteomic and genomic biomarkers rather than a single gene or protein could greatly improve diagnostic accuracy and enhance the predictive power for

  10. A novel approach for determining cancer genomic breakpoints in the presence of normal DNA.

    Directory of Open Access Journals (Sweden)

    Yu-Tsueng Liu

    Full Text Available CDKN2A (encodes p16(INK4A and p14(ARF deletion, which results in both Rb and p53 inactivation, is the most common chromosomal anomaly in human cancers. To precisely map the deletion breakpoints is important to understanding the molecular mechanism of genomic rearrangement and may also be useful for clinical applications. However, current methods for determining the breakpoint are either of low resolution or require the isolation of relatively pure cancer cells, which can be difficult for clinical samples that are typically contaminated with various amounts of normal host cells. To overcome this hurdle, we have developed a novel approach, designated Primer Approximation Multiplex PCR (PAMP, for enriching breakpoint sequences followed by genomic tiling array hybridization to locate the breakpoints. In a series of proof-of-concept experiments, we were able to identify cancer-derived CDKN2A genomic breakpoints when more than 99.9% of wild type genome was present in a model system. This design can be scaled up with bioinformatics support and can be applied to validate other candidate cancer-associated loci that are revealed by other more systemic but lower throughput assays.

  11. Single-molecule approach to bacterial genomic comparisons via optical mapping.

    Energy Technology Data Exchange (ETDEWEB)

    Zhou, Shiguo [Univ. Wisc.-Madison; Kile, A. [Univ. Wisc.-Madison; Bechner, M. [Univ. Wisc.-Madison; Kvikstad, E. [Univ. Wisc.-Madison; Deng, W. [Univ. Wisc.-Madison; Wei, J. [Univ. Wisc.-Madison; Severin, J. [Univ. Wisc.-Madison; Runnheim, R. [Univ. Wisc.-Madison; Churas, C. [Univ. Wisc.-Madison; Forrest, D. [Univ. Wisc.-Madison; Dimalanta, E. [Univ. Wisc.-Madison; Lamers, C. [Univ. Wisc.-Madison; Burland, V. [Univ. Wisc.-Madison; Blattner, F. R. [Univ. Wisc.-Madison; Schwartz, David C. [Univ. Wisc.-Madison

    2004-01-01

    Modern comparative genomics has been established, in part, by the sequencing and annotation of a broad range of microbial species. To gain further insights, new sequencing efforts are now dealing with the variety of strains or isolates that gives a species definition and range; however, this number vastly outstrips our ability to sequence them. Given the availability of a large number of microbial species, new whole genome approaches must be developed to fully leverage this information at the level of strain diversity that maximize discovery. Here, we describe how optical mapping, a single-molecule system, was used to identify and annotate chromosomal alterations between bacterial strains represented by several species. Since whole-genome optical maps are ordered restriction maps, sequenced strains of Shigella flexneri serotype 2a (2457T and 301), Yersinia pestis (CO 92 and KIM), and Escherichia coli were aligned as maps to identify regions of homology and to further characterize them as possible insertions, deletions, inversions, or translocations. Importantly, an unsequenced Shigella flexneri strain (serotype Y strain AMC[328Y]) was optically mapped and aligned with two sequenced ones to reveal one novel locus implicated in serotype conversion and several other loci containing insertion sequence elements or phage-related gene insertions. Our results suggest that genomic rearrangements and chromosomal breakpoints are readily identified and annotated against a prototypic sequenced strain by using the tools of optical mapping.

  12. A functional genomics approach to establish the complement of carbohydrate transporters in Streptococcus pneumoniae.

    Directory of Open Access Journals (Sweden)

    Alessandro Bidossi

    Full Text Available The aerotolerant anaerobe Streptococcus pneumoniae is part of the normal nasopharyngeal microbiota of humans and one of the most important invasive pathogens. A genomic survey allowed establishing the occurrence of twenty-one phosphotransferase systems, seven carbohydrate uptake ABC transporters, one sodium:solute symporter and a permease, underlining an exceptionally high capacity for uptake of carbohydrate substrates. Despite high genomic variability, combined phenotypic and genomic analysis of twenty sequenced strains did assign the substrate specificity only to two uptake systems. Systematic analysis of mutants for most carbohydrate transporters enabled us to assign a phenotype and substrate specificity to twenty-three transport systems. For five putative transporters for galactose, pentoses, ribonucleosides and sulphated glycans activity was inferred, but not experimentally confirmed and only one transport system remains with an unknown substrate and lack of any functional annotation. Using a metabolic approach, 80% of the thirty-two fermentable carbon substrates were assigned to the corresponding transporter. The complexity and robustness of sugar uptake is underlined by the finding that many transporters have multiple substrates, and many sugars are transported by more than one system. The present work permits to draw a functional map of the complete arsenal of carbohydrate utilisation proteins of pneumococci, allows re-annotation of genomic data and might serve as a reference for related species. These data provide tools for specific investigation of the roles of the different carbon substrates on pneumococcal physiology in the host during carriage and invasive infection.

  13. Characterization of the Genomic Diversity of Norovirus in Linked Patients Using a Metagenomic Deep Sequencing Approach

    Science.gov (United States)

    Nasheri, Neda; Petronella, Nicholas; Ronholm, Jennifer; Bidawid, Sabah; Corneau, Nathalie

    2017-01-01

    Norovirus (NoV) is the leading cause of gastroenteritis worldwide. A robust cell culture system does not exist for NoV and therefore detailed characterization of outbreak and sporadic strains relies on molecular techniques. In this study, we employed a metagenomic approach that uses non-specific amplification followed by next-generation sequencing to whole genome sequence NoV genomes directly from clinical samples obtained from 8 linked patients. Enough sequencing depth was obtained for each sample to use a de novo assembly of near-complete genome sequences. The resultant consensus sequences were then used to identify inter-host nucleotide variations that occur after direct transmission, analyze amino acid variations in the major capsid protein, and provide evidence of recombination events. The analysis of intra-host quasispecies diversity was possible due to high coverage-depth. We also observed a linear relationship between NoV viral load in the clinical sample and the number of sequence reads that could be attributed to NoV. The method demonstrated here has the potential for future use in whole genome sequence analyses of other RNA viruses isolated from clinical, environmental, and food specimens. PMID:28197136

  14. Design and Simulation for Producing Two Amplitude Matched Anti-phase Sine Waveforms Using ±2.5 V CMOS Current-Mode Approach

    OpenAIRE

    Anil Kumar Sharma; Dipankar Pal

    2010-01-01

    In this paper the current mode approach called “Current Conveyor (CCII+)” has been incorporated to design and simulate the circuit for producing two amplitude matched anti-phase sine waveforms which are frequently used in various communication and instrumentation systems. PSpice simulation has been used to depict the output waveforms. The power supply used is ±2.5 V which can be easily incorporated with CMOS IC technology. The designed circuit has been simulated at variousfrequency ranges and...

  15. Functional genomic screening approaches in mechanistic toxicology and potential future applications of CRISPR-Cas9.

    Science.gov (United States)

    Shen, Hua; McHale, Cliona M; Smith, Martyn T; Zhang, Luoping

    2015-01-01

    Characterizing variability in the extent and nature of responses to environmental exposures is a critical aspect of human health risk assessment. Chemical toxicants act by many different mechanisms, however, and the genes involved in adverse outcome pathways (AOPs) and AOP networks are not yet characterized. Functional genomic approaches can reveal both toxicity pathways and susceptibility genes, through knockdown or knockout of all non-essential genes in a cell of interest, and identification of genes associated with a toxicity phenotype following toxicant exposure. Screening approaches in yeast and human near-haploid leukemic KBM7 cells have identified roles for genes and pathways involved in response to many toxicants but are limited by partial homology among yeast and human genes and limited relevance to normal diploid cells. RNA interference (RNAi) suppresses mRNA expression level but is limited by off-target effects (OTEs) and incomplete knockdown. The recently developed gene editing approach called clustered regularly interspaced short palindrome repeats-associated nuclease (CRISPR)-Cas9, can precisely knock-out most regions of the genome at the DNA level with fewer OTEs than RNAi, in multiple human cell types, thus overcoming the limitations of the other approaches. It has been used to identify genes involved in the response to chemical and microbial toxicants in several human cell types and could readily be extended to the systematic screening of large numbers of environmental chemicals. CRISPR-Cas9 can also repress and activate gene expression, including that of non-coding RNA, with near-saturation, thus offering the potential to more fully characterize AOPs and AOP networks. Finally, CRISPR-Cas9 can generate complex animal models in which to conduct preclinical toxicity testing at the level of individual genotypes or haplotypes. Therefore, CRISPR-Cas9 is a powerful and flexible functional genomic screening approach that can be harnessed to provide

  16. A New Approach for Design of Model Matching Controllers for Time Delay Systems by Using GA Technique

    Directory of Open Access Journals (Sweden)

    K. K. D Priyanka

    2015-01-01

    Full Text Available Modeling of physical systems usually results in complex high order dynamic representation. The simulation and design of controller for higher order system is a difficult problem. Normally the cost and complexity of the controller increases with the system order. Hence it is desirable to approximate these models to reduced order model such that these lower order models preserves all salient features of higher order model. Lower order models simplify the understanding of the original higher order system. Modern controller design methods such as Model Matching Technique, LQG produce controllers of order at least equal to that of the plant, usually higher order. These control laws are may be too complex with regards to practical implementation and simpler designs are then sought. For this purpose, one can either reduce the order the plant model prior to controller design, or reduce the controller in the final stage, or both. In the present work, a controller is designed such that the closed loop system which includes a delay response(s matches with those of the chosen model with same time delay as close as possible. Based on desired model, a controller(of higher order is designed using model matching method and is approximated to a lower order one using Approximate Generalized Time Moments (AGTM / Approximate Generalized Markov Moments (AGMM matching technique and Optimal Pade Approximation technique. Genetic Algorithm (GA optimization technique is used to obtain the expansion points one which yields similar response as that of model, minimizing the error between the response of the model and that of designed closed loop system.

  17. Intraoperative cone-beam CT for correction of periaxial malrotation of the femoral shaft: a surface-matching approach.

    Science.gov (United States)

    Khoury, Amal; Whyne, Cari M; Daly, Michael; Moseley, Douglas; Bootsma, Greg; Skrinskas, Tomas; Siewerdsen, Jeffrey; Jaffray, David

    2007-04-01

    Limb length, alignment and rotation can be difficult to determine in femoral shaft fractures. Shaft axis rotation is particularly difficult to assess intraoperatively. Femoral malpositioning can cause deformity, pain and secondary degenerative joint damage. The aim of this study is to develop an intraoperative method based on cone-beam computed tomography (CBCT) to guide alignment of femoral shaft fractures. We hypothesize that bone surface matching can predict malrotation even with severe comminution. A cadaveric femur was imaged at 16 femoral periaxial malrotations (-51.2 degrees to 60.1 degrees). The images were processed resulting in an unwrapped bone surface plot consisting of a pattern of ridges and valleys. Fracture gaps were simulated by removing midline CT slices. The gaps were reconstituted by extrapolating the existing proximal and distal fragments to the midline of the fracture. The two bone surfaces were then shifted to align bony features. Periaxial malrotation was accurately assessed using surface matching (r2 = 0.99, slope 1.0). The largest mean error was 2.20 degrees and the average difference between repeated measurements was 0.49 degrees. CBCT can provide intraoperative high-resolution images with a large field of view. This quality of imaging enables surface matching algorithms to be utilized even with large areas of comminution.

  18. A new navigation approach of terrain contour matching based on 3-D terrain reconstruction from onboard image sequence

    Institute of Scientific and Technical Information of China (English)

    2010-01-01

    This article presents a passive navigation method of terrain contour matching by reconstructing the 3-D terrain from the image sequence(acquired by the onboard camera).To achieve automation and simultaneity of the image sequence processing for navigation,a correspondence registration method based on control points tracking is proposed which tracks the sparse control points through the whole image sequence and uses them as correspondence in the relation geometry solution.Besides,a key frame selection method based on the images overlapping ratio and intersecting angles is explored,thereafter the requirement for the camera system configuration is provided.The proposed method also includes an optimal local homography estimating algorithm according to the control points,which helps correctly predict points to be matched and their speed corresponding.Consequently,the real-time 3-D terrain of the trajectory thus reconstructed is matched with the referenced terrain map,and the result of which provides navigating information.The digital simulation experiment and the real image based experiment have verified the proposed method.

  19. MIRAGE: a functional genomics-based approach for metabolic network model reconstruction and its application to cyanobacteria networks.

    Science.gov (United States)

    Vitkin, Edward; Shlomi, Tomer

    2012-11-29

    Genome-scale metabolic network reconstructions are considered a key step in quantifying the genotype-phenotype relationship. We present a novel gap-filling approach, MetabolIc Reconstruction via functionAl GEnomics (MIRAGE), which identifies missing network reactions by integrating metabolic flux analysis and functional genomics data. MIRAGE's performance is demonstrated on the reconstruction of metabolic network models of E. coli and Synechocystis sp. and validated via existing networks for these species. Then, it is applied to reconstruct genome-scale metabolic network models for 36 sequenced cyanobacteria amenable for constraint-based modeling analysis and specifically for metabolic engineering. The reconstructed network models are supplied via standard SBML files.

  20. Pan-Genome Analysis of Human Gastric Pathogen H. pylori: Comparative Genomics and Pathogenomics Approaches to Identify Regions Associated with Pathogenicity and Prediction of Potential Core Therapeutic Targets

    Directory of Open Access Journals (Sweden)

    Amjad Ali

    2015-01-01

    Full Text Available Helicobacter pylori is a human gastric pathogen implicated as the major cause of peptic ulcer and second leading cause of gastric cancer (~70% around the world. Conversely, an increased resistance to antibiotics and hindrances in the development of vaccines against H. pylori are observed. Pan-genome analyses of the global representative H. pylori isolates consisting of 39 complete genomes are presented in this paper. Phylogenetic analyses have revealed close relationships among geographically diverse strains of H. pylori. The conservation among these genomes was further analyzed by pan-genome approach; the predicted conserved gene families (1,193 constitute ~77% of the average H. pylori genome and 45% of the global gene repertoire of the species. Reverse vaccinology strategies have been adopted to identify and narrow down the potential core-immunogenic candidates. Total of 28 nonhost homolog proteins were characterized as universal therapeutic targets against H. pylori based on their functional annotation and protein-protein interaction. Finally, pathogenomics and genome plasticity analysis revealed 3 highly conserved and 2 highly variable putative pathogenicity islands in all of the H. pylori genomes been analyzed.

  1. Systems biology approach reveals genome to phenome correlation in type 2 diabetes.

    Science.gov (United States)

    Jain, Priyanka; Vig, Saurabh; Datta, Malabika; Jindel, Dinesh; Mathur, Ashok Kumar; Mathur, Sandeep Kumar; Sharma, Abhay

    2013-01-01

    Genome-wide association studies (GWASs) have discovered association of several loci with Type 2 diabetes (T2D), a common complex disease characterized by impaired insulin secretion by pancreatic β cells and insulin signaling in target tissues. However, effect of genetic risk variants on continuous glycemic measures in nondiabetic subjects mainly elucidates perturbation of insulin secretion. Also, the disease associated genes do not clearly converge on functional categories consistent with the known aspects of T2D pathophysiology. We used a systems biology approach to unravel genome to phenome correlation in T2D. We first examined enrichment of pathways in genes identified in T2D GWASs at genome-wide or lower levels of significance. Genes at lower significance threshold showed enrichment of insulin secretion related pathway. Notably, physical and genetic interaction network of these genes showed robust enrichment of insulin signaling and other T2D pathophysiology related pathways including insulin secretion. The network also overrepresented genes reported to interact with insulin secretion and insulin action targeting antidiabetic drugs. The drug interacting genes themselves showed overrepresentation of insulin signaling and other T2D relevant pathways. Next, we generated genome-wide expression profiles of multiple insulin responsive tissues from nondiabetic and diabetic patients. Remarkably, the differentially expressed genes showed significant overlap with the network genes, with the intersection showing enrichment of insulin signaling and other pathways consistent with T2D pathophysiology. Literature search led our genomic, interactomic, transcriptomic and toxicogenomic evidence to converge on TGF-beta signaling, a pathway known to play a crucial role in pancreatic islets development and function, and insulin signaling. Cumulatively, we find that GWAS genes relate directly to insulin secretion and indirectly, through collaborating with other genes, to insulin

  2. Systems biology approach reveals genome to phenome correlation in type 2 diabetes.

    Directory of Open Access Journals (Sweden)

    Priyanka Jain

    Full Text Available Genome-wide association studies (GWASs have discovered association of several loci with Type 2 diabetes (T2D, a common complex disease characterized by impaired insulin secretion by pancreatic β cells and insulin signaling in target tissues. However, effect of genetic risk variants on continuous glycemic measures in nondiabetic subjects mainly elucidates perturbation of insulin secretion. Also, the disease associated genes do not clearly converge on functional categories consistent with the known aspects of T2D pathophysiology. We used a systems biology approach to unravel genome to phenome correlation in T2D. We first examined enrichment of pathways in genes identified in T2D GWASs at genome-wide or lower levels of significance. Genes at lower significance threshold showed enrichment of insulin secretion related pathway. Notably, physical and genetic interaction network of these genes showed robust enrichment of insulin signaling and other T2D pathophysiology related pathways including insulin secretion. The network also overrepresented genes reported to interact with insulin secretion and insulin action targeting antidiabetic drugs. The drug interacting genes themselves showed overrepresentation of insulin signaling and other T2D relevant pathways. Next, we generated genome-wide expression profiles of multiple insulin responsive tissues from nondiabetic and diabetic patients. Remarkably, the differentially expressed genes showed significant overlap with the network genes, with the intersection showing enrichment of insulin signaling and other pathways consistent with T2D pathophysiology. Literature search led our genomic, interactomic, transcriptomic and toxicogenomic evidence to converge on TGF-beta signaling, a pathway known to play a crucial role in pancreatic islets development and function, and insulin signaling. Cumulatively, we find that GWAS genes relate directly to insulin secretion and indirectly, through collaborating with other

  3. Multi-omic data integration and analysis using systems genomics approaches

    DEFF Research Database (Denmark)

    Suravajhala, Prashanth; Kogelman, Lisette; Kadarmideen, Haja

    2016-01-01

    , health and welfare. We conclude that there are big challenges in multi-omic data integration, modelling and systems-level analyses, particularly with the fast emerging HTO technologies. We highlight existing and emerging systems genomics approaches and discuss how they contribute to our understanding...... on animal production and health traits. However, notwithstanding these new HTO technologies, there remains an emerging challenge in data analysis. On the one hand, different HTO technologies judged on their own merit are appropriate for the identification of disease-causing genes, biomarkers for prevention...... and drug targets for the treatment of diseases and for individualized genomic predictions of performance or disease risks. On the other hand, integration of multi-omic data and joint modelling and analyses are very powerful and accurate to understand the systems biology of healthy and sustainable...

  4. A Scalable Genome-Editing-Based Approach for Mapping Multiprotein Complexes in Human Cells

    Directory of Open Access Journals (Sweden)

    Mathieu Dalvai

    2015-10-01

    Full Text Available Conventional affinity purification followed by mass spectrometry (AP-MS analysis is a broadly applicable method used to decipher molecular interaction networks and infer protein function. However, it is sensitive to perturbations induced by ectopically overexpressed target proteins and does not reflect multilevel physiological regulation in response to diverse stimuli. Here, we developed an interface between genome editing and proteomics to isolate native protein complexes produced from their natural genomic contexts. We used CRISPR/Cas9 and TAL effector nucleases (TALENs to tag endogenous genes and purified several DNA repair and chromatin-modifying holoenzymes to near homogeneity. We uncovered subunits and interactions among well-characterized complexes and report the isolation of MCM8/9, highlighting the efficiency and robustness of the approach. These methods improve and simplify both small- and large-scale explorations of protein interactions as well as the study of biochemical activities and structure-function relationships.

  5. Developing an integrated proteo-genomic approach for the characterisation of biomarkers for the identification of Bacillus anthracis.

    Science.gov (United States)

    Misra, Raju V; Ahmod, Nadia Z; Parker, Robert; Fang, Min; Shah, Haroun; Gharbia, Saheer

    2012-02-01

    Bacillus anthracis is the causative agent of anthrax, an acute and often fatal disease in humans. Due to the high genomic relatedness within the Bacillus cereus group of species it is a challenge to identify B. anthracis consistently. Alternative strategies such as proteomics coupled with mass spectrometry (MS) provide a powerful approach for biomarker discovery. However, validating and evaluating these markers, particularly for genetically homogeneous species such as B. anthracis are challenging. The objective of this study is to develop a robust biomarker discovery and validation pipeline, using proteomic methodology combined with in silico and molecular approaches, to determine a biomarker list, using B. anthracis as a model. In this exploratory study we profiled the proteome of B. anthracis and genetically related species using GeLC-Liquid Chromatography MS/MS (GeLC-LC MS/MS), identifying peptides that could be used to detect B. anthracis. Peptides were filtered to remove low quality identifications. Using comparative bioinformatic approaches, matching and searching against genomic sequence data a shortlist of peptide biomarkers was determined and validated using DNA sequencing, against a panel of closely related strains, to determine marker specificity. Further validation was performed using MS quantitation methods to assess sensitivity and specificity. A biomarker discovery pipeline was successfully developed in this study, comprising four distinct stages: proteome profiling, comparative bioinformatic validation, DNA sequencing and MS validation. Using the pipeline, 5379 peptides specific for Bacillus species and 36 peptides specific for B. anthracis were identified and validated. The 36 peptides, representing 30 proteins were derived from over 15 different clusters of orthologous group categories, including proteins involved in transcription, energy production/conservation as well as multifunctional proteins. We demonstrated that the peptide biomarkers

  6. Gene discovery in the hamster: a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs

    Directory of Open Access Journals (Sweden)

    Khan Shafiq A

    2003-06-01

    Full Text Available Abstract Background Complete genome annotation will likely be achieved through a combination of computer-based analysis of available genome sequences combined with direct experimental characterization of expressed regions of individual genomes. We have utilized a comparative genomics approach involving the sequencing of randomly selected hamster testis cDNAs to begin to identify genes not previously annotated on the human, mouse, rat and Fugu (pufferfish genomes. Results 735 distinct sequences were analyzed for their relatedness to known sequences in public databases. Eight of these sequences were derived from previously unidentified genes and expression of these genes in testis was confirmed by Northern blotting. The genomic locations of each sequence were mapped in human, mouse, rat and pufferfish, where applicable, and the structure of their cognate genes was derived using computer-based predictions, genomic comparisons and analysis of uncharacterized cDNA sequences from human and macaque. Conclusion The use of a comparative genomics approach resulted in the identification of eight cDNAs that correspond to previously uncharacterized genes in the human genome. The proteins encoded by these genes included a new member of the kinesin superfamily, a SET/MYND-domain protein, and six proteins for which no specific function could be predicted. Each gene was expressed primarily in testis, suggesting that they may play roles in the development and/or function of testicular cells.

  7. Genomic approach to studying nutritional requirements of Clostridium tyrobutyricum and other Clostridia causing late blowing defects.

    Science.gov (United States)

    Storari, Michelangelo; Kulli, Sandra; Wüthrich, Daniel; Bruggmann, Rémy; Berthoud, Hélène; Arias-Roth, Emmanuelle

    2016-10-01

    Clostridium tyrobutyricum is the main microorganism responsible for the late blowing defect in hard and semi-hard cheeses, causing considerable economic losses to the cheese industry. Deeper knowledge of the metabolic requirements of this microorganism can lead to the development of more effective control approaches. In this work, the amino acids and B vitamins essential for sustaining the growth of C. tyrobutyricum were investigated using a genomic approach. As the first step, the genomes of four C. tyrobutyricum strains were analyzed for the presence of genes putatively involved in the biosynthesis of amino acids and B vitamins. Metabolic pathways could be reconstructed for all amino acids and B vitamins with the exception of biotin (vitamin B7) and folate (vitamin B9). The biotin pathway was missing the enzyme amino-7-oxononanoate synthase that catalyzes the condensation of pimeloyl-ACP and l-alanine to 8-amino-7-oxononanoate. In the folate pathway, the missing genes were those coding for para-aminobenzoate synthase and aminodeoxychorismate lyase enzymes. These enzymes are responsible for the conversion of chorismate into para-aminobenzoate (PABA). Two C. tyrobutyircum strains whose genome was analyzed in silico as well as other 10 strains isolated from cheese were tested in liquid media to confirm these observations. 11 strains showed growth in a defined liquid medium containing biotin and PABA after 6-8 days of incubation. No strain showed growth when only one or none of these compounds were added, confirming the observations obtained in silico. Furthermore, the genome analysis was extended to genomes of single strains of other Clostridium species potentially causing late blowing, namely Clostridium beijerinckii, Clostridium sporogenes and Clostridium butyricum. Only the biotin biosynthesis pathway was incomplete for C. butyricum and C. beijerincki. In contrast, C. sporogenes showed missing enzymes in biosynthesis pathways of several amino acids as well

  8. Genome trees constructed using five different approaches suggest new major bacterial clades

    Directory of Open Access Journals (Sweden)

    Tatusov Roman L

    2001-10-01

    Full Text Available Abstract Background The availability of multiple complete genome sequences from diverse taxa prompts the development of new phylogenetic approaches, which attempt to incorporate information derived from comparative analysis of complete gene sets or large subsets thereof. Such attempts are particularly relevant because of the major role of horizontal gene transfer and lineage-specific gene loss, at least in the evolution of prokaryotes. Results Five largely independent approaches were employed to construct trees for completely sequenced bacterial and archaeal genomes: i presence-absence of genomes in clusters of orthologous genes; ii conservation of local gene order (gene pairs among prokaryotic genomes; iii parameters of identity distribution for probable orthologs; iv analysis of concatenated alignments of ribosomal proteins; v comparison of trees constructed for multiple protein families. All constructed trees support the separation of the two primary prokaryotic domains, bacteria and archaea, as well as some terminal bifurcations within the bacterial and archaeal domains. Beyond these obvious groupings, the trees made with different methods appeared to differ substantially in terms of the relative contributions of phylogenetic relationships and similarities in gene repertoires caused by similar life styles and horizontal gene transfer to the tree topology. The trees based on presence-absence of genomes in orthologous clusters and the trees based on conserved gene pairs appear to be strongly affected by gene loss and horizontal gene transfer. The trees based on identity distributions for orthologs and particularly the tree made of concatenated ribosomal protein sequences seemed to carry a stronger phylogenetic signal. The latter tree supported three potential high-level bacterial clades,: i Chlamydia-Spirochetes, ii Thermotogales-Aquificales (bacterial hyperthermophiles, and ii Actinomycetes-Deinococcales-Cyanobacteria. The latter group also

  9. From the double-helix to novel approaches to the sequencing of large genomes.

    Science.gov (United States)

    Szybalski, W

    1993-12-15

    Elucidation of the structure of DNA by Watson and Crick [Nature 171 (1953) 737-738] has led to many crucial molecular experiments, including studies on DNA replication, transcription, physical mapping, and most recently to serious attempts directed toward the sequencing of large genomes [Watson, Science 248 (1990) 44-49]. I am totally convinced of the great importance of the Human Genome Project, and toward achieving this goal I strongly favor 'top-down' approaches consisting of the physical mapping and preparation of contiguous 50-100-kb fragments directly from the genome, followed by their automated sequencing based on the rapid assembly of primers by hexamer ligation together with primer walking. Our 'top-down' procedures totally avoids conventional cloning, subcloning and random sequencing, which are the elements of the present 'bottom-up' procedures. Fragments of 50-100 kb are prepared in sufficient quantities either by in vitro excision with rare-cutting restriction systems (including Achilles' heel cleavage [AC] or the RecA-AC procedures of Koob et al. [Nucleic Acids Res. 20 (1992) 5831-5836]) or by in vivo excision and amplification using the yeast FRT/Flp system or the phage lambda att/Int system. Such fragments, when derived directly from the Escherichia coli genome, are arranged in consecutive order, so that 50 specially constructed strains of E. coli would supply 50 end-to-end arranged approx. 100-kb fragments, which will cover the entire approx. 5-Mb E. coli genome. For the 150-Mb Drosophila melanogaster genome, 1500 of such consecutive 100-kb fragments (supplied by 1500 strains) are required to cover the entire genome. The fragments will be sequenced by the SPEL-6 method involving hexamer ligation [Szybalski, Gene 90 (1990) 177-178; Fresenius J. Anal. Chem. 4 (1992) 343] and primer walking. The 18-mer primers are synthesized in only a few minutes from three contiguous hexamers annealed to the DNA strand to be sequenced when using an over 100-fold

  10. A mixed-integer linear programming approach to the reduction of genome-scale metabolic networks.

    Science.gov (United States)

    Röhl, Annika; Bockmayr, Alexander

    2017-01-03

    Constraint-based analysis has become a widely used method to study metabolic networks. While some of the associated algorithms can be applied to genome-scale network reconstructions with several thousands of reactions, others are limited to small or medium-sized models. In 2015, Erdrich et al. introduced a method called NetworkReducer, which reduces large metabolic networks to smaller subnetworks, while preserving a set of biological requirements that can be specified by the user. Already in 2001, Burgard et al. developed a mixed-integer linear programming (MILP) approach for computing minimal reaction sets under a given growth requirement. Here we present an MILP approach for computing minimum subnetworks with the given properties. The minimality (with respect to the number of active reactions) is not guaranteed by NetworkReducer, while the method by Burgard et al. does not allow specifying the different biological requirements. Our procedure is about 5-10 times faster than NetworkReducer and can enumerate all minimum subnetworks in case there exist several ones. This allows identifying common reactions that are present in all subnetworks, and reactions appearing in alternative pathways. Applying complex analysis methods to genome-scale metabolic networks is often not possible in practice. Thus it may become necessary to reduce the size of the network while keeping important functionalities. We propose a MILP solution to this problem. Compared to previous work, our approach is more efficient and allows computing not only one, but even all minimum subnetworks satisfying the required properties.

  11. The Preference for Anterior Approach Major Hepatectomy: Experience Over 3 Decades and a Propensity Score-Matching Analysis in Right Hepatectomy for Hepatocellular Carcinoma.

    Science.gov (United States)

    Chan, Kun-Ming; Wang, Yu-Chao; Wu, Tsung-Han; Lee, Chen-Fang; Wu, Ting-Jung; Chou, Hong-Shiue; Yu, Ming-Chin; Lee, Wei-Chen

    2015-08-01

    Surgical treatment for primary hepatocellular carcinoma (HCC) has progressed enormously over time. The aim of this study was to analyze the evolution of surgical techniques and outcomes of patients undergoing major right hepatectomy (RH) over the last few decades.A retrospective review of 557 consecutive patients who had undergone RH for HCC between January 1982 and December 2011 was performed. Patients were categorized into subgroups and analyzed according to period and surgical approach to hepatectomy. Based on a propensity score-matching model, the surgical approach in patients in the second period was also analyzed in terms of anterior approach (AA) and conventional approach (CA)-RH.Tumor factors remained the most important prognostic factors related to postoperative HCC recurrence throughout the 2 periods examined in this study. Comparison of patients selected by a propensity score-matching model showed that AA-RH led to significantly better outcomes including recurrence-free survival (RFS) (P = 0.011) and overall survival (OS) (P = 0.012) in patients with HCC as compared with CA-RH. The 5-year RFS and OS were 33.4% and 52.2% after AA-RH, and 21.0% and 36.5% after CA-RH.Major hepatectomy has evolved into a safe procedure that can be performed with confidence. RH by an AA has shown several advantages over CA-RH, and can thus be recommended as the standard procedure for liver resection in patients who require right hepatectomy.

  12. A hidden Markov model approach for determining expression from genomic tiling micro arrays

    DEFF Research Database (Denmark)

    Terkelsen, Kasper Munch; Gardner, P. P.; Arctander, Peter;

    2006-01-01

    HMM, that adaptively models tiling data prior to predicting expression on genomic sequence. A hidden Markov model (HMM) is used to model the distributions of tiling array probe scores in expressed and non-expressed regions. The HMM is trained on sets of probes mapped to regions of annotated expression and non......]. Results can be downloaded and viewed from our web site [2]. Conclusion The value of adaptive modelling of fluorescence scores prior to categorisation into expressed and non-expressed probes is demonstrated. Our results indicate that our adaptive approach is superior to the previous analysis in terms...

  13. Genome-first approach diagnosed Cabezas syndrome via novel CUL4B mutation detection.

    Science.gov (United States)

    Okamoto, Nobuhiko; Watanabe, Miki; Naruto, Takuya; Matsuda, Keiko; Kohmoto, Tomohiro; Saito, Masako; Masuda, Kiyoshi; Imoto, Issei

    2017-01-01

    Cabezas syndrome is a syndromic form of X-linked intellectual disability primarily characterized by a short stature, hypogonadism and abnormal gait, with other variable features resulting from mutations in the CUL4B gene. Here, we report a clinically undiagnosed 5-year-old male with severe intellectual disability. A genome-first approach using targeted exome sequencing identified a novel nonsense mutation [NM_003588.3:c.2698G>T, p.(Glu900*)] in the last coding exon of CUL4B, thus diagnosing this patient with Cabezas syndrome.

  14. A Hybrid Model Approach for Achieving the Highest Level of Matching Between the “Print and Original” in the Sheet-Fed Offset Process

    Directory of Open Access Journals (Sweden)

    Rajendrakumar Anayath

    2015-09-01

    Full Text Available This study was designed to explore an optimized hybrid system through dual measurement approach of contrast measurement method and CIE L*a*b* measurement method to arrive a logical interface for achieving the highest matching between the print and original in sheet fed offset printing process. The work was started with the detailed study of contrast measurement, CIE L*a*b* measurement method, visual and computational intelligence based assessment and control. The master was designed in such a way that value of density and other values can be measured easily in effective way. X-Rite eXact instrument is used for capturing the readings, the 10 standard observers were selected initially by manual method and finally by using on line Fransworth-Munsell 100 Hue Colour Vision Test and visual assessments and judgement were done on X-Rite’s Macbeth Lighting Booth and their visual assessments are recorded, analyzed and represented. All the parameters like Paper, Ink, Measuring Conditions and Measuring Instruments etc. were maintained in compliance with ISO specifications. After the analysis of data, the study reached to the following main conclusion i.e. It is observed that almost in all the cases, preference of the Standard Observers are for Delta E Value leading to the higher contrast value instead of Minimum Delta E with less contrast value. Therefore the dual approach of “Hybrid System” will help us to arrive at the most convincing, measurable and perceptually acceptable print result which will be the highest (closest match to the original by clubbing the best of “CIE Lab, Contrast Method and Standard Observers Visual Perception” as there are positives and negatives in all these three when one approaches individually. The objective of this paper was to develop a dual measurement approach of CIE Lab & Contrast to arrive at a logical inference for the highest level of matching between the original and print in the sheet fed offset printing.

  15. Polytrauma Defined by the New Berlin Definition: A Validation Test Based on Propensity-Score Matching Approach.

    Science.gov (United States)

    Rau, Cheng-Shyuan; Wu, Shao-Chun; Kuo, Pao-Jen; Chen, Yi-Chun; Chien, Peng-Chen; Hsieh, Hsiao-Yun; Hsieh, Ching-Hua

    2017-09-11

    Background: Polytrauma patients are expected to have a higher risk of mortality than that obtained by the summation of expected mortality owing to their individual injuries. This study was designed to investigate the outcome of patients with polytrauma, which was defined using the new Berlin definition, as cases with an Abbreviated Injury Scale (AIS) ≥ 3 for two or more different body regions and one or more additional variables from five physiologic parameters (hypotension [systolic blood pressure ≤ 90 mmHg], unconsciousness [Glasgow Coma Scale score ≤ 8], acidosis [base excess ≤ -6.0], coagulopathy [partial thromboplastin time ≥ 40 s or international normalized ratio ≥ 1.4], and age [≥70 years]). Methods: We retrieved detailed data on 369 polytrauma patients and 1260 non-polytrauma patients with an overall Injury Severity Score (ISS) ≥ 18 who were hospitalized between 1 January 2009 and 31 December 2015 for the treatment of all traumatic injuries, from the Trauma Registry System at a level I trauma center. Patients with burn injury or incomplete registered data were excluded. Categorical data were compared with two-sided Fisher exact or Pearson chi-square tests. The unpaired Student t-test and the Mann-Whitney U-test was used to analyze normally distributed continuous data and non-normally distributed data, respectively. Propensity-score matched cohort in a 1:1 ratio was allocated using the NCSS software with logistic regression to evaluate the effect of polytrauma on patient outcomes. Results: The polytrauma patients had a significantly higher ISS than non-polytrauma patients (median (interquartile range Q1-Q3), 29 (22-36) vs. 24 (20-25), respectively; p propensity score-matched pairs of polytrauma and non-polytrauma patients who showed no significant difference in sex, age, co-morbidity, AIS ≥ 3, and Injury Severity Score (ISS), the polytrauma patients had a significantly higher mortality rate (OR 17.5, 95% CI 4.21-72.76; p propensity

  16. Comparison of endoscopic endonasal and bifrontal craniotomy approaches for olfactory groove meningiomas: A matched pair analysis of outcomes and frontal lobe changes on MRI.

    Science.gov (United States)

    de Almeida, John R; Carvalho, Felipe; Vaz Guimaraes Filho, Francisco; Kiehl, Tim-Rasmus; Koutourousiou, Maria; Su, Shirley; Vescan, Allan D; Witterick, Ian J; Zadeh, Gelareh; Wang, Eric W; Fernandez-Miranda, Juan C; Gardner, Paul A; Gentili, Fred; Snyderman, Carl H

    2015-11-01

    We compare the outcomes and postoperative MRI changes of endoscopic endonasal (EEA) and bifrontal craniotomy (BFC) approaches for olfactory groove meningiomas (OGM). All patients who underwent either BFC or EEA for OGM were eligible. Matched pairs were created by matching tumor volumes of an EEA patient with a BFC patient, and matching the timing of the postoperative scans. The tumor dimensions, peritumoral edema, resectability issues, and frontal lobe changes were recorded based on preoperative and postoperative MRI. Postoperative fluid-attenuated inversion recovery (FLAIR) hyperintensity and residual cystic cavity (porencephalic cave) volume were compared using univariable and multivariable analyses. From a total of 70 patients (46 EEA, 24 BFC), 10 matched pairs (20 patients) were created. Three patients (30%) in the EEA group and two (20%) in the BFC had postoperative cerebrospinal fluid leaks (p=0.61). Gross total resections were achieved in seven (70%) of the EEA group and nine (90%) of the BFC group (p=0.26), and one patient from each group developed a recurrence. On postoperative MRI, there was no significant difference in FLAIR signal volumes between EEA and BFC approaches (6.9 versus 13.3 cm(3); p=0.17) or in porencephalic cave volumes (1.7 versus 5.0 cm(3); p=0.11) in univariable analysis. However, in a multivariable analysis, EEA was associated with less postoperative FLAIR change (p=0.02) after adjusting for the volume of preoperative edema. This study provides preliminary evidence that EEA is associated with quantifiable improvements in postoperative frontal lobe imaging.

  17. Towards 3D Face Recognition in the Real: A Registration-Free Approach Using Fine-Grained Matching of 3D Keypoint Descriptors

    KAUST Repository

    Li, Huibin

    2014-11-12

    Registration algorithms performed on point clouds or range images of face scans have been successfully used for automatic 3D face recognition under expression variations, but have rarely been investigated to solve pose changes and occlusions mainly since that the basic landmarks to initialize coarse alignment are not always available. Recently, local feature-based SIFT-like matching proves competent to handle all such variations without registration. In this paper, towards 3D face recognition for real-life biometric applications, we significantly extend the SIFT-like matching framework to mesh data and propose a novel approach using fine-grained matching of 3D keypoint descriptors. First, two principal curvature-based 3D keypoint detectors are provided, which can repeatedly identify complementary locations on a face scan where local curvatures are high. Then, a robust 3D local coordinate system is built at each keypoint, which allows extraction of pose-invariant features. Three keypoint descriptors, corresponding to three surface differential quantities, are designed, and their feature-level fusion is employed to comprehensively describe local shapes of detected keypoints. Finally, we propose a multi-task sparse representation based fine-grained matching algorithm, which accounts for the average reconstruction error of probe face descriptors sparsely represented by a large dictionary of gallery descriptors in identification. Our approach is evaluated on the Bosphorus database and achieves rank-one recognition rates of 96.56, 98.82, 91.14, and 99.21 % on the entire database, and the expression, pose, and occlusion subsets, respectively. To the best of our knowledge, these are the best results reported so far on this database. Additionally, good generalization ability is also exhibited by the experiments on the FRGC v2.0 database.

  18. Integrating experimental and analytic approaches to improve data quality in genome-wide RNAi screens.

    Science.gov (United States)

    Zhang, Xiaohua Douglas; Espeseth, Amy S; Johnson, Eric N; Chin, Jayne; Gates, Adam; Mitnaul, Lyndon J; Marine, Shane D; Tian, Jenny; Stec, Eric M; Kunapuli, Priya; Holder, Dan J; Heyse, Joseph F; Strulovici, Berta; Ferrer, Marc

    2008-06-01

    RNA interference (RNAi) not only plays an important role in drug discovery but can also be developed directly into drugs. RNAi high-throughput screening (HTS) biotechnology allows us to conduct genome-wide RNAi research. A central challenge in genome-wide RNAi research is to integrate both experimental and computational approaches to obtain high quality RNAi HTS assays. Based on our daily practice in RNAi HTS experiments, we propose the implementation of 3 experimental and analytic processes to improve the quality of data from RNAi HTS biotechnology: (1) select effective biological controls; (2) adopt appropriate plate designs to display and/or adjust for systematic errors of measurement; and (3) use effective analytic metrics to assess data quality. The applications in 5 real RNAi HTS experiments demonstrate the effectiveness of integrating these processes to improve data quality. Due to the effectiveness in improving data quality in RNAi HTS experiments, the methods and guidelines contained in the 3 experimental and analytic processes are likely to have broad utility in genome-wide RNAi research.

  19. Genome-scale modeling of human metabolism - a systems biology approach.

    Science.gov (United States)

    Mardinoglu, Adil; Gatto, Francesco; Nielsen, Jens

    2013-09-01

    Altered metabolism is linked to the appearance of various human diseases and a better understanding of disease-associated metabolic changes may lead to the identification of novel prognostic biomarkers and the development of new therapies. Genome-scale metabolic models (GEMs) have been employed for studying human metabolism in a systematic manner, as well as for understanding complex human diseases. In the past decade, such metabolic models - one of the fundamental aspects of systems biology - have started contributing to the understanding of the mechanistic relationship between genotype and phenotype. In this review, we focus on the construction of the Human Metabolic Reaction database, the generation of healthy cell type- and cancer-specific GEMs using different procedures, and the potential applications of these developments in the study of human metabolism and in the identification of metabolic changes associated with various disorders. We further examine how in silico genome-scale reconstructions can be employed to simulate metabolic flux distributions and how high-throughput omics data can be analyzed in a context-dependent fashion. Insights yielded from this mechanistic modeling approach can be used for identifying new therapeutic agents and drug targets as well as for the discovery of novel biomarkers. Finally, recent advancements in genome-scale modeling and the future challenge of developing a model of whole-body metabolism are presented. The emergent contribution of GEMs to personalized and translational medicine is also discussed. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  20. Novel approach for deriving genome wide SNP analysis data from archived blood spots

    Directory of Open Access Journals (Sweden)

    Fowler Katie E

    2012-09-01

    Full Text Available Abstract Background The ability to transport and store DNA at room temperature in low volumes has the advantage of optimising cost, time and storage space. Blood spots on adapted filter papers are popular for this, with FTA (Flinders Technology Associates Whatman™TM technology being one of the most recent. Plant material, plasmids, viral particles, bacteria and animal blood have been stored and transported successfully using this technology, however the method of porcine DNA extraction from FTA Whatman™TM cards is a relatively new approach, allowing nucleic acids to be ready for downstream applications such as PCR, whole genome amplification, sequencing and subsequent application to single nucleotide polymorphism microarrays has hitherto been under-explored. Findings DNA was extracted from FTA Whatman™TM cards (following adaptations of the manufacturer’s instructions, whole genome amplified and subsequently analysed to validate the integrity of the DNA for downstream SNP analysis. DNA was successfully extracted from 288/288 samples and amplified by WGA. Allele dropout post WGA, was observed in less than 2% of samples and there was no clear evidence of amplification bias nor contamination. Acceptable call rates on porcine SNP chips were also achieved using DNA extracted and amplified in this way. Conclusions DNA extracted from FTA Whatman cards is of a high enough quality and quantity following whole genomic amplification to perform meaningful SNP chip studies.

  1. A novel scan statistics approach for clustering identification and comparison in binary genomic data.

    Science.gov (United States)

    Pellin, Danilo; Di Serio, Clelia

    2016-09-22

    In biomedical research a relevant issue is to identify time intervals or portions of a n-dimensional support where a particular event of interest is more likely to occur than expected. Algorithms that require to specify a-priori number/dimension/length of clusters assumed for the data suffer from a high degree of arbitrariness whenever no precise information are available, and this may strongly affect final estimation on parameters. Within this framework, spatial scan-statistics have been proposed in the literature, representing a valid non-parametric alternative. We adapt the so called Bernoulli-model scan statistic to the genomic field and we propose a multivariate extension, named Relative Scan Statistics, for the comparison of two series of Bernoulli r.v. defined over a common support, with the final goal of highlighting unshared event rate variations. Using a probabilistic approach based on success probability estimates and comparison (likelihood based), we can exploit an hypothesis testing procedure to identify clusters and relative clusters. Both the univariate and the novel multivariate extension of the scan statistic confirm previously published findings. The method described in the paper represents a challenging application of scan statistics framework to problem related to genomic data. From a biological perspective, these tools offer the possibility to clinicians and researcher to improve their knowledge on viral vectors integrations process, allowing to focus their attention to restricted over-targeted portion of the genome.

  2. A hidden Markov model approach for determining expression from genomic tiling micro arrays

    Directory of Open Access Journals (Sweden)

    Krogh Anders

    2006-05-01

    Full Text Available Abstract Background Genomic tiling micro arrays have great potential for identifying previously undiscovered coding as well as non-coding transcription. To-date, however, analyses of these data have been performed in an ad hoc fashion. Results We present a probabilistic procedure, ExpressHMM, that adaptively models tiling data prior to predicting expression on genomic sequence. A hidden Markov model (HMM is used to model the distributions of tiling array probe scores in expressed and non-expressed regions. The HMM is trained on sets of probes mapped to regions of annotated expression and non-expression. Subsequently, prediction of transcribed fragments is made on tiled genomic sequence. The prediction is accompanied by an expression probability curve for visual inspection of the supporting evidence. We test ExpressHMM on data from the Cheng et al. (2005 tiling array experiments on ten Human chromosomes 1. Results can be downloaded and viewed from our web site 2. Conclusion The value of adaptive modelling of fluorescence scores prior to categorisation into expressed and non-expressed probes is demonstrated. Our results indicate that our adaptive approach is superior to the previous analysis in terms of nucleotide sensitivity and transfrag specificity.

  3. Genome-wide identification of Schistosoma japonicum microRNAs using a deep-sequencing approach.

    Directory of Open Access Journals (Sweden)

    Jian Huang

    Full Text Available BACKGROUND: Human schistosomiasis is one of the most prevalent and serious parasitic diseases worldwide. Schistosoma japonicum is one of important pathogens of this disease. MicroRNAs (miRNAs are a large group of non-coding RNAs that play important roles in regulating gene expression and protein translation in animals. Genome-wide identification of miRNAs in a given organism is a critical step to facilitating our understanding of genome organization, genome biology, evolution, and posttranscriptional regulation. METHODOLOGY/PRINCIPAL FINDINGS: We sequenced two small RNA libraries prepared from different stages of the life cycle of S. japonicum, immature schistosomula and mature pairing adults, through a deep DNA sequencing approach, which yielded approximately 12 million high-quality short sequence reads containing a total of approximately 2 million non-redundant tags. Based on a bioinformatics pipeline, we identified 176 new S. japonicum miRNAs, of which some exhibited a differential pattern of expression between the two stages. Although 21 S. japonicum miRNAs are orthologs of known miRNAs within the metazoans, some nucleotides at many positions of Schistosoma miRNAs, such as miR-8, let-7, miR-10, miR-31, miR-92, miR-124, and miR-125, are indeed significantly distinct from other bilaterian orthologs. In addition, both miR-71 and some miR-2 family members in tandem are found to be clustered in a reversal direction model on two genomic loci, and two pairs of novel S. japonicum miRNAs were derived from sense and antisense DNA strands at the same genomic loci. CONCLUSIONS/SIGNIFICANCE: The collection of S. japonicum miRNAs could be used as a new platform to study the genomic structure, gene regulation and networks, evolutionary processes, development, and host-parasite interactions. Some S. japonicum miRNAs and their clusters could represent the ancestral forms of the conserved orthologues and a model for the genesis of novel miRNAs.

  4. The direction of causality between exports and firm performance: microeconomic evidence from Croatia using the matching approach

    Directory of Open Access Journals (Sweden)

    Miljana Valdec

    2015-03-01

    Full Text Available This paper contributes to the literature by using propensity score matching to test for causal effects of starting to export on firm performance in Croatian manufacturing firm-level data. The results confirm that exporters have characteristics superior to those of non-exporters. In the main sample specification there is pervasive evidence of self-selection into export markets, meaning that firms are successful years before they become exporters. Using multiple firm performance indicators, panel and cross section data models together with various sample specifications there is scant evidence on learning-by-exporting which holds true only in a few cases. On the other hand, higher sales growth is found to be a more conclusive distinguishing characteristic of new exporters. As in similar studies, we find that a part of the results depends on the number of export starters in the estimation sample.

  5. Identification and validation of specific markers of Bacillus anthracis spores by proteomics and genomics approaches.

    Science.gov (United States)

    Chenau, Jérôme; Fenaille, François; Caro, Valérie; Haustant, Michel; Diancourt, Laure; Klee, Silke R; Junot, Christophe; Ezan, Eric; Goossens, Pierre L; Becher, François

    2014-03-01

    Bacillus anthracis is the causative bacteria of anthrax, an acute and often fatal disease in humans. The infectious agent, the spore, represents a real bioterrorism threat and its specific identification is crucial. However, because of the high genomic relatedness within the Bacillus cereus group, it is still a real challenge to identify B. anthracis spores confidently. Mass spectrometry-based tools represent a powerful approach to the efficient discovery and identification of such protein markers. Here we undertook comparative proteomics analyses of Bacillus anthracis, cereus and thuringiensis spores to identify proteoforms unique to B. anthracis. The marker discovery pipeline developed combined peptide- and protein-centric approaches using liquid chromatography coupled to tandem mass spectrometry experiments using a high resolution/high mass accuracy LTQ-Orbitrap instrument. By combining these data with those from complementary bioinformatics approaches, we were able to highlight a dozen novel proteins consistently observed across all the investigated B. anthracis spores while being absent in B. cereus/thuringiensis spores. To further demonstrate the relevance of these markers and their strict specificity to B. anthracis, the number of strains studied was extended to 55, by including closely related strains such as B. thuringiensis 9727, and above all the B. cereus biovar anthracis CI, CA strains that possess pXO1- and pXO2-like plasmids. Under these conditions, the combination of proteomics and genomics approaches confirms the pertinence of 11 markers. Genes encoding these 11 markers are located on the chromosome, which provides additional targets complementary to the commonly used plasmid-encoded markers. Last but not least, we also report the development of a targeted liquid chromatography coupled to tandem mass spectrometry method involving the selection reaction monitoring mode for the monitoring of the 4 most suitable protein markers. Within a proof

  6. Genome-scale Metabolic Reaction Modeling: a New Approach to Geomicrobial Kinetics

    Science.gov (United States)

    McKernan, S. E.; Shapiro, B.; Jin, Q.

    2014-12-01

    Geomicrobial rates, rates of microbial metabolism in natural environments, are a key parameter of theoretical and practical problems in geobiology and biogeochemistry. Both laboratory- and field-based approaches have been applied to study rates of geomicrobial processes. Laboratory-based approaches analyze geomicrobial kinetics by incubating environmental samples under controlled laboratory conditions. Field methods quantify geomicrobial rates by observing the progress of geomicrobial processes. To take advantage of recent development in biogeochemical modeling and genome-scale metabolic modeling, we suggest that geomicrobial rates can also be predicted by simulating metabolic reaction networks of microbes. To predict geomicrobial rates, we developed a genome-scale metabolic model that describes enzyme reaction networks of microbial metabolism, and simulated the network model by accounting for the kinetics and thermodynamics of enzyme reactions. The model is simulated numerically to solve cellular enzyme abundance and hence metabolic rates under the constraints of cellular physiology. The new modeling approach differs from flux balance analysis of system biology in that it accounts for the thermodynamics and kinetics of enzymatic reactions. It builds on subcellular metabolic reaction networks, and hence also differs from classical biogeochemical reaction modeling. We applied the new approach to Methanosarcina acetivorans, an anaerobic, marine methanogen capable of disproportionating acetate to carbon dioxide and methane. The input of the new model includes (1) enzyme reaction network of acetoclastic methanogenesis, and (2) representative geochemical conditions of freshwater sedimentary environments. The output of the simulation includes the proteomics, metabolomics, and energy and matter fluxes of M. acetivorans. Our simulation results demonstrate the predictive power of the new modeling approach. Specifically, the results illustrate how methanogenesis rates vary

  7. One-Match and All-Match Categories for Keywords Matching in Chatbot

    Directory of Open Access Journals (Sweden)

    Abbas S. Lokman

    2010-01-01

    Full Text Available Problem statement: Artificial intelligence chatbot is a technology that makes interactions between men and machines using natural language possible. From literature of chatbots keywords/pattern matching techniques, potential issues for improvement had been discovered. The discovered issues are in the context of keywords arrangement for matching precedence and keywords variety for matching flexibility. Approach: Combining previous techniques/mechanisms with some additional adjustment, new technique to be used for keywords matching process is proposed. Using newly developed chatbot named ViDi (abbreviation for Virtual Diabetes physician which is a chatbot for diabetes education activity as a testing medium, the proposed technique named One-Match and All-Match Categories (OMAMC is being used to test the creation of possible keywords surrounding one sample input sentence. The result for possible keywords created by this technique then being compared to possible keywords created by previous chatbots techniques surrounding the same sample sentence in matching precedence and matching flexibility context. Results: OMAMC technique is found to be improving previous matching techniques in matching precedence and flexibility context. This improvement is seen to be useful for shortening matching time and widening matching flexibility within the chatbots keywords matching process. Conclusion: OMAMC for keywords matching in chatbot is shown to be an improvement over previous techniques in the context of keywords arrangement for matching precedence and keywords variety for matching flexibility.

  8. Estimating variable effective population sizes from multiple genomes: a sequentially markov conditional sampling distribution approach.

    Science.gov (United States)

    Sheehan, Sara; Harris, Kelley; Song, Yun S

    2013-07-01

    Throughout history, the population size of modern humans has varied considerably due to changes in environment, culture, and technology. More accurate estimates of population size changes, and when they occurred, should provide a clearer picture of human colonization history and help remove confounding effects from natural selection inference. Demography influences the pattern of genetic variation in a population, and thus genomic data of multiple individuals sampled from one or more present-day populations contain valuable information about the past demographic history. Recently, Li and Durbin developed a coalescent-based hidden Markov model, called the pairwise sequentially Markovian coalescent (PSMC), for a pair of chromosomes (or one diploid individual) to estimate past population sizes. This is an efficient, useful approach, but its accuracy in the very recent past is hampered by the fact that, because of the small sample size, only few coalescence events occur in that period. Multiple genomes from the same population contain more information about the recent past, but are also more computationally challenging to study jointly in a coalescent framework. Here, we present a new coalescent-based method that can efficiently infer population size changes from multiple genomes, providing access to a new store of information about the recent past. Our work generalizes the recently developed sequentially Markov conditional sampling distribution framework, which provides an accurate approximation of the probability of observing a newly sampled haplotype given a set of previously sampled haplotypes. Simulation results demonstrate that we can accurately reconstruct the true population histories, with a significant improvement over the PSMC in the recent past. We apply our method, called diCal, to the genomes of multiple human individuals of European and African ancestry to obtain a detailed population size change history during recent times.

  9. A genomic pathway approach to a complex disease: axon guidance and Parkinson disease.

    Science.gov (United States)

    Lesnick, Timothy G; Papapetropoulos, Spiridon; Mash, Deborah C; Ffrench-Mullen, Jarlath; Shehadeh, Lina; de Andrade, Mariza; Henley, John R; Rocca, Walter A; Ahlskog, J Eric; Maraganore, Demetrius M

    2007-06-01

    While major inroads have been made in identifying the genetic causes of rare Mendelian disorders, little progress has been made in the discovery of common gene variations that predispose to complex diseases. The single gene variants that have been shown to associate reproducibly with complex diseases typically have small effect sizes or attributable risks. However, the joint actions of common gene variants within pathways may play a major role in predisposing to complex diseases (the paradigm of complex genetics). The goal of this study was to determine whether polymorphism in a candidate pathway (axon guidance) predisposed to a complex disease (Parkinson disease [PD]). We mined a whole-genome association dataset and identified single nucleotide polymorphisms (SNPs) that were within axon-guidance pathway genes. We then constructed models of axon-guidance pathway SNPs that predicted three outcomes: PD susceptibility (odds ratio = 90.8, p = 4.64 x 10(-38)), survival free of PD (hazards ratio = 19.0, p = 5.43 x 10(-48)), and PD age at onset (R(2) = 0.68, p = 1.68 x 10(-51)). By contrast, models constructed from thousands of random selections of genomic SNPs predicted the three PD outcomes poorly. Mining of a second whole-genome association dataset and mining of an expression profiling dataset also supported a role for many axon-guidance pathway genes in PD. These findings could have important implications regarding the pathogenesis of PD. This genomic pathway approach may also offer insights into other complex diseases such as Alzheimer disease, diabetes mellitus, nicotine and alcohol dependence, and several cancers.

  10. A genomic pathway approach to a complex disease: axon guidance and Parkinson disease.

    Directory of Open Access Journals (Sweden)

    Timothy G Lesnick

    2007-06-01

    Full Text Available While major inroads have been made in identifying the genetic causes of rare Mendelian disorders, little progress has been made in the discovery of common gene variations that predispose to complex diseases. The single gene variants that have been shown to associate reproducibly with complex diseases typically have small effect sizes or attributable risks. However, the joint actions of common gene variants within pathways may play a major role in predisposing to complex diseases (the paradigm of complex genetics. The goal of this study was to determine whether polymorphism in a candidate pathway (axon guidance predisposed to a complex disease (Parkinson disease [PD]. We mined a whole-genome association dataset and identified single nucleotide polymorphisms (SNPs that were within axon-guidance pathway genes. We then constructed models of axon-guidance pathway SNPs that predicted three outcomes: PD susceptibility (odds ratio = 90.8, p = 4.64 x 10(-38, survival free of PD (hazards ratio = 19.0, p = 5.43 x 10(-48, and PD age at onset (R(2 = 0.68, p = 1.68 x 10(-51. By contrast, models constructed from thousands of random selections of genomic SNPs predicted the three PD outcomes poorly. Mining of a second whole-genome association dataset and mining of an expression profiling dataset also supported a role for many axon-guidance pathway genes in PD. These findings could have important implications regarding the pathogenesis of PD. This genomic pathway approach may also offer insights into other complex diseases such as Alzheimer disease, diabetes mellitus, nicotine and alcohol dependence, and several cancers.

  11. Discovery and annotation of small proteins using genomics, proteomics and computational approaches

    Energy Technology Data Exchange (ETDEWEB)

    Yang, Xiaohan; Tschaplinski, Timothy J.; Hurst, Gregory B.; Jawdy, Sara; Abraham, Paul E.; Lankford, Patricia K.; Adams, Rachel M.; Shah, Manesh B.; Hettich, Robert L.; Lindquist, Erika; Kalluri, Udaya C.; Gunter, Lee E.; Pennacchio, Christa; Tuskan, Gerald A.

    2011-03-02

    Small proteins (10 200 amino acids aa in length) encoded by short open reading frames (sORF) play important regulatory roles in various biological processes, including tumor progression, stress response, flowering, and hormone signaling. However, ab initio discovery of small proteins has been relatively overlooked. Recent advances in deep transcriptome sequencing make it possible to efficiently identify sORFs at the genome level. In this study, we obtained 2.6 million expressed sequence tag (EST) reads from Populus deltoides leaf transcriptome and reconstructed full-length transcripts from the EST sequences. We identified an initial set of 12,852 sORFs encoding proteins of 10 200 aa in length. Three computational approaches were then used to enrich for bona fide protein-coding sORFs from the initial sORF set: (1) codingpotential prediction, (2) evolutionary conservation between P. deltoides and other plant species, and (3) gene family clustering within P. deltoides. As a result, a high-confidence sORF candidate set containing 1469 genes was obtained. Analysis of the protein domains, non-protein-coding RNA motifs, sequence length distribution, and protein mass spectrometry data supported this high-confidence sORF set. In the high-confidence sORF candidate set, known protein domains were identified in 1282 genes (higher-confidence sORF candidate set), out of which 611 genes, designated as highest-confidence candidate sORF set, were supported by proteomics data. Of the 611 highest-confidence candidate sORF genes, 56 were new to the current Populus genome annotation. This study not only demonstrates that there are potential sORF candidates to be annotated in sequenced genomes, but also presents an efficient strategy for discovery of sORFs in species with no genome annotation yet available.

  12. Analysis of the Complete Mitochondrial Genome Sequence of the Diploid Cotton Gossypium raimondii by Comparative Genomics Approaches

    Directory of Open Access Journals (Sweden)

    Changwei Bi

    2016-01-01

    Full Text Available Cotton is one of the most important economic crops and the primary source of natural fiber and is an important protein source for animal feed. The complete nuclear and chloroplast (cp genome sequences of G. raimondii are already available but not mitochondria. Here, we assembled the complete mitochondrial (mt DNA sequence of G. raimondii into a circular genome of length of 676,078 bp and performed comparative analyses with other higher plants. The genome contains 39 protein-coding genes, 6 rRNA genes, and 25 tRNA genes. We also identified four larger repeats (63.9 kb, 10.6 kb, 9.1 kb, and 2.5 kb in this mt genome, which may be active in intramolecular recombination in the evolution of cotton. Strikingly, nearly all of the G. raimondii mt genome has been transferred to nucleus on Chr1, and the transfer event must be very recent. Phylogenetic analysis reveals that G. raimondii, as a member of Malvaceae, is much closer to another cotton (G. barbadense than other rosids, and the clade formed by two Gossypium species is sister to Brassicales. The G. raimondii mt genome may provide a crucial foundation for evolutionary analysis, molecular biology, and cytoplasmic male sterility in cotton and other higher plants.

  13. A surrogate approach to study the evolution of noncoding DNA elements that organize eukaryotic genomes.

    Science.gov (United States)

    Vermaak, Danielle; Bayes, Joshua J; Malik, Harmit S

    2009-01-01

    Comparative genomics provides a facile way to address issues of evolutionary constraint acting on different elements of the genome. However, several important DNA elements have not reaped the benefits of this new approach. Some have proved intractable to current day sequencing technology. These include centromeric and heterochromatic DNA, which are essential for chromosome segregation as well as gene regulation, but the highly repetitive nature of the DNA sequences in these regions make them difficult to assemble into longer contigs. Other sequences, like dosage compensation X chromosomal sites, origins of DNA replication, or heterochromatic sequences that encode piwi-associated RNAs, have proved difficult to study because they do not have recognizable DNA features that allow them to be described functionally or computationally. We have employed an alternate approach to the direct study of these DNA elements. By using proteins that specifically bind these noncoding DNAs as surrogates, we can indirectly assay the evolutionary constraints acting on these important DNA elements. We review the impact that such "surrogate strategies" have had on our understanding of the evolutionary constraints shaping centromeres, origins of DNA replication, and dosage compensation X chromosomal sites. These have begun to reveal that in contrast to the view that such structural DNA elements are either highly constrained (under purifying selection) or free to drift (under neutral evolution), some of them may instead be shaped by adaptive evolution and genetic conflicts (these are not mutually exclusive). These insights also help to explain why the same elements (e.g., centromeres and replication origins), which are so complex in some eukaryotic genomes, can be simple and well defined in other where similar conflicts do not exist.

  14. Synthetic biology approaches in cancer immunotherapy, genetic network engineering, and genome editing.

    Science.gov (United States)

    Chakravarti, Deboki; Cho, Jang Hwan; Weinberg, Benjamin H; Wong, Nicole M; Wong, Wilson W

    2016-04-18

    Investigations into cells and their contents have provided evolving insight into the emergence of complex biological behaviors. Capitalizing on this knowledge, synthetic biology seeks to manipulate the cellular machinery towards novel purposes, extending discoveries from basic science to new applications. While these developments have demonstrated the potential of building with biological parts, the complexity of cells can pose numerous challenges. In this review, we will highlight the broad and vital role that the synthetic biology approach has played in applying fundamental biological discoveries in receptors, genetic circuits, and genome-editing systems towards translation in the fields of immunotherapy, biosensors, disease models and gene therapy. These examples are evidence of the strength of synthetic approaches, while also illustrating considerations that must be addressed when developing systems around living cells.

  15. Integrating Environmental Genomics and Biogeochemical Models: a Gene-centric Approach

    Science.gov (United States)

    Reed, D. C.; Algar, C. K.; Huber, J. A.; Dick, G.

    2013-12-01

    Rapid advances in molecular microbial ecology have yielded an unprecedented amount of data about the evolutionary relationships and functional traits of microbial communities that regulate global geochemical cycles. Biogeochemical models, however, are trailing in the wake of the environmental genomics revolution and such models rarely incorporate explicit representations of bacteria and archaea, nor are they compatible with nucleic acid or protein sequence data. Here, we present a functional gene-based framework for describing microbial communities in biogeochemical models that uses genomics data and provides predictions that are readily testable using cutting-edge molecular tools. To demonstrate the approach in practice, nitrogen cycling in the Arabian Sea oxygen minimum zone (OMZ) was modelled to examine key questions about cryptic sulphur cycling and dinitrogen production pathways in OMZs. By directly linking geochemical dynamics to the genetic composition of microbial communities, the method provides mechanistic insights into patterns and biogeochemical consequences of marine microbes. Such an approach is critical for informing our understanding of the key role microbes play in modulating Earth's biogeochemistry.

  16. In silico prediction and screening of modular crystal structures via a high-throughput genomic approach

    Science.gov (United States)

    Li, Yi; Li, Xu; Liu, Jiancong; Duan, Fangzheng; Yu, Jihong

    2015-09-01

    High-throughput computational methods capable of predicting, evaluating and identifying promising synthetic candidates with desired properties are highly appealing to today's scientists. Despite some successes, in silico design of crystalline materials with complex three-dimensionally extended structures remains challenging. Here we demonstrate the application of a new genomic approach to ABC-6 zeolites, a family of industrially important catalysts whose structures are built from the stacking of modular six-ring layers. The sequences of layer stacking, which we deem the genes of this family, determine the structures and the properties of ABC-6 zeolites. By enumerating these gene-like stacking sequences, we have identified 1,127 most realizable new ABC-6 structures out of 78 groups of 84,292 theoretical ones, and experimentally realized 2 of them. Our genomic approach can extract crucial structural information directly from these gene-like stacking sequences, enabling high-throughput identification of synthetic targets with desired properties among a large number of candidate structures.

  17. Integrative computational approach for genome-based study of microbial lipid-degrading enzymes.

    Science.gov (United States)

    Vorapreeda, Tayvich; Thammarongtham, Chinae; Laoteng, Kobkul

    2016-07-01

    Lipid-degrading or lipolytic enzymes have gained enormous attention in academic and industrial sectors. Several efforts are underway to discover new lipase enzymes from a variety of microorganisms with particular catalytic properties to be used for extensive applications. In addition, various tools and strategies have been implemented to unravel the functional relevance of the versatile lipid-degrading enzymes for special purposes. This review highlights the study of microbial lipid-degrading enzymes through an integrative computational approach. The identification of putative lipase genes from microbial genomes and metagenomic libraries using homology-based mining is discussed, with an emphasis on sequence analysis of conserved motifs and enzyme topology. Molecular modelling of three-dimensional structure on the basis of sequence similarity is shown to be a potential approach for exploring the structural and functional relationships of candidate lipase enzymes. The perspectives on a discriminative framework of cutting-edge tools and technologies, including bioinformatics, computational biology, functional genomics and functional proteomics, intended to facilitate rapid progress in understanding lipolysis mechanism and to discover novel lipid-degrading enzymes of microorganisms are discussed.

  18. Elastic-net regularization approaches for genome-wide association studies of rheumatoid arthritis.

    Science.gov (United States)

    Cho, Seoae; Kim, Haseong; Oh, Sohee; Kim, Kyunga; Park, Taesung

    2009-12-15

    The current trend in genome-wide association studies is to identify regions where the true disease-causing genes may lie by evaluating thousands of single-nucleotide polymorphisms (SNPs) across the whole genome. However, many challenges exist in detecting disease-causing genes among the thousands of SNPs. Examples include multicollinearity and multiple testing issues, especially when a large number of correlated SNPs are simultaneously tested. Multicollinearity can often occur when predictor variables in a multiple regression model are highly correlated, and can cause imprecise estimation of association. In this study, we propose a simple stepwise procedure that identifies disease-causing SNPs simultaneously by employing elastic-net regularization, a variable selection method that allows one to address multicollinearity. At Step 1, the single-marker association analysis was conducted to screen SNPs. At Step 2, the multiple-marker association was scanned based on the elastic-net regularization. The proposed approach was applied to the rheumatoid arthritis (RA) case-control data set of Genetic Analysis Workshop 16. While the selected SNPs at the screening step are located mostly on chromosome 6, the elastic-net approach identified putative RA-related SNPs on other chromosomes in an increased proportion. For some of those putative RA-related SNPs, we identified the interactions with sex, a well known factor affecting RA susceptibility.

  19. Design and Simulation for Producing Two Amplitude Matched Anti-phase Sine Waveforms Using ±2.5 V CMOS Current-Mode Approach

    Directory of Open Access Journals (Sweden)

    Anil Kumar Sharma,

    2010-08-01

    Full Text Available In this paper the current mode approach called “Current Conveyor (CCII+” has been incorporated to design and simulate the circuit for producing two amplitude matched anti-phase sine waveforms which are frequently used in various communication and instrumentation systems. PSpice simulation has been used to depict the output waveforms. The power supply used is ±2.5 V which can be easily incorporated with CMOS IC technology. The designed circuit has been simulated at variousfrequency ranges and the waveforms are obtained after the circuit is optimized.

  20. Automatic Tuning Matching Cycler (ATMC) In Situ NMR Spectroscopy as a Novel Approach for Real-Time Investigations of Li- and Na-Ion Batteries

    OpenAIRE

    2016-01-01

    This is the author accepted manuscript. The final version is available from Elsevier via http://dx.doi.org/10.1016/j.jmr.2016.02.008 We have developed and explored the use of a new Automatic Tuning Matching Cycler (ATMC) in situ NMR probe system to track the formation of intermediate phases and investigate electrolyte decomposition during electrochemical cycling of Li- and Na-ion batteries (LIBs and NIBs). The new approach addresses many of the issues arising during in situ NMR, e.g., sign...

  1. Short intervals induce superior training adaptations compared with long intervals in cyclists - an effort-matched approach.

    Science.gov (United States)

    Rønnestad, B R; Hansen, J; Vegge, G; Tønnessen, E; Slettaløkken, G

    2015-04-01

    The purpose of this study was to compare the effects of 10 weeks of effort-matched short intervals (SI; n = 9) or long intervals (LI; n = 7) in cyclists. The high-intensity interval sessions (HIT) were performed twice a week interspersed with low-intensity training. There were no differences between groups at pretest. There were no differences between groups in total volume of both HIT and low-intensity training. The SI group achieved a larger relative improvement in VO(2max) than the LI group (8.7% ± 5.0% vs 2.6% ± 5.2%), respectively, P ≤ 0.05). Mean effect size (ES) of the relative improvement in all measured parameters, including performance measured as mean power output during 30-s all-out, 5-min all-out, and 40-min all-out tests revealed a moderate-to-large effect of SI training vs LI training (ES range was 0.86-1.54). These results suggest that the present SI protocol induces superior training adaptations on both the high-power region and lower power region of cyclists' power profile compared with the present LI protocol. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  2. Effect of Prophylactic Antifungal Protocols on the Prognosis of Liver Transplantation: A Propensity Score Matching and Multistate Model Approach

    Science.gov (United States)

    Chen, Yi-Chan; Wang, Yu-Chao; Lee, Chen-Fang; Wu, Ting-Jun; Chou, Hong-Shiue; Chan, Kun-Ming; Lee, Wei-Chen

    2016-01-01

    Background. Whether routine antifungal prophylaxis decreases posttransplantation fungal infections in patients receiving orthotopic liver transplantation (OLT) remains unclear. This study aimed to determine the effectiveness of antifungal prophylaxis for patients receiving OLT. Patients and Methods. This is a retrospective analysis of a database at Chang Gung Memorial Hospital. We have been administering routine antibiotic and prophylactic antifungal regimens to recipients with high model for end-stage liver disease scores (>20) since 2009. After propensity score matching, 402 patients were enrolled. We conducted a multistate model to analyze the cumulative hazards, probability of fungal infections, and risk factors. Results. The cumulative hazards and transition probability of “transplantation to fungal infection” were lower in the prophylaxis group. The incidence rate of fungal infection after OLT decreased from 18.9% to 11.4% (p = 0.052); overall mortality improved from 40.8% to 23.4% (p < 0.001). In the “transplantation to fungal infection” transition, prophylaxis was significantly associated with reduced hazards for fungal infection (hazard ratio: 0.57, 95% confidence interval: 0.34–0.96, p = 0.033). Massive ascites, cadaver transplantation, and older age were significantly associated with higher risks for mortality. Conclusion. Prophylactic antifungal regimens in high-risk recipients might decrease the incidence of posttransplant fungal infections.

  3. Effect of Prophylactic Antifungal Protocols on the Prognosis of Liver Transplantation: A Propensity Score Matching and Multistate Model Approach

    Directory of Open Access Journals (Sweden)

    Yi-Chan Chen

    2016-01-01

    Full Text Available Background. Whether routine antifungal prophylaxis decreases posttransplantation fungal infections in patients receiving orthotopic liver transplantation (OLT remains unclear. This study aimed to determine the effectiveness of antifungal prophylaxis for patients receiving OLT. Patients and Methods. This is a retrospective analysis of a database at Chang Gung Memorial Hospital. We have been administering routine antibiotic and prophylactic antifungal regimens to recipients with high model for end-stage liver disease scores (>20 since 2009. After propensity score matching, 402 patients were enrolled. We conducted a multistate model to analyze the cumulative hazards, probability of fungal infections, and risk factors. Results. The cumulative hazards and transition probability of “transplantation to fungal infection” were lower in the prophylaxis group. The incidence rate of fungal infection after OLT decreased from 18.9% to 11.4% (p=0.052; overall mortality improved from 40.8% to 23.4% (p<0.001. In the “transplantation to fungal infection” transition, prophylaxis was significantly associated with reduced hazards for fungal infection (hazard ratio: 0.57, 95% confidence interval: 0.34–0.96, p=0.033. Massive ascites, cadaver transplantation, and older age were significantly associated with higher risks for mortality. Conclusion. Prophylactic antifungal regimens in high-risk recipients might decrease the incidence of posttransplant fungal infections.

  4. New Markov Model Approaches to Deciphering Microbial Genome Function and Evolution: Comparative Genomics of Laterally Transferred Genes

    Energy Technology Data Exchange (ETDEWEB)

    Borodovsky, M.

    2013-04-11

    Algorithmic methods for gene prediction have been developed and successfully applied to many different prokaryotic genome sequences. As the set of genes in a particular genome is not homogeneous with respect to DNA sequence composition features, the GeneMark.hmm program utilizes two Markov models representing distinct classes of protein coding genes denoted "typical" and "atypical". Atypical genes are those whose DNA features deviate significantly from those classified as typical and they represent approximately 10% of any given genome. In addition to the inherent interest of more accurately predicting genes, the atypical status of these genes may also reflect their separate evolutionary ancestry from other genes in that genome. We hypothesize that atypical genes are largely comprised of those genes that have been relatively recently acquired through lateral gene transfer (LGT). If so, what fraction of atypical genes are such bona fide LGTs? We have made atypical gene predictions for all fully completed prokaryotic genomes; we have been able to compare these results to other "surrogate" methods of LGT prediction.

  5. The long-term differential achievement effects of school socioeconomic composition in primary education: A propensity score matching approach.

    Science.gov (United States)

    Belfi, Barbara; Haelermans, Carla; De Fraine, Bieke

    2016-12-01

    The effects of school socio-economic composition on student achievement growth trajectories have been a hot topic of discussion among politicians around the world for many years. However, the bulk of research investigating school socio-economic composition effects has been limited in important ways. In an attempt to overcome the flaws in earlier research on school socio-economic composition effects, this study used data from a large sample, followed students throughout primary education, addressed selection bias problems, identified the grade(s) in which school socio-economic composition mattered the most, and studied the differential effects of school socio-economic composition by individual socio-economic status (SES). In a longitudinal design with seven occasions of data collection, the authors drew on a sample of N = 3,619 students (age at T1 about 5 years, age at T7 about 12 years) from 151 primary schools in Flanders (the northern part of Belgium). Students in low-, medium-, high-, and mixed-SES schools were matched using propensity scores. To compare students' achievement growth trajectories in the different school compositions, multilevel regression modelling with repeated measurements was applied. The results showed that students had more positive achievement growth in high-SES as compared to low-SES and mixed-SES schools. In two of the three comparisons, students in mixed-SES schools showed the lowest math development. The negative effects of mixed-SES schools on math achievement growth were the strongest for high-SES students. Our findings contribute to the ongoing discussion on the effects of school socio-economic composition on student achievement growth. © 2016 The British Psychological Society.

  6. Clusters versus affinity-based approaches in F. tularensis whole genome search of CTL epitopes.

    Directory of Open Access Journals (Sweden)

    Anat Zvi

    Full Text Available Deciphering the cellular immunome of a bacterial pathogen is challenging due to the enormous number of putative peptidic determinants. State-of-the-art prediction methods developed in recent years enable to significantly reduce the number of peptides to be screened, yet the number of remaining candidates for experimental evaluation is still in the range of ten-thousands, even for a limited coverage of MHC alleles. We have recently established a resource-efficient approach for down selection of candidates and enrichment of true positives, based on selection of predicted MHC binders located in high density "hotspots" of putative epitopes. This cluster-based approach was applied to an unbiased, whole genome search of Francisella tularensis CTL epitopes and was shown to yield a 17-25 fold higher level of responders as compared to randomly selected predicted epitopes tested in Kb/Db C57BL/6 mice. In the present study, we further evaluate the cluster-based approach (down to a lower density range and compare this approach to the classical affinity-based approach by testing putative CTL epitopes with predicted IC(50 values of <10 nM. We demonstrate that while the percent of responders achieved by both approaches is similar, the profile of responders is different, and the predicted binding affinity of most responders in the cluster-based approach is relatively low (geometric mean of 170 nM, rendering the two approaches complimentary. The cluster-based approach is further validated in BALB/c F. tularensis immunized mice belonging to another allelic restriction (Kd/Dd group. To date, the cluster-based approach yielded over 200 novel F. tularensis peptides eliciting a cellular response, all were verified as MHC class I binders, thereby substantially increasing the F. tularensis dataset of known CTL epitopes. The generality and power of the high density cluster-based approach suggest that it can be a valuable tool for identification of novel CTLs in

  7. Dry and wet approaches for genome-wide functional annotation of conventional and unconventional transcriptional activators

    Directory of Open Access Journals (Sweden)

    Elisabetta Levati

    2016-01-01

    Full Text Available Transcription factors (TFs are master gene products that regulate gene expression in response to a variety of stimuli. They interact with DNA in a sequence-specific manner using a variety of DNA-binding domain (DBD modules. This allows to properly position their second domain, called “effector domain”, to directly or indirectly recruit positively or negatively acting co-regulators including chromatin modifiers, thus modulating preinitiation complex formation as well as transcription elongation. At variance with the DBDs, which are comprised of well-defined and easily recognizable DNA binding motifs, effector domains are usually much less conserved and thus considerably more difficult to predict. Also not so easy to identify are the DNA-binding sites of TFs, especially on a genome-wide basis and in the case of overlapping binding regions. Another emerging issue, with many potential regulatory implications, is that of so-called “moonlighting” transcription factors, i.e., proteins with an annotated function unrelated to transcription and lacking any recognizable DBD or effector domain, that play a role in gene regulation as their second job. Starting from bioinformatic and experimental high-throughput tools for an unbiased, genome-wide identification and functional characterization of TFs (especially transcriptional activators, we describe both established (and usually well affordable as well as newly developed platforms for DNA-binding site identification. Selected combinations of these search tools, some of which rely on next-generation sequencing approaches, allow delineating the entire repertoire of TFs and unconventional regulators encoded by the any sequenced genome.

  8. An algorithmic approach for breakage-fusion-bridge detection in tumor genomes.

    Science.gov (United States)

    Zakov, Shay; Kinsella, Marcus; Bafna, Vineet

    2013-04-01

    Breakage-fusion-bridge (BFB) is a mechanism of genomic instability characterized by the joining and subsequent tearing apart of sister chromatids. When this process is repeated during multiple rounds of cell division, it leads to patterns of copy number increases of chromosomal segments as well as fold-back inversions where duplicated segments are arranged head-to-head. These structural variations can then drive tumorigenesis. BFB can be observed in progress using cytogenetic techniques, but generally BFB must be inferred from data such as microarrays or sequencing collected after BFB has ceased. Making correct inferences from this data is not straightforward, particularly given the complexity of some cancer genomes and BFB's ability to generate a wide range of rearrangement patterns. Here we present algorithms to aid the interpretation of evidence for BFB. We first pose the BFB count-vector problem: given a chromosome segmentation and segment copy numbers, decide whether BFB can yield a chromosome with the given segment counts. We present a linear time algorithm for the problem, in contrast to a previous exponential time algorithm. We then combine this algorithm with fold-back inversions to develop tests for BFB. We show that, contingent on assumptions about cancer genome evolution, count vectors and fold-back inversions are sufficient evidence for detecting BFB. We apply the presented techniques to paired-end sequencing data from pancreatic tumors and confirm a previous finding of BFB as well as identify a chromosomal region likely rearranged by BFB cycles, demonstrating the practicality of our approach.

  9. Revealing the biotechnological potential of Delftia sp. JD2 by a genomic approach

    Directory of Open Access Journals (Sweden)

    María A. Morel

    2016-04-01

    Full Text Available Delftia sp. JD2 is a chromium-resistant bacterium that reduces Cr(VI to Cr(III, accumulates Pb(II, produces the phytohormone indole-3-acetic acid and siderophores, and increases the plant growth performance of rhizobia in co-inoculation experiments. We aimed to analyze the biotechnological potential of JD2 using a genomic approach. JD2 has a genome of 6.76Mb, with 6,051 predicted protein coding sequences and 93 RNA genes (tRNA and rRNA. The indole-acetamide pathway was identified as responsible for the synthesis of indole-3-acetic acid. The genetic information involved in chromium resistance (the gene cluster, chrBACF, was found. At least 40 putative genes encoding for TonB-dependent receptors, probably involved in the utilization of siderophores and biopolymers, and genes for the synthesis, maturation, exportation and uptake of pyoverdine, and acquisition of Fe-pyochelin and Fe-enterobactin were also identified. The information also suggests that JD2 produce polyhydroxybutyrate, a carbon reserve polymer commonly used for manufacturing petrochemical free bioplastics. In addition, JD2 may degrade lignin-derived aromatic compounds to 2-pyrone-4,6-dicarboxylate, a molecule used in the bio-based polymer industry. Finally, a comparative genomic analysis of JD2, Delftia sp. Cs1-4 and Delftia acidovorans SPH-1 is also discussed. The present work provides insights into the physiology and genetics of a microorganism with many potential uses in biotechnology.

  10. Genome-scale identification of Legionella pneumophila effectors using a machine learning approach.

    Directory of Open Access Journals (Sweden)

    David Burstein

    2009-07-01

    Full Text Available A large number of highly pathogenic bacteria utilize secretion systems to translocate effector proteins into host cells. Using these effectors, the bacteria subvert host cell processes during infection. Legionella pneumophila translocates effectors via the Icm/Dot type-IV secretion system and to date, approximately 100 effectors have been identified by various experimental and computational techniques. Effector identification is a critical first step towards the understanding of the pathogenesis system in L. pneumophila as well as in other bacterial pathogens. Here, we formulate the task of effector identification as a classification problem: each L. pneumophila open reading frame (ORF was classified as either effector or not. We computationally defined a set of features that best distinguish effectors from non-effectors. These features cover a wide range of characteristics including taxonomical dispersion, regulatory data, genomic organization, similarity to eukaryotic proteomes and more. Machine learning algorithms utilizing these features were then applied to classify all the ORFs within the L. pneumophila genome. Using this approach we were able to predict and experimentally validate 40 new effectors, reaching a success rate of above 90%. Increasing the number of validated effectors to around 140, we were able to gain novel insights into their characteristics. Effectors were found to have low G+C content, supporting the hypothesis that a large number of effectors originate via horizontal gene transfer, probably from their protozoan host. In addition, effectors were found to cluster in specific genomic regions. Finally, we were able to provide a novel description of the C-terminal translocation signal required for effector translocation by the Icm/Dot secretion system. To conclude, we have discovered 40 novel L. pneumophila effectors, predicted over a hundred additional highly probable effectors, and shown the applicability of machine

  11. The molecular underpinning of lobular histological growth pattern: a genome-wide transcriptomic analysis of invasive lobular carcinomas and grade- and molecular subtype-matched invasive ductal carcinomas of no special type.

    Science.gov (United States)

    Weigelt, Britta; Geyer, Felipe C; Natrajan, Rachael; Lopez-Garcia, Maria A; Ahmad, Amar S; Savage, Kay; Kreike, Bas; Reis-Filho, Jorge S

    2010-01-01

    Invasive lobular carcinoma (ILC) is the most frequent special type of breast cancer. The majority of these tumours are of low histological grade, express hormone receptors, and lack HER2 expression. The pleomorphic variant of ILCs (PLCs) is characterized by atypical cells with pleomorphic nuclei and is reported to have an aggressive clinical behaviour. Expression profiling studies have demonstrated that classic ILCs preferentially display a luminal phenotype, whereas PLCs may be of luminal, HER2 or molecular apocrine subtypes. The aims of this study were two-fold: to determine the transcriptomic characteristics of lobular carcinomas and to define the genome-wide transcriptomic differences between classic ILCs and PLCs. To define the transcriptomic characteristics of ILCs, minimizing the impact of histological grade and molecular subtype on the analysis, we subjected a series of grade- and molecular subtype-matched ILCs and invasive ductal carcinomas (IDCs) to genome-wide gene expression profiling using oligonucleotide microarrays. Hierarchical clustering analysis demonstrated that ILCs formed a separate cluster and a supervised analysis revealed that 5.8% of the transcriptionally regulated genes were significantly differentially expressed in ILCs compared to grade- and molecular subtype-matched IDCs. ILCs displayed down-regulation of E-cadherin and of genes related to actin cytoskeleton remodelling, protein ubiquitin, DNA repair, cell adhesion, TGF-beta signalling; and up-regulation of transcription factors/immediate early genes, lipid/prostaglandin biosynthesis genes, and cell migration-associated genes. Supervised analysis of classic ILCs and PLCs demonstrated that less than 0.1% of genes were significantly differentially expressed between these tumour subtypes. Our results demonstrate that ILCs differ from grade- and molecular subtype-matched IDCs in the expression of genes related to cell adhesion, cell-to-cell signalling, and actin cytoskeleton signalling

  12. BAL31-NGS approach for identification of telomeres de novo in large genomes.

    Science.gov (United States)

    Peška, Vratislav; Sitová, Zdeňka; Fajkus, Petr; Fajkus, Jiří

    2017-02-01

    This article describes a novel method to identify as yet undiscovered telomere sequences, which combines next generation sequencing (NGS) with BAL31 digestion of high molecular weight DNA. The method was applied to two groups of plants: i) dicots, genus Cestrum, and ii) monocots, Allium species (e.g. A. ursinum and A. cepa). Both groups consist of species with large genomes (tens of Gb) and a low number of chromosomes (2n=14-16), full of repeat elements. Both genera lack typical telomeric repeats and multiple studies have attempted to characterize alternative telomeric sequences. However, despite interesting hypotheses and suggestions of alternative candidate telomeres (retrotransposons, rDNA, satellite repeats) these studies have not resolved the question. In a novel approach based on the two most general features of eukaryotic telomeres, their repetitive character and sensitivity to BAL31 nuclease digestion, we have taken advantage of the capacity and current affordability of NGS in combination with the robustness of classical BAL31 nuclease digestion of chromosomal termini. While representative samples of most repeat elements were ensured by low-coverage (less than 5%) genomic shot-gun NGS, candidate telomeres were identified as under-represented sequences in BAL31-treated samples.

  13. Post-genomic approaches to understanding interactions between fungi and their environment

    Energy Technology Data Exchange (ETDEWEB)

    de Vries, Ronald P.; Benoit, Isabelle; Doehlemann, Gunther; Kobayashi, Tetsuo; Magnuson, Jon K.; Panisko, Ellen A.; Baker, Scott E.; Lebrun, Marc-Henri

    2011-05-24

    Fungi inhabit every natural and anthropogenic environment on Earth. They have highly varied life-styles including saprobes (using only dead biomass as a nutrient source), pathogens (feeding on living biomass), and symbionts (co-existing with other organisms). These distinctions are not absolute as many species employ several life styles (e.g. saprobe and opportunistic pathogen, saprobe and mycorrhiza). To efficiently survive in these different and often changing environments, fungi need to be able to modify their physiology and in some cases will even modify their local environment. Understanding the interaction between fungi and their environments has been a topic of study for many decades. However, recently these studies have reached a new dimension. The availability of fungal genomes and development of postgenomic technologies for fungi, such as transcriptomics, proteomics and metabolomics, have enabled more detailed studies into this topic resulting in new insights. Based on a Special Interest Group session held during IMC9, this paper provides examples of the recent advances in using (post-)genomic approaches to better understand fungal interactions with their environments.

  14. Genomic and Global Approaches to Unravelling How Hypermutable Sequences Influence Bacterial Pathogenesis

    Directory of Open Access Journals (Sweden)

    Fadil A. Bidmos

    2014-02-01

    Full Text Available Rapid adaptation to fluctuations in the host milieu contributes to the host persistence and virulence of bacterial pathogens. Adaptation is frequently mediated by hypermutable sequences in bacterial pathogens. Early bacterial genomic studies identified the multiplicity and virulence-associated functions of these hypermutable sequences. Thus, simple sequence repeat tracts (SSRs and site-specific recombination were found to control capsular type, lipopolysaccharide structure, pilin diversity and the expression of outer membrane proteins. We review how the population diversity inherent in the SSR-mediated mechanism of localised hypermutation is being unlocked by the investigation of whole genome sequences of disease isolates, analysis of clinical samples and use of model systems. A contrast is presented between the problematical nature of analysing simple sequence repeats in next generation sequencing data and in simpler, pragmatic PCR-based approaches. Specific examples are presented of the potential relevance of this localized hypermutation to meningococcal pathogenesis. This leads us to speculate on the future prospects for unravelling how hypermutable mechanisms may contribute to the transmission, spread and persistence of bacterial pathogens.

  15. Affinity Density: a novel genomic approach to the identification of transcription factor regulatory targets.

    Science.gov (United States)

    Hazelett, Dennis J; Lakeland, Daniel L; Weiss, Joseph B

    2009-07-01

    A new method was developed for identifying novel transcription factor regulatory targets based on calculating Local Affinity Density. Techniques from the signal-processing field were used, in particular the Hann digital filter, to calculate the relative binding affinity of different regions based on previously published in vitro binding data. To illustrate this approach, the complete genomes of Drosophila melanogaster and D.pseudoobscura were analyzed for binding sites of the homeodomain proteinc Tinman, an essential heart development gene in both Drosophila and Mouse. The significant binding regions were identified relative to genomic background and assigned to putative target genes. Valid candidates common to both species of Drosophila were selected as a test of conservation. The new method was more sensitive than cluster searches for conserved binding motifs with respect to positive identification of known Tinman targets. Our Local Affinity Density method also identified a significantly greater proportion of Tinman-coexpressed genes than equivalent, optimized cluster searching. In addition, this new method predicted a significantly greater than expected number of genes with previously published RNAi phenotypes in the heart. Algorithms were implemented in Python, LISP, R and maxima, using MySQL to access locally mirrored sequence data from Ensembl (D.melanogaster release 4.3) and flybase (D.pseudoobscura). All code is licensed under GPL and freely available at http://www.ohsu.edu/cellbio/dev_biol_prog/affinitydensity/.

  16. [MATCHE: Management Approach to Teaching Consumer and Homemaking Education.] Consumer Approach Strand: Core. Module I-A-3: Consumer Rights and Responsibilities.

    Science.gov (United States)

    Smith, Sharman

    This competency-based preservice home economics teacher education module on consumer rights and responsibilities is the third in a set of four core curriculum modules on consumer approach to homemaking education. (This set is part of a larger series of sixty-seven on the Management Approach to Teaching Consumer and Homemaking Education…

  17. Comparative genomic analysis of single-molecule sequencing and hybrid approaches for finishing the Clostridium autoethanogenum JA1-1 strain DSM 10061 genome

    Energy Technology Data Exchange (ETDEWEB)

    Brown, Steven D [ORNL; Nagaraju, Shilpa [LanzaTech; Utturkar, Sagar M [ORNL; De Tissera, Sashini [LanzaTech; Segovia, Simón [LanzaTech; Mitchell, Wayne [LanzaTech; Land, Miriam L [ORNL; Dassanayake, Asela [LanzaTech; Köpke, Michael [LanzaTech

    2014-01-01

    Background Clostridium autoethanogenum strain JA1-1 (DSM 10061) is an acetogen capable of fermenting CO, CO2 and H2 (e.g. from syngas or waste gases) into biofuel ethanol and commodity chemicals such as 2,3-butanediol. A draft genome sequence consisting of 100 contigs has been published. Results A closed, high-quality genome sequence for C. autoethanogenum DSM10061 was generated using only the latest single-molecule DNA sequencing technology and without the need for manual finishing. It is assigned to the most complex genome classification based upon genome features such as repeats, prophage, nine copies of the rRNA gene operons. It has a low G + C content of 31.1%. Illumina, 454, Illumina/454 hybrid assemblies were generated and then compared to the draft and PacBio assemblies using summary statistics, CGAL, QUAST and REAPR bioinformatics tools and comparative genomic approaches. Assemblies based upon shorter read DNA technologies were confounded by the large number repeats and their size, which in the case of the rRNA gene operons were ~5 kb. CRISPR (Clustered Regularly Interspaced Short Paloindromic Repeats) systems among biotechnologically relevant Clostridia were classified and related to plasmid content and prophages. Potential associations between plasmid content and CRISPR systems may have implications for historical industrial scale Acetone-Butanol-Ethanol (ABE) fermentation failures and future large scale bacterial fermentations. While C. autoethanogenum contains an active CRISPR system, no such system is present in the closely related Clostridium ljungdahlii DSM 13528. A common prophage inserted into the Arg-tRNA shared between the strains suggests a common ancestor. However, C. ljungdahlii contains several additional putative prophages and it has more than double the amount of prophage DNA compared to C. autoethanogenum. Other differences include important metabolic genes for central metabolism (as an additional hydrogenase and the absence of a

  18. Data Mining Approaches for Genome-Wide Association of Mood Disorders

    Science.gov (United States)

    Pirooznia, Mehdi; Seifuddin, Fayaz; Judy, Jennifer; Mahon, Pamela B.; Potash, James B.; Zandi, Peter P.

    2012-01-01

    Mood disorders are highly heritable forms of major mental illness. A major breakthrough in elucidating the genetic architecture of mood disorders was anticipated with the advent of genome-wide association studies (GWAS). However, to date few susceptibility loci have been conclusively identified. The genetic etiology of mood disorders appears to be quite complex, and as a result, alternative approaches for analyzing GWAS data are needed. Recently, a polygenic scoring approach that captures the effects of alleles across multiple loci was successfully applied to the analysis of GWAS data in schizophrenia and bipolar disorder (BP). However, this method may be overly simplistic in its approach to the complexity of genetic effects. Data mining methods are available that may be applied to analyze the high dimensional data generated by GWAS of complex psychiatric disorders. We sought to compare the performance of five data mining methods, namely, Bayesian Networks (BN), Support Vector Machine (SVM), Random Forest (RF), Radial Basis Function network (RBF), and Logistic Regression (LR), against the polygenic scoring approach in the analysis of GWAS data on BP. The different classification methods were trained on GWAS datasets from the Bipolar Genome Study (2,191 cases with BP and 1,434 controls) and their ability to accurately classify case/control status was tested on a GWAS dataset from the Wellcome Trust Case Control Consortium. The performance of the classifiers in the test dataset was evaluated by comparing area under the receiver operating characteristic curves (AUC). BN performed the best of all the data mining classifiers, but none of these did significantly better than the polygenic score approach. We further examined a subset of SNPs in genes that are expressed in the brain, under the hypothesis that these might be most relevant to BP susceptibility, but all the classifiers performed worse with this reduced set of SNPs. The discriminative accuracy of all of these

  19. 'Wiggle matching' radiocarbon dates

    NARCIS (Netherlands)

    Ramsey, CB; van der Plicht, J; Weninger, B

    2001-01-01

    This paper covers three different methods of matching radiocarbon dates to the 'wiggles' of the calibration curve in those situations where the age difference between the C-14 dates is known. These methods are most often applied to tree-ring sequences. The simplest approach is to use a classical Chi

  20. Multi-criteria decision making approaches for quality control of genome-wide association studies.

    Science.gov (United States)

    Malovini, Alberto; Rognoni, Carla; Puca, Annibale; Bellazzi, Riccardo

    2009-03-01

    Experimental errors in the genotyping phases of a Genome-Wide Association Study (GWAS) can lead to false positive findings and to spurious associations. An appropriate quality control phase could minimize the effects of this kind of errors. Several filtering criteria can be used to perform quality control. Currently, no formal methods have been proposed for taking into account at the same time these criteria and the experimenter's preferences. In this paper we propose two strategies for setting appropriate genotyping rate thresholds for GWAS quality control. These two approaches are based on the Multi-Criteria Decision Making theory. We have applied our method on a real dataset composed by 734 individuals affected by Arterial Hypertension (AH) and 486 nonagenarians without history of AH. The proposed strategies appear to deal with GWAS quality control in a sound way, as they lead to rationalize and make explicit the experimenter's choices thus providing more reproducible results.

  1. A functional genomics approach using metabolomics and in silico pathway analysis

    DEFF Research Database (Denmark)

    Förster, Jochen; Gombert, Andreas Karoly; Nielsen, Jens

    2002-01-01

    In the field of functional genomics increasing effort is being undertaken to analyze the function of orphan genes using metabolome data. Improved analytical equipment allows screening simultaneously for a high number of metabolites. Such metabolite profiles are analyzed using multivariate data...... analysis techniques and changes in the genotype will in many cases lead to different metabolite profiles. Here, a theoretical framework that may be applied to identify the function of orphan genes is presented. The approach is based on a combination of metabolome analysis combined with in silico pathway...... analysis. Pathway analysis may be carried out using convex analysis and a change in the active pathway structure of deletion mutants expressed in a different metabolite profile may disclose the function or the functional class of an orphan gene. The concept is illustrated using a simplified model...

  2. Multiple Pattern Matching Algorithm using Pair-count

    Directory of Open Access Journals (Sweden)

    Raju Bhukya

    2011-07-01

    Full Text Available Pattern matching occurs in various applications, ranging from simple text searching in word processors to identification of common motifs in DNA sequences in computational biology. The problem of exact pattern matching has been well studied and a number of efficient algorithms already exist. However these exact pattern matching algorithms are of little help when they are applied to finding patterns in DNA sequences. Pattern matching in a DNA sequence or pattern searching from a large data base is a major research area in computational biology. To extract pattern from a large sequence it takes more time, in order to reduce searching time we have proposed an approach that reduces the search time with accurate retrieval of the matched pattern from the given sequence of any size of a file. Executing patterns from a large DNA or protein data is a computationally intensive task. As performance plays a major role in extracting patterns from a given DNA sequence or from a large database independent of the size of the sequence. More efficient approaches related to multiple pattern matching techniques are becoming more important for finding the functional as well as the structural properties of the proteins and genes. One of the major problems in genomic field is to perform pattern comparison on DNA and protein sequences. In the current approach we explore a new technique which avoids unnecessary comparisons in the DNA sequence and gives the accurate retrieval of the pattern called a multiple pattern matching algorithm using pair count. The proposed technique gives very good performance related to DNA sequence analysis for querying of publicly available genome sequence data. By using this method the number of comparisons gradually decreases and comparison per character ratio of the proposed algorithm reduces accordingly when compared to the some of the existing popular methods. The experimental results show that there is considerable amount of performance

  3. Biosurveillance enterprise for operational awareness, a genomic-based approach for tracking pathogen virulence.

    Science.gov (United States)

    Valdivia-Granda, Willy A

    2013-11-15

    To protect our civilians and warfighters against both known and unknown pathogens, biodefense stakeholders must be able to foresee possible technological trends that could affect their threat risk assessment. However, significant flaws in how we prioritize our countermeasure-needs continue to limit their development. As recombinant biotechnology becomes increasingly simplified and inexpensive, small groups, and even individuals, can now achieve the design, synthesis, and production of pathogenic organisms for offensive purposes. Under these daunting circumstances, a reliable biosurveillance approach that supports a diversity of users could better provide early warnings about the emergence of new pathogens (both natural and manmade), reverse engineer pathogens carrying traits to avoid available countermeasures, and suggest the most appropriate detection, prophylactic, and therapeutic solutions. While impressive in data mining capabilities, real-time content analysis of social media data misses much of the complexity in the factual reality. Quality issues within freeform user-provided hashtags and biased referencing can significantly undermine our confidence in the information obtained to make critical decisions about the natural vs. intentional emergence of a pathogen. At the same time, errors in pathogen genomic records, the narrow scope of most databases, and the lack of standards and interoperability across different detection and diagnostic devices, continue to restrict the multidimensional biothreat assessment. The fragmentation of our biosurveillance efforts into different approaches has stultified attempts to implement any new foundational enterprise that is more reliable, more realistic and that avoids the scenario of the warning that comes too late. This discussion focus on the development of genomic-based decentralized medical intelligence and laboratory system to track emerging and novel microbial health threats in both military and civilian settings and

  4. Approaching confidentiality at a familial level in genomic medicine: a focus group study with healthcare professionals

    Science.gov (United States)

    Dheensa, Sandi; Fenwick, Angela; Lucassen, Anneke

    2017-01-01

    Objectives Clinical genetics guidelines from 2011 conceptualise genetic information as confidential to families, not individuals. The normative consequence of this is that the family's interest is the primary consideration and genetic information is shared unless there are good reasons not to do so. We investigated healthcare professionals' (HCPs') views about, and reasoning around, individual and familial approaches to confidentiality and how such views influenced their practice. Method 16 focus groups with 80 HCPs working in/with clinical genetics services were analysed, drawing on grounded theory. Results Participants raised seven problems with, and arguments against, going beyond the individual approach to confidentiality. These problems fell into two overlapping categories: ‘relationships’ and ‘structures’. Most participants had never considered ways to—or thought it was impossible to—treat familial genetic information and personal information differently. They worried that putting the familial approach into practice could disrupt family dynamics and erode patient trust in the health service. They also thought they had insufficient resources to share information and feared that sharing might change the standard of care and make them more vulnerable to liability. Conclusions A familial approach to confidentiality has not been accepted or adopted as a standard, but wider research suggests that some of the problems HCPs perceived are surmountable and sharing in the interest of the family can be achieved. However, further research is needed to explore how personal and familial genetic information can be separated in practice. Our findings are relevant to HCPs across health services who are starting to use genome tests as part of their routine investigations. PMID:28159847

  5. A new experimental approach for studying bacterial genomic island evolution identifies island genes with bacterial host-specific expression patterns

    Directory of Open Access Journals (Sweden)

    Nickerson Cheryl A

    2006-01-01

    Full Text Available Abstract Background Genomic islands are regions of bacterial genomes that have been acquired by horizontal transfer and often contain blocks of genes that function together for specific processes. Recently, it has become clear that the impact of genomic islands on the evolution of different bacterial species is significant and represents a major force in establishing bacterial genomic variation. However, the study of genomic island evolution has been mostly performed at the sequence level using computer software or hybridization analysis to compare different bacterial genomic sequences. We describe here a novel experimental approach to study the evolution of species-specific bacterial genomic islands that identifies island genes that have evolved in such a way that they are differentially-expressed depending on the bacterial host background into which they are transferred. Results We demonstrate this approach by using a "test" genomic island that we have cloned from the Salmonella typhimurium genome (island 4305 and transferred to a range of Gram negative bacterial hosts of differing evolutionary relationships to S. typhimurium. Systematic analysis of the expression of the island genes in the different hosts compared to proper controls allowed identification of genes with genera-specific expression patterns. The data from the analysis can be arranged in a matrix to give an expression "array" of the island genes in the different bacterial backgrounds. A conserved 19-bp DNA site was found upstream of at least two of the differentially-expressed island genes. To our knowledge, this is the first systematic analysis of horizontally-transferred genomic island gene expression in a broad range of Gram negative hosts. We also present evidence in this study that the IS200 element found in island 4305 in S. typhimurium strain LT2 was inserted after the island had already been acquired by the S. typhimurium lineage and that this element is likely not

  6. Toward a Taxonomy for Multi-Omics Science? Terminology Development for Whole Genome Study Approaches by Omics Technology and Hierarchy.

    Science.gov (United States)

    Pirih, Nina; Kunej, Tanja

    2017-01-01

    Omics is a form of high-throughput systems science. However, taxonomies for omics studies are limited, inviting us to rethink new ways in which we classify, prioritize, and rank various omics systems science studies. In this overarching context, the genome-wide study approaches have proliferated in number and popularity over the past decade. However, their hierarchy is not well organized and the development of attendant terminology is not controlled. In the present study, we searched the literature in PubMed and the Web of Science databases published from March 1999 to September 2016 using the keywords, including genome-wide, association, whole genome, transcriptome-wide, metabolome, epigenome, and phenome. We identified the whole genome study approaches and sorted them according to the omics technology types (genomics, proteomics, and so on) and hierarchy. Thirty-four studies from over 90 publications were sorted into 10 omics groups: DNA level, transcriptomics, proteomics, interactomics, metabolomics, epigenomics, miRNomics/ncRNomics, phenomics, environmental omics, and pharmacogenomics. We suggest here modifications of terminology for study approaches, which share the same acronyms such as EWAS for epigenome-wide association and environment-wide association studies, and MWAS for methylome-wide association and metabolome-wide association studies. Taken together, our study presented here provides the first systematic review and analyses of whole genome approaches and presents a baseline for further controlled terminology development, with a view to a new taxonomy for omics and multi-omics studies in the future. Finally, we call for greater dialogue and collaboration across diverse omics knowledge domains and applications, for example, across plants, animals, clinical medicine, and ecology.

  7. Novel phenotypes and loci identified through clinical genomics approaches to pediatric cataract.

    Science.gov (United States)

    Patel, Nisha; Anand, Deepti; Monies, Dorota; Maddirevula, Sateesh; Khan, Arif O; Algoufi, Talal; Alowain, Mohammed; Faqeih, Eissa; Alshammari, Muneera; Qudair, Ahmed; Alsharif, Hadeel; Aljubran, Fatimah; Alsaif, Hessa S; Ibrahim, Niema; Abdulwahab, Firdous M; Hashem, Mais; Alsedairy, Haifa; Aldahmesh, Mohammed A; Lachke, Salil A; Alkuraya, Fowzan S

    2017-02-01

    Pediatric cataract is highly heterogeneous clinically and etiologically. While mostly isolated, cataract can be part of many multisystem disorders, further complicating the diagnostic process. In this study, we applied genomic tools in the form of a multi-gene panel as well as whole-exome sequencing on unselected cohort of pediatric cataract (166 patients from 74 families). Mutations in previously reported cataract genes were identified in 58% for a total of 43 mutations, including 15 that are novel. GEMIN4 was independently mutated in families with a syndrome of cataract, global developmental delay with or without renal involvement. We also highlight a recognizable syndrome that resembles galactosemia (a fulminant infantile liver disease with cataract) caused by biallelic mutations in CYP51A1. A founder mutation in RIC1 (KIAA1432) was identified in patients with cataract, brain atrophy, microcephaly with or without cleft lip and palate. For non-syndromic pediatric cataract, we map a novel locus in a multiplex consanguineous family on 4p15.32 where exome sequencing revealed a homozygous truncating mutation in TAPT1. We report two further candidates that are biallelically inactivated each in a single cataract family: TAF1A (cataract with global developmental delay) and WDR87 (non-syndromic cataract). In addition to positional mapping data, we use iSyTE developmental lens expression and gene-network analysis to corroborate the proposed link between the novel candidate genes and cataract. Our study expands the phenotypic, allelic and locus heterogeneity of pediatric cataract. The high diagnostic yield of clinical genomics supports the adoption of this approach in this patient group.

  8. Distinguishing bacterial pathogens of potato using a genome-wide microarray approach.

    Science.gov (United States)

    Aittamaa, M; Somervuo, P; Pirhonen, M; Mattinen, L; Nissinen, R; Auvinen, P; Valkonen, J P T

    2008-09-01

    A set of 9676 probes was designed for the most harmful bacterial pathogens of potato and tested in a microarray format. Gene-specific probes could be designed for all genes of Pectobacterium atrosepticum, c. 50% of the genes of Streptomyces scabies and c. 30% of the genes of Clavibacter michiganensis ssp. sepedonicus utilizing the whole-genome sequence information available. For Streptomyces turgidiscabies, 226 probes were designed according to the sequences of a pathogenicity island containing important virulence genes. In addition, probes were designed for the virulence-associated nip (necrosis-inducing protein) genes of P. atrosepticum, P. carotovorum and Dickeya dadantii and for the intergenic spacer (IGS) sequences of the 16S-23S rRNA gene region. Ralstonia solanacearum was not included in the study, because it is a quarantine organism and is not presently found in Finland, but a few probes were also designed for this species. The probes contained on average 40 target-specific nucleotides and were synthesized on the array in situ, organized as eight sub-arrays with an identical set of probes which could be used for hybridization with different samples. All bacteria were readily distinguished using a single channel system for signal detection. Nearly all of the c. 1000 probes designed for C. michiganensis ssp. sepedonicus, c. 50% and 40% of the c. 4000 probes designed for the genes of S. scabies and P. atrosepticum, respectively, and over 100 probes for S. turgidiscabies showed significant signals only with the respective species. P. atrosepticum, P. carotovorum and Dickeya strains were all detected with 110 common probes. By contrast, the strains of these species were found to differ in their signal profiles. Probes targeting the IGS region and nip genes could be used to place strains of Dickeya to two groups, which correlated with differences in virulence. Taken together, the approach of using a custom-designed, genome-wide microarray provided a robust means

  9. Mechanisms of colorectal and lung cancer prevention by vegetables: a genomic approach.

    Science.gov (United States)

    van Breda, Simone G J; de Kok, Theo M C M; van Delft, Joost H M

    2008-03-01

    Colorectal cancer (CRC) and lung cancer (LC) occur at high incidence, and both can be effectively prevented by dietary vegetable consumption. This makes these two types of cancer highly suitable for elucidating the underlying molecular mechanisms of cancer chemoprevention. Numerous studies have shown that vegetables exert their beneficial effects through various different mechanisms, but effects on the genome level remain mostly unclear. This review evaluates current knowledge on the mechanisms of CRC and LC prevention by vegetables, thereby focusing on the modulation of gene and protein expressions. The majority of the effects found in the colon are changes in the expression of genes and proteins involved in apoptosis, cell cycle, cell proliferation and intracellular defense, in favor of reduced CRC risk. Furthermore, vegetables and vegetable components changed the expression of many more genes and proteins involved in other pathways for which biologic meaning is less clear. The number of studies investigating gene and protein expression changes in the lungs is limited to only a few in vitro and animal studies. Data from these studies show that mostly genes involved in biotransformation, apoptosis and cell cycle regulation are affected. In both colon and lungs, genomewide analyses of gene and protein expression changes by new genomics and proteomics technologies, as well as the investigation of whole vegetables, are few in number. Further studies applying these 'omics' approaches are needed to provide more insights on affected genetic/biologic pathways and, thus, in molecular mechanisms by which different chemopreventive compounds can protect against carcinogenesis. Particularly studies with combinations of phytochemicals and whole vegetables are needed to establish gene expression changes in the colon, but especially in the lungs.

  10. PCR-based ordered genomic libraries: a new approach to drug target identification for Streptococcus pneumoniae.

    Science.gov (United States)

    Belanger, Aimee E; Lai, Angel; Brackman, Marcia A; LeBlanc, Donald J

    2002-08-01

    Described here are the development and validation of a novel approach to identify genes encoding drug targets in Streptococcus pneumoniae. The method relies on the use of an ordered genomic library composed of PCR amplicons that were generated under error-prone conditions so as to introduce random mutations into the DNA. Since some of the mutations occur in drug target-encoding genes and subsequently affect the binding of the drug to its respective cellular target, amplicons containing drug targets can be identified as those producing drug-resistant colonies when transformed into S. pneumoniae. Examination of the genetic content of the amplicon giving resistance coupled with bioinformatics and additional genetic approaches could be used to rapidly identify candidate drug target genes. The utility of this approach was verified by using a number of known antibiotics. For drugs with single protein targets, amplicons were identified that rendered S. pneumoniae drug resistant. Assessment of amplicon composition revealed that each of the relevant amplicons contained the gene encoding the known target for the particular drug tested. Fusidic acid-resistant mutants that resulted from the transformation of S. pneumoniae with amplicons containing fusA were further characterized by sequence analysis. A single mutation was found to occur in a region of the S. pneumoniae elongation factor G protein that is analogous to that already implicated in other bacteria as being associated with fusidic acid resistance. Thus, in addition to facilitating the identification of genes encoding drug targets, this method could provide strains that aid future mechanistic studies.

  11. Identification of putative drug targets of Listeria monocytogenes F2365 by subtractive genomics approach

    Directory of Open Access Journals (Sweden)

    Md. Musharaf Hossain

    2013-01-01

    Full Text Available The prolonged and uncontrolled use of antibiotics in treatment against many pathogens causes the multiple drug resistance. The drug resistance of Listeria monocytogenes F2365 has been evolved, which cause a major disease listeriosis. The drug dose limit against that pathogen was also increased for currently prescribed antibiotics and more often combinational therapy was preferred. Therefore, identification of an extensive novel drug target, unique and essential to the microorganism and subjected to its validation and drug development is imperative. Availability of the total proteome of L. monocytogenes F2365 enabled in silico identification of putative common drug targets and their subcellular localization by subtractive genomics approach. In the present work subtractive genomics approach is used to identify vaccine and drug targets of L. monocytogenes F2365 to speed up the rational drug and vaccine design. It has revealed that out of 2821 reference sequences of the pathogen, 744 represent essential proteins and among them 274 are human non-homolog proteins. Besides, all predicted human non-homologs were then analyzed by subcellular localization servers, in which 46 proteins were identified as surface exposed proteins and can be considered as potential drug and vaccine targets for the pathogen. The 3D structure of two human non-homolog putative drug targets, pantothenate kinase (LmPK and holliday junction resolvase-like protein (LmHJR of L. monocytogenes F2365 were generated by homology modeling program Easymodeller 4.0; a GUI version of modeller. Generated structures were also validated by several online servers. The overall stereochemical quality of the model was assessed by Ramachandran plot analysis that was provided by PROCHECK. ProQ, ERRAT, Pro-SA web and VERIFY 3D of SAVES programs were also used to compute several validation parameters during the evaluation of the model. This protein structure information is important in structure

  12. Incremental pattern matching for regular expressions

    NARCIS (Netherlands)

    Jalali, Arash; Ghamarian, Amir Hossein; Rensink, Arend; Fish, Andrew; Lambers, Leen

    2012-01-01

    Graph pattern matching lies at the heart of any graph transformation-based system. Incremental pattern matching is one approach proposed for reducingthe overall cost of pattern matching over successive transformations by preserving the matches that stay relevant after a rule application. An importan

  13. Population genomics in Sardinia: a novel approach to hunt for genomic combinations underlying complex traits and diseases.

    Science.gov (United States)

    Siniscalco, M; Robledo, R; Bender, P K; Carcassi, C; Contu, L; Beck, J C

    1999-01-01

    The availability of highly polymorphic markers permits testing whether complex traits and diseases result from genomic interactions between nonallelic normal variants at separate loci. Such variants may be identified by deviations from the expected distributions of alleles at a high number of polymorphic loci, when individuals with the phenotype of interest are compared to normal controls of the same breeding unit, provided that both groups share the same remote ancestry and had no ancestors in common for the last three to four generations. The circumstances needed for such studies are ideally met on the island of Sardinia. The recurrent finding of the same type of association in separate breeding units between the phenotype of interest and a given genotype should allow a distinction between true genetic identity by descent and randomly occurring identities, as these will be obviously different in separate breeding units. The availability of several breeding units located in sharply different ecological environments will permit assessment of the role of nature/nurture factors in the degree of manifestation of each newly discovered genotype/phenotype association. A pilot study to evaluate the proposed strategy has been carried out in the Sardinian village of Carloforte, a community of about 8,000 individuals who have remained genetically homogeneous. Fifty-five control samples have been genotyped with six tetranucleotide microsatellites and with a subset of the 400 markers contained in the ABI PRISM linkage mapping panel, version 2. The allele frequencies for these microsatellite markers have been determined for these 55 individuals and compared to those from a random sampling of subsets of these 55 persons. For the six tetranucleotide microsatellites, a subset of as few as 20 people displayed the same allele frequency distributions as observed with the original 55 unrelated individuals. In conclusion, when samples are chosen from the same breeding unit, the number

  14. Relaxation matching algorithm for moving photogrammetry

    Science.gov (United States)

    Guo, Lei; Liu, Ke; Miao, Yinxiao; Zhu, Jigui

    2015-02-01

    Moving photogrammetry is an application of close range photogrammetry in industrial measurement to realize threedimensional coordinate measurement within large-scale volume. This paper describes an approach of relaxation matching algorithm applicable to moving photogrammetry according to the characteristics of accurate matching result of different measuring images. This method uses neighborhood matching support to improve the matching rate after coarse matching based on epipolar geometry constraint and precise matching using three images. It reflects the overall matching effect of all points, that means when a point is matched correctly, the matching results of those points round it must be correct. So for one point considered, the matching results of points round it are calculated to judge whether its result is correct. Analysis indicates that relaxation matching can eliminate the mismatching effectively and acquire 100% rate of correct matching. It will play a very important role in moving photogrammetry to ensure the following implement of ray bundle adjustment.

  15. Implementation of an Electronic Circuit for SSSA Control Approach of a Plate Type Element and Experimental Match with a Feed-Forward Approach

    Directory of Open Access Journals (Sweden)

    Viscardi Massimo

    2016-12-01

    Full Text Available Successful implementation of an active vibration control system is strictly correlated to the exact knowledge of the dynamic behavior of the system, of the excitation level and spectra and of the sensor and actuator’s specification. Only the correct management of these aspects may guarantee the correct choice of the control strategy and the relative performance. Within this paper, some preliminary activities aimed at the creation of a structurally simple, cheap and easily replaceable active control systems for metal panels are discussed. The final future aim is to control and to reduce noise, produced by vibrations of metal panels of the body of a car. The paper is focused on two points. The first one is the realization of an electronic circuit for Synchronized Shunted Switch Architecture (SSSA with the right dimensioning of the components to control the proposed test article, represented by a rectangular aluminum plate. The second one is a preliminary experimental study on the test article, in controlled laboratory conditions, to compare performances of two possible control approach: SSSA and a feed-forward control approach. This comparison would contribute to the future choice of the most suitable control architecture for the specific attenuation of structure-born noise related to an automotive floor structure under deterministic (engine and road-tyre interaction and stochastic (road-tyre interaction and aerodynamic forcing actions.

  16. Whole genome semiconductor based sequencing of farmed European sea bass (Dicentrarchus labrax) Mediterranean genetic stocks using a DNA pooling approach.

    Science.gov (United States)

    Bertolini, Francesca; Geraci, Claudia; Schiavo, Giuseppina; Sardina, Maria Teresa; Chiofalo, Vincenzo; Fontanesi, Luca

    2016-08-01

    European sea bass (Dicentrarchus labrax) is an important marine species for commercial and sport fisheries and aquaculture production. Recently, the European sea bass genome has been sequenced and assembled. This resource can open new opportunities to evaluate and monitor variability and identify variants that could contribute to the adaptation to farming conditions. In this work, two DNA pools constructed from cultivated European sea bass were sequenced using a next generation semiconductor sequencing approach based on Ion Proton sequencer. Using the first draft version of the D. labrax genome as reference, sequenced reads obtained a total of about 1.6 million of single nucleotide polymorphisms (SNPs), spread all over the chromosomes. Transition/transversion (Ti/Tv) was equal to 1.28, comparable to what was already reported in Salmon species. A pilot homozygosity analysis across the D. labrax genome using DNA pool sequence datasets indicated that this approach can identify chromosome regions with putative signatures of selection, including genes involved in ion transport and chloride channel functions, amino acid metabolism and circadian clock and related neurological systems. This is the first study that reported genome wide polymorphisms in a fish species obtained with the Ion Proton sequencer. Moreover, this study provided a methodological approach for selective sweep analysis in this species.

  17. Is flood risk capitalized into real estate market values? : a Mahalanobis-metric matching approach to housing market in Busan, South Korea

    Science.gov (United States)

    Jung, E.; Yoon, H.

    2016-12-01

    Natural disasters are substantial source of social and economic damage around the globe. The amount of damage is larger when such catastrophe events happen in urbanized areas where the wealth is concentrated. Disasters cause losses in real estate assets, incurring additional cost of repair and maintenance of the properties. For this reason, natural hazard risk such as flooding and landslide is regarded as one of the important determinants of homebuyers' choice and preference. In this research, we aim to reveal whether the past records of flood affect real estate market values in Busan, Korea in 2014, under a hypothesis that homebuyers' perception of natural hazard is reflected on housing values, using the Mahalanobis-metric matching method. Unlike conventionally used hedonic pricing model to estimate capitalization of flood risk into the sales price of properties, the analytical method we adopt here enables inferring causal effects by efficiently controlling for observed/unobserved omitted variable bias. This matching approach pairs each inundated property (treatment variable) with a non-inundated property (control variable) with the closest Mahalanobis distance between them, and comparing their effects on residential property sales price (outcome variable). As a result, we expect price discounts for inundated properties larger than the one for comparable non-inundated properties. This research will be valuable in establishing the mitigation policies of future climate change to relieve the possible negative economic consequences from the disaster by estimating how people perceive and respond to natural hazard. This work was supported by the Korea Environmental Industry and Technology Institute (KEITI) under Grant (No. 2014-001-310007).

  18. Biofilm formation in Candida glabrata: What have we learnt from functional genomics approaches?

    Science.gov (United States)

    d'Enfert, Christophe; Janbon, Guilhem

    2016-02-01

    Biofilms are a source of therapeutic failures because of their intrinsic tolerance to antimicrobials. Candida glabrata is one of the pathogenic yeasts that is responsible for life-threatening disseminated infections and able to form biofilms on medical devices such as vascular and urinary catheters. Recent progresses in the functional genomics of C. glabrata have been applied to the study of biofilm formation, revealing the contribution of an array of genes to this process. In particular, the Yak1 kinase and the Swi/Snf chromatin remodeling complex have been shown to relieve the repression exerted by subtelomeric silencing on the expression of the EPA6 and EPA7 genes, thus allowing the encoded adhesins to exert their key roles in biofilm formation. This provides a framework to evaluate the contribution of other genes that have been genetically linked to biofilm development and, based on the function of their orthologs in Saccharomyces cerevisiae, appear to have roles in adaptation to nutrient deprivation, calcium signaling, cell wall remodeling and adherence. Future studies combining the use of in vitro and animal models of biofilm formation, omics approaches and forward or reverse genetics are needed to expand the current knowledge of C. glabrata biofilm formation and reveal the mechanisms underlying their antifungal tolerance.

  19. Genomic approaches uncover increasing complexities in the regulatory landscape at the human SCL (TAL1 locus.

    Directory of Open Access Journals (Sweden)

    Pawandeep Dhami

    Full Text Available The SCL (TAL1 transcription factor is a critical regulator of haematopoiesis and its expression is tightly controlled by multiple cis-acting regulatory elements. To elaborate further the DNA elements which control its regulation, we used genomic tiling microarrays covering 256 kb of the human SCL locus to perform a concerted analysis of chromatin structure and binding of regulatory proteins in human haematopoietic cell lines. This approach allowed us to characterise further or redefine known human SCL regulatory elements and led to the identification of six novel elements with putative regulatory function both up and downstream of the SCL gene. They bind a number of haematopoietic transcription factors (GATA1, E2A LMO2, SCL, LDB1, CTCF or components of the transcriptional machinery and are associated with relevant histone modifications, accessible chromatin and low nucleosomal density. Functional characterisation shows that these novel elements are able to enhance or repress SCL promoter activity, have endogenous promoter function or enhancer-blocking insulator function. Our analysis opens up several areas for further investigation and adds new layers of complexity to our understanding of the regulation of SCL expression.

  20. A Comparative Genomics Approach to Prediction of New Members of Regulons

    Science.gov (United States)

    Tan, Kai; Moreno-Hagelsieb, Gabriel; Collado-Vides, Julio; Stormo, Gary D.

    2001-01-01

    Identifying the complete transcriptional regulatory network for an organism is a major challenge. For each regulatory protein, we want to know all the genes it regulates, that is, its regulon. Examples of known binding sites can be used to estimate the binding specificity of the protein and to predict other binding sites. However, binding site predictions can be unreliable because determining the true specificity of the protein is difficult because of the considerable variability of binding sites. Because regulatory systems tend to be conserved through evolution, we can use comparisons between species to increase the reliability of binding site predictions. In this article, an approach is presented to evaluate the computational predicitions of regulatory sites. We combine the prediction of transcription units having orthologous genes with the prediction of transcription factor binding sites based on probabilistic models. We augment the sets of genes in Escherichia coli that are expected to be regulated by two transcription factors, the cAMP receptor protein and the fumarate and nitrate reduction regulatory protein, through a comparison with the Haemophilus influenzae genome. At the same time, we learned more about the regulatory networks of H. influenzae, a species with much less experimental knowledge than E. coli. By studying orthologous genes subject to regulation by the same transcription factor, we also gained understanding of the evolution of the entire regulatory systems. PMID:11282972

  1. A systems biological approach to identify key transcription factors and their genomic neighborhoods in human sarcomas

    Institute of Scientific and Technical Information of China (English)

    Antti Ylip(a)(a); Olli Yli-Harja; Wei Zhang; Matti Nykter

    2011-01-01

    Identification of genetic signatures is the main objective for many computational oncology studies. The signature usually consists of numerous genes that are differentially expressed between two clinically distinct groups of samples, such as tumor subtypes. Prospectively, many signatures have been found to generalize poorly to other datasets and, thus, have rarely been accepted into clinical use. Recognizing the limited success of traditionally generated signatures, we developed a systems biology-based framework for robust identification of key transcription factors and their genomic regulatory neighborhoods. Application of the framework to study the differences between gastrointestinal stromal tumor (GIST) and leiomyosarcoma (LMS) resulted in the identification of nine transcription factors (SRF, NKX2-5, CCDC6, LEF1, VDR, ZNF250, TRIM63, MAF, and MYC). Functional annotations of the obtained neighborhoods identified the biological processes which the key transcription factors regulate differently between the tumor types. Analyzing the differences in the expression patterns using our approach resulted in a more robust genetic signature and more biological insight into the diseases compared to a traditional genetic signature.

  2. Genetic approaches for studying myiasis-causing flies: molecular markers and mitochondrial genomics.

    Science.gov (United States)

    de Azeredo-Espin, Ana Maria Lima; Lessinger, Ana Cláudia

    2006-01-01

    "Myiasis-causing flies" is a generic term that includes species from numerous dipteran families, mainly Calliphoridae and Oestridae, of which blowflies, screwworm flies and botflies are among the most important. This group of flies is characterized by the ability of their larvae to develop in animal flesh. When the host is a live vertebrate, such parasitism by dipterous larvae is known as primary myiasis. Myiasis-causing flies can be classified as saprophagous (free-living species), facultative or obligate parasites. Many of these flies are of great medical and veterinary importance in Brazil because of their role as key livestock insect-pests and vectors of pathogens, in addition to being considered important legal evidence in forensic entomology. The characterization of myiasis-causing flies using molecular markers to study mtDNA (by RFLP) and nuclear DNA (by RAPD and microsatellite) has been used to identify the evolutionary mechanisms responsible for specific patterns of genetic variability. These approaches have been successfully used to analyze the population structures of the New World screwworm fly Cochliomyia hominivorax and the botfly Dermatobia hominis. In this review, various aspects of the organization, evolution and potential applications of the mitochondrial genome of myiasis-causing flies in Brazil, and the analysis of nuclear markers in genetic studies of populations, are discussed.

  3. A network-based approach to prioritize results from genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Nirmala Akula

    Full Text Available Genome-wide association studies (GWAS are a valuable approach to understanding the genetic basis of complex traits. One of the challenges of GWAS is the translation of genetic association results into biological hypotheses suitable for further investigation in the laboratory. To address this challenge, we introduce Network Interface Miner for Multigenic Interactions (NIMMI, a network-based method that combines GWAS data with human protein-protein interaction data (PPI. NIMMI builds biological networks weighted by connectivity, which is estimated by use of a modification of the Google PageRank algorithm. These weights are then combined with genetic association p-values derived from GWAS, producing what we call 'trait prioritized sub-networks.' As a proof of principle, NIMMI was tested on three GWAS datasets previously analyzed for height, a classical polygenic trait. Despite differences in sample size and ancestry, NIMMI captured 95% of the known height associated genes within the top 20% of ranked sub-networks, far better than what could be achieved by a single-locus approach. The top 2% of NIMMI height-prioritized sub-networks were significantly enriched for genes involved in transcription, signal transduction, transport, and gene expression, as well as nucleic acid, phosphate, protein, and zinc metabolism. All of these sub-networks were ranked near the top across all three height GWAS datasets we tested. We also tested NIMMI on a categorical phenotype, Crohn's disease. NIMMI prioritized sub-networks involved in B- and T-cell receptor, chemokine, interleukin, and other pathways consistent with the known autoimmune nature of Crohn's disease. NIMMI is a simple, user-friendly, open-source software tool that efficiently combines genetic association data with biological networks, translating GWAS findings into biological hypotheses.

  4. Complete genome-wide screening and subtractive genomic approach revealed new virulence factors, potential drug targets against bio-war pathogen Brucella melitensis 16M

    Directory of Open Access Journals (Sweden)

    Pradeepkiran JA

    2015-03-01

    Full Text Available Jangampalli Adi Pradeepkiran,1* Sri Bhashyam Sainath,2,3* Konidala Kranthi Kumar,1 Matcha Bhaskar1 1Division of Animal Biotechnology, Department of Zoology, Sri Venkateswara University, Tirupati, India; 2CIMAR/CIIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, Porto, Portugal, 3Department of Biotechnology, Vikrama Simhapuri University, Nellore, Andhra Pradesh, India *These authors contributed equally to this work Abstract: Brucella melitensis 16M is a Gram-negative coccobacillus that infects both animals and humans. It causes a disease known as brucellosis, which is characterized by acute febrile illness in humans and causes abortions in livestock. To prevent and control brucellosis, identification of putative drug targets is crucial. The present study aimed to identify drug targets in B. melitensis 16M by using a subtractive genomic approach. We used available database repositories (Database of Essential Genes, Kyoto Encyclopedia of Genes and Genomes Automatic Annotation Server, and Kyoto Encyclopedia of Genes and Genomes to identify putative genes that are nonhomologous to humans and essential for pathogen B. melitensis 16M. The results revealed that among 3 Mb genome size of pathogen, 53 putative characterized and 13 uncharacterized hypothetical genes were identified; further, from Basic Local Alignment Search Tool protein analysis, one hypothetical protein showed a close resemblance (50% to Silicibacter pomeroyi DUF1285 family protein (2RE3. A further homology model of the target was constructed using MODELLER 9.12 and optimized through variable target function method by molecular dynamics optimization with simulating annealing. The stereochemical quality of the restrained model was evaluated by PROCHECK, VERIFY-3D, ERRAT, and WHATIF servers. Furthermore, structure-based virtual screening was carried out against the predicted active site of the respective protein using the

  5. PARP1 genomics: chromatin immunoprecipitation approach using anti-PARP1 antibody (ChIP and ChIP-seq).

    Science.gov (United States)

    Lodhi, Niraj; Tulin, Alexei V

    2011-01-01

    Poly(ADP-ribose) polymerase1 (PARP1) is a global regulator of different cellular mechanisms, ranging from DNA damage repair to control of gene expression. Since PARP1 protein and pADPr have been shown to persist in chromatin through cell cycle, they may both act as epigenetic markers. However, it is not known how many loci are occupied by PARP1 protein during mitosis genome-wide. To reveal the genome-wide PARP1 binding sites, we used the ChIP-seq approach, an emerging technique to study genome-wide PARP1 protein interaction with chromatin. Here, we describe how to perform ChIP-seq in the context of PARP1 binding sites identification in chromatin, using human embryonic kidney cell lines.

  6. Complete genome-wide screening and subtractive genomic approach revealed new virulence factors, potential drug targets against bio-war pathogen Brucella melitensis 16M.

    Science.gov (United States)

    Pradeepkiran, Jangampalli Adi; Sainath, Sri Bhashyam; Kumar, Konidala Kranthi; Bhaskar, Matcha

    2015-01-01

    Brucella melitensis 16M is a Gram-negative coccobacillus that infects both animals and humans. It causes a disease known as brucellosis, which is characterized by acute febrile illness in humans and causes abortions in livestock. To prevent and control brucellosis, identification of putative drug targets is crucial. The present study aimed to identify drug targets in B. melitensis 16M by using a subtractive genomic approach. We used available database repositories (Database of Essential Genes, Kyoto Encyclopedia of Genes and Genomes Automatic Annotation Server, and Kyoto Encyclopedia of Genes and Genomes) to identify putative genes that are nonhomologous to humans and essential for pathogen B. melitensis 16M. The results revealed that among 3 Mb genome size of pathogen, 53 putative characterized and 13 uncharacterized hypothetical genes were identified; further, from Basic Local Alignment Search Tool protein analysis, one hypothetical protein showed a close resemblance (50%) to Silicibacter pomeroyi DUF1285 family protein (2RE3). A further homology model of the target was constructed using MODELLER 9.12 and optimized through variable target function method by molecular dynamics optimization with simulating annealing. The stereochemical quality of the restrained model was evaluated by PROCHECK, VERIFY-3D, ERRAT, and WHATIF servers. Furthermore, structure-based virtual screening was carried out against the predicted active site of the respective protein using the glycerol structural analogs from the PubChem database. We identified five best inhibitors with strong affinities, stable interactions, and also with reliable drug-like properties. Hence, these leads might be used as the most effective inhibitors of modeled protein. The outcome of the present work of virtual screening of putative gene targets might facilitate design of potential drugs for better treatment against brucellosis.

  7. Complete genome-wide screening and subtractive genomic approach revealed new virulence factors, potential drug targets against bio-war pathogen Brucella melitensis 16M

    Science.gov (United States)

    Pradeepkiran, Jangampalli Adi; Sainath, Sri Bhashyam; Kumar, Konidala Kranthi; Bhaskar, Matcha

    2015-01-01

    Brucella melitensis 16M is a Gram-negative coccobacillus that infects both animals and humans. It causes a disease known as brucellosis, which is characterized by acute febrile illness in humans and causes abortions in livestock. To prevent and control brucellosis, identification of putative drug targets is crucial. The present study aimed to identify drug targets in B. melitensis 16M by using a subtractive genomic approach. We used available database repositories (Database of Essential Genes, Kyoto Encyclopedia of Genes and Genomes Automatic Annotation Server, and Kyoto Encyclopedia of Genes and Genomes) to identify putative genes that are nonhomologous to humans and essential for pathogen B. melitensis 16M. The results revealed that among 3 Mb genome size of pathogen, 53 putative characterized and 13 uncharacterized hypothetical genes were identified; further, from Basic Local Alignment Search Tool protein analysis, one hypothetical protein showed a close resemblance (50%) to Silicibacter pomeroyi DUF1285 family protein (2RE3). A further homology model of the target was constructed using MODELLER 9.12 and optimized through variable target function method by molecular dynamics optimization with simulating annealing. The stereochemical quality of the restrained model was evaluated by PROCHECK, VERIFY-3D, ERRAT, and WHATIF servers. Furthermore, structure-based virtual screening was carried out against the predicted active site of the respective protein using the glycerol structural analogs from the PubChem database. We identified five best inhibitors with strong affinities, stable interactions, and also with reliable drug-like properties. Hence, these leads might be used as the most effective inhibitors of modeled protein. The outcome of the present work of virtual screening of putative gene targets might facilitate design of potential drugs for better treatment against brucellosis. PMID:25834405

  8. Genome-enabled Modeling of Microbial Biogeochemistry using a Trait-based Approach. Does Increasing Metabolic Complexity Increase Predictive Capabilities?

    Science.gov (United States)

    King, E.; Karaoz, U.; Molins, S.; Bouskill, N.; Anantharaman, K.; Beller, H. R.; Banfield, J. F.; Steefel, C. I.; Brodie, E.

    2015-12-01

    The biogeochemical functioning of ecosystems is shaped in part by genomic information stored in the subsurface microbiome. Cultivation-independent approaches allow us to extract this information through reconstruction of thousands of genomes from a microbial community. Analysis of these genomes, in turn, gives an indication of the organisms present and their functional roles. However, metagenomic analyses can currently deliver thousands of different genomes that range in abundance/importance, requiring the identification and assimilation of key physiologies and metabolisms to be represented as traits for successful simulation of subsurface processes. Here we focus on incorporating -omics information into BioCrunch, a genome-informed trait-based model that represents the diversity of microbial functional processes within a reactive transport framework. This approach models the rate of nutrient uptake and the thermodynamics of coupled electron donors and acceptors for a range of microbial metabolisms including heterotrophs and chemolithotrophs. Metabolism of exogenous substrates fuels catabolic and anabolic processes, with the proportion of energy used for cellular maintenance, respiration, biomass development, and enzyme production based upon dynamic intracellular and environmental conditions. This internal resource partitioning represents a trade-off against biomass formation and results in microbial community emergence across a fitness landscape. Biocrunch was used here in simulations that included organisms and metabolic pathways derived from a dataset of ~1200 non-redundant genomes reflecting a microbial community in a floodplain aquifer. Metagenomic data was directly used to parameterize trait values related to growth and to identify trait linkages associated with respiration, fermentation, and key enzymatic functions such as plant polymer degradation. Simulations spanned a range of metabolic complexities and highlight benefits originating from simulations

  9. Genomic DNA enrichment using sequence capture microarrays: a novel approach to discover sequence nucleotide polymorphisms (SNP in Brassica napus L.

    Directory of Open Access Journals (Sweden)

    Wayne E Clarke

    Full Text Available Targeted genomic selection methodologies, or sequence capture, allow for DNA enrichment and large-scale resequencing and characterization of natural genetic variation in species with complex genomes, such as rapeseed canola (Brassica napus L., AACC, 2n=38. The main goal of this project was to combine sequence capture with next generation sequencing (NGS to discover single nucleotide polymorphisms (SNPs in specific areas of the B. napus genome historically associated (via quantitative trait loci -QTL- analysis to traits of agronomical and nutritional importance. A 2.1 million feature sequence capture platform was designed to interrogate DNA sequence variation across 47 specific genomic regions, representing 51.2 Mb of the Brassica A and C genomes, in ten diverse rapeseed genotypes. All ten genotypes were sequenced using the 454 Life Sciences chemistry and to assess the effect of increased sequence depth, two genotypes were also sequenced using Illumina HiSeq chemistry. As a result, 589,367 potentially useful SNPs were identified. Analysis of sequence coverage indicated a four-fold increased representation of target regions, with 57% of the filtered SNPs falling within these regions. Sixty percent of discovered SNPs corresponded to transitions while 40% were transversions. Interestingly, fifty eight percent of the SNPs were found in genic regions while 42% were found in intergenic regions. Further, a high percentage of genic SNPs was found in exons (65% and 64% for the A and C genomes, respectively. Two different genotyping assays were used to validate the discovered SNPs. Validation rates ranged from 61.5% to 84% of tested SNPs, underpinning the effectiveness of this SNP discovery approach. Most importantly, the discovered SNPs were associated with agronomically important regions of the B. napus genome generating a novel data resource for research and breeding this crop species.

  10. Approaching the Three-Dimensional Organization and Dynamics of the Human Genome

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2003-01-01

    textabstractGenomes are one of the major foundations of life due to their role in information storage, process regulation and evolution. However, the sequential and three-dimensional structure of the human genome in the cell nucleus as well as its interplay with and embedding into the cell and organ

  11. Approaching the Three-Dimensional Organization and Dynamics of the Human Genome

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2006-01-01

    textabstractGenomes are one of the major foundations of life due to their role in information storage, process regulation and evolution. However, the sequential and three-dimensional structure of the human genome in the cell nucleus as well as its interplay with and embedding into the cell and organ

  12. Approaching the three-dimensional organization and dynamics of the human genome

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2004-01-01

    textabstractGenomes are one of the major foundations of life due to their role in information storage, process regulation and evolution. However, the sequential and three-dimensional structure of the human genome in the cell nucleus as well as its interplay with and embedding into the cell and organ

  13. Re-annotation of the Saccharopolyspora erythraea genome using a systems biology approach.

    Science.gov (United States)

    Marcellin, Esteban; Licona-Cassani, Cuauhtemoc; Mercer, Tim R; Palfreyman, Robin W; Nielsen, Lars K

    2013-10-11

    Accurate bacterial genome annotations provide a framework to understanding cellular functions, behavior and pathogenicity and are essential for metabolic engineering. Annotations based only on in silico predictions are inaccurate, particularly for large, high G + C content genomes due to the lack of similarities in gene length and gene organization to model organisms. Here we describe a 2D systems biology driven re-annotation of the Saccharopolyspora erythraea genome using proteogenomics, a genome-scale metabolic reconstruction, RNA-sequencing and small-RNA-sequencing. We observed transcription of more than 300 intergenic regions, detected 59 peptides in intergenic regions, confirmed 164 open reading frames previously annotated as hypothetical proteins and reassigned function to open reading frames using the genome-scale metabolic reconstruction. Finally, we present a novel way of mapping ribosomal binding sites across the genome by sequencing small RNAs. The work presented here describes a novel framework for annotation of the Saccharopolyspora erythraea genome. Based on experimental observations, the 2D annotation framework greatly reduces errors that are commonly made when annotating large-high G + C content genomes using computational prediction algorithms.

  14. Optimization of genome engineering approaches with the CRISPR/Cas9 system

    DEFF Research Database (Denmark)

    Li, Kai; Wang, Gang; Andersen, Troels

    2014-01-01

    Designer nucleases such as TALENS and Cas9 have opened new opportunities to scarlessly edit the mammalian genome. Here we explored several parameters that influence Cas9-mediated scarless genome editing efficiency in murine embryonic stem cells. Optimization of transfection conditions and enrichi...

  15. Reference set of regulons in Desulfovibrionales inferred by comparative genomics approach

    Energy Technology Data Exchange (ETDEWEB)

    Kazakov, A.E.; Rodionov, D.A.; Price, M.N.; Arkin, A.P.; Dubchak, I.; Novichkov, P.S.

    2010-11-15

    in this study, we carried out large-scale comparative genomics analysis of regulatory interactions in Desulfovibrio vulgaris and 12 related genomes from Desulfovibrionales order using our recently developed web server RegPredict (http://regpredict.lbl.gov). An overall reference collection of 26 Desulfovibrionales regulogs can be accessed through RegPrecise database (http://regpredict.lbl.gov).

  16. Novel genomic approaches unravel genetic architecture of complex traits in apple.

    NARCIS (Netherlands)

    Kumar, S.; Garrick, D.J.; Bink, M.C.A.M.; Whitworth, C.; Chagné, D.

    2013-01-01

    BACKGROUND: Understanding the genetic architecture of quantitative traits is important for developing genome-based crop improvement methods. Genome-wide association study (GWAS) is a powerful technique for mining novel functional variants. Using a family-based design involving 1,200 apple (Malus × d

  17. A physical approach to segregation and folding of the Caulobacter crescentus genome

    NARCIS (Netherlands)

    Dame, R.T.; Tark-Dame, M.; Schiessel, H

    2011-01-01

    Bacterial genomes are functionally organized. This organization is dynamic and globally changing throughout the cell cycle. Upon initiation of replication of the chromosome, the two origins segregate and move towards their new location taking along the newly replicated genome. Caulobacter crescentus

  18. Approaching the Three-Dimensional Organization and Dynamics of the Human Genome

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2003-01-01

    textabstractGenomes are one of the major foundations of life due to their role in information storage, process regulation and evolution. However, the sequential and three-dimensional structure of the human genome in the cell nucleus as well as ist interplay with and embedding into the cell and o

  19. Approaching the Three-Dimensional Organization and Dynamics of the Human Genome

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2003-01-01

    textabstractGenomes are one of the major foundations of life due to their role in information storage, process regulation and evolution. However, the sequential and three-dimensional structure of the human genome in the cell nucleus as well as its interplay with and embedding into the cell and or

  20. Approaching the three-dimensional organization and dynamics of the human genome

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2003-01-01

    textabstractGenomes are one of the major foundations of life due to their role in information storage, process regulation and evolution. However, the sequential and three-dimensional structure of the human genome in the cell nucleus as well as its interplay with and embedding into the cell and o

  1. Use of comparative genomics approaches to characterize interspecies differences in response to environmental chemicals: Challenges, opportunities, and research needs

    Energy Technology Data Exchange (ETDEWEB)

    Burgess-Herbert, Sarah L., E-mail: sarah.burgess@alum.mit.edu [American Association for the Advancement of Science (AAAS) Science and Technology Policy Fellow at the US Environmental Protection Agency (EPA), 2009–10 (United States); Euling, Susan Y. [National Center for Environmental Assessment, Office of Research and Development, US Environmental Protection Agency, Washington, DC 20460 (United States)

    2013-09-15

    A critical challenge for environmental chemical risk assessment is the characterization and reduction of uncertainties introduced when extrapolating inferences from one species to another. The purpose of this article is to explore the challenges, opportunities, and research needs surrounding the issue of how genomics data and computational and systems level approaches can be applied to inform differences in response to environmental chemical exposure across species. We propose that the data, tools, and evolutionary framework of comparative genomics be adapted to inform interspecies differences in chemical mechanisms of action. We compare and contrast existing approaches, from disciplines as varied as evolutionary biology, systems biology, mathematics, and computer science, that can be used, modified, and combined in new ways to discover and characterize interspecies differences in chemical mechanism of action which, in turn, can be explored for application to risk assessment. We consider how genetic, protein, pathway, and network information can be interrogated from an evolutionary biology perspective to effectively characterize variations in biological processes of toxicological relevance among organisms. We conclude that comparative genomics approaches show promise for characterizing interspecies differences in mechanisms of action, and further, for improving our understanding of the uncertainties inherent in extrapolating inferences across species in both ecological and human health risk assessment. To achieve long-term relevance and consistent use in environmental chemical risk assessment, improved bioinformatics tools, computational methods robust to data gaps, and quantitative approaches for conducting extrapolations across species are critically needed. Specific areas ripe for research to address these needs are recommended.

  2. Genome wide association analysis of the 16th QTL- MAS Workshop dataset using the Random Forest machine learning approach

    Science.gov (United States)

    2014-01-01

    Background Genome wide association studies are now widely used in the livestock sector to estimate the association among single nucleotide polymorphisms (SNPs) distributed across the whole genome and one or more trait. As computational power increases, the use of machine learning techniques to analyze large genome wide datasets becomes possible. Methods The objective of this study was to identify SNPs associated with the three traits simulated in the 16th MAS-QTL workshop dataset using the Random Forest (RF) approach. The approach was applied to single and multiple trait estimated breeding values, and on yield deviations and to compare them with the results of the GRAMMAR-CG method. Results The two QTL mapping methods used, GRAMMAR-CG and RF, were successful in identifying the main QTLs for trait 1 on chromosomes 1 and 4, for trait 2 on chromosomes 1, 4 and 5 and for trait 3 on chromosomes 1, 2 and 3. Conclusions The results of the RF approach were confirmed by the GRAMMAR-CG method and validated by the effective QTL position, even if their approach to unravel cryptic genetic structure is different. Furthermore, both methods showed complementary findings. However, when the variance explained by the QTL is low, they both failed to detect significant associations. PMID:25519518

  3. Social and behavioral research in genomic sequencing: approaches from the Clinical Sequencing Exploratory Research Consortium Outcomes and Measures Working Group.

    Science.gov (United States)

    Gray, Stacy W; Martins, Yolanda; Feuerman, Lindsay Z; Bernhardt, Barbara A; Biesecker, Barbara B; Christensen, Kurt D; Joffe, Steven; Rini, Christine; Veenstra, David; McGuire, Amy L

    2014-10-01

    The routine use of genomic sequencing in clinical medicine has the potential to dramatically alter patient care and medical outcomes. To fully understand the psychosocial and behavioral impact of sequencing integration into clinical practice, it is imperative that we identify the factors that influence sequencing-related decision making and patient outcomes. In an effort to develop a collaborative and conceptually grounded approach to studying sequencing adoption, members of the National Human Genome Research Institute's Clinical Sequencing Exploratory Research Consortium formed the Outcomes and Measures Working Group. Here we highlight the priority areas of investigation and psychosocial and behavioral outcomes identified by the Working Group. We also review some of the anticipated challenges to measurement in social and behavioral research related to genomic sequencing; opportunities for instrument development; and the importance of qualitative, quantitative, and mixed-method approaches. This work represents the early, shared efforts of multiple research teams as we strive to understand individuals' experiences with genomic sequencing. The resulting body of knowledge will guide recommendations for the optimal use of sequencing in clinical practice.

  4. Phylogenetic approaches for the analysis of mitochondrial genome sequence data in the Hymenoptera--a lineage with both rapidly and slowly evolving mitochondrial genomes.

    Science.gov (United States)

    Dowton, Mark; Cameron, Stephen L; Austin, Andy D; Whiting, Michael F

    2009-08-01

    We entirely sequenced two new hymenopteran mitochondrial genomes (Cephus cinctus and Orussus occidentalis), and a substantial portion of another three hymenopterans (Schlettererius cinctipes, Venturia canescens, and Enicospilus). We analyze them together with nine others reported in the literature. We establish that the rate of genetic divergence is two to three times higher among some Hymenoptera when compared with others, making this a group with both long and short phylogenetic branches. We then assessed the ability of a range of phylogenetic approaches to recover seven uncontroversial relationships, when lineages show markedly different rates of molecular evolution. This range encompassed maximum parsimony and Bayesian analysis of (i) amino acid data, (ii) nucleotide data, and (iii) nucleotide data excluding third codon positions. Unpartitioned analyses were compared with partitioned analyses, with the data partitioned by codon position (ribosomal genes were placed in a separate partition). These analyses indicated that partitioned, Bayesian analysis of nucleotide data, excluding 3rd codon positions, recovered more of the uncontroversial relationships than any other approach. These results suggest that the analysis of complete mitochondrial genome sequences holds promise for the resolution of hymenopteran superfamily relationships.

  5. Matching Two-dimensional Gel Electrophoresis' Spots

    DEFF Research Database (Denmark)

    Dos Anjos, António; AL-Tam, Faroq; Shahbazkia, Hamid Reza

    2012-01-01

    This paper describes an approach for matching Two-Dimensional Electrophoresis (2-DE) gels' spots, involving the use of image registration. The number of false positive matches produced by the proposed approach is small, when compared to academic and commercial state-of-the-art approaches. This ar......This paper describes an approach for matching Two-Dimensional Electrophoresis (2-DE) gels' spots, involving the use of image registration. The number of false positive matches produced by the proposed approach is small, when compared to academic and commercial state-of-the-art approaches...

  6. Microsatellite analysis in the genome of Acanthaceae: An in silico approach

    Directory of Open Access Journals (Sweden)

    Priyadharsini Kaliswamy

    2015-01-01

    Full Text Available Background: Acanthaceae is one of the advanced and specialized families with conventionally used medicinal plants. Simple sequence repeats (SSRs play a major role as molecular markers for genome analysis and plant breeding. The microsatellites existing in the complete genome sequences would help to attain a direct role in the genome organization, recombination, gene regulation, quantitative genetic variation, and evolution of genes. Objective: The current study reports the frequency of microsatellites and appropriate markers for the Acanthaceae family genome sequences. Materials and Methods: The whole nucleotide sequences of Acanthaceae species were obtained from National Center for Biotechnology Information database and screened for the presence of SSRs. SSR Locator tool was used to predict the microsatellites and inbuilt Primer3 module was used for primer designing. Results: Totally 110 repeats from 108 sequences of Acanthaceae family plant genomes were identified, and the occurrence of dinucleotide repeats was found to be abundant in the genome sequences. The essential amino acid isoleucine was found rich in all the sequences. We also designed the SSR-based primers/markers for 59 sequences of this family that contains microsatellite repeats in their genome. Conclusion: The identified microsatellites and primers might be useful for breeding and genetic studies of plants that belong to Acanthaceae family in the future.

  7. A Genome-Scale Modeling Approach to Quantify Biofilm Component Growth of Salmonella Typhimurium.

    Science.gov (United States)

    Ribaudo, Nicholas; Li, Xianhua; Davis, Brett; Wood, Thomas K; Huang, Zuyi Jacky

    2017-01-01

    Salmonella typhimurium (S. typhimurium) is an extremely dangerous foodborne bacterium that infects both animal and human subjects, causing fatal diseases around the world. Salmonella's robust virulence, antibiotic-resistant nature, and capacity to survive under harsh conditions are largely due to its ability to form resilient biofilms. Multiple genome-scale metabolic models have been developed to study the complex and diverse nature of this organism's metabolism; however, none of these models fully integrated the reactions and mechanisms required to study the influence of biofilm formation. This work developed a systems-level approach to study the adjustment of intracellular metabolism of S. typhimurium during biofilm formation. The most advanced metabolic reconstruction currently available, STM_v1.0, was 1st extended to include the formation of the extracellular biofilm matrix. Flux balance analysis was then employed to study the influence of biofilm formation on cellular growth rate and the production rates of biofilm components. With biofilm formation present, biomass growth was examined under nutrient rich and nutrient deficient conditions, resulting in overall growth rates of 0.8675 and 0.6238 h(-1) respectively. Investigation of intracellular flux variation during biofilm formation resulted in the elucidation of 32 crucial reactions, and associated genes, whose fluxes most significantly adapt during the physiological response. Experimental data were found in the literature to validate the importance of these genes for the biofilm formation of S. typhimurium. This preliminary investigation on the adjustment of intracellular metabolism of S. typhimurium during biofilm formation will serve as a platform to generate hypotheses for further experimental study on the biofilm formation of this virulent bacterium.

  8. [DNA barcoding is a new approach in comparative genomics of plants].

    Science.gov (United States)

    Shneer, V S

    2009-11-01

    DNA barcoding was proposed as a method for recognition and identification of eukaryotic species through comparison of sequences of a standard short DNA fragment--DNA barcode--from an unknown specimen to a library of reference sequences from known species. This allows identifying an organism at any stage of development from a very small tissue sample, fresh or conserved many years ago. Molecular identification of plant samples can be used in various scientific and applied fields. It would also help to find new species, which is particularly important for cryptogamic plants. An optimal DNA barcode region is a small fragment present in all species of a major taxonomic group, having invariable nucleotide sequence in all members of the same species, but with sufficient variation to discriminate among the species. This fragment should be flanked by low-variable regions for use of universal primers in PCR for amplification and sequencing. The DNA barcode that is well established in animals is a sequence of a fragment of the mitochondrial cytochrome c oxidase gene CO1. However, searching for DNA barcode in plants proved to be a more challenging task. No DNA region universally suitable for all plants and meeting all of the necessary criteria has been found. Apparently, a multilocus or two-stage approach should be applied for this purpose. Several fragments of the chloroplast genome (trnH-psbA, matK, rpoC, rpoB, rbcL) in combinations of two or three regions were suggested as candidate regions with highest potential, but more representative samples should be examined to choose the best candidate. The possibility is discussed to use as DNA barcode internal transcribed spacers (ITS) of nuclear rRNA genes, which are highly variable, widely employed in molecular phylogenetic studies at the species level, but also have some limitations.

  9. A functional genomics approach identifies candidate effectors from the aphid species Myzus persicae (green peach aphid).

    Science.gov (United States)

    Bos, Jorunn I B; Prince, David; Pitino, Marco; Maffei, Massimo E; Win, Joe; Hogenhout, Saskia A

    2010-11-18

    Aphids are amongst the most devastating sap-feeding insects of plants. Like most plant parasites, aphids require intimate associations with their host plants to gain access to nutrients. Aphid feeding induces responses such as clogging of phloem sieve elements and callose formation, which are suppressed by unknown molecules, probably proteins, in aphid saliva. Therefore, it is likely that aphids, like plant pathogens, deliver proteins (effectors) inside their hosts to modulate host cell processes, suppress plant defenses, and promote infestation. We exploited publicly available aphid salivary gland expressed sequence tags (ESTs) to apply a functional genomics approach for identification of candidate effectors from Myzus persicae (green peach aphid), based on common features of plant pathogen effectors. A total of 48 effector candidates were identified, cloned, and subjected to transient overexpression in Nicotiana benthamiana to assay for elicitation of a phenotype, suppression of the Pathogen-Associated Molecular Pattern (PAMP)-mediated oxidative burst, and effects on aphid reproductive performance. We identified one candidate effector, Mp10, which specifically induced chlorosis and local cell death in N. benthamiana and conferred avirulence to recombinant Potato virus X (PVX) expressing Mp10, PVX-Mp10, in N. tabacum, indicating that this protein may trigger plant defenses. The ubiquitin-ligase associated protein SGT1 was required for the Mp10-mediated chlorosis response in N. benthamiana. Mp10 also suppressed the oxidative burst induced by flg22, but not by chitin. Aphid fecundity assays revealed that in planta overexpression of Mp10 and Mp42 reduced aphid fecundity, whereas another effector candidate, MpC002, enhanced aphid fecundity. Thus, these results suggest that, although Mp10 suppresses flg22-triggered immunity, it triggers a defense response, resulting in an overall decrease in aphid performance in the fecundity assays. Overall, we identified aphid

  10. CRISPR/Cas9-Mediated Genome Editing as a Therapeutic Approach for Leber Congenital Amaurosis 10.

    Science.gov (United States)

    Ruan, Guo-Xiang; Barry, Elizabeth; Yu, Dan; Lukason, Michael; Cheng, Seng H; Scaria, Abraham

    2017-02-01

    As the most common subtype of Leber congenital amaurosis (LCA), LCA10 is a severe retinal dystrophy caused by mutations in the CEP290 gene. The most frequent mutation found in patients with LCA10 is a deep intronic mutation in CEP290 that generates a cryptic splice donor site. The large size of the CEP290 gene prevents its use in adeno-associated virus (AAV)-mediated gene augmentation therapy. Here, we show that targeted genomic deletion using the clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 system represents a promising therapeutic approach for the treatment of patients with LCA10 bearing the CEP290 splice mutation. We generated a cellular model of LCA10 by introducing the CEP290 splice mutation into 293FT cells and we showed that guide RNA pairs coupled with SpCas9 were highly efficient at removing the intronic splice mutation and restoring the expression of wild-type CEP290. In addition, we demonstrated that a dual AAV system could effectively delete an intronic fragment of the Cep290 gene in the mouse retina. To minimize the immune response to prolonged expression of SpCas9, we developed a self-limiting CRISPR/Cas9 system that minimizes the duration of SpCas9 expression. These results support further studies to determine the therapeutic potential of CRISPR/Cas9-based strategies for the treatment of patients with LCA10. Copyright © 2017 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.

  11. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance

    OpenAIRE

    Manning, Alisa K; Hivert, Marie-France; Scott, Robert A.; Grimsby, Jonna L; Bouatia-Naji, Nabila; Chen, Han; Rybin, Denis; Liu, Ching-Ti; Bielak, Lawrence F; Prokopenko, Inga; Amin, Najaf; Barnes, Daniel; Cadby, Gemma; Hottenga, Jouke-Jan; Ingelsson, Erik

    2012-01-01

    Recent genome-wide association studies have described many loci implicated in type 2 diabetes (T2D) pathophysiology and beta-cell dysfunction but have contributed little to the understanding of the genetic basis of insulin resistance. We hypothesized that genes implicated in insulin resistance pathways might be uncovered by accounting for differences in body mass index (BMI) and potential interactions between BMI and genetic variants. We applied a joint meta-analysis approach to test associat...

  12. Pan-Genome Analysis of Human Gastric Pathogen H. pylori: Comparative Genomics and Pathogenomics Approaches to Identify Regions Associated with Pathogenicity and Prediction of Potential Core Therapeutic Targets

    DEFF Research Database (Denmark)

    Ali, Amjad; Naz, Anam; Soares, Siomar C.

    2015-01-01

    . Pan-genome analyses of the global representative H. pylori isolates consisting of 39 complete genomes are presented in this paper. Phylogenetic analyses have revealed close relationships among geographically diverse strains of H. pylori. The conservation among these genomes was further analyzed by pan-genome...

  13. PEMapper and PECaller provide a simplified approach to whole-genome sequencing.

    Science.gov (United States)

    Johnston, H Richard; Chopra, Pankaj; Wingo, Thomas S; Patel, Viren; Epstein, Michael P; Mulle, Jennifer G; Warren, Stephen T; Zwick, Michael E; Cutler, David J

    2017-03-07

    The analysis of human whole-genome sequencing data presents significant computational challenges. The sheer size of datasets places an enormous burden on computational, disk array, and network resources. Here, we present an integrated computational package, PEMapper/PECaller, that was designed specifically to minimize the burden on networks and disk arrays, create output files that are minimal in size, and run in a highly computationally efficient way, with the single goal of enabling whole-genome sequencing at scale. In addition to improved computational efficiency, we implement a statistical framework that allows for a base by base error model, allowing this package to perform as well or better than the widely used Genome Analysis Toolkit (GATK) in all key measures of performance on human whole-genome sequences.

  14. Tapping into Salmonella typhimurium LT2 genome in a quest to explore its therapeutic arsenal: A metabolic network modeling approach.

    Science.gov (United States)

    Mehla, Kusum; Ramana, Jayashree

    2017-02-01

    S. typhimurium, the classical broad-host-range serovar is a widely distributed cause of food-borne illness. Escalating antibiotic resistance and potential of conjugal transmission to other pathogens attributable to its broad spectrum host specificities have aided S. typhimurium to emerge as a global health threat. To keep pace with ever evolving bacterial defenses, there is dire need to restock the antibiotic pipeline. Genome scale metabolic reconstructions present immense possibilities to decipher physiological properties of an organism using constraint-based methods The systems-level approaches of genome scale metabolic networks interrogation open up new avenues of drug target identification against deadly infectious diseases. We performed flux balance analysis and minimization of metabolic adjustment studies of genome scale reconstruction model of S. typhimurium targeted at identifying large number of metabolites with a potential to be utilized as therapeutic drug targets. These constraint based approaches initially predict a set of genes indispensable to bacterial survival by performing gene knockout studies which are then prioritized through a multistep process. Metabolites involved in l-rhamnose biosynthesis, peptidoglycan biosynthesis, fatty acid biosynthesis, and folate biosynthesis pathways were prioritized as candidate drug targets. This study provides a general therapeutic approach which can be effectively applied to other pathogens as well.

  15. Random Tagging Genotyping by Sequencing (rtGBS, an Unbiased Approach to Locate Restriction Enzyme Sites across the Target Genome.

    Directory of Open Access Journals (Sweden)

    Elena Hilario

    Full Text Available Genotyping by sequencing (GBS is a restriction enzyme based targeted approach developed to reduce the genome complexity and discover genetic markers when a priori sequence information is unavailable. Sufficient coverage at each locus is essential to distinguish heterozygous from homozygous sites accurately. The number of GBS samples able to be pooled in one sequencing lane is limited by the number of restriction sites present in the genome and the read depth required at each site per sample for accurate calling of single-nucleotide polymorphisms. Loci bias was observed using a slight modification of the Elshire et al.some restriction enzyme sites were represented in higher proportions while others were poorly represented or absent. This bias could be due to the quality of genomic DNA, the endonuclease and ligase reaction efficiency, the distance between restriction sites, the preferential amplification of small library restriction fragments, or bias towards cluster formation of small amplicons during the sequencing process. To overcome these issues, we have developed a GBS method based on randomly tagging genomic DNA (rtGBS. By randomly landing on the genome, we can, with less bias, find restriction sites that are far apart, and undetected by the standard GBS (stdGBS method. The study comprises two types of biological replicates: six different kiwifruit plants and two independent DNA extractions per plant; and three types of technical replicates: four samples of each DNA extraction, stdGBS vs. rtGBS methods, and two independent library amplifications, each sequenced in separate lanes. A statistically significant unbiased distribution of restriction fragment size by rtGBS showed that this method targeted 49% (39,145 of BamH I sites shared with the reference genome, compared to only 14% (11,513 by stdGBS.

  16. Unraveling CRISPR-Cas9 genome engineering parameters via a library-on-library approach

    Science.gov (United States)

    Chari, Raj; Mali, Prashant; Moosburner, Mark; Church, George M.

    2017-01-01

    We develop an in vivo library-on-library methodology to simultaneously assess single guide RNA (sgRNA) activity across ~1,400 genomic loci. Assaying across multiple human cell types, end-processing enzymes, and two Cas9 orthologs, we unravel underlying nucleotide sequence and epigenetic parameters. Our results enable improved design of reagents, shed light on mechanisms of genome targeting, and provide a generalizable framework to study nucleic acid-nucleic acid interactions and biochemistry in high throughput. PMID:26167643

  17. Germ line genome editing in clinics: the approaches, objectives and global society

    OpenAIRE

    Ishii, Tetsuya

    2015-01-01

    Genome editing allows for the versatile genetic modification of somatic cells, germ cells and embryos. In particular, CRISPR/Cas9 is worldwide used in biomedical research. Although the first report on Cas9-mediated gene modification in human embryos focused on the prevention of a genetic disease in offspring, it raised profound ethical and social concerns over the safety of subsequent generations and the potential misuse of genome editing for human enhancement. The present article considers g...

  18. Regulatory hurdles for genome editing: process- vs. product-based approaches in different regulatory contexts

    OpenAIRE

    Sprink, Thorben; Eriksson, Dennis; Schiemann, Joachim; Hartung, Frank

    2016-01-01

    Novel plant genome editing techniques call for an updated legislation regulating the use of plants produced by genetic engineering or genome editing, especially in the European Union. Established more than 25?years ago and based on a clear distinction between transgenic and conventionally bred plants, the current EU Directives fail to accommodate the new continuum between genetic engineering and conventional breeding. Despite the fact that the Directive 2001/18/EC contains both process- and p...

  19. Nutritional genomics: a practical approach by early life conditioning with dietary phosphorus

    OpenAIRE

    Ashwell,Christopher M.; Angel, Roselina

    2010-01-01

    The recent technologies that have led to the new field of functional genomics (how the genome of an organism regulates homeostasis and responds to stimuli) are providing a clearer understanding of how organisms interact with their environment and in particular their diet. We are beginning to learn how the diet may have long-term influence on performance and health. A form of epigenetic regulation has been recently described called fetal "programming". Fueled by epidemiological data the "fetal...

  20. Germ line genome editing in clinics: the approaches, objectives and global society

    OpenAIRE

    Ishii, Tetsuya

    2017-01-01

    Genome editing allows for the versatile genetic modification of somatic cells, germ cells and embryos. In particular, CRISPR/ Cas9 is worldwide used in biomedical research. Although the first report on Cas9-mediated gene modification in human embryos focused on the prevention of a genetic disease in offspring, it raised profound ethical and social concerns over the safety of subsequent generations and the potential misuse of genome editing for human enhancement. The present article considers ...

  1. Multi-omic data integration and analysis using systems genomics approaches: methods and applications in animal production, health and welfare.

    Science.gov (United States)

    Suravajhala, Prashanth; Kogelman, Lisette J A; Kadarmideen, Haja N

    2016-04-29

    In the past years, there has been a remarkable development of high-throughput omics (HTO) technologies such as genomics, epigenomics, transcriptomics, proteomics and metabolomics across all facets of biology. This has spearheaded the progress of the systems biology era, including applications on animal production and health traits. However, notwithstanding these new HTO technologies, there remains an emerging challenge in data analysis. On the one hand, different HTO technologies judged on their own merit are appropriate for the identification of disease-causing genes, biomarkers for prevention and drug targets for the treatment of diseases and for individualized genomic predictions of performance or disease risks. On the other hand, integration of multi-omic data and joint modelling and analyses are very powerful and accurate to understand the systems biology of healthy and sustainable production of animals. We present an overview of current and emerging HTO technologies each with a focus on their applications in animal and veterinary sciences before introducing an integrative systems genomics framework for analysing and integrating multi-omic data towards improved animal production, health and welfare. We conclude that there are big challenges in multi-omic data integration, modelling and systems-level analyses, particularly with the fast emerging HTO technologies. We highlight existing and emerging systems genomics approaches and discuss how they contribute to our understanding of the biology of complex traits or diseases and holistic improvement of production performance, disease resistance and welfare.

  2. Combining Mass Spectrometric Metabolic Profiling with Genomic Analysis: A Powerful Approach for Discovering Natural Products from Cyanobacteria.

    Science.gov (United States)

    Kleigrewe, Karin; Almaliti, Jehad; Tian, Isaac Yuheng; Kinnel, Robin B; Korobeynikov, Anton; Monroe, Emily A; Duggan, Brendan M; Di Marzo, Vincenzo; Sherman, David H; Dorrestein, Pieter C; Gerwick, Lena; Gerwick, William H

    2015-07-24

    An innovative approach was developed for the discovery of new natural products by combining mass spectrometric metabolic profiling with genomic analysis and resulted in the discovery of the columbamides, a new class of di- and trichlorinated acyl amides with cannabinomimetic activity. Three species of cultured marine cyanobacteria, Moorea producens 3L, Moorea producens JHB, and Moorea bouillonii PNG, were subjected to genome sequencing and analysis for their recognizable biosynthetic pathways, and this information was then compared with their respective metabolomes as detected by MS profiling. By genome analysis, a presumed regulatory domain was identified upstream of several previously described biosynthetic gene clusters in two of these cyanobacteria, M. producens 3L and M. producens JHB. A similar regulatory domain was identified in the M. bouillonii PNG genome, and a corresponding downstream biosynthetic gene cluster was located and carefully analyzed. Subsequently, MS-based molecular networking identified a series of candidate products, and these were isolated and their structures rigorously established. On the basis of their distinctive acyl amide structure, the most prevalent metabolite was evaluated for cannabinomimetic properties and found to be moderate affinity ligands for CB1.

  3. Inferring Population Size History from Large Samples of Genome-Wide Molecular Data - An Approximate Bayesian Computation Approach.

    Directory of Open Access Journals (Sweden)

    Simon Boitard

    2016-03-01

    Full Text Available Inferring the ancestral dynamics of effective population size is a long-standing question in population genetics, which can now be tackled much more accurately thanks to the massive genomic data available in many species. Several promising methods that take advantage of whole-genome sequences have been recently developed in this context. However, they can only be applied to rather small samples, which limits their ability to estimate recent population size history. Besides, they can be very sensitive to sequencing or phasing errors. Here we introduce a new approximate Bayesian computation approach named PopSizeABC that allows estimating the evolution of the effective population size through time, using a large sample of complete genomes. This sample is summarized using the folded allele frequency spectrum and the average zygotic linkage disequilibrium at different bins of physical distance, two classes of statistics that are widely used in population genetics and can be easily computed from unphased and unpolarized SNP data. Our approach provides accurate estimations of past population sizes, from the very first generations before present back to the expected time to the most recent common ancestor of the sample, as shown by simulations under a wide range of demographic scenarios. When applied to samples of 15 or 25 complete genomes in four cattle breeds (Angus, Fleckvieh, Holstein and Jersey, PopSizeABC revealed a series of population declines, related to historical events such as domestication or modern breed creation. We further highlight that our approach is robust to sequencing errors, provided summary statistics are computed from SNPs with common alleles.

  4. Pattern recognition and string matching

    CERN Document Server

    Cheng, Xiuzhen

    2002-01-01

    The research and development of pattern recognition have proven to be of importance in science, technology, and human activity. Many useful concepts and tools from different disciplines have been employed in pattern recognition. Among them is string matching, which receives much theoretical and practical attention. String matching is also an important topic in combinatorial optimization. This book is devoted to recent advances in pattern recognition and string matching. It consists of twenty eight chapters written by different authors, addressing a broad range of topics such as those from classifica­ tion, matching, mining, feature selection, and applications. Each chapter is self-contained, and presents either novel methodological approaches or applications of existing theories and techniques. The aim, intent, and motivation for publishing this book is to pro­ vide a reference tool for the increasing number of readers who depend upon pattern recognition or string matching in some way. This includes student...

  5. On the Approximability of Comparing Genomes with Duplicates

    CERN Document Server

    Angibaud, Sébastien; Rusu, Irena; Thevenin, Annelyse; Vialette, Stéphane

    2008-01-01

    A central problem in comparative genomics consists in computing a (dis-)similarity measure between two genomes, e.g. in order to construct a phylogeny. All the existing measures are defined on genomes without duplicates. However, we know that genes can be duplicated within the same genome. One possible approach to overcome this difficulty is to establish a one-to-one correspondence (i.e. a matching) between genes of both genomes, where the correspondence is chosen in order to optimize the studied measure. In this paper, we are interested in three measures (number of breakpoints, number of common intervals and number of conserved intervals) and three models of matching (exemplar, intermediate and maximum matching models). We prove that, for each model and each measure M, computing a matching between two genomes that optimizes M is APX-hard. We also study the complexity of the following problem: is there an exemplarization (resp. an intermediate/maximum matching) that induces no breakpoint? We prove the problem...

  6. Final Report: Transport and its regulation in Marine Microorganisms: A Genomic Based Approach

    Energy Technology Data Exchange (ETDEWEB)

    Brian Palenik; Bianca Brahamsha; Ian Paulsen

    2009-09-03

    This grant funded the analysis and annotation of the genomes of Synechococcus and Ostreococcus, major marine primary producers. Particular attention was paid to the analysis of transporters using state of the art bioinformatics analyses. During the analysis of the Synechococcus genome, some of the components of the unique bacterial swimming apparatus of one species of Synechococcus (Clade III, strain WH8102) were determined and these included transporters, novel giant proteins and glycosyltransferases. This grant funded the analysis of gene expression in Synechococcus using whole genome microarrays. These analyses revealed the strategies by which marine cyanobacteria respond to environmental conditions such as the absence of phosphorus, a common limiting nutrient, and the interaction of Synechococcus with other microbes. These analyses will help develop models of gene regulation in cyanobacteria and thus help predict their responses to changes in environmental conditions.

  7. Genomics approaches to unlock the high yield potential of cassava, a tropical model plant

    Directory of Open Access Journals (Sweden)

    Shengkui ZHANG,Ping'an MA,Haiyan WANG,Cheng LU,Xin CHEN,Zhiqiang XIA,Meiling ZOU,Xinchen ZHOU,Wenquan WANG

    2014-12-01

    Full Text Available Cassava, a tropical food, feed and biofuel crop, has great capacity for biomass accumulation and an extraordinary efficiency in water use and mineral nutrition, which makes it highly suitable as a model plant for tropical crops. However, the understanding of the metabolism and genomics of this important crop is limited. The recent breakthroughs in the genomics of cassava, including whole-genome sequencing and transcriptome analysis, as well as advances in the biology of photosynthesis, starch biosynthesis, adaptation to drought and high temperature, and resistance to virus and bacterial diseases, are reviewed here. Many of the new developments have come from comparative analyses between a wild ancestor and existing cultivars. Finally, the current challenges and future potential of cassava as a model plant are discussed.

  8. An approach to incorporate linkage disequilibrium structure into genomic association analysis

    Institute of Scientific and Technical Information of China (English)

    Fengyu Zhang; Diane Wagener

    2008-01-01

    In this study, we propose to use the principal component analysis (PCA) and regression model to incorporate linkage disequilibrium (LD) in genomic association data analysis. To accommodate LD in genomic data and reduce multiple testing, we suggest performing PCA and extracting the PCA score to capture the variation of genomic data, after which regression analysis is used to assess the association of the disease with the principal component score. An empirical analysis result shows that both genotype-basod correlation matrix and haplotype-based LD matrix can produce similar results for PCA. Principal component score seems to be more powerful in detecting genetic association because the principal component score is quantitatively measured and may be able to capture the effect of multiple loci.

  9. New approach for phylogenetic tree recovery based on genome-scale metabolic networks.

    Science.gov (United States)

    Gamermann, Daniel; Montagud, Arnaud; Conejero, J Alberto; Urchueguía, Javier F; de Córdoba, Pedro Fernández

    2014-07-01

    A wide range of applications and research has been done with genome-scale metabolic models. In this work, we describe an innovative methodology for comparing metabolic networks constructed from genome-scale metabolic models and how to apply this comparison in order to infer evolutionary distances between different organisms. Our methodology allows a quantification of the metabolic differences between different species from a broad range of families and even kingdoms. This quantification is then applied in order to reconstruct phylogenetic trees for sets of various organisms.

  10. Landscape genomics of Sphaeralcea ambigua in the Mojave Desert: a multivariate, spatially-explicit approach to guide ecological restoration

    Science.gov (United States)

    Shryock, Daniel F.; Havrilla, Caroline A.; DeFalco, Lesley; Esque, Todd C.; Custer, Nathan; Wood, Troy E.

    2015-01-01

    Local adaptation influences plant species’ responses to climate change and their performance in ecological restoration. Fine-scale physiological or phenological adaptations that direct demographic processes may drive intraspecific variability when baseline environmental conditions change. Landscape genomics characterize adaptive differentiation by identifying environmental drivers of adaptive genetic variability and mapping the associated landscape patterns. We applied such an approach to Sphaeralcea ambigua, an important restoration plant in the arid southwestern United States, by analyzing variation at 153 amplified fragment length polymorphism loci in the context of environmental gradients separating 47 Mojave Desert populations. We identified 37 potentially adaptive loci through a combination of genome scan approaches. We then used a generalized dissimilarity model (GDM) to relate variability in potentially adaptive loci with spatial gradients in temperature, precipitation, and topography. We identified non-linear thresholds in loci frequencies driven by summer maximum temperature and water stress, along with continuous variation corresponding to temperature seasonality. Two GDM-based approaches for mapping predicted patterns of local adaptation are compared. Additionally, we assess uncertainty in spatial interpolations through a novel spatial bootstrapping approach. Our study presents robust, accessible methods for deriving spatially-explicit models of adaptive genetic variability in non-model species that will inform climate change modelling and ecological restoration.

  11. Multi-omic data integration and analysis using systems genomics approaches

    DEFF Research Database (Denmark)

    Suravajhala, Prashanth; Kogelman, Lisette; Kadarmideen, Haja

    2016-01-01

    In the past years, there has been a remarkable development of high-throughput omics (HTO) technologies such as genomics, epigenomics, transcriptomics, proteomics and metabolomics across all facets of biology. This has spearheaded the progress of the systems biology era, including applications on ...

  12. Candidate fire blight resistance genes in Malus identified with the use of genomic tools and approaches

    Science.gov (United States)

    The goal of this research is to utilize current advances in Rosaceae genomics to identify DNA markers for use in marker-assisted selection of durable resistance to fire blight. Candidate fire blight resistance genes were selected and ranked based upon differential expression after inoculation with ...

  13. DNA Slippage Occurs at Microsatellite Loci without Minimal Threshold Length in Humans: A Comparative Genomic Approach

    Science.gov (United States)

    Leclercq, Sébastien; Rivals, Eric; Jarne, Philippe

    2010-01-01

    The dynamics of microsatellite, or short tandem repeats (STRs), is well documented for long, polymorphic loci, but much less is known for shorter ones. For example, the issue of a minimum threshold length for DNA slippage remains contentious. Model-fitting methods have generally concluded that slippage only occurs over a threshold length of about eight nucleotides, in contradiction with some direct observations of tandem duplications at shorter repeated sites. Using a comparative analysis of the human and chimpanzee genomes, we examined the mutation patterns at microsatellite loci with lengths as short as one period plus one nucleotide. We found that the rates of tandem insertions and deletions at microsatellite loci strongly deviated from background rates in other parts of the human genome and followed an exponential increase with STR size. More importantly, we detected no lower threshold length for slippage. The rate of tandem duplications at unrepeated sites was higher than expected from random insertions, providing evidence for genome-wide action of indel slippage (an alternative mechanism generating tandem repeats). The rate of point mutations adjacent to STRs did not differ from that estimated elsewhere in the genome, except around dinucleotide loci. Our results suggest that the emergence of STR depends on DNA slippage, indel slippage, and point mutations. We also found that the dynamics of tandem insertions and deletions differed in both rates and size at which these mutations take place. We discuss these results in both evolutionary and mechanistic terms. PMID:20624737

  14. A genomic approach to examine the complex evolution of laurasiatherian mammals.

    Directory of Open Access Journals (Sweden)

    Björn M Hallström

    Full Text Available Recent phylogenomic studies have failed to conclusively resolve certain branches of the placental mammalian tree, despite the evolutionary analysis of genomic data from 32 species. Previous analyses of single genes and retroposon insertion data yielded support for different phylogenetic scenarios for the most basal divergences. The results indicated that some mammalian divergences were best interpreted not as a single bifurcating tree, but as an evolutionary network. In these studies the relationships among some orders of the super-clade Laurasiatheria were poorly supported, albeit not studied in detail. Therefore, 4775 protein-coding genes (6,196,263 nucleotides were collected and aligned in order to analyze the evolution of this clade. Additionally, over 200,000 introns were screened in silico, resulting in 32 phylogenetically informative long interspersed nuclear elements (LINE insertion events. The present study shows that the genome evolution of Laurasiatheria may best be understood as an evolutionary network. Thus, contrary to the common expectation to resolve major evolutionary events as a bifurcating tree, genome analyses unveil complex speciation processes even in deep mammalian divergences. We exemplify this on a subset of 1159 suitable genes that have individual histories, most likely due to incomplete lineage sorting or introgression, processes that can make the genealogy of mammalian genomes complex. These unexpected results have major implications for the understanding of evolution in general, because the evolution of even some higher level taxa such as mammalian orders may sometimes not be interpreted as a simple bifurcating pattern.

  15. A combined approach for genome wide protein function annotation/prediction

    DEFF Research Database (Denmark)

    Benso, Alfredo; Di Carlo, Stefano; Ur Rehman, Hafeez

    2013-01-01

    proteins in functional genomics and biology in general motivates the use of computational techniques well orchestrated to accurately predict their functions. METHODS: We propose a computational flow for the functional annotation of a protein able to assign the most probable functions to a protein...

  16. Identification of a strawberry flavor gene candidate using an integrated genetic-genomic-analytical chemistry approach

    Science.gov (United States)

    Background: There is interest in improving the flavor of commercial strawberry (Fragaria × ananassa) varieties. Fruit flavor is shaped by combinations of sugars, acids and volatile compounds. Many efforts seek to use genomics-based strategies to identify genes controlling flavor, and then designing ...

  17. GenoMetric Query Language: a novel approach to large-scale genomic data management.

    Science.gov (United States)

    Masseroli, Marco; Pinoli, Pietro; Venco, Francesco; Kaitoua, Abdulrahman; Jalili, Vahid; Palluzzi, Fernando; Muller, Heiko; Ceri, Stefano

    2015-06-15

    Improvement of sequencing technologies and data processing pipelines is rapidly providing sequencing data, with associated high-level features, of many individual genomes in multiple biological and clinical conditions. They allow for data-driven genomic, transcriptomic and epigenomic characterizations, but require state-of-the-art 'big data' computing strategies, with abstraction levels beyond available tool capabilities. We propose a high-level, declarative GenoMetric Query Language (GMQL) and a toolkit for its use. GMQL operates downstream of raw data preprocessing pipelines and supports queries over thousands of heterogeneous datasets and samples; as such it is key to genomic 'big data' analysis. GMQL leverages a simple data model that provides both abstractions of genomic region data and associated experimental, biological and clinical metadata and interoperability between many data formats. Based on Hadoop framework and Apache Pig platform, GMQL ensures high scalability, expressivity, flexibility and simplicity of use, as demonstrated by several biological query examples on ENCODE and TCGA datasets. The GMQL toolkit is freely available for non-commercial use at http://www.bioinformatics.deib.polimi.it/GMQL/. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  18. A genomic approach to examine the complex evolution of laurasiatherian mammals.

    Science.gov (United States)

    Hallström, Björn M; Schneider, Adrian; Zoller, Stefan; Janke, Axel

    2011-01-01

    Recent phylogenomic studies have failed to conclusively resolve certain branches of the placental mammalian tree, despite the evolutionary analysis of genomic data from 32 species. Previous analyses of single genes and retroposon insertion data yielded support for different phylogenetic scenarios for the most basal divergences. The results indicated that some mammalian divergences were best interpreted not as a single bifurcating tree, but as an evolutionary network. In these studies the relationships among some orders of the super-clade Laurasiatheria were poorly supported, albeit not studied in detail. Therefore, 4775 protein-coding genes (6,196,263 nucleotides) were collected and aligned in order to analyze the evolution of this clade. Additionally, over 200,000 introns were screened in silico, resulting in 32 phylogenetically informative long interspersed nuclear elements (LINE) insertion events. The present study shows that the genome evolution of Laurasiatheria may best be understood as an evolutionary network. Thus, contrary to the common expectation to resolve major evolutionary events as a bifurcating tree, genome analyses unveil complex speciation processes even in deep mammalian divergences. We exemplify this on a subset of 1159 suitable genes that have individual histories, most likely due to incomplete lineage sorting or introgression, processes that can make the genealogy of mammalian genomes complex. These unexpected results have major implications for the understanding of evolution in general, because the evolution of even some higher level taxa such as mammalian orders may sometimes not be interpreted as a simple bifurcating pattern.

  19. Crop systems biology : an approach to connect functional genomics with crop modelling

    NARCIS (Netherlands)

    Yin, X.; Struik, P.C.

    2007-01-01

    The response of the whole crop to environmental conditions is a critical factor in agriculture. It can only be understood if the organization of the crop system is taken into account. A popular view in modern science is that genomics (and other `omics¿) will provide knowledge and tools to allow the

  20. Approaches to advancing quantitative human health risk assessment of environmental chemicals in the post-genomic era

    Energy Technology Data Exchange (ETDEWEB)

    Chiu, Weihsueh A., E-mail: chiu.weihsueh@epa.gov [National Center for Environmental Assessment, U.S. Environmental Protection Agency, Washington DC, 20460 (United States); Euling, Susan Y.; Scott, Cheryl Siegel; Subramaniam, Ravi P. [National Center for Environmental Assessment, U.S. Environmental Protection Agency, Washington DC, 20460 (United States)

    2013-09-15

    The contribution of genomics and associated technologies to human health risk assessment for environmental chemicals has focused largely on elucidating mechanisms of toxicity, as discussed in other articles in this issue. However, there is interest in moving beyond hazard characterization to making more direct impacts on quantitative risk assessment (QRA) — i.e., the determination of toxicity values for setting exposure standards and cleanup values. We propose that the evolution of QRA of environmental chemicals in the post-genomic era will involve three, somewhat overlapping phases in which different types of approaches begin to mature. The initial focus (in Phase I) has been and continues to be on “augmentation” of weight of evidence — using genomic and related technologies qualitatively to increase the confidence in and scientific basis of the results of QRA. Efforts aimed towards “integration” of these data with traditional animal-based approaches, in particular quantitative predictors, or surrogates, for the in vivo toxicity data to which they have been anchored are just beginning to be explored now (in Phase II). In parallel, there is a recognized need for “expansion” of the use of established biomarkers of susceptibility or risk of human diseases and disorders for QRA, particularly for addressing the issues of cumulative assessment and population risk. Ultimately (in Phase III), substantial further advances could be realized by the development of novel molecular and pathway-based biomarkers and statistical and in silico models that build on anticipated progress in understanding the pathways of human diseases and disorders. Such efforts would facilitate a gradual “reorientation” of QRA towards approaches that more directly link environmental exposures to human outcomes.

  1. Genomic confirmation of vancomycin-resistant Enterococcus transmission from deceased donor to liver transplant recipient.

    Science.gov (United States)

    Bashir, Ali; Attie, Oliver; Sullivan, Mitchell; Sebra, Robert; Singh, Kavindra V; Altman, Deena; Pak, Theodore; Dutta, Jayeeta; Chacko, Kieran; Webster, Elizabeth; Lewis, Martha; Hamula, Camille; Delli Carpini, Kristin W; Murray, Barbara E; Kasarskis, Andrew; van Bakel, Harm; Huprikar, Shirish

    2017-01-01

    In a liver transplant recipient with vancomycin-resistant Enterococcus (VRE) surgical site and bloodstream infection, a combination of pulsed-field gel electrophoresis, multilocus sequence typing, and whole genome sequencing identified that donor and recipient VRE isolates were highly similar when compared to time-matched hospital isolates. Comparison of de novo assembled isolate genomes was highly suggestive of transplant transmission rather than hospital-acquired transmission and also identified subtle internal rearrangements between donor and recipient missed by other genomic approaches. Given the improved resolution, whole-genome assembly of pathogen genomes is likely to become an essential tool for investigation of potential organ transplant transmissions.

  2. The genomic architecture and association genetics of adaptive characters using a candidate SNP approach in boreal black spruce.

    Science.gov (United States)

    Prunier, Julien; Pelgas, Betty; Gagnon, France; Desponts, Mireille; Isabel, Nathalie; Beaulieu, Jean; Bousquet, Jean

    2013-06-01

    The genomic architecture of adaptive traits remains poorly understood in non-model plants. Various approaches can be used to bridge this gap, including the mapping of quantitative trait loci (QTL) in pedigrees, and genetic association studies in non-structured populations. Here we present results on the genomic architecture of adaptive traits in black spruce, which is a widely distributed conifer of the North American boreal forest. As an alternative to the usual candidate gene approach, a candidate SNP approach was developed for association testing. A genetic map containing 231 gene loci was used to identify QTL that were related to budset timing and to tree height assessed over multiple years and sites. Twenty-two unique genomic regions were identified, including 20 that were related to budset timing and 6 that were related to tree height. From results of outlier detection and bulk segregant analysis for adaptive traits using DNA pool sequencing of 434 genes, 52 candidate SNPs were identified and subsequently tested in genetic association studies for budset timing and tree height assessed over multiple years and sites. A total of 34 (65%) SNPs were significantly associated with budset timing, or tree height, or both. Although the percentages of explained variance (PVE) by individual SNPs were small, several significant SNPs were shared between sites and among years. The sharing of genomic regions and significant SNPs between budset timing and tree height indicates pleiotropic effects. Significant QTLs and SNPs differed quite greatly among years, suggesting that different sets of genes for the same characters are involved at different stages in the tree's life history. The functional diversity of genes carrying significant SNPs and low observed PVE further indicated that a large number of polymorphisms are involved in adaptive genetic variation. Accordingly, for undomesticated species such as black spruce with natural populations of large effective size and low

  3. A new approach to in silico SNP detection and some new SNPs in the Bacillus anthracis genome

    Directory of Open Access Journals (Sweden)

    Francoeur Joe

    2011-04-01

    Full Text Available Abstract Background Bacillus anthracis is one of the most monomorphic pathogens known. Identification of polymorphisms in its genome is essential for taxonomic classification, for determination of recent evolutionary changes, and for evaluation of pathogenic potency. Findings In this work three strains of the Bacillus anthracis genome are compared and previously unpublished single nucleotide polymorphisms (SNPs are revealed. Moreover, it is shown that, despite the highly monomorphic nature of Bacillus anthracis, the SNPs are (1 abundant in the genome and (2 distributed relatively uniformly across the sequence. Conclusions The findings support the proposition that SNPs, together with indels and variable number tandem repeats (VNTRs, can be used effectively not only for the differentiation of perfect strain data, but also for the comparison of moderately incomplete, noisy and, in some cases, unknown Bacillus anthracis strains. In the case when the data is of still lower quality, a new DNA sequence fingerprinting approach based on recently introduced markers, based on combinatorial-analytic concepts and called cyclic difference sets, can be used.

  4. Evaluation of a Two-Stage Approach in Trans-Ethnic Meta-Analysis in Genome-Wide Association Studies.

    Science.gov (United States)

    Hong, Jaeyoung; Lunetta, Kathryn L; Cupples, L Adrienne; Dupuis, Josée; Liu, Ching-Ti

    2016-05-01

    Meta-analysis of genome-wide association studies (GWAS) has achieved great success in detecting loci underlying human diseases. Incorporating GWAS results from diverse ethnic populations for meta-analysis, however, remains challenging because of the possible heterogeneity across studies. Conventional fixed-effects (FE) or random-effects (RE) methods may not be most suitable to aggregate multiethnic GWAS results because of violation of the homogeneous effect assumption across studies (FE) or low power to detect signals (RE). Three recently proposed methods, modified RE (RE-HE) model, binary-effects (BE) model and a Bayesian approach (Meta-analysis of Transethnic Association [MANTRA]), show increased power over FE and RE methods while incorporating heterogeneity of effects when meta-analyzing trans-ethnic GWAS results. We propose a two-stage approach to account for heterogeneity in trans-ethnic meta-analysis in which we clustered studies with cohort-specific ancestry information prior to meta-analysis. We compare this to a no-prior-clustering (crude) approach, evaluating type I error and power of these two strategies, in an extensive simulation study to investigate whether the two-stage approach offers any improvements over the crude approach. We find that the two-stage approach and the crude approach for all five methods (FE, RE, RE-HE, BE, MANTRA) provide well-controlled type I error. However, the two-stage approach shows increased power for BE and RE-HE, and similar power for MANTRA and FE compared to their corresponding crude approach, especially when there is heterogeneity across the multiethnic GWAS results. These results suggest that prior clustering in the two-stage approach can be an effective and efficient intermediate step in meta-analysis to account for the multiethnic heterogeneity.

  5. A systems approach to predict oncometabolites via context-specific genome-scale metabolic networks.

    Directory of Open Access Journals (Sweden)

    Hojung Nam

    2014-09-01

    Full Text Available Altered metabolism in cancer cells has been viewed as a passive response required for a malignant transformation. However, this view has changed through the recently described metabolic oncogenic factors: mutated isocitrate dehydrogenases (IDH, succinate dehydrogenase (SDH, and fumarate hydratase (FH that produce oncometabolites that competitively inhibit epigenetic regulation. In this study, we demonstrate in silico predictions of oncometabolites that have the potential to dysregulate epigenetic controls in nine types of cancer by incorporating massive scale genetic mutation information (collected from more than 1,700 cancer genomes, expression profiling data, and deploying Recon 2 to reconstruct context-specific genome-scale metabolic models. Our analysis predicted 15 compounds and 24 substructures of potential oncometabolites that could result from the loss-of-function and gain-of-function mutations of metabolic enzymes, respectively. These results suggest a substantial potential for discovering unidentified oncometabolites in various forms of cancers.

  6. Survey of Genomics Approaches to Improve Bioenergy Traits in Maize, Sorghum and Sugarcane

    Institute of Scientific and Technical Information of China (English)

    Wilfred Vermerris

    2011-01-01

    Bioenergy crops currently provide the only source of alternative energy with the potential to reduce the use of fossil transportation fuels in a way that is compatible with existing engine technology, including in developing countries. Even though bioenergy research is currently receiving considerable attention, many of the concepts are not new,but rather build on intense research efforts from 30 years ago. A major difference with that era is the availability of genomics tools that have the potential to accelerate crop improvement significantly. This review is focused on maize, sorghum and sugarcane as representatives of bioenergy grasses that produce sugar and/or lignocellulosic biomass.Examples of how genetic mapping, forward and reverse genetics, high-throughput expression profiling and comparative genomics can be used to unravel and improve bioenergy traits will be presented.

  7. A bioinformatic approach to understanding antibiotic resistance in intracellular bacteria through whole genome analysis

    OpenAIRE

    Biswas, S.(National Institute of Science Education and Research, Bhubaneswar, India); Raoult, Didier; Rolain, J. M.

    2008-01-01

    Intracellular bacteria survive within eukaryotic host cells and are difficult to kill with certain antibiotics. As a result, antibiotic resistance in intracellular bacteria is becoming commonplace in healthcare institutions. Owing to the lack of methods available for transforming these bacteria, we evaluated the mechanisms of resistance using molecular methods and in silico genome analysis. The objective of this review was to understand the molecular mechanisms of antibiotic resistance throug...

  8. Precursor-centric genome-mining approach for lasso peptide discovery

    OpenAIRE

    Maksimov, Mikhail O.; Pelczer, István; Link, A. James

    2012-01-01

    Lasso peptides are a class of ribosomally synthesized posttranslationally modified natural products found in bacteria. Currently known lasso peptides have a diverse set of pharmacologically relevant activities, including inhibition of bacterial growth, receptor antagonism, and enzyme inhibition. The biosynthesis of lasso peptides is specified by a cluster of three genes encoding a precursor protein and two enzymes. Here we develop a unique genome-mining algorithm to identify lasso peptide gen...

  9. Cancer driver gene discovery through an integrative genomics approach in a non-parametric Bayesian framework.

    Science.gov (United States)

    Yang, Hai; Wei, Qiang; Zhong, Xue; Yang, Hushan; Li, Bingshan

    2017-02-15

    Comprehensive catalogue of genes that drive tumor initiation and progression in cancer is key to advancing diagnostics, therapeutics and treatment. Given the complexity of cancer, the catalogue is far from complete yet. Increasing evidence shows that driver genes exhibit consistent aberration patterns across multiple-omics in tumors. In this study, we aim to leverage complementary information encoded in each of the omics data to identify novel driver genes through an integrative framework. Specifically, we integrated mutations, gene expression, DNA copy numbers, DNA methylation and protein abundance, all available in The Cancer Genome Atlas (TCGA) and developed iDriver, a non-parametric Bayesian framework based on multivariate statistical modeling to identify driver genes in an unsupervised fashion. iDriver captures the inherent clusters of gene aberrations and constructs the background distribution that is used to assess and calibrate the confidence of driver genes identified through multi-dimensional genomic data. We applied the method to 4 cancer types in TCGA and identified candidate driver genes that are highly enriched with known drivers. (e.g.: P < 3.40 × 10 -36 for breast cancer). We are particularly interested in novel genes and observed multiple lines of supporting evidence. Using systematic evaluation from multiple independent aspects, we identified 45 candidate driver genes that were not previously known across these 4 cancer types. The finding has important implications that integrating additional genomic data with multivariate statistics can help identify cancer drivers and guide the next stage of cancer genomics research. The C ++ source code is freely available at https://medschool.vanderbilt.edu/cgg/ . hai.yang@vanderbilt.edu or bingshan.li@Vanderbilt.Edu. Supplementary data are available at Bioinformatics online.

  10. Molecular characterisation of the poorly differentiated and undifferentiated thyroid carcinomas using genome-wide approaches

    OpenAIRE

    Pita, Jaime Miguel Gomes, 1985-

    2013-01-01

    Tese de doutoramento, Bioquímica (Genética Molecular), Universidade de Lisboa, Faculdade de Ciências, 2013 Poorly differentiated (PDTC) and anaplastic thyroid carcinomas (ATC) are highly malignant tumours composed by dedifferentiated cells, for which current therapeutic options have been ineffective. In the present project, the molecular signatures and genetic alterations associated with these tumours were elucidated, by using genome-wide expression analysis as first assessment. The role ...

  11. Carbohydrate-active enzymes from pigmented Bacilli: a genomic approach to assess carbohydrate utilization and degradation

    Directory of Open Access Journals (Sweden)

    Henrissat Bernard

    2011-09-01

    Full Text Available Abstract Background Spore-forming Bacilli are Gram-positive bacteria commonly found in a variety of natural habitats, including soil, water and the gastro-intestinal (GI-tract of animals. Isolates of various Bacillus species produce pigments, mostly carotenoids, with a putative protective role against UV irradiation and oxygen-reactive forms. Results We report the annotation of carbohydrate active enzymes (CAZymes of two pigmented Bacilli isolated from the human GI-tract and belonging to the Bacillus indicus and B. firmus species. A high number of glycoside hydrolases (GHs and carbohydrate binding modules (CBMs were found in both isolates. A detailed analysis of CAZyme families, was performed and supported by growth data. Carbohydrates able to support growth as the sole carbon source negatively effected carotenoid formation in rich medium, suggesting that a catabolite repression-like mechanism controls carotenoid biosynthesis in both Bacilli. Experimental results on biofilm formation confirmed genomic data on the potentials of B. indicus HU36 to produce a levan-based biofilm, while mucin-binding and -degradation experiments supported genomic data suggesting the ability of both Bacilli to degrade mammalian glycans. Conclusions CAZy analyses of the genomes of the two pigmented Bacilli, compared to other Bacillus species and validated by experimental data on carbohydrate utilization, biofilm formation and mucin degradation, suggests that the two pigmented Bacilli are adapted to the intestinal environment and are suited to grow in and colonize the human gut.

  12. Carbohydrate-active enzymes from pigmented Bacilli: a genomic approach to assess carbohydrate utilization and degradation

    Science.gov (United States)

    2011-01-01

    Background Spore-forming Bacilli are Gram-positive bacteria commonly found in a variety of natural habitats, including soil, water and the gastro-intestinal (GI)-tract of animals. Isolates of various Bacillus species produce pigments, mostly carotenoids, with a putative protective role against UV irradiation and oxygen-reactive forms. Results We report the annotation of carbohydrate active enzymes (CAZymes) of two pigmented Bacilli isolated from the human GI-tract and belonging to the Bacillus indicus and B. firmus species. A high number of glycoside hydrolases (GHs) and carbohydrate binding modules (CBMs) were found in both isolates. A detailed analysis of CAZyme families, was performed and supported by growth data. Carbohydrates able to support growth as the sole carbon source negatively effected carotenoid formation in rich medium, suggesting that a catabolite repression-like mechanism controls carotenoid biosynthesis in both Bacilli. Experimental results on biofilm formation confirmed genomic data on the potentials of B. indicus HU36 to produce a levan-based biofilm, while mucin-binding and -degradation experiments supported genomic data suggesting the ability of both Bacilli to degrade mammalian glycans. Conclusions CAZy analyses of the genomes of the two pigmented Bacilli, compared to other Bacillus species and validated by experimental data on carbohydrate utilization, biofilm formation and mucin degradation, suggests that the two pigmented Bacilli are adapted to the intestinal environment and are suited to grow in and colonize the human gut. PMID:21892951

  13. Carbohydrate-active enzymes from pigmented Bacilli: a genomic approach to assess carbohydrate utilization and degradation.

    Science.gov (United States)

    Manzo, Nicola; D'Apuzzo, Enrica; Coutinho, Pedro M; Cutting, Simon M; Henrissat, Bernard; Ricca, Ezio

    2011-09-05

    Spore-forming Bacilli are gram-positive bacteria commonly found in a variety of natural habitats, including soil, water and the gastro-intestinal (GI)-tract of animals. Isolates of various Bacillus species produce pigments, mostly carotenoids, with a putative protective role against UV irradiation and oxygen-reactive forms. We report the annotation of carbohydrate active enzymes (CAZymes) of two pigmented Bacilli isolated from the human GI-tract and belonging to the Bacillus indicus and B. firmus species. A high number of glycoside hydrolases (GHs) and carbohydrate binding modules (CBMs) were found in both isolates. A detailed analysis of CAZyme families, was performed and supported by growth data. Carbohydrates able to support growth as the sole carbon source negatively effected carotenoid formation in rich medium, suggesting that a catabolite repression-like mechanism controls carotenoid biosynthesis in both Bacilli. Experimental results on biofilm formation confirmed genomic data on the potentials of B. indicus HU36 to produce a levan-based biofilm, while mucin-binding and -degradation experiments supported genomic data suggesting the ability of both Bacilli to degrade mammalian glycans. CAZy analyses of the genomes of the two pigmented Bacilli, compared to other Bacillus species and validated by experimental data on carbohydrate utilization, biofilm formation and mucin degradation, suggests that the two pigmented Bacilli are adapted to the intestinal environment and are suited to grow in and colonize the human gut.

  14. Genome privacy: challenges, technical approaches to mitigate risk, and ethical considerations in the United States.

    Science.gov (United States)

    Wang, Shuang; Jiang, Xiaoqian; Singh, Siddharth; Marmor, Rebecca; Bonomi, Luca; Fox, Dov; Dow, Michelle; Ohno-Machado, Lucila

    2017-01-01

    Accessing and integrating human genomic data with phenotypes are important for biomedical research. Making genomic data accessible for research purposes, however, must be handled carefully to avoid leakage of sensitive individual information to unauthorized parties and improper use of data. In this article, we focus on data sharing within the scope of data accessibility for research. Current common practices to gain biomedical data access are strictly rule based, without a clear and quantitative measurement of the risk of privacy breaches. In addition, several types of studies require privacy-preserving linkage of genotype and phenotype information across different locations (e.g., genotypes stored in a sequencing facility and phenotypes stored in an electronic health record) to accelerate discoveries. The computer science community has developed a spectrum of techniques for data privacy and confidentiality protection, many of which have yet to be tested on real-world problems. In this article, we discuss clinical, technical, and ethical aspects of genome data privacy and confidentiality in the United States, as well as potential solutions for privacy-preserving genotype-phenotype linkage in biomedical research. © 2016 New York Academy of Sciences.

  15. A comparative genomics approach to the evolution of eukaryotes and their mitochondria.

    Science.gov (United States)

    Lang, B F; Seif, E; Gray, M W; O'Kelly, C J; Burger, G

    1999-01-01

    The Organelle Genome Megasequencing Program (OGMP) investigates mitochondrial genome diversity and evolution by systematically determining the complete mitochondrial DNA (mtDNA) sequences of a phylogenetically broad selection of protists. The mtDNAs of lower fungi and choanoflagellates are being analyzed by the Fungal Mitochondrial Genome Project (FMGP), a sister project to the OGMP. Some of the most interesting protists include the jakobid flagellates Reclinomonas americana, Malawimonas jakobiformis, and Jakoba libera, which share ultrastructural similarities with amitochondriate retortamonads, and harbor mitochondrial genes not seen before in mtDNAs of other organisms. In R. americana and J. libera, gene clusters are found that resemble, to an unprecedented degree, the contiguous ribosomal protein operons str, S10, spc, and alpha of eubacteria. In addition, their mtDNAs code for an RNase P RNA that displays all the elements of a bacterial minimum consensus structure. This structure has been instrumental in detecting the rnpB gene in additional protists. Gene repertoire and gene order comparisons as well as multiple-gene phylogenies support the view of a single endosymbiotic origin of mitochondria, whose closest extant relatives are Rickettsia-type alpha-Proteobacteria.

  16. Chemical Biology Approaches to Genome Editing: Understanding, Controlling, and Delivering Programmable Nucleases.

    Science.gov (United States)

    Hu, Johnny H; Davis, Kevin M; Liu, David R

    2016-01-21

    Programmable DNA nucleases have provided scientists with the unprecedented ability to probe, regulate, and manipulate the human genome. Zinc-finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and the clustered regularly interspaced short palindromic repeat-Cas9 system (CRISPR-Cas9) represent a powerful array of tools that can bind to and cleave a specified DNA sequence. In their canonical forms, these nucleases induce double-strand breaks at a DNA locus of interest that can trigger cellular DNA repair processes that disrupt or replace genes. The fusion of these programmable nucleases with a variety of other protein domains has led to a rapidly growing suite of tools for activating, repressing, visualizing, and modifying loci of interest. Maximizing the usefulness and therapeutic relevance of these tools, however, requires precisely controlling their activity and specificity to minimize potentially toxic side effects arising from off-target activities. This need has motivated the application of chemical biology principles and methods to genome-editing proteins, including the engineering of variants of these proteins with improved or altered specificities, and the development of genetic, chemical, optical, and protein delivery methods that control the activity of these agents in cells. Advancing the capabilities, safety, effectiveness, and therapeutic relevance of genome-engineering proteins will continue to rely on chemical biology strategies that manipulate their activity, specificity, and localization. Copyright © 2016 Elsevier Ltd. All rights reserved.

  17. Challenges of metagenomics and single-cell genomics approaches for exploring cyanobacterial diversity.

    Science.gov (United States)

    Davison, Michelle; Hall, Eric; Zare, Richard; Bhaya, Devaki

    2015-10-01

    Cyanobacteria have played a crucial role in the history of early earth and continue to be instrumental in shaping our planet, yet applications of cutting edge technology have not yet been widely used to explore cyanobacterial diversity. To provide adequate background, we briefly review current sequencing technologies and their innovative uses in genomics and metagenomics. Next, we focus on current cell capture technologies and the challenges of using them with cyanobacteria. We illustrate the utility in coupling breakthroughs in DNA amplification with cell capture platforms, with an example of microfluidic isolation and subsequent targeted amplicon sequencing from individual terrestrial thermophilic cyanobacteria. Single cells of thermophilic, unicellular Synechococcus sp. JA-2-3-B'a(2-13) (Syn OS-B') were sorted in a microfluidic device, lysed, and subjected to whole genome amplification by multiple displacement amplification. We amplified regions from specific CRISPR spacer arrays, which are known to be highly diverse, contain semi-palindromic repeats which form secondary structure, and can be difficult to amplify. Cell capture, lysis, and genome amplification on a microfluidic device have been optimized, setting a stage for further investigations of individual cyanobacterial cells isolated directly from natural populations.

  18. Optimization of genome engineering approaches with the CRISPR/Cas9 system.

    Science.gov (United States)

    Li, Kai; Wang, Gang; Andersen, Troels; Zhou, Pingzhu; Pu, William T

    2014-01-01

    Designer nucleases such as TALENS and Cas9 have opened new opportunities to scarlessly edit the mammalian genome. Here we explored several parameters that influence Cas9-mediated scarless genome editing efficiency in murine embryonic stem cells. Optimization of transfection conditions and enriching for transfected cells are critical for efficiently recovering modified clones. Paired gRNAs and wild-type Cas9 efficiently create programmed deletions, which facilitate identification of targeted clones, while paired gRNAs and the Cas9D10A nickase generated smaller targeted indels with lower chance of off-target mutagenesis. Genome editing is also useful for programmed introduction of exogenous DNA sequences at a target locus. Increasing the length of the homology arms of the homology-directed repair template strongly enhanced targeting efficiency, while increasing the length of the DNA insert reduced it. Together our data provide guidance on optimal design of scarless gene knockout, modification, or knock-in experiments using Cas9 nuclease.

  19. Characteristics and Behavioral Outcomes for Youth in Group Care and Family-Based Care: A Propensity Score Matching Approach Using National Data

    Science.gov (United States)

    James, Sigrid; Roesch, Scott; Zhang, Jin Jin

    2012-01-01

    This study aimed to answer two questions: (a) Given expected differences in children who are placed in group care compared to those in family-based settings, is it possible to match children on baseline characteristics? (b) Are there differences in behavioral outcomes for youth with episodes in group care versus those in family-based care? Using…

  20. In silico and microarray-based genomic approaches to identifying potential vaccine candidates against Leptospira interrogans

    Directory of Open Access Journals (Sweden)

    Jiang Xu-Cheng

    2006-11-01

    Full Text Available Abstract Background Currently available vaccines against leptospirosis are of low efficacy, have an unacceptable side-effect profile, do not induce long-term protection, and provide no cross-protection against the different serovars of pathogenic leptospira. The current major focus in leptospirosis research is to discover conserved protective antigens that may elicit longer-term protection against a broad range of Leptospira. There is a need to screen vaccine candidate genes in the genome of Leptospira interrogans. Results Bioinformatics, comparative genomic hybridization (CGH analysis and transcriptional analysis were used to identify vaccine candidates in the genome of L. interrogans serovar Lai strain #56601. Of a total of 4727 open reading frames (ORFs, 616 genes were predicted to encode surface-exposed proteins by P-CLASSIFIER combined with signal peptide prediction, α-helix transmembrane topology prediction, integral β-barrel outer membrane protein and lipoprotein prediction, as well as by retaining the genes shared by the two sequenced L. interrogans genomes and by subtracting genes with human homologues. A DNA microarray of L. interrogans strain #56601 was constructed for CGH analysis and transcriptome analysis in vitro. Three hundred and seven differential genes were identified in ten pathogenic serovars by CGH; 1427 genes had high transcriptional levels (Cy3 signal ≥ 342 and Cy5 signal ≥ 363.5, respectively. There were 565 genes in the intersection between the set encoding surface-exposed proteins and the set of 307 differential genes. The number of genes in the intersection between this set of 565 and the set of 1427 highly transcriptionally active genes was 226. These 226 genes were thus identified as putative vaccine candidates. The proteins encoded by these genes are not only potentially surface-exposed in the bacterium, but also conserved in two sequenced L. interrogans. Moreover, these genes are conserved among ten epidemic

  1. Beyond barcoding: a mitochondrial genomics approach to molecular phylogenetics and diagnostics of blowflies (Diptera: Calliphoridae).

    Science.gov (United States)

    Nelson, Leigh A; Lambkin, Christine L; Batterham, Philip; Wallman, James F; Dowton, Mark; Whiting, Michael F; Yeates, David K; Cameron, Stephen L

    2012-12-15

    Members of the Calliphoridae (blowflies) are significant for medical and veterinary management, due to the ability of some species to consume living flesh as larvae, and for forensic investigations due to the ability of others to develop in corpses. Due to the difficulty of accurately identifying larval blowflies to species there is a need for DNA-based diagnostics for this family, however the widely used DNA-barcoding marker, cox1, has been shown to fail for several groups within this family. Additionally, many phylogenetic relationships within the Calliphoridae are still unresolved, particularly deeper level relationships. Sequencing whole mt genomes has been demonstrated both as an effective method for identifying the most informative diagnostic markers and for resolving phylogenetic relationships. Twenty-seven complete, or nearly so, mt genomes were sequenced representing 13 species, seven genera and four calliphorid subfamilies and a member of the related family Tachinidae. PCR and sequencing primers developed for sequencing one calliphorid species could be reused to sequence related species within the same superfamily with success rates ranging from 61% to 100%, demonstrating the speed and efficiency with which an mt genome dataset can be assembled. Comparison of molecular divergences for each of the 13 protein-coding genes and 2 ribosomal RNA genes, at a range of taxonomic scales identified novel targets for developing as diagnostic markers which were 117-200% more variable than the markers which have been used previously in calliphorids. Phylogenetic analysis of whole mt genome sequences resulted in much stronger support for family and subfamily-level relationships. The Calliphoridae are polyphyletic, with the Polleninae more closely related to the Tachinidae, and the Sarcophagidae are the sister group of the remaining calliphorids. Within the Calliphoridae, there was strong support for the monophyly of the Chrysomyinae and Luciliinae and for the sister

  2. Improved bounds for stochastic matching

    CERN Document Server

    Li, Jian

    2010-01-01

    In this paper we study stochastic matching problems that are motivated by applications in online dating and kidney exchange programs. We consider two probing models: edge probing and matching probing. Our main result is an algorithm that finds a matching-probing strategy attaining a small constant approximation ratio. An interesting aspect of our approach is that we compare the cost our solution to the best edge-probing strategy. Thus, we indirectly show that the best matching-probing strategy is only a constant factor away from the best edge-probing strategy. Even though our algorithm has a slightly worse approximation ratio than a greedy algorithm for edge-probing strategies, we show that the two algorithms can be combined to get improved approximations.

  3. Identification of cancer predisposition variants in apparently healthy individuals using a next-generation sequencing-based family genomics approach.

    Science.gov (United States)

    Karageorgos, Ioannis; Mizzi, Clint; Giannopoulou, Efstathia; Pavlidis, Cristiana; Peters, Brock A; Zagoriti, Zoi; Stenson, Peter D; Mitropoulos, Konstantinos; Borg, Joseph; Kalofonos, Haralabos P; Drmanac, Radoje; Stubbs, Andrew; van der Spek, Peter; Cooper, David N; Katsila, Theodora; Patrinos, George P

    2015-06-20

    Cancer, like many common disorders, has a complex etiology, often with a strong genetic component and with multiple environmental factors contributing to susceptibility. A considerable number of genomic variants have been previously reported to be causative of, or associated with, an increased risk for various types of cancer. Here, we adopted a next-generation sequencing approach in 11 members of two families of Greek descent to identify all genomic variants with the potential to predispose family members to cancer. Cross-comparison with data from the Human Gene Mutation Database identified a total of 571 variants, from which 47 % were disease-associated polymorphisms, 26 % disease-associated polymorphisms with additional supporting functional evidence, 19 % functional polymorphisms with in vitro/laboratory or in vivo supporting evidence but no known disease association, 4 % putative disease-causing mutations but with some residual doubt as to their pathological significance, and 3 % disease-causing mutations. Subsequent analysis, focused on the latter variant class most likely to be involved in cancer predisposition, revealed two variants of prime interest, namely MSH2 c.2732T>A (p.L911R) and BRCA1 c.2955delC, the first of which is novel. KMT2D c.13895delC and c.1940C>A variants are additionally reported as incidental findings. The next-generation sequencing-based family genomics approach described herein has the potential to be applied to other types of complex genetic disorder in order to identify variants of potential pathological significance.

  4. A Bac Library and Paired-PCR Approach to Mapping and Completing the Genome Sequence of Sulfolobus Solfataricus P2

    DEFF Research Database (Denmark)

    She, Qunxin; Confalonieri, F.; Zivanovic, Y.;

    2000-01-01

    -productive because there was a high sequence bias in the cosmid and lambda libraries. Therefore, a new approach was devised for linking the sequenced regions which may be generally applicable. BAC libraries were constructed and terminal sequences of the clones were determined and used for both end mapping and PCR...... screening. The PCR approaches included a novel chromosome walking method termed “paired-PCR”. 21 gaps were filled by BAC end sequence analyses and 6 gaps were filled by PCR including three large ones by paired-PCR. The complete map revealed that 0.9 Mb remained to be sequenced and 34 BAC clones were...... selected for walking over small gaps and preparing template libraries for larger ones. It is concluded that an optimal strategy for sequencing microorganism genomes involves construction of a high-resolution physical map by BAC end analyses, PCR screening and paired-PCR chromosome walking after about half...

  5. AN EVEN ODD MULTIPLE PATTERN MATCHING ALGORITHM

    OpenAIRE

    Raju Bhukya; DVLN Somayajulu

    2011-01-01

    Pattern matching plays an important role in various applications ranging from text searching in word processors to identification of functional and structural behavior in proteins and genes. Pattern matching is one of the fundamental areas in the field of computational biology. Currently research in life science area is producing large amount of genetic data. Due to this large and use full information can be gained by finding valuable information available from the genomic sequences. Many alg...

  6. Generalized Orthogonal Matching Pursuit

    CERN Document Server

    Wang, Jian; Shim, Byonghyo

    2011-01-01

    As a greedy algorithm to recover sparse signals from compressed measurements, the orthogonal matching pursuit (OMP) algorithm has received much attention in recent years. In this paper, we introduce an extension of the orthogonal matching pursuit (gOMP) for pursuing efficiency in reconstructing sparse signals. Our approach, henceforth referred to as generalized OMP (gOMP), is literally a generalization of the OMP in the sense that multiple indices are identified per iteration. Owing to the selection of multiple "correct" indices, the gOMP algorithm is finished with much smaller number of iterations compared to the OMP. We show that the gOMP can perfectly reconstruct any $K$-sparse signals ($K > 1$), provided that the sensing matrix satisfies the RIP with $\\delta_{NK} < \\frac{\\sqrt{N}}{\\sqrt{K} + 2 \\sqrt{N}}$. We also demonstrate by empirical simulations that the gOMP has excellent recovery performance comparable to $\\ell_1$-minimization technique with fast processing speed and competitive computational com...

  7. MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study

    Directory of Open Access Journals (Sweden)

    Xue Hong

    2009-01-01

    Full Text Available Abstract Background The interactions of multiple single nucleotide polymorphisms (SNPs are highly hypothesized to affect an individual's susceptibility to complex diseases. Although many works have been done to identify and quantify the importance of multi-SNP interactions, few of them could handle the genome wide data due to the combinatorial explosive search space and the difficulty to statistically evaluate the high-order interactions given limited samples. Results Three comparative experiments are designed to evaluate the performance of MegaSNPHunter. The first experiment uses synthetic data generated on the basis of epistasis models. The second one uses a genome wide study on Parkinson disease (data acquired by using Illumina HumanHap300 SNP chips. The third one chooses the rheumatoid arthritis study from Wellcome Trust Case Control Consortium (WTCCC using Affymetrix GeneChip 500K Mapping Array Set. MegaSNPHunter outperforms the best solution in this area and reports many potential interactions for the two real studies. Conclusion The experimental results on both synthetic data and two real data sets demonstrate that our proposed approach outperforms the best solution that is currently available in handling large-scale SNP data both in terms of speed and in terms of detection of potential interactions that were not identified before. To our knowledge, MegaSNPHunter is the first approach that is capable of identifying the disease-associated SNP interactions from WTCCC studies and is promising for practical disease prognosis.

  8. Sequence Matching Analysis for Curriculum Development

    National Research Council Canada - National Science Library

    Liem Yenny Bendatu; Bernardo Nugroho Yahya

    2015-01-01

    .... This study attempts to develop a sequence matching analysis. Considering conformance checking as the basis of this approach, this proposed approach utilizes the current control flow technique in process mining domain...

  9. A Two-Stage Penalized Logistic Regression Approach to Case-Control Genome-Wide Association Studies

    Directory of Open Access Journals (Sweden)

    Jingyuan Zhao

    2012-01-01

    Full Text Available We propose a two-stage penalized logistic regression approach to case-control genome-wide association studies. This approach consists of a screening stage and a selection stage. In the screening stage, main-effect and interaction-effect features are screened by using L1-penalized logistic like-lihoods. In the selection stage, the retained features are ranked by the logistic likelihood with the smoothly clipped absolute deviation (SCAD penalty (Fan and Li, 2001 and Jeffrey’s Prior penalty (Firth, 1993, a sequence of nested candidate models are formed, and the models are assessed by a family of extended Bayesian information criteria (J. Chen and Z. Chen, 2008. The proposed approach is applied to the analysis of the prostate cancer data of the Cancer Genetic Markers of Susceptibility (CGEMS project in the National Cancer Institute, USA. Simulation studies are carried out to compare the approach with the pair-wise multiple testing approach (Marchini et al. 2005 and the LASSO-patternsearch algorithm (Shi et al. 2007.

  10. Investigation of Yersinia pestis Laboratory Adaptation through a Combined Genomics and Proteomics Approach.

    Directory of Open Access Journals (Sweden)

    Owen P Leiser

    Full Text Available The bacterial pathogen Yersinia pestis, the cause of plague in humans and animals, normally has a sylvatic lifestyle, cycling between fleas and mammals. In contrast, laboratory-grown Y. pestis experiences a more constant environment and conditions that it would not normally encounter. The transition from the natural environment to the laboratory results in a vastly different set of selective pressures, and represents what could be considered domestication. Understanding the kinds of adaptations Y. pestis undergoes as it becomes domesticated will contribute to understanding the basic biology of this important pathogen. In this study, we performed a parallel serial passage experiment (PSPE to explore the mechanisms by which Y. pestis adapts to laboratory conditions, hypothesizing that cells would undergo significant changes in virulence and nutrient acquisition systems. Two wild strains were serially passaged in 12 independent populations each for ~750 generations, after which each population was analyzed using whole-genome sequencing, LC-MS/MS proteomic analysis, and GC/MS metabolomics. We observed considerable parallel evolution in the endpoint populations, detecting multiple independent mutations in ail, pepA, and zwf, suggesting that specific selective pressures are shaping evolutionary responses. Complementary LC-MS/MS proteomic data provide physiological context to the observed mutations, and reveal regulatory changes not necessarily associated with specific mutations, including changes in amino acid metabolism and cell envelope biogenesis. Proteomic data support hypotheses generated by genomic data in addition to suggesting future mechanistic studies, indicating that future whole-genome sequencing studies be designed to leverage proteomics as a critical complement.

  11. SBH and the integration of complementary approaches in the mapping, sequencing, and understanding of complex genomes

    Energy Technology Data Exchange (ETDEWEB)

    Drmanac, R.; Drmanac, S.; Labat, I.; Vicentic, A.; Gemmell, A.; Stavropoulos, N.; Jarvis, J.

    1992-12-01

    A variant of sequencing by hybridization (SBH) is being developed with a potential to inexpensively determine up to 100 million base pairs per year. The method comprises (1) arraying short clones in 864-well plates; (2) growth of the M13 clones or PCR of the inserts; (3) automated spotting of DNAs by corresponding pin-arrays; (4) hybridization of dotted samples with 200-3000 {sup 32}P- or {sup 33}P-labeled 6- to 8-mer probes; and (5) scoring hybridization signals using storage phosphor plates. Some 200 7- to 8-mers can provide an inventory of the genes if CDNA clones are hybridized, or can define the order of 2-kb genomic clones, creating physical and structural maps with 100-bp resolution; the distribution of G+C, LINEs, SINEs, and gene families would be revealed. cDNAs that represent new genes and genomic clones in regions of interest selected by SBH can be sequenced by a gel method. Uniformly distributed clones from the previous step will be hybridized with 2000--3000 6- to 8-mers. As a result, approximately 50--60% of the genomic regions containing members of large repetitive and gene families and those families represented in GenBank would be completely sequenced. In the less redundant regions, every base pair is expected to be read with 3-4 probes, but the complete sequence can not be reconstructed. Such partial sequences allow the inference of similarity and the recognition of coding, regulatory, and repetitive sequences, as well as study of the evolutionary processes all the way up to the species delineation.

  12. SBH and the integration of complementary approaches in the mapping, sequencing, and understanding of complex genomes

    Energy Technology Data Exchange (ETDEWEB)

    Drmanac, R.; Drmanac, S.; Labat, I.; Vicentic, A.; Gemmell, A.; Stavropoulos, N.; Jarvis, J.

    1992-01-01

    A variant of sequencing by hybridization (SBH) is being developed with a potential to inexpensively determine up to 100 million base pairs per year. The method comprises (1) arraying short clones in 864-well plates; (2) growth of the M13 clones or PCR of the inserts; (3) automated spotting of DNAs by corresponding pin-arrays; (4) hybridization of dotted samples with 200-3000 [sup 32]P- or [sup 33]P-labeled 6- to 8-mer probes; and (5) scoring hybridization signals using storage phosphor plates. Some 200 7- to 8-mers can provide an inventory of the genes if CDNA clones are hybridized, or can define the order of 2-kb genomic clones, creating physical and structural maps with 100-bp resolution; the distribution of G+C, LINEs, SINEs, and gene families would be revealed. cDNAs that represent new genes and genomic clones in regions of interest selected by SBH can be sequenced by a gel method. Uniformly distributed clones from the previous step will be hybridized with 2000--3000 6- to 8-mers. As a result, approximately 50--60% of the genomic regions containing members of large repetitive and gene families and those families represented in GenBank would be completely sequenced. In the less redundant regions, every base pair is expected to be read with 3-4 probes, but the complete sequence can not be reconstructed. Such partial sequences allow the inference of similarity and the recognition of coding, regulatory, and repetitive sequences, as well as study of the evolutionary processes all the way up to the species delineation.

  13. Estimating quantitative genetic parameters in wild populations: a comparison of pedigree and genomic approaches.

    Science.gov (United States)

    Bérénos, Camillo; Ellis, Philip A; Pilkington, Jill G; Pemberton, Josephine M

    2014-07-01

    The estimation of quantitative genetic parameters in wild populations is generally limited by the accuracy and completeness of the available pedigree information. Using relatedness at genomewide markers can potentially remove this limitation and lead to less biased and more precise estimates. We estimated heritability, maternal genetic effects and genetic correlations for body size traits in an unmanaged long-term study population of Soay sheep on St Kilda using three increasingly complete and accurate estimates of relatedness: (i) Pedigree 1, using observation-derived maternal links and microsatellite-derived paternal links; (ii) Pedigree 2, using SNP-derived assignment of both maternity and paternity; and (iii) whole-genome relatedness at 37 037 autosomal SNPs. In initial analyses, heritability estimates were strikingly similar for all three methods, while standard errors were systematically lower in analyses based on Pedigree 2 and genomic relatedness. Genetic correlations were generally strong, differed little between the three estimates of relatedness and the standard errors declined only very slightly with improved relatedness information. When partitioning maternal effects into separate genetic and environmental components, maternal genetic effects found in juvenile traits increased substantially across the three relatedness estimates. Heritability declined compared to parallel models where only a maternal environment effect was fitted, suggesting that maternal genetic effects are confounded with direct genetic effects and that more accurate estimates of relatedness were better able to separate maternal genetic effects from direct genetic effects. We found that the heritability captured by SNP markers asymptoted at about half the SNPs available, suggesting that denser marker panels are not necessarily required for precise and unbiased heritability estimates. Finally, we present guidelines for the use of genomic relatedness in future quantitative genetics

  14. 一种基于证据理论和任务分配的Deep Web查询接口匹配方法%A Deep Web Query Interface Matching Approach Based on Evidence Theory and Task Assignment

    Institute of Scientific and Technical Information of China (English)

    董永权; 李庆忠; 丁艳辉; 张永新

    2011-01-01

    针对已有查询接口匹配方法匹配器权重设置困难、匹配决策缺乏有效处理的局限性,提出一种基于证据理论和任务分配的Deep Web查询接口匹配方法.该方法通过引人改进的D-S证据理论自动融合多个匹配器结果,避免手工设定匹配器权重,有效减少人工干预.通过对任务分配问题进行扩展,将查询接口的一对一匹配决策问题转化为扩展的任务分配问题,为源查询接口中的每一个属性选择合适的匹配,并在此基础上,采用树结构启发式规则进行一对多匹配决策.实验结果表明ETTA-IM方法具有较高的查准率和查全率.%To solve the limitations of existing query interface matching which have the difficulties of weight setting of the matcher and the absence of the efficient processing of matching decision, a deep web query interface matching approach based on evidence theory and task assignment is proposed called evidence theory and task assignment based query interface matching approach (ETTA-IM).Firstly, an improved D-S evidence theory is used to automatically combine multiple matchers.Thus, the weight of each matcher is not required to be set by hand and human involvement is reduced.Then, a method is used to select a proper attribute correspondence of each source attribute from target query interface, which converts one-to-one matching decision to the extended task assignment problem.Finally, based on one-to-one matching results, some heuristic rules of tree structure are used to perform one-to-many matching decision.Experimental results show that ETTA-IM approach has high precision and recall measure.

  15. A two step Bayesian approach for genomic prediction of breeding values

    DEFF Research Database (Denmark)

    Mahdi Shariati, Mohammad; Sørensen, Peter; Janss, Luc

    2012-01-01

    Background: In genomic models that assign an individual variance to each marker, the contribution of one marker to the posterior distribution of the marker variance is only one degree of freedom (df), which introduces many variance parameters with only little information per variance parameter...... of predicted breeding values. However, the accuracies of predicted breeding values were lower than Bayesian methods with marker specific variances. Conclusions: Grouping markers is less flexible than allowing each marker to have a specific marker variance but, by grouping, the power to estimate marker...

  16. A comparative genomics approach to understanding the biosynthesis of the sunscreen scytonemin in cyanobacteria

    Directory of Open Access Journals (Sweden)

    Potrafka Ruth M

    2009-07-01

    Full Text Available Abstract Background The extracellular sunscreen scytonemin is the most common and widespread indole-alkaloid among cyanobacteria. Previous research using the cyanobacterium Nostoc punctiforme ATCC 29133 revealed a unique 18-gene cluster (NpR1276 to NpR1259 in the N. punctiforme genome involved in the biosynthesis of scytonemin. We provide further genomic characterization of these genes in N. punctiforme and extend it to homologous regions in other cyanobacteria. Results Six putative genes in the scytonemin gene cluster (NpR1276 to NpR1271 in the N. punctiforme genome, with no previously known protein function and annotated in this study as scyA to scyF, are likely involved in the assembly of scytonemin from central metabolites, based on genetic, biochemical, and sequence similarity evidence. Also in this cluster are redundant copies of genes encoding for aromatic amino acid biosynthetic enzymes. These can theoretically lead to tryptophan and the tyrosine precursor, p-hydroxyphenylpyruvate, (expected biosynthetic precursors of scytonemin from end products of the shikimic acid pathway. Redundant copies of the genes coding for the key regulatory and rate-limiting enzymes of the shikimic acid pathway are found there as well. We identified four other cyanobacterial strains containing orthologues of all of these genes, three of them by database searches (Lyngbya PCC 8106, Anabaena PCC 7120, and Nodularia CCY 9414 and one by targeted sequencing (Chlorogloeopsis sp. strain Cgs-089; CCMEE 5094. Genomic comparisons revealed that most scytonemin-related genes were highly conserved among strains and that two additional conserved clusters, NpF5232 to NpF5236 and a putative two-component regulatory system (NpF1278 and NpF1277, are likely involved in scytonemin biosynthesis and regulation, respectively, on the basis of conservation and location. Since many of the protein product sequences for the newly described genes, including ScyD, ScyE, and ScyF, have

  17. Approximating Graphic TSP by Matchings

    CERN Document Server

    Mömke, Tobias

    2011-01-01

    We present a framework for approximating the metric TSP based on a novel use of matchings. Traditionally, matchings have been used to add edges in order to make a given graph Eulerian, whereas our approach also allows for the removal of certain edges leading to a decreased cost. For the TSP on graphic metrics (graph-TSP), the approach yields a 1.461-approximation algorithm with respect to the Held-Karp lower bound. For graph-TSP restricted to a class of graphs that contains degree three bounded and claw-free graphs, we show that the integrality gap of the Held-Karp relaxation matches the conjectured ratio 4/3. The framework allows for generalizations in a natural way and also leads to a 1.586-approximation algorithm for the traveling salesman path problem on graphic metrics where the start and end vertices are prespecified.

  18. On the analysis of genome-wide association studies in family-based designs: a universal, robust analysis approach and an application to four genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Sungho Won

    2009-11-01

    Full Text Available For genome-wide association studies in family-based designs, we propose a new, universally applicable approach. The new test statistic exploits all available information about the association, while, by virtue of its design, it maintains the same robustness against population admixture as traditional family-based approaches that are based exclusively on the within-family information. The approach is suitable for the analysis of almost any trait type, e.g. binary, continuous, time-to-onset, multivariate, etc., and combinations of those. We use simulation studies to verify all theoretically derived properties of the approach, estimate its power, and compare it with other standard approaches. We illustrate the practical implications of the new analysis method by an application to a lung-function phenotype, forced expiratory volume in one second (FEV1 in 4 genome-wide association studies.

  19. Genome-Wide Locations of Potential Epimutations Associated with Environmentally Induced Epigenetic Transgenerational Inheritance of Disease Using a Sequential Machine Learning Prediction Approach.

    Science.gov (United States)

    Haque, M Muksitul; Holder, Lawrence B; Skinner, Michael K

    2015-01-01

    Environmentally induced epigenetic transgenerational inheritance of disease and phenotypic variation involves germline transmitted epimutations. The primary epimutations identified involve altered differential DNA methylation regions (DMRs). Different environmental toxicants have been shown to promote exposure (i.e., toxicant) specific signatures of germline epimutations. Analysis of genomic features associated with these epimutations identified low-density CpG regions (learning computational approach to predict all potential epimutations in the genome. A number of previously identified sperm epimutations were used as training sets. A novel machine learning approach using a sequential combination of Active Learning and Imbalance Class Learner analysis was developed. The transgenerational sperm epimutation analysis identified approximately 50K individual sites with a 1 kb mean size and 3,233 regions that had a minimum of three adjacent sites with a mean size of 3.5 kb. A select number of the most relevant genomic features were identified with the low density CpG deserts being a critical genomic feature of the features selected. A similar independent analysis with transgenerational somatic cell epimutation training sets identified a smaller number of 1,503 regions of genome-wide predicted sites and differences in genomic feature contributions. The predicted genome-wide germline (sperm) epimutations were found to be distinct from the predicted somatic cell epimutations. Validation of the genome-wide germline predicted sites used two recently identified transgenerational sperm epimutation signature sets from the pesticides dichlorodiphenyltrichloroethane (DDT) and methoxychlor (MXC) exposure lineage F3 generation. Analysis of this positive validation data set showed a 100% prediction accuracy for all the DDT-MXC sperm epimutations. Observations further elucidate the genomic features associated with transgenerational germline epimutations and identify a genome-wide set

  20. A new approach for sequencing virion genome of Chinese HIV-1 strains subtype B and BC from plasma

    Institute of Scientific and Technical Information of China (English)

    MENG Zhe-feng; ZHANG Xiao-yan; XIN Ruo-lei; XING Hui; HE Xiang; XU Jian-qing; SHAO Yi-ming

    2011-01-01

    Background Although it was widely accepted that full-length HIV genome sequences is important in studying virus genetic evolution and variation as well as developing vaccine candidate,to directly sequencing HIV-1 genome of virion RNA remains as a challenge worldwide.Up to date,no published genomic sequences from virion RNA are available for Chinese prevalent HIV-1 strains due to the absence of specialized protocol and appropriate lab equipments.In this study we developed a straightforward approach for amplifying and sequencing HIV virion RNA from plasma by modifying published protocols and further confirmed it is suitable to process Chinese samples.Methods The methods for viral RNA extraction and gene amplification was modified and optimized as could be widely used in most Chinese labs.Gene alignment of Chinese HIV-1 strains was employed for designing specialized primer sets for Thai-B and BC recombinant strains.Based on comprehensively consideration of high variable gene region and recombinant breakpoints in BC recombinant strains,a three-amplicon strategy (including 4.3-kb gag-pol,2.9-kb pol-env and 2.7-kb env-ne) was developed.In addition,one amplicon (9 kb near full-length genome) was also used in 32 samples with varied viral loads.All amplicons were directly sequenced by DNA automated sequencer.Results Twenty-five percent(8/32) amplification efficiency was achieved by the one-amplicon strategy and 65.6%(21/32) by three-amplicon strategy.For one amplicon strategy,none of complete near full-length genome sequences was obtained by DNA sequencing.For three-amplicon strategy,75% sequences were achieved in DNA sequencing.Amplification efficiency but not sequencing efficiency was closely associated with viral loads.Conclusion Three-amplicon strategy covering all encoding regions of HIV-1 is suitable for Thai-B and BC recombinant strains and could be potentially employed in less-well equipped Chinese labs.

  1. De novo discovery of neuropeptides in the genomes of parasitic flatworms using a novel comparative approach.

    Science.gov (United States)

    Koziol, Uriel; Koziol, Miguel; Preza, Matías; Costábile, Alicia; Brehm, Klaus; Castillo, Estela

    2016-10-01

    Neuropeptide mediated signalling is an ancient mechanism found in almost all animals and has been proposed as a promising target for the development of novel drugs against helminths. However, identification of neuropeptides from genomic data is challenging, and knowledge of the neuropeptide complement of parasitic flatworms is still fragmentary. In this work, we have developed an evolution-based strategy for the de novo discovery of neuropeptide precursors, based on the detection of localised sequence conservation between possible prohormone convertase cleavage sites. The method detected known neuropeptide precursors with good precision and specificity in the models Drosophila melanogaster and Caenorhabditis elegans. Furthermore, it identified novel putative neuropeptide precursors in nematodes, including the first description of allatotropin homologues in this phylum. Our search for neuropeptide precursors in the genomes of parasitic flatworms resulted in the description of 34 conserved neuropeptide precursor families, including 13 new ones, and of hundreds of new homologues of known neuropeptide precursor families. Most neuropeptide precursor families show a wide phylogenetic distribution among parasitic flatworms and show little similarity to neuropeptide precursors of other bilaterian animals. However, we could also find orthologs of some conserved bilaterian neuropeptides including pyrokinin, crustacean cardioactive peptide, myomodulin, neuropeptide-Y, neuropeptide KY and SIF-amide. Finally, we determined the expression patterns of seven putative neuropeptide precursor genes in the protoscolex of Echinococcus multilocularis. All genes were expressed in the nervous system with different patterns, indicating a hidden complexity of peptidergic signalling in cestodes.

  2. Predicting Hybrid Performances for Quality Traits through Genomic-Assisted Approaches in Central European Wheat

    KAUST Repository

    Liu, Guozheng

    2016-07-06

    Bread-making quality traits are central targets for wheat breeding. The objectives of our study were to (1) examine the presence of major effect QTLs for quality traits in a Central European elite wheat population, (2) explore the optimal strategy for predicting the hybrid performance for wheat quality traits, and (3) investigate the effects of marker density and the composition and size of the training population on the accuracy of prediction of hybrid performance. In total 135 inbred lines of Central European bread wheat (Triticum aestivum L.) and 1,604 hybrids derived from them were evaluated for seven quality traits in up to six environments. The 135 parental lines were genotyped using a 90k single-nucleotide polymorphism array. Genome-wide association mapping initially suggested presence of several quantitative trait loci (QTLs), but cross-validation rather indicated the absence of major effect QTLs for all quality traits except of 1000-kernel weight. Genomic selection substantially outperformed marker-assisted selection in predicting hybrid performance. A resampling study revealed that increasing the effective population size in the estimation set of hybrids is relevant to boost the accuracy of prediction for an unrelated test population.

  3. Predicting Hybrid Performances for Quality Traits through Genomic-Assisted Approaches in Central European Wheat.

    Directory of Open Access Journals (Sweden)

    Guozheng Liu

    Full Text Available Bread-making quality traits are central targets for wheat breeding. The objectives of our study were to (1 examine the presence of major effect QTLs for quality traits in a Central European elite wheat population, (2 explore the optimal strategy for predicting the hybrid performance for wheat quality traits, and (3 investigate the effects of marker density and the composition and size of the training population on the accuracy of prediction of hybrid performance. In total 135 inbred lines of Central European bread wheat (Triticum aestivum L. and 1,604 hybrids derived from them were evaluated for seven quality traits in up to six environments. The 135 parental lines were genotyped using a 90k single-nucleotide polymorphism array. Genome-wide association mapping initially suggested presence of several quantitative trait loci (QTLs, but cross-validation rather indicated the absence of major effect QTLs for all quality traits except of 1000-kernel weight. Genomic selection substantially outperformed marker-assisted selection in predicting hybrid performance. A resampling study revealed that increasing the effective population size in the estimation set of hybrids is relevant to boost the accuracy of prediction for an unrelated test population.

  4. VaProS: a database-integration approach for protein/genome information retrieval

    KAUST Repository

    Gojobori, Takashi

    2016-12-24

    Life science research now heavily relies on all sorts of databases for genome sequences, transcription, protein three-dimensional (3D) structures, protein–protein interactions, phenotypes and so forth. The knowledge accumulated by all the omics research is so vast that a computer-aided search of data is now a prerequisite for starting a new study. In addition, a combinatory search throughout these databases has a chance to extract new ideas and new hypotheses that can be examined by wet-lab experiments. By virtually integrating the related databases on the Internet, we have built a new web application that facilitates life science researchers for retrieving experts’ knowledge stored in the databases and for building a new hypothesis of the research target. This web application, named VaProS, puts stress on the interconnection between the functional information of genome sequences and protein 3D structures, such as structural effect of the gene mutation. In this manuscript, we present the notion of VaProS, the databases and tools that can be accessed without any knowledge of database locations and data formats, and the power of search exemplified in quest of the molecular mechanisms of lysosomal storage disease. VaProS can be freely accessed at http://p4d-info.nig.ac.jp/vapros/.

  5. A Genomic Approach to Resolving Relapse versus Reinfection among Four Cases of Buruli Ulcer.

    Directory of Open Access Journals (Sweden)

    Miriam Eddyani

    2015-11-01

    Full Text Available Increased availability of Next Generation Sequencing (NGS techniques allows, for the first time, to distinguish relapses from reinfections in patients with multiple Buruli ulcer (BU episodes.We compared the number and location of single nucleotide polymorphisms (SNPs identified by genomic screening between four pairs of Mycobacterium ulcerans isolates collected at the time of first diagnosis and at recurrence, derived from a collection of almost 5000 well characterized clinical samples from one BU treatment center in Benin.The findings suggest that after surgical treatment-without antibiotics-the second episodes were due to relapse rather than reinfection. Since specific antibiotics were introduced for the treatment of BU, the one patient with a culture available from both disease episodes had M. ulcerans isolates with a genomic distance of 20 SNPs, suggesting the patient was most likely reinfected rather than having a relapse.To our knowledge, this study is the first to study recurrences in M. ulcerans using NGS, and to identify exogenous reinfection as causing a recurrence of BU. The occurrence of reinfection highlights the contribution of ongoing exposure to M. ulcerans to disease recurrence, and has implications for vaccine development.

  6. Acclimation of microorganisms to harsh soil crust conditions: Experimental and genomic approaches

    Science.gov (United States)

    Raanan, Hagai; Kaplan, Aaron

    2015-04-01

    Biological soil crusts (BSC) are formed by the adhesion of sand particles to cyanobacterial exo- polysaccharides and play an important role in stabilizing sandy desert. Its destruction promotes desertification. These organisms cope with extreme temperatures, excess light and frequent hydration/dehydration cycles; the mechanisms involved are largely unknown. With the genome of newly sequenced Leptolyngbya, isolated from Nizzana BSC, we conduct comparative genomics of three desiccation tolerant cyanobacteria. This yield 46 unique genes, some of them similar to genes involve in sporulation of the gram positive bacteria Bacillus. In order to understand the molecular mechanisms taking place during desiccation we built an environmental chamber capable of simulating dynamic changes of environmental conditions in the crust. This chamber allows us to perform repetitive and accurate desiccation/rehydration experiments and follow cyanobacterial physiological and molecular response to such environmental changes. When we compared fast desiccation (less than 5 min) of isolated cyanobacteria to simulation of natural desiccation, we observed a 60% lower fluorescence recovery rate. The extent of damage from desiccation depended on the stress conditions during the dry period. These results suggest that cyanobacteria activated protection mechanisms in response to desiccation stress but which were not activated in 5 min desiccation tests. Gene expression patterns during desiccation are being analyzed in order to provide a better understanding of desiccation stress protection mechanisms.

  7. A surrogate-based approach for post-genomic partner identification

    Directory of Open Access Journals (Sweden)

    Giordano Tony

    2001-09-01

    Full Text Available Abstract Background Modern drug discovery is concerned with identification and validation of novel protein targets from among the 30,000 genes or more postulated to be present in the human genome. While protein-protein interactions may be central to many disease indications, it has been difficult to identify new chemical entities capable of regulating these interactions as either agonists or antagonists. Results In this paper, we show that peptide complements (or surrogates derived from highly diverse random phage display libraries can be used for the identification of the expected natural biological partners for protein and non-protein targets. Our examples include surrogates isolated against both an extracellular secreted protein (TNFβ and intracellular disease related mRNAs. In each case, surrogates binding to these targets were obtained and found to contain partner information embedded in their amino acid sequences. Furthermore, this information was able to identify the correct biological partners from large human genome databases by rapid and integrated computer based searches. Conclusions Modified versions of these surrogates should provide agents capable of modifying the activity of these targets and enable one to study their involvement in specific biological processes as a means of target validation for downstream drug discovery.

  8. The complex genetics of gait speed: genome-wide meta-analysis approach

    Science.gov (United States)

    Lunetta, Kathryn L.; Smith, Jennifer A.; Eicher, John D.; Vered, Rotem; Deelen, Joris; Arnold, Alice M.; Buchman, Aron S.; Tanaka, Toshiko; Faul, Jessica D.; Nethander, Maria; Fornage, Myriam; Adams, Hieab H.; Matteini, Amy M.; Callisaya, Michele L.; Smith, Albert V.; Yu, Lei; De Jager, Philip L.; Evans, Denis A.; Gudnason, Vilmundur; Hofman, Albert; Pattie, Alison; Corley, Janie; Launer, Lenore J.; Knopman, Davis S.; Parimi, Neeta; Turner, Stephen T.; Bandinelli, Stefania; Beekman, Marian; Gutman, Danielle; Sharvit, Lital; Mooijaart, Simon P.; Liewald, David C.; Houwing-Duistermaat, Jeanine J.; Ohlsson, Claes; Moed, Matthijs; Verlinden, Vincent J.; Mellström, Dan; van der Geest, Jos N.; Karlsson, Magnus; Hernandez, Dena; McWhirter, Rebekah; Liu, Yongmei; Thomson, Russell; Tranah, Gregory J.; Uitterlinden, Andre G.; Weir, David R.; Zhao, Wei; Starr, John M.; Johnson, Andrew D.; Ikram, M. Arfan; Bennett, David A.; Cummings, Steven R.; Deary, Ian J.; Harris, Tamara B.; Kardia, Sharon L. R.; Mosley, Thomas H.; Srikanth, Velandai K.; Windham, Beverly G.; Newman, Ann B.; Walston, Jeremy D.; Davies, Gail; Evans, Daniel S.; Slagboom, Eline P.; Ferrucci, Luigi; Kiel, Douglas P.; Murabito, Joanne M.; Atzmon, Gil

    2017-01-01

    Emerging evidence suggests that the basis for variation in late-life mobility is attributable, in part, to genetic factors, which may become increasingly important with age. Our objective was to systematically assess the contribution of genetic variation to gait speed in older individuals. We conducted a meta-analysis of gait speed GWASs in 31,478 older adults from 17 cohorts of the CHARGE consortium, and validated our results in 2,588 older adults from 4 independent studies. We followed our initial discoveries with network and eQTL analysis of candidate signals in tissues. The meta-analysis resulted in a list of 536 suggestive genome wide significant SNPs in or near 69 genes. Further interrogation with Pathway Analysis placed gait speed as a polygenic complex trait in five major networks. Subsequent eQTL analysis revealed several SNPs significantly associated with the expression of PRSS16, WDSUB1 and PTPRT, which in addition to the meta-analysis and pathway suggested that genetic effects on gait speed may occur through synaptic function and neuronal development pathways. No genome-wide significant signals for gait speed were identified from this moderately large sample of older adults, suggesting that more refined physical function phenotypes will be needed to identify the genetic basis of gait speed in aging. PMID:28077804

  9. Kinetic theory approach to modeling of cellular repair mechanisms under genome stress.

    Directory of Open Access Journals (Sweden)

    Jinpeng Qi

    Full Text Available Under acute perturbations from outer environment, a normal cell can trigger cellular self-defense mechanism in response to genome stress. To investigate the kinetics of cellular self-repair process at single cell level further, a model of DNA damage generating and repair is proposed under acute Ion Radiation (IR by using mathematical framework of kinetic theory of active particles (KTAP. Firstly, we focus on illustrating the profile of Cellular Repair System (CRS instituted by two sub-populations, each of which is made up of the active particles with different discrete states. Then, we implement the mathematical framework of cellular self-repair mechanism, and illustrate the dynamic processes of Double Strand Breaks (DSBs and Repair Protein (RP generating, DSB-protein complexes (DSBCs synthesizing, and toxins accumulating. Finally, we roughly analyze the capability of cellular self-repair mechanism, cellular activity of transferring DNA damage, and genome stability, especially the different fates of a certain cell before and after the time thresholds of IR perturbations that a cell can tolerate maximally under different IR perturbation circumstances.

  10. 一种用于辅助导航的快速图像匹配方法%A Rapid Image Matching Approach for Auxiliary Navigation

    Institute of Scientific and Technical Information of China (English)

    吴政; 冯燕; 陈武

    2009-01-01

    为了提高辅助导航中多传感器图像匹配的精确性和实时性,首先提取图像的边缘特征,并用3-4距离变换(3-4DT)方法对边缘二值图像进行变换,以变换后的边缘距离图像为匹配特征;针对传统Hausdorff距离的局限性提出了一种融合点集重合数的Hausdorff距离,并以之为相似性度量;搜索策略根据人眼视觉系统的机制采用一种由远到近的分层匹配方法,同时使用一种改进的实数编码遗传算法来加快底层图像匹配的速度.实验结果为平均匹配时间为1283ms,平均误差值为1.036,表明匹配方法能满足导航要求.%To improve the velocity and accuracy of multi - sensor image matching in auxiliary navigation, after the edge feature of image is extracted, a 3 -4DT method is applied to transform the edge binary image ,then the transformed edge distance image is taken as the matching feature. A Hausdorff distance integrating points set coincidence numbers (I -HD) is proposed to overcome the limitation of traditional Hauedorff distance, and I -HD can be used as the similarity measure. On the basis of the human visual system, a far - near stratification search strategy is adopt-ed. Meanwhile a genetic algorithm using real - coding is applied to accelerate the speed of matching at the bottom of image matching. Experimental results show that average matching time is 1283ms and average error value is 1.036 , which shows that the method can meet the navigation requirement.

  11. Genome-wide association analysis of bacterial cold water disease resistance in rainbow trout reveals the potential of a hybrid approach between genomic selection and marker assisted selection

    Science.gov (United States)

    Genomic selection (GS) simultaneously incorporates dense SNP marker genotypes with phenotypic data from related animals to predict animal-specific genomic breeding value (GEBV), which circumvents the need to measure the disease phenotype in potential breeders. Marker assisted selection (MAS) involv...

  12. Statistics of polarisation matching

    NARCIS (Netherlands)

    Naus, H.W.L.; Zwamborn, A.P.M.

    2014-01-01

    The reception of electromagnetic signals depends on the polarisation matching of the transmitting and receiving antenna. The practical matching differs from the theoretical one because of the noise deterioration of the transmitted and eventually received electromagnetic field. In other applications,

  13. The 'morbid anatomy' of the human genome: tracing the observational and representational approaches of postwar genetics and biomedicine the William Bynum Prize Essay.

    Science.gov (United States)

    Hogan, Andrew J

    2014-07-01

    This paper explores evolving conceptions and depictions of the human genome among human and medical geneticists during the postwar period. Historians of science and medicine have shown significant interest in the use of informational approaches in postwar genetics, which treat the genome as an expansive digital data set composed of three billion DNA nucleotides. Since the 1950s, however, geneticists have largely interacted with the human genome at the microscopically visible level of chromosomes. Mindful of this, I examine the observational and representational approaches of postwar human and medical genetics. During the 1970s and 1980s, the genome increasingly came to be understood as, at once, a discrete part of the human anatomy and a standardised scientific object. This paper explores the role of influential medical geneticists in recasting the human genome as being a visible, tangible, and legible entity, which was highly relevant to traditional medical thinking and practice. I demonstrate how the human genome was established as an object amenable to laboratory and clinical research, and argue that the observational and representational approaches of postwar medical genetics reflect, more broadly, the interdisciplinary efforts underlying the development of contemporary biomedicine.

  14. Integrated genomic approaches implicate osteoglycin (Ogn) in the regulation of left ventricular mass

    Science.gov (United States)

    Petretto, Enrico; Sarwar, Rizwan; Grieve, Ian; Lu, Han; Kumaran, Mande K; Muckett, Phillip J; Mangion, Jonathan; Schroen, Blanche; Benson, Matthew; Punjabi, Prakash P; Prasad, Sanjay K; Pennell, Dudley J; Kiesewetter, Chris; Tasheva, Elena S; Corpuz, Lolita M; Webb, Megan D; Conrad, Gary W; Kurtz, Theodore W; Kren, Vladimir; Fischer, Judith; Hubner, Norbert; Pinto, Yigal M; Pravenec, Michal; Aitman, Timothy J; Cook, Stuart A

    2009-01-01

    Left ventricular mass (LVM) and cardiac gene expression are complex traits regulated by factors both intrinsic and extrinsic to the heart. To dissect the major determinants of LVM, we combined expression quantitative trait locus1 and quantitative trait transcript2 (QTT) analyses of the cardiac transcriptome in the rat. Using these methods and in vitro functional assays, we identified osteoglycin (Ogn) as a major candidate regulator of rat LVM, with increased Ogn protein expression associated with elevated LVM. We also applied genome-wide QTT analysis to the human heart and observed that, out of ~22,000 transcripts, OGN transcript abundance had the highest correlation with LVM. We further confirmed a role for Ogn in the in vivo regulation of LVM in Ogn knockout mice. Taken together, these data implicate Ogn as a key regulator of LVM in rats, mice and humans, and suggest that Ogn modifies the hypertrophic response to extrinsic factors such as hypertension and aortic stenosis. PMID:18443592

  15. Perspective: NanoMine: A material genome approach for polymer nanocomposites analysis and design

    Science.gov (United States)

    Zhao, He; Li, Xiaolin; Zhang, Yichi; Schadler, Linda S.; Chen, Wei; Brinson, L. Catherine

    2016-05-01

    Polymer nanocomposites are a designer class of materials where nanoscale particles, functional chemistry, and polymer resin combine to provide materials with unprecedented combinations of physical properties. In this paper, we introduce NanoMine, a data-driven web-based platform for analysis and design of polymer nanocomposite systems under the material genome concept. This open data resource strives to curate experimental and computational data on nanocomposite processing, structure, and properties, as well as to provide analysis and modeling tools that leverage curated data for material property prediction and design. With a continuously expanding dataset and toolkit, NanoMine encourages community feedback and input to construct a sustainable infrastructure that benefits nanocomposite material research and development.

  16. A genome-based identification approach for members of the genus Bifidobacterium.

    Science.gov (United States)

    Ferrario, Chiara; Milani, Christian; Mancabelli, Leonardo; Lugli, Gabriele Andrea; Turroni, Francesca; Duranti, Sabrina; Mangifesta, Marta; Viappiani, Alice; Sinderen, Douwe van; Ventura, Marco

    2015-03-01

    During recent years, the significant and increasing interest in novel bifidobacterial strains with health-promoting characteristics has catalyzed the development of methods for efficient and reliable identification of Bifidobacterium strains at (sub) species level. We developed an assay based on recently acquired bifidobacterial genomic data and involving 98 primer pairs, called the Bifidobacterium-ampliseq panel. This panel includes multiplex PCR primers that target both core and variable genes of the pangenome of this genus. Our results demonstrate that the employment of the Bifidobacterium-ampliseq panel allows rapid and specific identification of the so far recognized 48 (sub)species harboring the Bifidobacterium genus, and thus represents a cost- and time-effective bifidobacterial screening methodology.

  17. Development of bioleaching: proteomics and genomics approach in metals extraction process

    Directory of Open Access Journals (Sweden)

    M. Azizur Rahman

    2016-08-01

    Full Text Available Microbes are key components of the structure and function of bioleaching process. Increasing consciousness of the role of microbes has led to a quick growth of descriptive and investigational studies of their abundance and activities. However, the detail information of complex functional molecules contain in promising microbes which are very important for understanding microbial processes in bioleaching, are lacking. Therefore, molecular functions of microbes in the bioleaching process are very essential to understand about the microbial activities, especially in the process of the extraction of metals in mineral industries. In this review, the current state of proteomics and genomics of bioleaching in metals extraction processes and the major developments of these analytical methods at industrial scales are highlighted.

  18. A strategic stakeholder approach for addressing further analysis requests in whole genome sequencing research.

    Science.gov (United States)

    Thornock, Bradley Steven O

    2016-01-01

    Whole genome sequencing (WGS) can be a cost-effective and efficient means of diagnosis for some children, but it also raises a number of ethical concerns. One such concern is how researchers derive and communicate results from WGS, including future requests for further analysis of stored sequences. The purpose of this paper is to think about what is at stake, and for whom, in any solution that is developed to deal with such requests. To accomplish this task, this paper will utilize stakeholder theory, a common method used in business ethics. Several scenarios that connect stakeholder concerns and WGS will also posited and analyzed. This paper concludes by developing criteria composed of a series of questions that researchers can answer in order to more effectively address requests for further analysis of stored sequences.

  19. Identification of a strawberry flavor gene candidate using an integrated genetic-genomic-analytical chemistry approach.

    Science.gov (United States)

    Chambers, Alan H; Pillet, Jeremy; Plotto, Anne; Bai, Jinhe; Whitaker, Vance M; Folta, Kevin M

    2014-04-17

    There is interest in improving the flavor of commercial strawberry (Fragaria × ananassa) varieties. Fruit flavor is shaped by combinations of sugars, acids and volatile compounds. Many efforts seek to use genomics-based strategies to identify genes controlling flavor, and then designing durable molecular markers to follow these genes in breeding populations. In this report, fruit from two cultivars, varying for presence-absence of volatile compounds, along with segregating progeny, were analyzed using GC/MS and RNAseq. Expression data were bulked in silico according to presence/absence of a given volatile compound, in this case γ-decalactone, a compound conferring a peach flavor note to fruits. Computationally sorting reads in segregating progeny based on γ-decalactone presence eliminated transcripts not directly relevant to the volatile, revealing transcripts possibly imparting quantitative contributions. One candidate encodes an omega-6 fatty acid desaturase, an enzyme known to participate in lactone production in fungi, noted here as FaFAD1. This candidate was induced by ripening, was detected in certain harvests, and correlated with γ-decalactone presence. The FaFAD1 gene is present in every genotype where γ-decalactone has been detected, and it was invariably missing in non-producers. A functional, PCR-based molecular marker was developed that cosegregates with the phenotype in F1 and BC1 populations, as well as in many other cultivars and wild Fragaria accessions. Genetic, genomic and analytical chemistry techniques were combined to identify FaFAD1, a gene likely controlling a key flavor volatile in strawberry. The same data may now be re-sorted based on presence/absence of any other volatile to identify other flavor-affecting candidates, leading to rapid generation of gene-specific markers.

  20. A New Approach to Predict Microbial Community Assembly and Function Using a Stochastic, Genome-Enabled Modeling Framework

    Science.gov (United States)

    King, E.; Brodie, E.; Anantharaman, K.; Karaoz, U.; Bouskill, N.; Banfield, J. F.; Steefel, C. I.; Molins, S.

    2016-12-01

    Characterizing and predicting the microbial and chemical compositions of subsurface aquatic systems necessitates an understanding of the metabolism and physiology of organisms that are often uncultured or studied under conditions not relevant for one's environment of interest. Cultivation-independent approaches are therefore important and have greatly enhanced our ability to characterize functional microbial diversity. The capability to reconstruct genomes representing thousands of populations from microbial communities using metagenomic techniques provides a foundation for development of predictive models for community structure and function. Here, we discuss a genome-informed stochastic trait-based model incorporated into a reactive transport framework to represent the activities of coupled guilds of hypothetical microorganisms. Metabolic pathways for each microbe within a functional guild are parameterized from metagenomic data with a unique combination of traits governing organism fitness under dynamic environmental conditions. We simulate the thermodynamics of coupled electron donor and acceptor reactions to predict the energy available for cellular maintenance, respiration, biomass development, and enzyme production. While `omics analyses can now characterize the metabolic potential of microbial communities, it is functionally redundant as well as computationally prohibitive to explicitly include the thousands of recovered organisms into biogeochemical models. However, one can derive potential metabolic pathways from genomes along with trait-linkages to build probability distributions of traits. These distributions are used to assemble groups of microbes that couple one or more of these pathways. From the initial ensemble of microbes, only a subset will persist based on the interaction of their physiological and metabolic traits with environmental conditions, competing organisms, etc. Here, we analyze the predicted niches of these hypothetical microbes and

  1. Retroviral integration process in the human genome: is it really non-random? A new statistical approach.

    Directory of Open Access Journals (Sweden)

    Alessandro Ambrosi

    Full Text Available Retroviral vectors are widely used in gene therapy to introduce therapeutic genes into patients' cells, since, once delivered to the nucleus, the genes of interest are stably inserted (integrated into the target cell genome. There is now compelling evidence that integration of retroviral vectors follows non-random patterns in mammalian genome, with a preference for active genes and regulatory regions. In particular, Moloney Leukemia Virus (MLV-derived vectors show a tendency to integrate in the proximity of the transcription start site (TSS of genes, occasionally resulting in the deregulation of gene expression and, where proto-oncogenes are targeted, in tumor initiation. This has drawn the attention of the scientific community to the molecular determinants of the retroviral integration process as well as to statistical methods to evaluate the genome-wide distribution of integration sites. In recent approaches, the observed distribution of MLV integration distances (IDs from the TSS of the nearest gene is assumed to be non-random by empirical comparison with a random distribution generated by computational simulation procedures. To provide a statistical procedure to test the randomness of the retroviral insertion pattern, we propose a probability model (Beta distribution based on IDs between two consecutive genes. We apply the procedure to a set of 595 unique MLV insertion sites retrieved from human hematopoietic stem/progenitor cells. The statistical goodness of fit test shows the suitability of this distribution to the observed data. Our statistical analysis confirms the preference of MLV-based vectors to integrate in promoter-proximal regions.

  2. Rethinking the Match: A Proposal for Modern Match-Making.

    Science.gov (United States)

    Ray, Chris; Bishop, Steven E; Dow, Alan W

    2017-06-27

    Since the 1950s, the National Resident Matching Program, or "the Match," has governed the placement of medical students into residencies. The Match was created to protect students in an era when residency positions outnumbered applicants and hospitals pressured students early in their academic careers to commit to a residency position. Now, however, applicants outnumber positions, applicants are applying to increasing numbers of programs, and the costs of the Match for applicants and programs are high. Meanwhile, medical education is evolving toward a competency-based approach, a U.S. physician shortage is predicted, and some researchers describe a "July effect"-worse clinical outcomes correlated with the mass entry of new residents.Against this background, the authors argue for adopting a more modern, free-market approach to residency match-making that might better suit the needs of applicants, programs, and the public. They propose allowing students who have been identified by their medical schools as having achieved graduation-level competency to apply to residency programs at any point during the year. Residency programs would set their own application timetables and extend offers in an ongoing fashion. Students, counseled by their schools, would accept or decline offers as desired. The authors argue this approach would better support competency-based education while allowing applicants and programs more choice regarding how they engage and adapt within the selection process. The approach's staggered start times for new residents might attenuate the July effect and improve outcomes for patients. Medical students might also enter and thereby complete residency earlier, increasing the physician workforce.

  3. A Boyer-Moore Approach to Degenerate Pattern Matching%基于BM方法的退化模式匹配算法

    Institute of Scientific and Technical Information of China (English)

    林劼; 林舒晔

    2012-01-01

    退化模式匹配问题在生物信息学中具有重要应用意义,但由于该问题的计算复杂度高,现有的算法均难以在实际中应用.在分析退化模式的特点以及经典的Boyer-Moore (BM)算法的基础上,提出基于BM算法框架解决退化模式匹配问题的方法.在计算偏移数组的预处理过程中,定义兼容规则并计算偏移数组,并将其应用在查找阶段,提高退化模式的匹配速度.在平均情况下,该算法提供了线性的模式匹配速度,在实际应用中得到良好的效果.%Degenerated pattern matching problem has important applications in biology sequences, however, due to the computational complexity of the problem, no existing algorithms can be used in practice. After analyzing the characteristics of degenerated pattern matching problem and classical Boyer-Moore (BM) algorithm, it proposes a practical BM based algorithm to tackle the problem. In the pre-process of computing shift arrays, the algorithm defines comparable rules and arrays, and uses them in the searching phase to improve the matching speed. In average case, the algorithm provides a linear time complexity which can be efficiently used in practice.

  4. Megathrust Earthquake Swarms Contemporaneous to Slow Slip and Non-Volcanic Tremor in Southern Mexico, Detected and Analyzed through a Template Matching Approach

    Science.gov (United States)

    Holtkamp, S.; Brudzinski, M. R.; Cabral-Cano, E.; Arciniega-Ceballos, A.

    2012-12-01

    An outstanding question in geophysics is the degree to which the newly discovered types of slow fault slip are related to their destructive cousin - the earthquake. Here, we utilize a local network along the Oaxacan segment of the Middle American subduction zone to investigate the potential relationship between slow slip, non-volcanic tremor (NVT), and earthquakes along the subduction megathrust. We have developed a multi-station "template matching" waveform cross correlation technique which is able to detect and locate events several orders of magnitude smaller than would be possible using more traditional techniques. Also, our template matching procedure is capable of consistently locate events which occur during periods of increased background activity (e.g., during productive NVT, loud cultural noise, or after larger earthquakes) because the multi-station detector is finely tuned to events with similar hypocentral location and focal mechanism. The local network in the Oaxaca region allows us to focus on documented megathrust earthquake swarms, which we focus on because slow slip is hypothesized to be the cause for earthquake swarms in some tectonic environments. We identify a productive earthquake swarm in July 2006 (~600 similar earthquakes detected), which occurred during a week-long episode of productive tremor and slow slip. Families of events in this sequence were also active during larger and longer slow slip events, which provides a potential link between slow slip in the transition zone and earthquakes at the downdip end of the seismogenic portion of the megathrust. Because template matching techniques only detect similar signals, detected waveforms can be stacked together to produce higher signal to noise ratios or cross correlated against each other to produce precise relative phase arrival times. We are using the refined signals to look for evidence of expansion or propagation of hypocenters during these earthquake swarms, which could be used as a

  5. Improved genome-scale multi-target virtual screening via a novel collaborative filtering approach to cold-start problem

    Science.gov (United States)

    Lim, Hansaim; Gray, Paul; Xie, Lei; Poleksic, Aleksandar

    2016-12-01

    Conventional one-drug-one-gene approach has been of limited success in modern drug discovery. Polypharmacology, which focuses on searching for multi-targeted drugs to perturb disease-causing networks instead of designing selective ligands to target individual proteins, has emerged as a new drug discovery paradigm. Although many methods for single-target virtual screening have been developed to improve the efficiency of drug discovery, few of these algorithms are designed for polypharmacology. Here, we present a novel theoretical framework and a corresponding algorithm for genome-scale multi-target virtual screening based on the one-class collaborative filtering technique. Our method overcomes the sparseness of the protein-chemical interaction data by means of interaction matrix weighting and dual regularization from both chemicals and proteins. While the statistical foundation behind our method is general enough to encompass genome-wide drug off-target prediction, the program is specifically tailored to find protein targets for new chemicals with little to no available interaction data. We extensively evaluate our method using a number of the most widely accepted gene-specific and cross-gene family benchmarks and demonstrate that our method outperforms other state-of-the-art algorithms for predicting the interaction of new chemicals with multiple proteins. Thus, the proposed algorithm may provide a powerful tool for multi-target drug design.

  6. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance.

    Science.gov (United States)

    Manning, Alisa K; Hivert, Marie-France; Scott, Robert A; Grimsby, Jonna L; Bouatia-Naji, Nabila; Chen, Han; Rybin, Denis; Liu, Ching-Ti; Bielak, Lawrence F; Prokopenko, Inga; Amin, Najaf; Barnes, Daniel; Cadby, Gemma; Hottenga, Jouke-Jan; Ingelsson, Erik; Jackson, Anne U; Johnson, Toby; Kanoni, Stavroula; Ladenvall, Claes; Lagou, Vasiliki; Lahti, Jari; Lecoeur, Cecile; Liu, Yongmei; Martinez-Larrad, Maria Teresa; Montasser, May E; Navarro, Pau; Perry, John R B; Rasmussen-Torvik, Laura J; Salo, Perttu; Sattar, Naveed; Shungin, Dmitry; Strawbridge, Rona J; Tanaka, Toshiko; van Duijn, Cornelia M; An, Ping; de Andrade, Mariza; Andrews, Jeanette S; Aspelund, Thor; Atalay, Mustafa; Aulchenko, Yurii; Balkau, Beverley; Bandinelli, Stefania; Beckmann, Jacques S; Beilby, John P; Bellis, Claire; Bergman, Richard N; Blangero, John; Boban, Mladen; Boehnke, Michael; Boerwinkle, Eric; Bonnycastle, Lori L; Boomsma, Dorret I; Borecki, Ingrid B; Böttcher, Yvonne; Bouchard, Claude; Brunner, Eric; Budimir, Danijela; Campbell, Harry; Carlson, Olga; Chines, Peter S; Clarke, Robert; Collins, Francis S; Corbatón-Anchuelo, Arturo; Couper, David; de Faire, Ulf; Dedoussis, George V; Deloukas, Panos; Dimitriou, Maria; Egan, Josephine M; Eiriksdottir, Gudny; Erdos, Michael R; Eriksson, Johan G; Eury, Elodie; Ferrucci, Luigi; Ford, Ian; Forouhi, Nita G; Fox, Caroline S; Franzosi, Maria Grazia; Franks, Paul W; Frayling, Timothy M; Froguel, Philippe; Galan, Pilar; de Geus, Eco; Gigante, Bruna; Glazer, Nicole L; Goel, Anuj; Groop, Leif; Gudnason, Vilmundur; Hallmans, Göran; Hamsten, Anders; Hansson, Ola; Harris, Tamara B; Hayward, Caroline; Heath, Simon; Hercberg, Serge; Hicks, Andrew A; Hingorani, Aroon; Hofman, Albert; Hui, Jennie; Hung, Joseph; Jarvelin, Marjo-Riitta; Jhun, Min A; Johnson, Paul C D; Jukema, J Wouter; Jula, Antti; Kao, W H; Kaprio, Jaakko; Kardia, Sharon L R; Keinanen-Kiukaanniemi, Sirkka; Kivimaki, Mika; Kolcic, Ivana; Kovacs, Peter; Kumari, Meena; Kuusisto, Johanna; Kyvik, Kirsten Ohm; Laakso, Markku; Lakka, Timo; Lannfelt, Lars; Lathrop, G Mark; Launer, Lenore J; Leander, Karin; Li, Guo; Lind, Lars; Lindstrom, Jaana; Lobbens, Stéphane; Loos, Ruth J F; Luan, Jian'an; Lyssenko, Valeriya; Mägi, Reedik; Magnusson, Patrik K E; Marmot, Michael; Meneton, Pierre; Mohlke, Karen L; Mooser, Vincent; Morken, Mario A; Miljkovic, Iva; Narisu, Narisu; O'Connell, Jeff; Ong, Ken K; Oostra, Ben A; Palmer, Lyle J; Palotie, Aarno; Pankow, James S; Peden, John F; Pedersen, Nancy L; Pehlic, Marina; Peltonen, Leena; Penninx, Brenda; Pericic, Marijana; Perola, Markus; Perusse, Louis; Peyser, Patricia A; Polasek, Ozren; Pramstaller, Peter P; Province, Michael A; Räikkönen, Katri; Rauramaa, Rainer; Rehnberg, Emil; Rice, Ken; Rotter, Jerome I; Rudan, Igor; Ruokonen, Aimo; Saaristo, Timo; Sabater-Lleal, Maria; Salomaa, Veikko; Savage, David B; Saxena, Richa; Schwarz, Peter; Seedorf, Udo; Sennblad, Bengt; Serrano-Rios, Manuel; Shuldiner, Alan R; Sijbrands, Eric J G; Siscovick, David S; Smit, Johannes H; Small, Kerrin S; Smith, Nicholas L; Smith, Albert Vernon; Stančáková, Alena; Stirrups, Kathleen; Stumvoll, Michael; Sun, Yan V; Swift, Amy J; Tönjes, Anke; Tuomilehto, Jaakko; Trompet, Stella; Uitterlinden, Andre G; Uusitupa, Matti; Vikström, Max; Vitart, Veronique; Vohl, Marie-Claude; Voight, Benjamin F; Vollenweider, Peter; Waeber, Gerard; Waterworth, Dawn M; Watkins, Hugh; Wheeler, Eleanor; Widen, Elisabeth; Wild, Sarah H; Willems, Sara M; Willemsen, Gonneke; Wilson, James F; Witteman, Jacqueline C M; Wright, Alan F; Yaghootkar, Hanieh; Zelenika, Diana; Zemunik, Tatijana; Zgaga, Lina; Wareham, Nicholas J; McCarthy, Mark I; Barroso, Ines; Watanabe, Richard M; Florez, Jose C; Dupuis, Josée; Meigs, James B; Langenberg, Claudia

    2012-05-13

    Recent genome-wide association studies have described many loci implicated in type 2 diabetes (T2D) pathophysiology and β-cell dysfunction but have contributed little to the understanding of the genetic basis of insulin resistance. We hypothesized that genes implicated in insulin resistance pathways might be uncovered by accounting for differences in body mass index (BMI) and potential interactions between BMI and genetic variants. We applied a joint meta-analysis approach to test associations with fasting insulin and glucose on a genome-wide scale. We present six previously unknown loci associated with fasting insulin at P < 5 × 10(-8) in combined discovery and follow-up analyses of 52 studies comprising up to 96,496 non-diabetic individuals. Risk variants were associated with higher triglyceride and lower high-density lipoprotein (HDL) cholesterol levels, suggesting a role for these loci in insulin resistance pathways. The discovery of these loci will aid further characterization of the role of insulin resistance in T2D pathophysiology.

  7. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance

    Science.gov (United States)

    Manning, Alisa K.; Hivert, Marie-France; Scott, Robert A.; Grimsby, Jonna L.; Bouatia-Naji, Nabila; Chen, Han; Rybin, Denis; Liu, Ching-Ti; Bielak, Lawrence F.; Prokopenko, Inga; Amin, Najaf; Barnes, Daniel; Cadby, Gemma; Hottenga, Jouke-Jan; Ingelsson, Erik; Jackson, Anne U.; Johnson, Toby; Kanoni, Stavroula; Ladenvall, Claes; Lagou, Vasiliki; Lahti, Jari; Lecoeur, Cecile; Liu, Yongmei; Martinez-Larrad, Maria Teresa; Montasser, May E.; Navarro, Pau; Perry, John R. B.; Rasmussen-Torvik, Laura J.; Salo, Perttu; Sattar, Naveed; Shungin, Dmitry; Strawbridge, Rona J.; Tanaka, Toshiko; van Duijn, Cornelia M.; An, Ping; de Andrade, Mariza; Andrews, Jeanette S.; Aspelund, Thor; Atalay, Mustafa; Aulchenko, Yurii; Balkau, Beverley; Bandinelli, Stefania; Beckmann, Jacques S.; Beilby, John P.; Bellis, Claire; Bergman, Richard N.; Blangero, John; Boban, Mladen; Boehnke, Michael; Boerwinkle, Eric; Bonnycastle, Lori L.; Boomsma, Dorret I.; Borecki, Ingrid B.; Böttcher, Yvonne; Bouchard, Claude; Brunner, Eric; Budimir, Danijela; Campbell, Harry; Carlson, Olga; Chines, Peter S.; Clarke, Robert; Collins, Francis S.; Corbatón-Anchuelo, Arturo; Couper, David; de Faire, Ulf; Dedoussis, George V; Deloukas, Panos; Dimitriou, Maria; Egan, Josephine M; Eiriksdottir, Gudny; Erdos, Michael R.; Eriksson, Johan G.; Eury, Elodie; Ferrucci, Luigi; Ford, Ian; Forouhi, Nita G.; Fox, Caroline S; Franzosi, Maria Grazia; Franks, Paul W; Frayling, Timothy M; Froguel, Philippe; Galan, Pilar; de Geus, Eco; Gigante, Bruna; Glazer, Nicole L.; Goel, Anuj; Groop, Leif; Gudnason, Vilmundur; Hallmans, Göran; Hamsten, Anders; Hansson, Ola; Harris, Tamara B.; Hayward, Caroline; Heath, Simon; Hercberg, Serge; Hicks, Andrew A.; Hingorani, Aroon; Hofman, Albert; Hui, Jennie; Hung, Joseph; Jarvelin, Marjo Riitta; Jhun, Min A.; Johnson, Paul C.D.; Jukema, J Wouter; Jula, Antti; Kao, W.H.; Kaprio, Jaakko; Kardia, Sharon L. R.; Keinanen-Kiukaanniemi, Sirkka; Kivimaki, Mika; Kolcic, Ivana; Kovacs, Peter; Kumari, Meena; Kuusisto, Johanna; Kyvik, Kirsten Ohm; Laakso, Markku; Lakka, Timo; Lannfelt, Lars; Lathrop, G Mark; Launer, Lenore J.; Leander, Karin; Li, Guo; Lind, Lars; Lindstrom, Jaana; Lobbens, Stéphane; Loos, Ruth J. F.; Luan, Jian’an; Lyssenko, Valeriya; Mägi, Reedik; Magnusson, Patrik K. E.; Marmot, Michael; Meneton, Pierre; Mohlke, Karen L.; Mooser, Vincent; Morken, Mario A.; Miljkovic, Iva; Narisu, Narisu; O’Connell, Jeff; Ong, Ken K.; Oostra, Ben A.; Palmer, Lyle J.; Palotie, Aarno; Pankow, James S.; Peden, John F.; Pedersen, Nancy L.; Pehlic, Marina; Peltonen, Leena; Penninx, Brenda; Pericic, Marijana; Perola, Markus; Perusse, Louis; Peyser, Patricia A; Polasek, Ozren; Pramstaller, Peter P.; Province, Michael A.; Räikkönen, Katri; Rauramaa, Rainer; Rehnberg, Emil; Rice, Ken; Rotter, Jerome I.; Rudan, Igor; Ruokonen, Aimo; Saaristo, Timo; Sabater-Lleal, Maria; Salomaa, Veikko; Savage, David B.; Saxena, Richa; Schwarz, Peter; Seedorf, Udo; Sennblad, Bengt; Serrano-Rios, Manuel; Shuldiner, Alan R.; Sijbrands, Eric J.G.; Siscovick, David S.; Smit, Johannes H.; Small, Kerrin S.; Smith, Nicholas L.; Smith, Albert Vernon; Stančáková, Alena; Stirrups, Kathleen; Stumvoll, Michael; Sun, Yan V.; Swift, Amy J.; Tönjes, Anke; Tuomilehto, Jaakko; Trompet, Stella; Uitterlinden, Andre G.; Uusitupa, Matti; Vikström, Max; Vitart, Veronique; Vohl, Marie-Claude; Voight, Benjamin F.; Vollenweider, Peter; Waeber, Gerard; Waterworth, Dawn M; Watkins, Hugh; Wheeler, Eleanor; Widen, Elisabeth; Wild, Sarah H.; Willems, Sara M.; Willemsen, Gonneke; Wilson, James F.; Witteman, Jacqueline C.M.; Wright, Alan F.; Yaghootkar, Hanieh; Zelenika, Diana; Zemunik, Tatijana; Zgaga, Lina; Wareham, Nicholas J.; McCarthy, Mark I.; Barroso, Ines; Watanabe, Richard M.; Florez, Jose C.; Dupuis, Josée; Meigs, James B.; Langenberg, Claudia

    2013-01-01

    Recent genome-wide association studies have described many loci implicated in type 2 diabetes (T2D) pathophysiology and beta-cell dysfunction, but contributed little to our understanding of the genetic basis of insulin resistance. We hypothesized that genes implicated in insulin resistance pathways may be uncovered by accounting for differences in body mass index (BMI) and potential interaction between BMI and genetic variants. We applied a novel joint meta-analytical approach to test associations with fasting insulin (FI) and glucose (FG) on a genome-wide scale. We present six previously unknown FI loci at Pdiscovery and follow-up analyses of 52 studies comprising up to 96,496non-diabetic individuals. Risk variants were associated with higher triglyceride and lower HDL cholesterol levels, suggestive of a role for these FI loci in insulin resistance pathways. The localization of these additional loci will aid further characterization of the role of insulin resistance in T2D pathophysiology. PMID:22581228

  8. swDMR: A Sliding Window Approach to Identify Differentially Methylated Regions Based on Whole Genome Bisulfite Sequencing.

    Directory of Open Access Journals (Sweden)

    Zhen Wang

    Full Text Available DNA methylation is a widespread epigenetic modification that plays an essential role in gene expression through transcriptional regulation and chromatin remodeling. The emergence of whole genome bisulfite sequencing (WGBS represents an important milestone in the detection of DNA methylation. Characterization of differential methylated regions (DMRs is fundamental as well for further functional analysis. In this study, we present swDMR (http://sourceforge.net/projects/swDMR/ for the comprehensive analysis of DMRs from whole genome methylation profiles by a sliding window approach. It is an integrated tool designed for WGBS data, which not only implements accessible statistical methods to perform hypothesis test adapted to two or more samples without replicates, but false discovery rate was also controlled by multiple test correction. Downstream analysis tools were also provided, including cluster, annotation and visualization modules. In summary, based on WGBS data, swDMR can produce abundant information of differential methylated regions. As a convenient and flexible tool, we believe swDMR will bring us closer to unveil the potential functional regions involved in epigenetic regulation.

  9. Hierarchical model of matching

    Science.gov (United States)

    Pedrycz, Witold; Roventa, Eugene

    1992-01-01

    The issue of matching two fuzzy sets becomes an essential design aspect of many algorithms including fuzzy controllers, pattern classifiers, knowledge-based systems, etc. This paper introduces a new model of matching. Its principal features involve the following: (1) matching carried out with respect to the grades of membership of fuzzy sets as well as some functionals defined on them (like energy, entropy,transom); (2) concepts of hierarchies in the matching model leading to a straightforward distinction between 'local' and 'global' levels of matching; and (3) a distributed character of the model realized as a logic-based neural network.

  10. Best matching theory & applications

    CERN Document Server

    Moghaddam, Mohsen

    2017-01-01

    Mismatch or best match? This book demonstrates that best matching of individual entities to each other is essential to ensure smooth conduct and successful competitiveness in any distributed system, natural and artificial. Interactions must be optimized through best matching in planning and scheduling, enterprise network design, transportation and construction planning, recruitment, problem solving, selective assembly, team formation, sensor network design, and more. Fundamentals of best matching in distributed and collaborative systems are explained by providing: § Methodical analysis of various multidimensional best matching processes § Comprehensive taxonomy, comparing different best matching problems and processes § Systematic identification of systems’ hierarchy, nature of interactions, and distribution of decision-making and control functions § Practical formulation of solutions based on a library of best matching algorithms and protocols, ready for direct applications and apps development. Design...

  11. A new approach for cloning hLIF cDNA from genomic DNA isolated from the oral mucous membrane.

    Science.gov (United States)

    Cui, Y H; Zhu, G Q; Chen, Q J; Wang, Y F; Yang, M M; Song, Y X; Wang, J G; Cao, B Y

    2011-11-25

    Complementary DNA (cDNA) is valuable for investigating protein structure and function in the study of life science, but it is difficult to obtain by traditional reverse transcription. We employed a novel strategy to clone human leukemia inhibitory factor (hLIF) gene cDNA from genomic DNA, which was directly isolated from the mucous membrane of mouth. The hLIF sequence, which is 609 bp long and is composed of three exons, can be acquired within a few hours by amplifying each exon and splicing all of them using overlap-PCR. This new approach developed is simple, time- and cost-effective, without RNA preparation or cDNA synthesis, and is not limited to the specific tissues for a particular gene and the expression level of the gene.

  12. Integrated pathway-based approach identifies association between genomic regions at CTCF and CACNB2 and schizophrenia.

    Directory of Open Access Journals (Sweden)

    Dilafruz Juraeva

    2014-06-01

    Full Text Available In the present study, an integrated hierarchical approach was applied to: (1 identify pathways associated with susceptibility to schizophrenia; (2 detect genes that may be potentially affected in these pathways since they contain an associated polymorphism; and (3 annotate the functional consequences of such single-nucleotide polymorphisms (SNPs in the affected genes or their regulatory regions. The Global Test was applied to detect schizophrenia-associated pathways using discovery and replication datasets comprising 5,040 and 5,082 individuals of European ancestry, respectively. Information concerning functional gene-sets was retrieved from the Kyoto Encyclopedia of Genes and Genomes, Gene Ontology, and the Molecular Signatures Database. Fourteen of the gene-sets or pathways identified in the discovery dataset were confirmed in the replication dataset. These include functional processes involved in transcriptional regulation and gene expression, synapse organization, cell adhesion, and apoptosis. For two genes, i.e. CTCF and CACNB2, evidence for association with schizophrenia was available (at the gene-level in both the discovery study and published data from the Psychiatric Genomics Consortium schizophrenia study. Furthermore, these genes mapped to four of the 14 presently identified pathways. Several of the SNPs assigned to CTCF and CACNB2 have potential functional consequences, and a gene in close proximity to CACNB2, i.e. ARL5B, was identified as a potential gene of interest. Application of the present hierarchical approach thus allowed: (1 identification of novel biological gene-sets or pathways with potential involvement in the etiology of schizophrenia, as well as replication of these findings in an independent cohort; (2 detection of genes of interest for future follow-up studies; and (3 the highlighting of novel genes in previously reported candidate regions for schizophrenia.

  13. A Bayesian Approach for Analysis of Whole-Genome Bisulphite Sequencing Data Identifies Disease-Associated Changes in DNA Methylation.

    Science.gov (United States)

    Rackham, Owen J L; Langley, Sarah R; Oates, Thomas; Vradi, Eleni; Harmston, Nathan; Srivastava, Prashant K; Behmoaras, Jacques; Dellaportas, Petros; Bottolo, Leonardo; Petretto, Enrico

    2017-02-17

    DNA methylation is a key epigenetic modification involved in gene regulation whose contribution to disease susceptibility remains to be fully understood. Here, we present a novel Bayesian smoothing approach (called ABBA) to detect differentially methylated regions (DMRs) from whole-genome bisulphite sequencing (WGBS). We also show how this approach can be leveraged to identify disease-associated changes in DNA methylation, suggesting mechanisms through which these alterations might affect disease. From a data modeling perspective, ABBA has the distinctive feature of automatically adapting to different correlation structures in CpG methylation levels across the genome whilst taking into account the distance between CpG sites as a covariate. Our simulation study shows that ABBA has greater power to detect DMRs than existing methods, providing an accurate identification of DMRs in the large majority of simulated cases. To empirically demonstrate the method's efficacy in generating biological hypotheses, we performed WGBS of primary macrophages derived from an experimental rat system of glomerulonephritis and used ABBA to identify >1,000 disease-associated DMRs. Investigation of these DMRs revealed differential DNA methylation localized to a 600bp region in the promoter of the Ifitm3 gene. This was confirmed by ChIP-seq and RNA-seq analyses, showing differential transcription factor binding at the Ifitm3 promoter by JunD (an established determinant of glomerulonephritis) and a consistent change in Ifitm3 expression. Our ABBA analysis allowed us to propose a new role for Ifitm3 in the pathogenesis of glomerulonephritis via a mechanism involving promoter hypermethylation that is associated with Ifitm3 repression in the rat strain susceptible to glomerulonephritis.

  14. Genomic research with human samples. Points of view from scientists and research subjects about disclosure of results and risks of genomic research. Ethical and empirical approach.

    Science.gov (United States)

    Valle Mansilla, José Ignacio

    2011-01-01

    Biomedical researchers often now ask subjects to donate samples to be deposited in biobanks. This is not only of interest to researchers, patients and society as a whole can benefit from the improvements in diagnosis, treatment, and prevention that the advent of genomic medicine portends. However, there is a growing debate regarding the social and ethical implications of creating biobanks and using stored human tissue samples for genomic research. Our aim was to identify factors related to both scientists and patients' preferences regarding the sort of information to convey to subjects about the results of the study and the risks related to genomic research. The method used was a survey addressed to 204 scientists and 279 donors from the U.S. and Spain. In this sample, researchers had already published genomic epidemiology studies; and research subjects had actually volunteered to donate a human sample for genomic research. Concerning the results, patients supported more frequently than scientists their right to know individual results from future genomic research. These differences were statistically significant after adjusting by the opportunity to receive genetic research results from the research they had previously participated and their perception of risks regarding genetic information compared to other clinical data. A slight majority of researchers supported informing participants about individual genomic results only if the reliability and clinical validity of the information had been established. Men were more likely than women to believe that patients should be informed of research results even if these conditions were not met. Also among patients, almost half of them would always prefer to be informed about individual results from future genomic research. The three main factors associated to a higher support of a non-limited access to individual results were: being from the US, having previously been offered individual information and considering

  15. Hepatitis B virus infection in Latin America: A genomic medicine approach

    Science.gov (United States)

    Roman, Sonia; Jose-Abrego, Alexis; Fierro, Nora Alma; Escobedo-Melendez, Griselda; Ojeda-Granados, Claudia; Martinez-Lopez, Erika; Panduro, Arturo

    2014-01-01

    Hepatitis B virus (HBV) infection is the leading cause of severe chronic liver disease. This article provides a critical view of the importance of genomic medicine for the study of HBV infection and its clinical outcomes in Latin America. Three levels of evolutionary adaptation may correlate with the clinical outcomes of HBV infection. Infections in Latin America are predominantly of genotype H in Mexico and genotype F in Central and South America; these strains have historically circulated among the indigenous population. Both genotypes appear to be linked to a benign course of disease among the native and mestizo Mexicans and native South Americans. In contrast, genotypes F, A and D are common in acute and chronic infections among mestizos with Caucasian ancestry. Hepatocellular carcinoma is rare in Mexicans, but it has been associated with genotype F1b among Argentineans. This observation illustrates the significance of ascertaining the genetic and environmental factors involved in the development of HBV-related liver disease in Latin America, which contrast with those reported in other regions of the world. PMID:24966588

  16. Functional genomics of the brain: uncovering networks in the CNS using a systems approach.

    Science.gov (United States)

    Konopka, Genevieve

    2011-01-01

    The central nervous system (CNS) is undoubtedly the most complex human organ system in terms of its diverse functions, cellular composition, and connections. Attempts to capture this diversity experimentally were the foundation on which the field of neurobiology was built. Until now though, techniques were either painstakingly slow or insufficient in capturing this heterogeneity. In addition, the combination of multiple layers of information needed for a complete picture of neuronal diversity from the epigenome to the proteome requires an even more complex compilation of data. In this era of high-throughput genomics though, the ability to isolate and profile neurons and brain tissue has increased tremendously and now requires less effort. Both microarrays and next-generation sequencing have identified neuronal transcriptomes and signaling networks involved in normal brain development, as well as in disease. However, the expertise needed to organize and prioritize the resultant data remains substantial. A combination of supervised organization and unsupervised analyses are needed to fully appreciate the underlying structure in these datasets. When utilized effectively, these analyses have yielded striking insights into a number of fundamental questions in neuroscience on topics ranging from the evolution of the human brain to neuropsychiatric and neurodegenerative disorders. Future studies will incorporate these analyses with behavioral and physiological data from patients to more efficiently move toward personalized therapeutics.

  17. α-amanitin resistance in Drosophila melanogaster: A genome-wide association approach

    Science.gov (United States)

    Mitchell, Chelsea L.; Latuszek, Catrina E.; Vogel, Kara R.; Greenlund, Ian M.; Hobmeier, Rebecca E.; Ingram, Olivia K.; Dufek, Shannon R.; Pecore, Jared L.; Nip, Felicia R.; Johnson, Zachary J.; Ji, Xiaohui; Wei, Hairong; Gailing, Oliver

    2017-01-01

    We investigated the mechanisms of mushroom toxin resistance in the Drosophila Genetic Reference Panel (DGRP) fly lines, using genome-wide association studies (GWAS). While Drosophila melanogaster avoids mushrooms in nature, some lines are surprisingly resistant to α-amanitin—a toxin found solely in mushrooms. This resistance may represent a pre-adaptation, which might enable this species to invade the mushroom niche in the future. Although our previous microarray study had strongly suggested that pesticide-metabolizing detoxification genes confer α-amanitin resistance in a Taiwanese D. melanogaster line Ama-KTT, none of the traditional detoxification genes were among the top candidate genes resulting from the GWAS in the current study. Instead, we identified Megalin, Tequila, and widerborst as candidate genes underlying the α-amanitin resistance phenotype in the North American DGRP lines, all three of which are connected to the Target of Rapamycin (TOR) pathway. Both widerborst and Tequila are upstream regulators of TOR, and TOR is a key regulator of autophagy and Megalin-mediated endocytosis. We suggest that endocytosis and autophagy of α-amanitin, followed by lysosomal degradation of the toxin, is one of the mechanisms that confer α-amanitin resistance in the DGRP lines. PMID:28241077

  18. Hepatitis B virus infection in Latin America: a genomic medicine approach.

    Science.gov (United States)

    Roman, Sonia; Jose-Abrego, Alexis; Fierro, Nora Alma; Escobedo-Melendez, Griselda; Ojeda-Granados, Claudia; Martinez-Lopez, Erika; Panduro, Arturo

    2014-06-21

    Hepatitis B virus (HBV) infection is the leading cause of severe chronic liver disease. This article provides a critical view of the importance of genomic medicine for the study of HBV infection and its clinical outcomes in Latin America. Three levels of evolutionary adaptation may correlate with the clinical outcomes of HBV infection. Infections in Latin America are predominantly of genotype H in Mexico and genotype F in Central and South America; these strains have historically circulated among the indigenous population. Both genotypes appear to be linked to a benign course of disease among the native and mestizo Mexicans and native South Americans. In contrast, genotypes F, A and D are common in acute and chronic infections among mestizos with Caucasian ancestry. Hepatocellular carcinoma is rare in Mexicans, but it has been associated with genotype F1b among Argentineans. This observation illustrates the significance of ascertaining the genetic and environmental factors involved in the development of HBV-related liver disease in Latin America, which contrast with those reported in other regions of the world.

  19. Phenotypic consequences of polyploidy and genome size at the microevolutionary scale: a multivariate morphological approach.

    Science.gov (United States)

    Balao, Francisco; Herrera, Javier; Talavera, Salvador

    2011-10-01

    • Chromosomal duplications and increases in DNA amount have the potential to alter quantitative plant traits like flower number, plant stature or stomata size. This has been documented often across species, but information on whether such effects also occur within species (i.e. at the microevolutionary or population scale) is scarce. • We studied trait covariation associated with polyploidy and genome size (both monoploid and total) in 22 populations of Dianthus broteri s.l., a perennial herb with several cytotypes (2x, 4x, 6x and 12x) that do not coexist spatially. Principal component scores of organ size/number variations were assessed as correlates of polyploidy, and phylogenetic relatedness among populations was controlled using phylogenetic generalized least squares. • Polyploidy covaried with organ dimensions, causing multivariate characters to increase, remain unchanged, or decrease with DNA amount. Variations in monoploid DNA amount had detectable consequences on some phenotypic traits. According to the analyses, some traits would experience phenotypic selection, while others would not. • We show that polyploidy contributes to decouple variation among traits in D. broteri, and hypothesize that polyploids may experience an evolutionary advantage in this plant lineage, for example, if it helps to overcome the constraints imposed by trait integration.

  20. Gametic phase estimation over large genomic regions using an adaptive window approach

    Directory of Open Access Journals (Sweden)

    Excoffier Laurent

    2003-11-01

    Full Text Available Abstract The authors present ELB, an easy to programme and computationally fast algorithm for inferring gametic phase in population samples of multilocus genotypes. Phase updates are made on the basis of a window of neighbouring loci, and the window size varies according to the local level of linkage disequilibrium. Thus, ELB is particularly well suited to problems involving many loci and/or relatively large genomic regions, including those with variable recombination rate. The authors have simulated population samples of single nucleotide polymorphism genotypes with varying levels of recombination and marker density, and find that ELB provides better local estimation of gametic phase than the PHASE or HTYPER programs, while its global accuracy is broadly similar. The relative improvement in local accuracy increases both with increasing recombination and with increasing marker density. Short tandem repeat (STR, or microsatellite simulation studies demonstrate ELB's superiority over PHASE both globally and locally. Missing data are handled by ELB; simulations show that phase recovery is virtually unaffected by up to 2 per cent of missing data, but that phase estimation is noticeably impaired beyond this amount. The authors also applied ELB to datasets obtained from random pairings of 42 human X chromosomes typed at 97 diallelic markers in a 200 kb low-recombination region. Once again, they found ELB to have consistently better local accuracy than PHASE or HTYPER, while its global accuracy was close to the best.

  1. Host influence in the genomic composition of flaviviruses: A multivariate approach.

    Science.gov (United States)

    Simón, Diego; Fajardo, Alvaro; Sóñora, Martín; Delfraro, Adriana; Musto, Héctor

    2017-10-28

    Flaviviruses present substantial differences in their host range and transmissibility. We studied the evolution of base composition, dinucleotide biases, codon usage and amino acid frequencies in the genus Flavivirus within a phylogenetic framework by principal components analysis. There is a mutual interplay between the evolutionary history of flaviviruses and their respective vectors and/or hosts. Hosts associated to distinct phylogenetic groups may be driving flaviviruses at different pace and through various sequence landscapes, as can be seen for viruses associated with Aedes or Culex spp., although phylogenetic inertia cannot be ruled out. In some cases, viruses face even opposite forces. For instance, in tick-borne flaviviruses, while vertebrate hosts exert pressure to deplete their CpG, tick vectors drive them to exhibit GC-rich codons. Within a vertebrate environment, natural selection appears to be acting on the viral genome to overcome the immune system. On the other side, within an arthropod environment, mutational biases seem to be the dominant forces. Copyright © 2017 Elsevier Inc. All rights reserved.

  2. A genomic approach to study Down syndrome and cancer inverse comorbidity: Untangling the Chromosome 21

    Directory of Open Access Journals (Sweden)

    Jaume eForés-Martos

    2015-02-01

    Full Text Available Down syndrome (DS, one of the most common birth defects and the most widespread genetic cause of intellectual disabilities, is caused by extra genetic material on chromosome 21 (HSA21. The increased genomic dosage of trisomy 21 is thought to be responsible for the distinct DS phenotypes, including an increased risk of developing some types of childhood leukemia and germ cell tumors. Patients with DS, however, have a strikingly lower incidence of many other solid tumors. We hypothesized that the third copy of genes located in HSA21 may have an important role on the protective effect that DS patients show against most types of solid tumors. Focusing on Copy Number Variation (CNV array data, we have generated frequencies of deleted regions in HSA21 in four different tumor types from which DS patients have been reported to be protected. We describe three different regions of deletion pointing to a set of candidate genes that could explain the inverse comorbidity phenomenon between DS and solid tumors. In particular we found RCAN1 gene in Wilms tumors and the miR-99A, miR-125B2 and miR-LET7C in lung, breast and melanoma tumors as the main candidates for explaining the inverse comorbidity observed between solid tumors and DS.

  3. α-amanitin resistance in Drosophila melanogaster: A genome-wide association approach.

    Science.gov (United States)

    Mitchell, Chelsea L; Latuszek, Catrina E; Vogel, Kara R; Greenlund, Ian M; Hobmeier, Rebecca E; Ingram, Olivia K; Dufek, Shannon R; Pecore, Jared L; Nip, Felicia R; Johnson, Zachary J; Ji, Xiaohui; Wei, Hairong; Gailing, Oliver; Werner, Thomas

    2017-01-01

    We investigated the mechanisms of mushroom toxin resistance in the Drosophila Genetic Reference Panel (DGRP) fly lines, using genome-wide association studies (GWAS). While Drosophila melanogaster avoids mushrooms in nature, some lines are surprisingly resistant to α-amanitin-a toxin found solely in mushrooms. This resistance may represent a pre-adaptation, which might enable this species to invade the mushroom niche in the future. Although our previous microarray study had strongly suggested that pesticide-metabolizing detoxification genes confer α-amanitin resistance in a Taiwanese D. melanogaster line Ama-KTT, none of the traditional detoxification genes were among the top candidate genes resulting from the GWAS in the current study. Instead, we identified Megalin, Tequila, and widerborst as candidate genes underlying the α-amanitin resistance phenotype in the North American DGRP lines, all three of which are connected to the Target of Rapamycin (TOR) pathway. Both widerborst and Tequila are upstream regulators of TOR, and TOR is a key regulator of autophagy and Megalin-mediated endocytosis. We suggest that endocytosis and autophagy of α-amanitin, followed by lysosomal degradation of the toxin, is one of the mechanisms that confer α-amanitin resistance in the DGRP lines.

  4. An evolutionary and genomic approach to challenges and opportunities for eliminating aging.

    Science.gov (United States)

    Rose, Michael R; Rutledge, Grant A; Phung, Kevin H; Phillips, Mark A; Greer, Lee F; Mueller, Laurence D

    2014-01-01

    While solutions to major scientific and medical problems are never perfect or complete, it is still reasonable to delineate cases where both have been essentially solved. For example, Darwin's theory of natural selection provides a successful solution to the problem of biological adaptation, while the germ theory of infection solved the scientific problem of contagious disease. Likewise in the context of medicine, we have effectively solved the problem of contagious disease, reducing it to a minor cause of death and disability for almost everyone in countries with advanced medicine and adequate resources. Evolutionary biologists claim to have solved the scientific problem of aging: we explain it theoretically using Hamilton's forces of natural selection; in experimental evolution we readily manipulate the onset, rate, and eventual cessation of aging by manipulating these forces. In this article, we turn to the technological challenge of solving the medical problem of aging. While we feel that the broad outlines of such a solution are clear enough starting from the evolutionary solution to the scientific problem of aging, we do not claim that we can give a complete or exhaustive plan for medically solving the problem of aging. But we are confident that biology and medicine will effectively solve the problem of aging within the next 50 years, providing Hamiltonian lifestyle changes, tissue repair, and genomic technological opportunities are fully exploited in public health practices, in medical practice, and in medical research, respectively.

  5. An integrative genomic and epigenomic approach for the study of transcriptional regulation.

    Directory of Open Access Journals (Sweden)

    Maria E Figueroa

    Full Text Available The molecular heterogeneity of acute leukemias and other tumors constitutes a major obstacle towards understanding disease pathogenesis and developing new targeted-therapies. Aberrant gene regulation is a hallmark of cancer and plays a central role in determining tumor phenotype. We predicted that integration of different genome-wide epigenetic regulatory marks along with gene expression levels would provide greater power in capturing biological differences between leukemia subtypes. Gene expression, cytosine methylation and histone H3 lysine 9 (H3K9 acetylation were measured using high-density oligonucleotide microarrays in primary human acute myeloid leukemia (AML and acute lymphocytic leukemia (ALL specimens. We found that DNA methylation and H3K9 acetylation distinguished these leukemias of distinct cell lineage, as expected, but that an integrative analysis combining the information from each platform revealed hundreds of additional differentially expressed genes that were missed by gene expression arrays alone. This integrated analysis also enhanced the detection and statistical significance of biological pathways dysregulated in AML and ALL. Integrative epigenomic studies are thus feasible using clinical samples and provide superior detection of aberrant transcriptional programming than single-platform microarray studies.

  6. Breeding approaches and genomics technologies to increase crop yield under low-temperature stress.

    Science.gov (United States)

    Jha, Uday Chand; Bohra, Abhishek; Jha, Rintu

    2017-01-01

    Improved knowledge about plant cold stress tolerance offered by modern omics technologies will greatly inform future crop improvement strategies that aim to breed cultivars yielding substantially high under low-temperature conditions. Alarmingly rising temperature extremities present a substantial impediment to the projected target of 70% more food production by 2050. Low-temperature (LT) stress severely constrains crop production worldwide, thereby demanding an urgent yet sustainable solution. Considerable research progress has been achieved on this front. Here, we review the crucial cellular and metabolic alterations in plants that follow LT stress along with the signal transduction and the regulatory network describing the plant cold tolerance. The significance of plant genetic resources to expand the genetic base of breeding programmes with regard to cold tolerance is highlighted. Also, the genetic architecture of cold tolerance trait as elucidated by conventional QTL mapping and genome-wide association mapping is described. Further, global expression profiling techniques including RNA-Seq along with diverse omics platforms are briefly discussed to better understand the underlying mechanism and prioritize the candidate gene (s) for downstream applications. These latest additions to breeders' toolbox hold immense potential to support plant breeding schemes that seek development of LT-tolerant cultivars. High-yielding cultivars endowed with greater cold tolerance are urgently required to sustain the crop yield under conditions severely challenged by low-temperature.

  7. Bioinformatical approaches to RNA structure prediction & Sequencing of an ancient human genome

    DEFF Research Database (Denmark)

    Lindgreen, Stinus

    Stinus Lindgreen has been working in two different fields during his Ph.D. The first part has been focused on computational approaches to predict the structure of non-coding RNA molecules at the base pairing level. This has resulted in the analysis of various measures of the base pairing potentia...

  8. Approaching the Three-Dimensional Organization and Dynamics of the Human Genome

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2002-01-01

    textabstractTo approach the still largely unknown sequential and three-dimensional organization of the human cell nucleus, the structural-, scaling- and dynamic properties of interphase chromosomes and cell nuclei were simulated on the 30nm chromatin fiber level with Monte Carlo,

  9. Approaching the Three-Dimensional Organization and Dynamics of the Human Genome

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2002-01-01

    textabstractTo approach the still largely unknown sequential and three-dimensional organization of the human cell nucleus, the structural-, scaling- and dynamic properties of interphase chromosomes and cell nuclei were simulated on the 30nm chromatin fiber level with Monte Carlo, Brownian Dyna

  10. Approaching the Three-Dimensional Organization and Dynamics of the Human Genome

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2003-01-01

    textabstractTo approach the three-dimensional organization of the human cell nucleus, the structural-, scaling- and dynamic properties of interphase chromosomes and cell nuclei were simulated with Monte Carlo and Brownian Dynamics methods. The 30 nm chromatin fiber was folded according to the M

  11. Approaching the Three-Dimensional Organization and Dynamics of the Human Genome

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2002-01-01

    textabstractTo approach by virtual microscopy the three-dimensional organization of the human cell nucleus, the structural-, scaling- and dynamic properties of interphase chromosomes and cell nuclei were simulated with Monte Carlo and Brownian Dynamics methods. The 30 nm chromatin fiber was fold

  12. A semantic web approach applied to integrative bioinformatics experimentation: a biological use case with genomics data.

    NARCIS (Netherlands)

    Post, L.J.G.; Roos, M.; Marshall, M.S.; van Driel, R.; Breit, T.M.

    2007-01-01

    The numerous public data resources make integrative bioinformatics experimentation increasingly important in life sciences research. However, it is severely hampered by the way the data and information are made available. The semantic web approach enhances data exchange and integration by providing

  13. A genomic and transcriptomic approach to investigate the blue pigment phenotype in Pseudomonas fluorescens.

    Science.gov (United States)

    Andreani, Nadia Andrea; Carraro, Lisa; Martino, Maria Elena; Fondi, Marco; Fasolato, Luca; Miotto, Giovanni; Magro, Massimiliano; Vianello, Fabio; Cardazzo, Barbara

    2015-11-20

    Pseudomonas fluorescens is a well-known food spoiler, able to cause serious economic losses in the food industry due to its ability to produce many extracellular, and often thermostable, compounds. The most outstanding spoilage events involving P. fluorescens were blue discoloration of several food stuffs, mainly dairy products. The bacteria involved in such high-profile cases have been identified as belonging to a clearly distinct phylogenetic cluster of the P. fluorescens group. Although the blue pigment has recently been investigated in several studies, the biosynthetic pathway leading to the pigment formation, as well as its chemical nature, remain challenging and unsolved points. In the present paper, genomic and transcriptomic data of 4 P. fluorescens strains (2 blue-pigmenting strains and 2 non-pigmenting strains) were analyzed to evaluate the presence and the expression of blue strain-specific genes. In particular, the pangenome analysis showed the presence in the blue-pigmenting strains of two copies of genes involved in the tryptophan biosynthesis pathway (including trpABCDF). The global expression profiling of blue-pigmenting strains versus non-pigmenting strains showed a general up-regulation of genes involved in iron uptake and a down-regulation of genes involved in primary metabolism. Chromogenic reaction of the blue-pigmenting bacterial cells with Kovac's reagent indicated an indole-derivative as the precursor of the blue pigment. Finally, solubility tests and MALDI-TOF mass spectrometry analysis of the isolated pigment suggested that its molecular structure is very probably a hydrophobic indigo analog.

  14. Transcriptional control in embryonic Drosophila midline guidance assessed through a whole genome approach

    Directory of Open Access Journals (Sweden)

    Tomancak Pavel

    2007-07-01

    Full Text Available Abstract Background During the development of the Drosophila central nervous system the process of midline crossing is orchestrated by a number of guidance receptors and ligands. Many key axon guidance molecules have been identified in both invertebrates and vertebrates, but the transcriptional regulation of growth cone guidance remains largely unknown. It is established that translational regulation plays a role in midline crossing, and there are indications that transcriptional regulation is also involved. To investigate this issue, we conducted a genome-wide study of transcription in Drosophila embryos using wild type and a number of well-characterized Drosophila guidance mutants and transgenics. We also analyzed a previously published microarray time course of Drosophila embryonic development with an axon guidance focus. Results Using hopach, a novel clustering method which is well suited to microarray data analysis, we identified groups of genes with similar expression patterns across guidance mutants and transgenics. We then systematically characterized the resulting clusters with respect to their relevance to axon guidance using two complementary controlled vocabularies: the Gene Ontology (GO and anatomical annotations of the Atlas of Pattern of Gene Expression (APoGE in situ hybridization database. The analysis indicates that regulation of gene expression does play a role in the process of axon guidance in Drosophila. We also find a strong link between axon guidance and hemocyte migration, a result that agrees with mounting evidence that axon guidance molecules are co-opted in vertebrate vascularization. Cell cyclin activity in the context of axon guidance is also suggested from our array data. RNA and protein expression patterns of cell cyclins in axon guidance mutants and transgenics support this possible link. Conclusion This study provides important insights into the regulation of axon guidance in vivo.

  15. Is gene activity in plant cells affected by UMTS-irradiation? A whole genome approach

    Directory of Open Access Journals (Sweden)

    Julia C Engelmann

    2008-10-01

    Full Text Available Julia C Engelmann3,* Rosalia Deeken1,* Tobias Müller3, Günter Nimtz2, M Rob G Roelfsema1, Rainer Hedrich11Molecular Plant Physiology and Biophysics, Julius-von-Sachs Institute for Biosciences; 2Institute of Physics II, University of Cologne, Cologne, Germany; 3Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany; *These authors contributed equally to this workAbstract: Mobile phone technology makes use of radio frequency (RF electromagnetic fields transmitted through a dense network of base stations in Europe. Possible harmful effects of RF fields on humans and animals are discussed, but their effect on plants has received little attention. In search for physiological processes of plant cells sensitive to RF fields, cell suspension cultures of Arabidopsis thaliana were exposed for 24 h to a RF field protocol representing typical microwave exposition in an urban environment. mRNA of exposed cultures and controls was used to hybridize Affymetrix-ATH1 whole genome microarrays. Differential expression analysis revealed significant changes in transcription of 10 genes, but they did not exceed a fold change of 2.5. Besides that 3 of them are dark-inducible, their functions do not point to any known responses of plants to environmental stimuli. The changes in transcription of these genes were compared with published microarray datasets and revealed a weak similarity of the microwave to light treatment experiments. Considering the large changes described in published experiments, it is questionable if the small alterations caused by a 24 h continuous microwave exposure would have any impact on the growth and reproduction of whole plants.Keywords: suspension cultured plant cells, radio frequency electromagnetic fields, microarrays, Arabidopsis thaliana

  16. A Novel Framework for Short Tandem Repeats (STRs Using Parallel String Matching

    Directory of Open Access Journals (Sweden)

    D. Bala MuraliKrishna,

    2015-09-01

    Full Text Available Short tandem repeats (STRs have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. These repeated DNA sequences are found in both Plants and bacteria. Most of the computer programs that find STRs failed to report its number of occurrences of the repeated pattern, exact position and it is difficult task to obtain accurate results from the larger datasets. So we need high performance computing models to extract certain repeats. One of the solution is STRs using parallel string matching, it gives number of occurrences with corresponding line number and exact location or position of each STR in the genome of any length. In this, we implemented parallel string matching using JAVA Multithreading with multi core processing, for this we implemented a basic algorithm and made a comparison with previous algorithms like Knuth Morris Pratt, Boyer Moore and Brute force string matching algorithms and from the results our new basic algorithm gives better results than the previous algorithms. We apply this algorithm in parallel string matching using multi-threading concept to reduce the time by running on multicore processors. From the test results it is shown that the multicore processing is a remarkably efficient and powerful compared to lower versions and finally this proposed STR using parallel string matching algorithm is better than the sequential approaches.

  17. DNA Microarray as Part of a Genomic-Assisted Breeding Approach

    DEFF Research Database (Denmark)

    Vincze, Éva; Bowra, Steve

    2010-01-01

    . tissue/pathway specific approaches using an example of focused microarray and how it follows predicted changes during grain development. We describe of an extension of the study to field grown material and conclude that such an approach is able to interpret differences in the gene expression profiles......In the struggle to achieve global food security, crop breeding retains an important role in crop production. A current trend is the diversification of the aims of crop production, to include an increased awareness of aspects and consequences of food quality. The added emphasis on food and feed...... and practical significances, fold changes, validation and possible additional regulatory mechanisms in gene expression. The subject of the fourth section is the applications of DNA microarrays to study of global gene expression during grain filling in monocot crops, especially barley. We compare large arrays vs...

  18. 基于曲率特征的自主车辆地图匹配定位方法%A Novel Localization Approach for Autonomous Vehicles Based on Map Matching with Curvature Features

    Institute of Scientific and Technical Information of China (English)

    苏奎峰; 邓志东; 黄振

    2012-01-01

    提出了一种新的基于曲率特征的自主车辆地图匹配定位方法,该方法通过计算自主车辆行驶轨迹和参考轨迹的尺度不变曲率积分特征及其相关性进行匹配,可以有效地消除因航迹推算(DR)传感器标定参数偏差和航向角估计偏差而引起的错误匹配问题.文中首先采用扩展卡尔曼滤波器融合惯性测量单元输出、方向盘转角和4个ABS(防抱死刹车系统)传感器测量的轮速,估计自主车辆的位姿状态,并据此从数字地图中选择匹配的候选路段.然后利用本文提出的曲率空间特征地图匹配算法实现路段匹配,并根据曲率和航向角变化确定匹配点,最后将其作为无迹卡尔曼滤波器的观测值更新滤波器,从而实现高精度的位姿估计.现场道路实验结果表明,该法能够有效地实现地图匹配,降低自主车辆DR产生的累积误差,从而能够在GPS(全球定位系统)信号失效情况下实现长距离精确定位.%Using the curvature features, a novel map-matching based localization approach for autonomous vehicles is proposed. By computing the scale-invariant curvature integral and its correlation of autonomous vehicle's historical and reference trajectories for matching, the proposed approach can effectively eliminate the mismatch problem caused by odometer calibration parameters bias and azimuth estimation errors in dead-reckoning (DR). Firstly, we integrate the inertial measurement unit output, steering angles, and wheel speed measurements from four ABS (anti-lock braking system) sensors by using the extended Kalman filter in order to estimate the autonomous vehicle's position and orientation, which are then used to select the candidate matching segments from digital maps. Then, a map matching algorithm based on spatial curvature features is proposed to accomplish segment matching, and matching points are determined according to the changes in curvature and yaw. Finally, these matching points

  19. A metadata approach for clinical data management in translational genomics studies in breast cancer

    Directory of Open Access Journals (Sweden)

    Davies Jim

    2009-11-01

    Full Text Available Abstract Background In molecular profiling studies of cancer patients, experimental and clinical data are combined in order to understand the clinical heterogeneity of the disease: clinical information for each subject needs to be linked to tumour samples, macromolecules extracted, and experimental results. This may involve the integration of clinical data sets from several different sources: these data sets may employ different data definitions and some may be incomplete. Methods In this work we employ semantic web techniques developed within the CancerGrid project, in particular the use of metadata elements and logic-based inference to annotate heterogeneous clinical information, integrate and query it. Results We show how this integration can be achieved automatically, following the declaration of appropriate metadata elements for each clinical data set; we demonstrate the practicality of this approach through application to experimental results and clinical data from five hospitals in the UK and Canada, undertaken as part of the MET