global sequence information: Topics by WorldWideScience.org

Sample records for global sequence information

Accessible surface area of proteins from purely sequence information and the importance of global features

Science.gov (United States)

Faraggi, Eshel; Zhou, Yaoqi; Kloczkowski, Andrzej

2014-03-01

We present a new approach for predicting the accessible surface area of proteins. The novelty of this approach lies in not using residue mutation profiles generated by multiple sequence alignments as descriptive inputs. Rather, sequential window information and the global monomer and dimer compositions of the chain are used. We find that much of the lost accuracy due to the elimination of evolutionary information is recouped by the use of global features. Furthermore, this new predictor produces similar results for proteins with or without sequence homologs deposited in the Protein Data Bank, and hence shows generalizability. Finally, these predictions are obtained in a small fraction (1/1000) of the time required to run mutation profile based prediction. All these factors indicate the possible usability of this work in de-novo protein structure prediction and in de-novo protein design using iterative searches. Funded in part by the financial support of the National Institutes of Health through Grants R01GM072014 and R01GM073095, and the National Science Foundation through Grant NSF MCB 1071785.
Inter-laboratory evaluation of the EUROFORGEN Global ancestry-informative SNP panel by massively parallel sequencing using the Ion PGM™.

Science.gov (United States)

Eduardoff, M; Gross, T E; Santos, C; de la Puente, M; Ballard, D; Strobl, C; Børsting, C; Morling, N; Fusco, L; Hussing, C; Egyed, B; Souto, L; Uacyisrael, J; Syndercombe Court, D; Carracedo, Á; Lareu, M V; Schneider, P M; Parson, W; Phillips, C; Parson, W; Phillips, C

2016-07-01

The EUROFORGEN Global ancestry-informative SNP (AIM-SNPs) panel is a forensic multiplex of 128 markers designed to differentiate an individual's ancestry from amongst the five continental population groups of Africa, Europe, East Asia, Native America, and Oceania. A custom multiplex of AmpliSeq™ PCR primers was designed for the Global AIM-SNPs to perform massively parallel sequencing using the Ion PGM™ system. This study assessed individual SNP genotyping precision using the Ion PGM™, the forensic sensitivity of the multiplex using dilution series, degraded DNA plus simple mixtures, and the ancestry differentiation power of the final panel design, which required substitution of three original ancestry-informative SNPs with alternatives. Fourteen populations that had not been previously analyzed were genotyped using the custom multiplex and these studies allowed assessment of genotyping performance by comparison of data across five laboratories. Results indicate a low level of genotyping error can still occur from sequence misalignment caused by homopolymeric tracts close to the target SNP, despite careful scrutiny of candidate SNPs at the design stage. Such sequence misalignment required the exclusion of component SNP rs2080161 from the Global AIM-SNPs panel. However, the overall genotyping precision and sensitivity of this custom multiplex indicates the Ion PGM™ assay for the Global AIM-SNPs is highly suitable for forensic ancestry analysis with massively parallel sequencing. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments.

Science.gov (United States)

Daily, Jeff

2016-02-10

Sequence alignment algorithms are a key component of many bioinformatics applications. Though various fast Smith-Waterman local sequence alignment implementations have been developed for x86 CPUs, most are embedded into larger database search tools. In addition, fast implementations of Needleman-Wunsch global sequence alignment and its semi-global variants are not as widespread. This article presents the first software library for local, global, and semi-global pairwise intra-sequence alignments and improves the performance of previous intra-sequence implementations. A faster intra-sequence local pairwise alignment implementation is described and benchmarked, including new global and semi-global variants. Using a 375 residue query sequence a speed of 136 billion cell updates per second (GCUPS) was achieved on a dual Intel Xeon E5-2670 24-core processor system, the highest reported for an implementation based on Farrar's 'striped' approach. Rognes's SWIPE optimal database search application is still generally the fastest available at 1.2 to at best 2.4 times faster than Parasail for sequences shorter than 500 amino acids. However, Parasail was faster for longer sequences. For global alignments, Parasail's prefix scan implementation is generally the fastest, faster even than Farrar's 'striped' approach, however the opal library is faster for single-threaded applications. The software library is designed for 64 bit Linux, OS X, or Windows on processors with SSE2, SSE41, or AVX2. Source code is available from https://github.com/jeffdaily/parasail under the Battelle BSD-style license. Applications that require optimal alignment scores could benefit from the improved performance. For the first time, SIMD global, semi-global, and local alignments are available in a stand-alone C library.
Protein Function Prediction Based on Sequence and Structure Information

KAUST Repository

Smaili, Fatima Z.

2016-05-25

The number of available protein sequences in public databases is increasing exponentially. However, a significant fraction of these sequences lack functional annotation which is essential to our understanding of how biological systems and processes operate. In this master thesis project, we worked on inferring protein functions based on the primary protein sequence. In the approach we follow, 3D models are first constructed using I-TASSER. Functions are then deduced by structurally matching these predicted models, using global and local similarities, through three independent enzyme commission (EC) and gene ontology (GO) function libraries. The method was tested on 250 “hard” proteins, which lack homologous templates in both structure and function libraries. The results show that this method outperforms the conventional prediction methods based on sequence similarity or threading. Additionally, our method could be improved even further by incorporating protein-protein interaction information. Overall, the method we use provides an efficient approach for automated functional annotation of non-homologous proteins, starting from their sequence.
Integrating genome-based informatics to modernize global disease monitoring, information sharing, and response

DEFF Research Database (Denmark)

Aarestrup, Frank Møller; Brown, Eric W; Detter, Chris

2012-01-01

The rapid advancement of genome technologies holds great promise for improving the quality and speed of clinical and public health laboratory investigations and for decreasing their cost. The latest generation of genome DNA sequencers can provide highly detailed and robust information on disease...... typing methods to provide point-of-care clinical diagnosis and other essential information for quicker and better treatment of patients. Provided there is free-sharing of information by all clinical and public health laboratories, these genomic tools could spawn a global system of linked databases......-causing microbes, and in the near future these technologies will be suitable for routine use in national, regional, and global public health laboratories. With additional improvements in instrumentation, these next- or third-generation sequencers are likely to replace conventional culture-based and molecular...
Statistical distributions of optimal global alignment scores of random protein sequences

Directory of Open Access Journals (Sweden)

Tang Jiaowei

2005-10-01

Full Text Available Abstract Background The inference of homology from statistically significant sequence similarity is a central issue in sequence alignments. So far the statistical distribution function underlying the optimal global alignments has not been completely determined. Results In this study, random and real but unrelated sequences prepared in six different ways were selected as reference datasets to obtain their respective statistical distributions of global alignment scores. All alignments were carried out with the Needleman-Wunsch algorithm and optimal scores were fitted to the Gumbel, normal and gamma distributions respectively. The three-parameter gamma distribution performs the best as the theoretical distribution function of global alignment scores, as it agrees perfectly well with the distribution of alignment scores. The normal distribution also agrees well with the score distribution frequencies when the shape parameter of the gamma distribution is sufficiently large, for this is the scenario when the normal distribution can be viewed as an approximation of the gamma distribution. Conclusion We have shown that the optimal global alignment scores of random protein sequences fit the three-parameter gamma distribution function. This would be useful for the inference of homology between sequences whose relationship is unknown, through the evaluation of gamma distribution significance between sequences.
Inter-laboratory evaluation of the EUROFORGEN Global ancestry-informative SNP panel by massively parallel sequencing using the Ion PGM™

DEFF Research Database (Denmark)

Eduardoff, M; Gross, T E; Santos, C

2016-01-01

Seq™ PCR primers was designed for the Global AIM-SNPs to perform massively parallel sequencing using the Ion PGM™ system. This study assessed individual SNP genotyping precision using the Ion PGM™, the forensic sensitivity of the multiplex using dilution series, degraded DNA plus simple mixtures...
Sequencing Information Management System (SIMS). Final report

Energy Technology Data Exchange (ETDEWEB)

Fields, C.

1996-02-15

A feasibility study to develop a requirements analysis and functional specification for a data management system for large-scale DNA sequencing laboratories resulted in a functional specification for a Sequencing Information Management System (SIMS). This document reports the results of this feasibility study, and includes a functional specification for a SIMS relational schema. The SIMS is an integrated information management system that supports data acquisition, management, analysis, and distribution for DNA sequencing laboratories. The SIMS provides ad hoc query access to information on the sequencing process and its results, and partially automates the transfer of data between laboratory instruments, analysis programs, technical personnel, and managers. The SIMS user interfaces are designed for use by laboratory technicians, laboratory managers, and scientists. The SIMS is designed to run in a heterogeneous, multiplatform environment in a client/server mode. The SIMS communicates with external computational and data resources via the internet.
INFORMATION AND KNOWLEDGE IN A GLOBAL CONTEXT

Directory of Open Access Journals (Sweden)

Florina BRAN

2015-12-01

Full Text Available Information and knowledge are two important entities, which make up present stage of globalization, based mostly on their dynamics. This paper is providing an overview of information and knowledge in global context, highlighting the importance of information society that turned into knowledge society in the beginning of the 21 century, being driven by Internet – the latter, as part of globalization process. Modern economic theories recognise the importance of information in economic process because its impact on globalization process in economy was essential, and change the way how markets and companies work and represent the key factor of new era of economic development. This paper presents main results from available literature about the relationship between information, knowledge and economic theory in a global conterxt and finally explained the benefits of the knowledge economy to all countries.
Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications

DEFF Research Database (Denmark)

Yilmaz, Pelin; Kottmann, Renzo; Field, Dawn

2011-01-01

Here we present a standard developed by the Genomic Standards Consortium (GSC) for reporting marker gene sequences--the minimum information about a marker gene sequence (MIMARKS). We also introduce a system for describing the environment from which a biological sample originates. The 'environment...
Information Security Management in Context of Globalization

OpenAIRE

Wawak, Slawomir

2012-01-01

Modern information technologies are the engine of globalization. At the same time, the global market influences the way of looking at information security. Information security thus becomes an increasingly important field. The article discuses the results of research on information security management systems in public administration in Poland.
PIMS sequencing extension: a laboratory information management system for DNA sequencing facilities

Directory of Open Access Journals (Sweden)

Baldwin Stephen A

2011-03-01

Full Text Available Abstract Background Facilities that provide a service for DNA sequencing typically support large numbers of users and experiment types. The cost of services is often reduced by the use of liquid handling robots but the efficiency of such facilities is hampered because the software for such robots does not usually integrate well with the systems that run the sequencing machines. Accordingly, there is a need for software systems capable of integrating different robotic systems and managing sample information for DNA sequencing services. In this paper, we describe an extension to the Protein Information Management System (PIMS that is designed for DNA sequencing facilities. The new version of PIMS has a user-friendly web interface and integrates all aspects of the sequencing process, including sample submission, handling and tracking, together with capture and management of the data. Results The PIMS sequencing extension has been in production since July 2009 at the University of Leeds DNA Sequencing Facility. It has completely replaced manual data handling and simplified the tasks of data management and user communication. Samples from 45 groups have been processed with an average throughput of 10000 samples per month. The current version of the PIMS sequencing extension works with Applied Biosystems 3130XL 96-well plate sequencer and MWG 4204 or Aviso Theonyx liquid handling robots, but is readily adaptable for use with other combinations of robots. Conclusions PIMS has been extended to provide a user-friendly and integrated data management solution for DNA sequencing facilities that is accessed through a normal web browser and allows simultaneous access by multiple users as well as facility managers. The system integrates sequencing and liquid handling robots, manages the data flow, and provides remote access to the sequencing results. The software is freely available, for academic users, from http://www.pims-lims.org/.
PIMS sequencing extension: a laboratory information management system for DNA sequencing facilities.

Science.gov (United States)

Troshin, Peter V; Postis, Vincent Lg; Ashworth, Denise; Baldwin, Stephen A; McPherson, Michael J; Barton, Geoffrey J

2011-03-07

Facilities that provide a service for DNA sequencing typically support large numbers of users and experiment types. The cost of services is often reduced by the use of liquid handling robots but the efficiency of such facilities is hampered because the software for such robots does not usually integrate well with the systems that run the sequencing machines. Accordingly, there is a need for software systems capable of integrating different robotic systems and managing sample information for DNA sequencing services. In this paper, we describe an extension to the Protein Information Management System (PIMS) that is designed for DNA sequencing facilities. The new version of PIMS has a user-friendly web interface and integrates all aspects of the sequencing process, including sample submission, handling and tracking, together with capture and management of the data. The PIMS sequencing extension has been in production since July 2009 at the University of Leeds DNA Sequencing Facility. It has completely replaced manual data handling and simplified the tasks of data management and user communication. Samples from 45 groups have been processed with an average throughput of 10000 samples per month. The current version of the PIMS sequencing extension works with Applied Biosystems 3130XL 96-well plate sequencer and MWG 4204 or Aviso Theonyx liquid handling robots, but is readily adaptable for use with other combinations of robots. PIMS has been extended to provide a user-friendly and integrated data management solution for DNA sequencing facilities that is accessed through a normal web browser and allows simultaneous access by multiple users as well as facility managers. The system integrates sequencing and liquid handling robots, manages the data flow, and provides remote access to the sequencing results. The software is freely available, for academic users, from http://www.pims-lims.org/.
The Global Drought Information System - A Decision Support Tool with Global Applications

Science.gov (United States)

Arndt, D. S.; Brewer, M.; Heim, R. R., Jr.

2014-12-01

Drought is a natural hazard which can cause famine in developing countries and severe economic hardship in developed countries. Given current concerns with the increasing frequency and magnitude of droughts in many regions of the world, especially in the light of expected climate change, drought monitoring and dissemination of early warning information in a timely fashion on a global scale is a critical concern as an important adaptation and mitigation strategy. While a number of nations, and a few continental-scale activities have developed drought information system activities, a global drought early warning system (GDEWS) remains elusive, despite the benefits highlighted by ministers to the Global Earth Observation System of System in 2008. In an effort to begin a process of drought monitoring with international collaboration, the National Integrated Drought Information System's (NIDIS) U.S. Drought Portal, a web-based information system created to address drought services and early warning in the United States, including drought monitoring, forecasting, impacts, mitigation, research, and education, volunteered to develop a prototype Global Drought Monitoring Portal (GDMP). Through integration of data and information at the global level, and with four continental-level partners, the GDMP has proven successful as a tool to monitor drought around the globe. At a past meeting between NIDIS, the World Meteorological Organization, and the Global Earth Observation System of Systems, it was recommended that the GDMP form the basis for a Global Drought Information System (GDIS). Currently, GDIS activities are focused around providing operational global drought monitoring products and assessments, incorporating additional drought monitoring information, especially from those areas without regional or continental-scale input, and incorporating drought-specific climate forecast information from the World Climate Research Programme. Additional GDIS pilot activities are
Information decomposition method to analyze symbolical sequences

International Nuclear Information System (INIS)

Korotkov, E.V.; Korotkova, M.A.; Kudryashov, N.A.

2003-01-01

The information decomposition (ID) method to analyze symbolical sequences is presented. This method allows us to reveal a latent periodicity of any symbolical sequence. The ID method is shown to have advantages in comparison with application of the Fourier transformation, the wavelet transform and the dynamic programming method to look for latent periodicity. Examples of the latent periods for poetic texts, DNA sequences and amino acids are presented. Possible origin of a latent periodicity for different symbolical sequences is discussed
Information technology and global change science

Energy Technology Data Exchange (ETDEWEB)

Baxter, F.P.

1990-01-01

The goal of this paper is to identify and briefly describe major existing and near term information technologies that cold have a positive impact on the topics being discussed at this conference by helping to manage the data of global change science and helping global change scientists conduct their research. Desktop computer systems have changed dramatically during the past seven years. Faster data processing can be expected in the future through full development of traditional serial computer architectures. Some other proven information technologies may be currently underutilized by global change scientists. Relational database management systems and good organization of data through the use of thoughtful database design would enable the scientific community to better share and maintain quality research data. Custodians of the data should use rigorous data administration to ensure integrity and long term value of the data resource. Still other emerging information technologies that involve the use of artificial intelligence, parallel computer architectures, and new sensors for data collection will be in relatively common use in the near term and should become part of the global science community's technical toolkit. Consideration should also be given to the establishment of Information Analysis Centers to facilitate effective organization and management of interdisciplinary data and the prototype testing and use of advanced information technology to facilitate rapid and cost-effective integration of these tools into global change science. 8 refs.
Inferences about the global scenario of human T-cell lymphotropic virus type 1 infection using data mining of viral sequences

Directory of Open Access Journals (Sweden)

Thessika Hialla Almeida Araujo

2014-07-01

Full Text Available Human T-cell lymphotropic virus type 1 (HTLV-1 is mainly associated with two diseases: tropical spastic paraparesis/HTLV-1-associated myelopathy (TSP/HAM and adult T-cell leukaemia/lymphoma. This retrovirus infects five-10 million individuals throughout the world. Previously, we developed a database that annotates sequence data from GenBank and the present study aimed to describe the clinical, molecular and epidemiological scenarios of HTLV-1 infection through the stored sequences in this database. A total of 2,545 registered complete and partial sequences of HTLV-1 were collected and 1,967 (77.3% of those sequences represented unique isolates. Among these isolates, 93% contained geographic origin information and only 39% were related to any clinical status. A total of 1,091 sequences contained information about the geographic origin and viral subtype and 93% of these sequences were identified as subtype “a”. Ethnicity data are very scarce. Regarding clinical status data, 29% of the sequences were generated from TSP/HAM and 67.8% from healthy carrier individuals. Although the data mining enabled some inferences about specific aspects of HTLV-1 infection to be made, due to the relative scarcity of data of available sequences, it was not possible to delineate a global scenario of HTLV-1 infection.
Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

Science.gov (United States)

Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.

2017-07-01

DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.
Global repeat discovery and estimation of genomic copy number in a large, complex genome using a high-throughput 454 sequence survey

Directory of Open Access Journals (Sweden)

Varala Kranthi

2007-05-01

Full Text Available Abstract Background Extensive computational and database tools are available to mine genomic and genetic databases for model organisms, but little genomic data is available for many species of ecological or agricultural significance, especially those with large genomes. Genome surveys using conventional sequencing techniques are powerful, particularly for detecting sequences present in many copies per genome. However these methods are time-consuming and have potential drawbacks. High throughput 454 sequencing provides an alternative method by which much information can be gained quickly and cheaply from high-coverage surveys of genomic DNA. Results We sequenced 78 million base-pairs of randomly sheared soybean DNA which passed our quality criteria. Computational analysis of the survey sequences provided global information on the abundant repetitive sequences in soybean. The sequence was used to determine the copy number across regions of large genomic clones or contigs and discover higher-order structures within satellite repeats. We have created an annotated, online database of sequences present in multiple copies in the soybean genome. The low bias of pyrosequencing against repeat sequences is demonstrated by the overall composition of the survey data, which matches well with past estimates of repetitive DNA content obtained by DNA re-association kinetics (Cot analysis. Conclusion This approach provides a potential aid to conventional or shotgun genome assembly, by allowing rapid assessment of copy number in any clone or clone-end sequence. In addition, we show that partial sequencing can provide access to partial protein-coding sequences.
Effects of informed consent for individual genome sequencing on relevant knowledge.

Science.gov (United States)

Kaphingst, K A; Facio, F M; Cheng, M-R; Brooks, S; Eidem, H; Linn, A; Biesecker, B B; Biesecker, L G

2012-11-01

Increasing availability of individual genomic information suggests that patients will need knowledge about genome sequencing to make informed decisions, but prior research is limited. In this study, we examined genome sequencing knowledge before and after informed consent among 311 participants enrolled in the ClinSeq™ sequencing study. An exploratory factor analysis of knowledge items yielded two factors (sequencing limitations knowledge; sequencing benefits knowledge). In multivariable analysis, high pre-consent sequencing limitations knowledge scores were significantly related to education [odds ratio (OR): 8.7, 95% confidence interval (CI): 2.45-31.10 for post-graduate education, and OR: 3.9; 95% CI: 1.05, 14.61 for college degree compared with less than college degree] and race/ethnicity (OR: 2.4, 95% CI: 1.09, 5.38 for non-Hispanic Whites compared with other racial/ethnic groups). Mean values increased significantly between pre- and post-consent for the sequencing limitations knowledge subscale (6.9-7.7, p benefits knowledge subscale (7.0-7.5, p < 0.0001); increase in knowledge did not differ by sociodemographic characteristics. This study highlights gaps in genome sequencing knowledge and underscores the need to target educational efforts toward participants with less education or from minority racial/ethnic groups. The informed consent process improved genome sequencing knowledge. Future studies could examine how genome sequencing knowledge influences informed decision making. © 2012 John Wiley & Sons A/S.

Development of Global Soil Information Facilities

Directory of Open Access Journals (Sweden)

N H Batjes

2013-02-01

Full Text Available ISRIC - World Soil Information has a mandate to serve the international community as custodian of global soil information and to increase awareness and understanding of the role of soils in major global issues. To adapt to the current demand for soil information, ISRIC is updating its enterprise data management system, including procedures for registering acquired data, such as lineage, versioning, quality assessment, and control. Data can be submitted, queried, and analysed using a growing range of web-based services - ultimately aiming at full and open exchange of data, metadata, and products - through the ICSU-accredited World Data Centre for Soils.
A metadata initiative for global information discovery

Science.gov (United States)

Christian, E.

2001-01-01

The Global Information Locator Service (GILS) encompasses a global vision framed by the fundamental values of open societies. Societal values such as a free flow of information impose certain requirements on the society's information infrastructure. These requirements in turn shape the various laws, policies, standards, and technologies that determine the infrastructure design. A particular focus of GILS is the requirement to provide the means for people to discover sources of data and information. Information discovery in the GILS vision is designed to be decentralized yet coherent, and globally comprehensive yet useful for detailed data. This article introduces basic concepts and design issues, with emphasis on the techniques by which GILS supports interoperability. It explains the practical implications of GILS for the common roles of organizations involved in handling information, from content provider through system engineer and intermediary to searcher. The article provides examples of GILS initiatives in various types of communities: bibliographic, geographic, environmental, and government. ?? 2001 Elsevier Science Inc.
Earth science information: Planning for the integration and use of global change information

Science.gov (United States)

Lousma, Jack R.

1992-01-01

Activities and accomplishments of the first six months of the Consortium for International Earth Science Information Network (CIESIN's) 1992 technical program have focused on four main missions: (1) the development and implementation of plans for initiation of the Socioeconomic Data and Applications Center (SEDAC) as part of the EOSDIS Program; (2) the pursuit and development of a broad-based global change information cooperative by providing systems analysis and integration between natural science and social science data bases held by numerous federal agencies and other sources; (3) the fostering of scientific research into the human dimensions of global change and providing integration between natural science and social science data and information; and (4) the serving of CIESIN as a gateway for global change data and information distribution through development of the Global Change Research Information Office and other comprehensive knowledge sharing systems.
Concepts for a global resources information system

Science.gov (United States)

Billingsley, F. C.; Urena, J. L.

1984-01-01

The objective of the Global Resources Information System (GRIS) is to establish an effective and efficient information management system to meet the data access requirements of NASA and NASA-related scientists conducting large-scale, multi-disciplinary, multi-mission scientific investigations. Using standard interfaces and operating guidelines, diverse data systems can be integrated to provide the capabilities to access and process multiple geographically dispersed data sets and to develop the necessary procedures and algorithms to derive global resource information.
Sharing Data to Build a Medical Information Commons: From Bermuda to the Global Alliance.

Science.gov (United States)

Cook-Deegan, Robert; Ankeny, Rachel A; Maxson Jones, Kathryn

2017-08-31

The Human Genome Project modeled its open science ethos on nematode biology, most famously through daily release of DNA sequence data based on the 1996 Bermuda Principles. That open science philosophy persists, but daily, unfettered release of data has had to adapt to constraints occasioned by the use of data from individual people, broader use of data not only by scientists but also by clinicians and individuals, the global reach of genomic applications and diverse national privacy and research ethics laws, and the rising prominence of a diverse commercial genomics sector. The Global Alliance for Genomics and Health was established to enable the data sharing that is essential for making meaning of genomic variation. Data-sharing policies and practices will continue to evolve as researchers, health professionals, and individuals strive to construct a global medical and scientific information commons.
The US Global Change Data and Information Management Program Plan

International Nuclear Information System (INIS)

1992-01-01

The US Global Change Research Program (USGCRP) requires massive quantities of highly diverse data and information to improve our understanding of global change processes. The Committee on Earth and Environmental Sciences (CEES) comprises Federal agencies that need to provide reliable data and information for this purpose from existing programs and archives and from new activities designed to improve upon the data and information. This US Global Change Data and Information Management Program Plan commits the participating Federal agencies to work with each other, with academia, and with the international community to make it as easy as possible for researchers and others to access and use global change data and information. Toward this end, the agencies are organizing a Global Change Data and Information System (GCDIS), which takes advantage of the mission resources and responsibilities of each agency. Sources for global change data and information are national and international agency programs, including those focused on the USGCRP, such as NASA's Earth Observing System [EOS] and other agency global change initiatives and those contributing to the USGCRP from other agency programs not focused on global change. Data and information include raw data from observation systems, value-added data from data assembly activities, and derived data and information from models and other investigations. Additional data and information are identified from appropriate sources including academia and the international community
An accurate and rapid continuous wavelet dynamic time warping algorithm for unbalanced global mapping in nanopore sequencing

KAUST Repository

Han, Renmin; Li, Yu; Wang, Sheng; Gao, Xin

2017-01-01

Long-reads, point-of-care, and PCR-free are the promises brought by nanopore sequencing. Among various steps in nanopore data analysis, the global mapping between the raw electrical current signal sequence and the expected signal sequence from
Image encryption using random sequence generated from generalized information domain

International Nuclear Information System (INIS)

Zhang Xia-Yan; Wu Jie-Hua; Zhang Guo-Ji; Li Xuan; Ren Ya-Zhou

2016-01-01

A novel image encryption method based on the random sequence generated from the generalized information domain and permutation–diffusion architecture is proposed. The random sequence is generated by reconstruction from the generalized information file and discrete trajectory extraction from the data stream. The trajectory address sequence is used to generate a P-box to shuffle the plain image while random sequences are treated as keystreams. A new factor called drift factor is employed to accelerate and enhance the performance of the random sequence generator. An initial value is introduced to make the encryption method an approximately one-time pad. Experimental results show that the random sequences pass the NIST statistical test with a high ratio and extensive analysis demonstrates that the new encryption scheme has superior security. (paper)
Highly accurate fluorogenic DNA sequencing with information theory-based error correction.

Science.gov (United States)

Chen, Zitian; Zhou, Wenxiong; Qiao, Shuo; Kang, Li; Duan, Haifeng; Xie, X Sunney; Huang, Yanyi

2017-12-01

Eliminating errors in next-generation DNA sequencing has proved challenging. Here we present error-correction code (ECC) sequencing, a method to greatly improve sequencing accuracy by combining fluorogenic sequencing-by-synthesis (SBS) with an information theory-based error-correction algorithm. ECC embeds redundancy in sequencing reads by creating three orthogonal degenerate sequences, generated by alternate dual-base reactions. This is similar to encoding and decoding strategies that have proved effective in detecting and correcting errors in information communication and storage. We show that, when combined with a fluorogenic SBS chemistry with raw accuracy of 98.1%, ECC sequencing provides single-end, error-free sequences up to 200 bp. ECC approaches should enable accurate identification of extremely rare genomic variations in various applications in biology and medicine.
MACSIMS : multiple alignment of complete sequences information management system

Directory of Open Access Journals (Sweden)

Plewniak Frédéric

2006-06-01

Full Text Available Abstract Background In the post-genomic era, systems-level studies are being performed that seek to explain complex biological systems by integrating diverse resources from fields such as genomics, proteomics or transcriptomics. New information management systems are now needed for the collection, validation and analysis of the vast amount of heterogeneous data available. Multiple alignments of complete sequences provide an ideal environment for the integration of this information in the context of the protein family. Results MACSIMS is a multiple alignment-based information management program that combines the advantages of both knowledge-based and ab initio sequence analysis methods. Structural and functional information is retrieved automatically from the public databases. In the multiple alignment, homologous regions are identified and the retrieved data is evaluated and propagated from known to unknown sequences with these reliable regions. In a large-scale evaluation, the specificity of the propagated sequence features is estimated to be >99%, i.e. very few false positive predictions are made. MACSIMS is then used to characterise mutations in a test set of 100 proteins that are known to be involved in human genetic diseases. The number of sequence features associated with these proteins was increased by 60%, compared to the features available in the public databases. An XML format output file allows automatic parsing of the MACSIM results, while a graphical display using the JalView program allows manual analysis. Conclusion MACSIMS is a new information management system that incorporates detailed analyses of protein families at the structural, functional and evolutionary levels. MACSIMS thus provides a unique environment that facilitates knowledge extraction and the presentation of the most pertinent information to the biologist. A web server and the source code are available at http://bips.u-strasbg.fr/MACSIMS/.
U.S. Global Change Research Program National Climate Assessment Global Change Information System

Science.gov (United States)

Tilmes, Curt

2012-01-01

The program: a) Coordinates Federal research to better understand and prepare the nation for global change. b) Priori4zes and supports cutting edge scientific work in global change. c) Assesses the state of scientific knowledge and the Nation s readiness to respond to global change. d) Communicates research findings to inform, educate, and engage the global community.
Children inhibit global information when the forest is dense and local information when the forest is sparse.

Science.gov (United States)

Krakowski, Claire-Sara; Borst, Grégoire; Vidal, Julie; Houdé, Olivier; Poirel, Nicolas

2018-09-01

Visual environments are composed of global shapes and local details that compete for attentional resources. In adults, the global level is processed more rapidly than the local level, and global information must be inhibited in order to process local information when the local information and global information are in conflict. Compared with adults, children present less of a bias toward global visual information and appear to be more sensitive to the density of local elements that constitute the global level. The current study aimed, for the first time, to investigate the key role of inhibition during global/local processing in children. By including two different conditions of global saliency during a negative priming procedure, the results showed that when the global level was salient (dense hierarchical figures), 7-year-old children and adults needed to inhibit the global level to process the local information. However, when the global level was less salient (sparse hierarchical figures), only children needed to inhibit the local level to process the global information. These results confirm a weaker global bias and the greater impact of saliency in children than in adults. Moreover, the results indicate that, regardless of age, inhibition of the most salient hierarchical level is systematically required to select the less salient but more relevant level. These findings have important implications for future research in this area. Copyright © 2018 Elsevier Inc. All rights reserved.
Thai Youths and Global Warming: Media Information, Awareness, and Lifestyle Activities

Science.gov (United States)

Chokriensukchai, Kanchana; Tamang, Ritendra

2010-01-01

This study examines the exposure of Thai youths to media information on global warming, the relationship between exposure to global warming information and awareness of global warming, and the relationship between that awareness and lifestyle activities that contribute to global warming. A focus group of eight Thai youths provided information that…
Human genome and genetic sequencing research and informed consent

International Nuclear Information System (INIS)

Iwakawa, Mayumi

2003-01-01

On March 29, 2001, the Ethical Guidelines for Human Genome and Genetic Sequencing Research were established. They have intended to serve as ethical guidelines for all human genome and genetic sequencing research practice, for the purpose of upholding respect for human dignity and rights and enforcing use of proper methods in the pursuit of human genome and genetic sequencing research, with the understanding and cooperation of the public. The RadGenomics Project has prepared a research protocol and informed consent document that follow these ethical guidelines. We have endeavored to protect the privacy of individual information, and have established a procedure for examination of research practices by an ethics committee. Here we report our procedure in order to offer this concept to the patients. (authors)
Knowledge Management and Global Information Dissemination

Science.gov (United States)

Umunadi, Ejiwoke Kennedy

2014-01-01

The paper looked at knowledge management and global information dissemination. Knowledge is a very powerful tool for survival, growth and development. It can be seen as the information, understanding and skills that you gain through education or experience. The paper was addressed under the following sub-headings: Knowledge management knowledge…
Information management for global environmental change, including the Carbon Dioxide Information Analysis Center

Energy Technology Data Exchange (ETDEWEB)

Stoss, F.W. [Oak Ridge National Lab., TN (United States). Carbon Dioxide Information Analysis Center

1994-06-01

The issue of global change is international in scope. A body of international organizations oversees the worldwide coordination of research and policy initiatives. In the US the National Science and Technology Council (NSTC) was established in November of 1993 to provide coordination of science, space, and technology policies throughout the federal government. NSTC is organized into nine proposed committees. The Committee on Environmental and Natural Resources (CERN) oversees the US Department of Energy`s Global Change Research Program (USGCRP). As part of the USGCRP, the US Department of Energy`s Global Change Research Program aims to improve the understanding of Earth systems and to strengthen the scientific basis for the evaluation of policy and government action in response to potential global environmental changes. This paper examines the information and data management roles of several international and national programs, including Oak Ridge National Laboratory`s (ORNL`s) global change information programs. An emphasis will be placed on the Carbon Dioxide Information Analysis Center (CDIAC), which also serves as the World Data Center-A for Atmospheric Trace Gases.
Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions

KAUST Repository

Odat, Enas M.

2011-05-01

The purpose of this dissertation is to present a methodology to model global sequence alignment problem as directed acyclic graph which helps to extract all possible optimal alignments. Moreover, a mechanism to sequentially optimize sequence alignment problem relative to different cost functions is suggested. Sequence alignment is mostly important in computational biology. It is used to find evolutionary relationships between biological sequences. There are many algo- rithms that have been developed to solve this problem. The most famous algorithms are Needleman-Wunsch and Smith-Waterman that are based on dynamic program- ming. In dynamic programming, problem is divided into a set of overlapping sub- problems and then the solution of each subproblem is found. Finally, the solutions to these subproblems are combined into a final solution. In this thesis it has been proved that for two sequences of length m and n over a fixed alphabet, the suggested optimization procedure requires O(mn) arithmetic operations per cost function on a single processor machine. The algorithm has been simulated using C#.Net programming language and a number of experiments have been done to verify the proved statements. The results of these experiments show that the number of optimal alignments is reduced after each step of optimization. Furthermore, it has been verified that as the sequence length increased linearly then the number of optimal alignments increased exponentially which also depends on the cost function that is used. Finally, the number of executed operations increases polynomially as the sequence length increase linearly.
Fast global sequence alignment technique

KAUST Repository

Bonny, Mohamed Talal

2011-11-01

Bioinformatics database is growing exponentially in size. Processing these large amount of data may take hours of time even if super computers are used. One of the most important processing tool in Bioinformatics is sequence alignment. We introduce fast alignment algorithm, called \\'Alignment By Scanning\\' (ABS), to provide an approximate alignment of two DNA sequences. We compare our algorithm with the wellknown sequence alignment algorithms, the \\'GAP\\' (which is heuristic) and the \\'Needleman-Wunsch\\' (which is optimal). The proposed algorithm achieves up to 51% enhancement in alignment score when it is compared with the GAP Algorithm. The evaluations are conducted using different lengths of DNA sequences. © 2011 IEEE.
The"minimum information about an environmental sequence" (MIENS) specification

Energy Technology Data Exchange (ETDEWEB)

Yilmaz, P.; Kottmann, R.; Field, D.; Knight, R.; Cole, J.R.; Amaral-Zettler, L.; Gilbert, J.A.; Karsch-Mizrachi, I.; Johnston, A.; Cochrane, G.; Vaughan, R.; Hunter, C.; Park, J.; Morrison, N.; Rocca-Serra, P.; Sterk, P.; Arumugam, M.; Baumgartner, L.; Birren, B.W.; Blaser, M.J.; Bonazzi, V.; Bork, P.; Buttigieg, P. L.; Chain, P.; Costello, E.K.; Huot-Creasy, H.; Dawyndt, P.; DeSantis, T.; Fierer, N.; Fuhrman, J.; Gallery, R.E.; Gibbs, R.A.; Giglio, M.G.; Gil, I. San; Gonzalez, A.; Gordon, J.I.; Guralnick, R.; Hankeln, W.; Highlander, S.; Hugenholtz, P.; Jansson, J.; Kennedy, J.; Knights, D.; Koren, O.; Kuczynski, J.; Kyrpides, N.; Larsen, R.; Lauber, C.L.; Legg, T.; Ley, R.E.; Lozupone, C.A.; Ludwig, W.; Lyons, D.; Maguire, E.; Methe, B.A.; Meyer, F.; Nakieny, S.; Nelson, K.E.; Nemergut, D.; Neufeld, J.D.; Pace, N.R.; Palanisamy, G.; Peplies, J.; Peterson, J.; Petrosino, J.; Proctor, L.; Raes, J.; Ratnasingham, S.; Ravel, J.; Relman, D.A.; Assunta-Sansone, S.; Schriml, L.; Sodergren, E.; Spor, A.; Stombaugh, J.; Tiedje, J.M.; Ward, D.V.; Weinstock, G.M.; Wendel, D.; White, O.; Wikle, A.; Wortman, J.R.; Glockner, F.O.; Bushman, F.D.; Charlson, E.; Gevers, D.; Kelley, S.T.; Neubold, L.K.; Oliver, A.E.; Pruesse, E.; Quast, C.; Schloss, P.D.; Sinha, R.; Whitely, A.

2010-10-15

We present the Genomic Standards Consortium's (GSC) 'Minimum Information about an ENvironmental Sequence' (MIENS) standard for describing marker genes. Adoption of MIENS will enhance our ability to analyze natural genetic diversity across the Tree of Life as it is currently being documented by massive DNA sequencing efforts from myriad ecosystems in our ever-changing biosphere.
Providing Global Change Information for Decision-Making: Capturing and Presenting Provenance

Science.gov (United States)

Ma, Xiaogang; Fox, Peter; Tilmes, Curt; Jacobs, Katherine; Waple, Anne

2014-01-01

Global change information demands access to data sources and well-documented provenance to provide evidence needed to build confidence in scientific conclusions and, in specific applications, to ensure the information's suitability for use in decision-making. A new generation of Web technology, the Semantic Web, provides tools for that purpose. The topic of global change covers changes in the global environment (including alterations in climate, land productivity, oceans or other water resources, atmospheric composition and or chemistry, and ecological systems) that may alter the capacity of the Earth to sustain life and support human systems. Data and findings associated with global change research are of great public, government, and academic concern and are used in policy and decision-making, which makes the provenance of global change information especially important. In addition, since different types of decisions benefit from different types of information, understanding how to capture and present the provenance of global change information is becoming more of an imperative in adaptive planning.

Information empowerment: predeparture resource training for students in global health.

Science.gov (United States)

Rana, Gurpreet K

2014-04-01

The Taubman Health Sciences Library (THL) collaborates with health sciences schools to provide information skills instruction for students preparing for international experiences. THL enhances students' global health learning through predeparture instruction for students who are involved in global health research, clinical internships, and international collaborations. This includes teaching international literature searching skills, providing country-specific data sources, building awareness of relevant mobile resources, and encouraging investigation of international news. Information skills empower creation of stronger global partnerships. Use of information resources has enhanced international research and training experiences, built lifelong learning foundations, and contributed to the university's global engagement. THL continues to assess predeparture instruction.
Provenance Representation in the Global Change Information System (GCIS)

Science.gov (United States)

Tilmes, Curt

2012-01-01

Global climate change is a topic that has become very controversial despite strong support within the scientific community. It is common for agencies releasing information about climate change to be served with Freedom of Information Act (FOIA) requests for everything that led to that conclusion. Capturing and presenting the provenance, linking to the research papers, data sets, models, analyses, observation instruments and satellites, etc. supporting key findings has the potential to mitigate skepticism in this domain. The U.S. Global Change Research Program (USGCRP) is now coordinating the production of a National Climate Assessment (NCA) that presents our best understanding of global change. We are now developing a Global Change Information System (GCIS) that will present the content of that report and its provenance, including the scientific support for the findings of the assessment. We are using an approach that will present this information both through a human accessible web site as well as a machine readable interface for automated mining of the provenance graph. We plan to use the developing W3C PROV Data Model and Ontology for this system.
Information-Theoretic Properties of Auditory Sequences Dynamically Influence Expectation and Memory.

Science.gov (United States)

Agres, Kat; Abdallah, Samer; Pearce, Marcus

2018-01-01

A basic function of cognition is to detect regularities in sensory input to facilitate the prediction and recognition of future events. It has been proposed that these implicit expectations arise from an internal predictive coding model, based on knowledge acquired through processes such as statistical learning, but it is unclear how different types of statistical information affect listeners' memory for auditory stimuli. We used a combination of behavioral and computational methods to investigate memory for non-linguistic auditory sequences. Participants repeatedly heard tone sequences varying systematically in their information-theoretic properties. Expectedness ratings of tones were collected during three listening sessions, and a recognition memory test was given after each session. Information-theoretic measures of sequential predictability significantly influenced listeners' expectedness ratings, and variations in these properties had a significant impact on memory performance. Predictable sequences yielded increasingly better memory performance with increasing exposure. Computational simulations using a probabilistic model of auditory expectation suggest that listeners dynamically formed a new, and increasingly accurate, implicit cognitive model of the information-theoretic structure of the sequences throughout the experimental session. Copyright © 2017 Cognitive Science Society, Inc.
Fast global sequence alignment technique

KAUST Repository

Bonny, Mohamed Talal; Salama, Khaled N.

2011-01-01

fast alignment algorithm, called 'Alignment By Scanning' (ABS), to provide an approximate alignment of two DNA sequences. We compare our algorithm with the wellknown sequence alignment algorithms, the 'GAP' (which is heuristic) and the 'Needleman
Fast convergence of spike sequences to periodic patterns in recurrent networks

International Nuclear Information System (INIS)

Jin, Dezhe Z.

2002-01-01

The dynamical attractors are thought to underlie many biological functions of recurrent neural networks. Here we show that stable periodic spike sequences with precise timings are the attractors of the spiking dynamics of recurrent neural networks with global inhibition. Almost all spike sequences converge within a finite number of transient spikes to these attractors. The convergence is fast, especially when the global inhibition is strong. These results support the possibility that precise spatiotemporal sequences of spikes are useful for information encoding and processing in biological neural networks
Agile Data Management with the Global Change Information System

Science.gov (United States)

Duggan, B.; Aulenbach, S.; Tilmes, C.; Goldstein, J.

2013-12-01

We describe experiences applying agile software development techniques to the realm of data management during the development of the Global Change Information System (GCIS), a web service and API for authoritative global change information under development by the US Global Change Research Program. Some of the challenges during system design and implementation have been : (1) balancing the need for a rigorous mechanism for ensuring information quality with the realities of large data sets whose contents are often in flux, (2) utilizing existing data to inform decisions about the scope and nature of new data, and (3) continuously incorporating new knowledge and concepts into a relational data model. The workflow for managing the content of the system has much in common with the development of the system itself. We examine various aspects of agile software development and discuss whether or how we have been able to use them for data curation as well as software development.
Elman RNN based classification of proteins sequences on account of their mutual information.

Science.gov (United States)

Mishra, Pooja; Nath Pandey, Paras

2012-10-21

In the present work we have employed the method of estimating residue correlation within the protein sequences, by using the mutual information (MI) of adjacent residues, based on structural and solvent accessibility properties of amino acids. The long range correlation between nonadjacent residues is improved by constructing a mutual information vector (MIV) for a single protein sequence, like this each protein sequence is associated with its corresponding MIVs. These MIVs are given to Elman RNN to obtain the classification of protein sequences. The modeling power of MIV was shown to be significantly better, giving a new approach towards alignment free classification of protein sequences. We also conclude that sequence structural and solvent accessible property based MIVs are better predictor. Copyright © 2012 Elsevier Ltd. All rights reserved.
Mixed multiscale finite element methods using approximate global information based on partial upscaling

KAUST Repository

Jiang, Lijian; Efendiev, Yalchin; Mishev, IIya

2009-01-01

The use of limited global information in multiscale simulations is needed when there is no scale separation. Previous approaches entail fine-scale simulations in the computation of the global information. The computation of the global information
Global information sampling in the honey bee

Science.gov (United States)

Johnson, Brian R.

2008-06-01

Central to the question of task allocation in social insects is how workers acquire information. Patrolling is a curious behavior in which bees meander over the face of the comb inspecting cells. Several authors have suggested it allows bees to collect global information, but this has never been formally evaluated. This study explores this hypothesis by answering three questions. First, do bees gather information in a consistent manner as they patrol? Second, do they move far enough to get a sense of task demand in distant areas of the nest? And third, is patrolling a commonly performed task? Focal animal observations were used to address the first two predictions, while a scan sampling study was used to address the third. The results were affirmative for each question. While patrolling, workers collected information by performing periodic clusters of cell inspections. Patrolling bees not only traveled far enough to frequently change work zone; they often visited every part of the nest. Finally, the majority of the bees in the middle-age caste were shown to move throughout the nest over the course of a few hours in a manner suggestive of patrolling. Global information collection is contrary to much current theory, which assumes that workers respond to local information only. This study thus highlights the nonmutually exclusive nature of various information collection regimes in social insects.
Analyticity and the Global Information Field

Directory of Open Access Journals (Sweden)

Evgeni A. Solov'ev

2015-03-01

Full Text Available The relation between analyticity in mathematics and the concept of a global information field in physics is reviewed. Mathematics is complete in the complex plane only. In the complex plane, a very powerful tool appears—analyticity. According to this property, if an analytic function is known on the countable set of points having an accumulation point, then it is known everywhere. This mysterious property has profound consequences in quantum physics. Analyticity allows one to obtain asymptotic (approximate results in terms of some singular points in the complex plane which accumulate all necessary data on a given process. As an example, slow atomic collisions are presented, where the cross-sections of inelastic transitions are determined by branch-points of the adiabatic energy surface at a complex internuclear distance. Common aspects of the non-local nature of analyticity and a recently introduced interpretation of classical electrodynamics and quantum physics as theories of a global information field are discussed.
Global sequence diversity of the lactate dehydrogenase gene in Plasmodium falciparum.

Science.gov (United States)

Simpalipan, Phumin; Pattaradilokrat, Sittiporn; Harnyuttanakorn, Pongchai

2018-01-09

Antigen-detecting rapid diagnostic tests (RDTs) have been recommended by the World Health Organization for use in remote areas to improve malaria case management. Lactate dehydrogenase (LDH) of Plasmodium falciparum is one of the main parasite antigens employed by various commercial RDTs. It has been hypothesized that the poor detection of LDH-based RDTs is attributed in part to the sequence diversity of the gene. To test this, the present study aimed to investigate the genetic diversity of the P. falciparum ldh gene in Thailand and to construct the map of LDH sequence diversity in P. falciparum populations worldwide. The ldh gene was sequenced for 50 P. falciparum isolates in Thailand and compared with hundreds of sequences from P. falciparum populations worldwide. Several indices of molecular variation were calculated, including the proportion of polymorphic sites, the average nucleotide diversity index (π), and the haplotype diversity index (H). Tests of positive selection and neutrality tests were performed to determine signatures of natural selection on the gene. Mean genetic distance within and between species of Plasmodium ldh was analysed to infer evolutionary relationships. Nucleotide sequences of P. falciparum ldh could be classified into 9 alleles, encoding 5 isoforms of LDH. L1a was the most common allelic type and was distributed in P. falciparum populations worldwide. Plasmodium falciparum ldh sequences were highly conserved, with haplotype and nucleotide diversity values of 0.203 and 0.0004, respectively. The extremely low genetic diversity was maintained by purifying selection, likely due to functional constraints. Phylogenetic analysis inferred the close genetic relationship of P. falciparum to malaria parasites of great apes, rather than to other human malaria parasites. This study revealed the global genetic variation of the ldh gene in P. falciparum, providing knowledge for improving detection of LDH-based RDTs and supporting the candidacy of
The limit space of a Cauchy sequence of globally hyperbolic spacetimes

Energy Technology Data Exchange (ETDEWEB)

Noldus, Johan [Universiteit Gent, Vakgroep Wiskundige analyse, Galglaan 2, 9000 Gent (Belgium)

2004-02-21

In this second paper, I construct a limit space of a Cauchy sequence of globally hyperbolic spacetimes. In section 2, I work gradually towards a construction of the limit space. I prove that the limit space is unique up to isometry. I also show that, in general, the limit space has quite complicated causal behaviour. This work prepares the final paper in which I shall study in more detail properties of the limit space and the moduli space of (compact) globally hyperbolic spacetimes (cobordisms). As a fait divers, I give in this paper a suitable definition of dimension of a Lorentz space in agreement with the one given by Gromov in the Riemannian case. The difference in philosophy between Lorentzian and Riemannian geometry is one of relativism versus absolutism. In the latter every point distinguishes itself while in the former in general two elements get distinguished by a third, different, one.
The limit space of a Cauchy sequence of globally hyperbolic spacetimes

International Nuclear Information System (INIS)

Noldus, Johan

2004-01-01

In this second paper, I construct a limit space of a Cauchy sequence of globally hyperbolic spacetimes. In section 2, I work gradually towards a construction of the limit space. I prove that the limit space is unique up to isometry. I also show that, in general, the limit space has quite complicated causal behaviour. This work prepares the final paper in which I shall study in more detail properties of the limit space and the moduli space of (compact) globally hyperbolic spacetimes (cobordisms). As a fait divers, I give in this paper a suitable definition of dimension of a Lorentz space in agreement with the one given by Gromov in the Riemannian case. The difference in philosophy between Lorentzian and Riemannian geometry is one of relativism versus absolutism. In the latter every point distinguishes itself while in the former in general two elements get distinguished by a third, different, one
China's Mission in Surveying, Mapping and Geographic Information during Global Governance

Science.gov (United States)

Jia, D.; Xue, C.; Chen, X.

2018-04-01

In the new era, it is proposed that China should be transformed from a participant and a cooperator into a designer, an impeller and a leader, continue taking an effect of responsible great power, increase public product supply, perfect a global governance system and contribute to China's wisdom and China's schemes during global governance, thus surveying and mapping geographic information takes on great mission. On the one hand, we have to timely grasp global geographic information data resources to provide an important scientific data support for China's wisdom and China's schemes. On the other hand, we have to provide surveying and mapping geographic information infrastructure construction and public products for developing countries, support location services within a global territorial scope, and realize the smoothness of talent flow, material flow and information flow between China and countries in the world. Meanwhile, external assistance and international communication and cooperation of surveying and mapping geographic information are also enhanced, and popularization and application of a geographic information technology in underdeveloped countries and regions are promoted.
Functional region prediction with a set of appropriate homologous sequences-an index for sequence selection by integrating structure and sequence information with spatial statistics

Science.gov (United States)

2012-01-01

Background The detection of conserved residue clusters on a protein structure is one of the effective strategies for the prediction of functional protein regions. Various methods, such as Evolutionary Trace, have been developed based on this strategy. In such approaches, the conserved residues are identified through comparisons of homologous amino acid sequences. Therefore, the selection of homologous sequences is a critical step. It is empirically known that a certain degree of sequence divergence in the set of homologous sequences is required for the identification of conserved residues. However, the development of a method to select homologous sequences appropriate for the identification of conserved residues has not been sufficiently addressed. An objective and general method to select appropriate homologous sequences is desired for the efficient prediction of functional regions. Results We have developed a novel index to select the sequences appropriate for the identification of conserved residues, and implemented the index within our method to predict the functional regions of a protein. The implementation of the index improved the performance of the functional region prediction. The index represents the degree of conserved residue clustering on the tertiary structure of the protein. For this purpose, the structure and sequence information were integrated within the index by the application of spatial statistics. Spatial statistics is a field of statistics in which not only the attributes but also the geometrical coordinates of the data are considered simultaneously. Higher degrees of clustering generate larger index scores. We adopted the set of homologous sequences with the highest index score, under the assumption that the best prediction accuracy is obtained when the degree of clustering is the maximum. The set of sequences selected by the index led to higher functional region prediction performance than the sets of sequences selected by other sequence
Global games with noisy sharing of information

KAUST Repository

Touri, Behrouz; Shamma, Jeff S.

2014-01-01

We provide a framework for the study of global games with noisy sharing of information. In contrast to the previous works where it is shown that an intuitive threshold policy is an equilibrium for such games, we show that noisy sharing of information leads to non-existence of such an equilibrium. We also investigate the group best-response dynamics of two groups of agents sharing the same information to threshold policies based on each group's observation and show the convergence of such dynamics.
Workshop Proceedings: Utilizing global information systems

Energy Technology Data Exchange (ETDEWEB)

NONE

1998-03-01

This one day workshop was attended by 50 delegates representing 20 countries. Fourteen papers dealing with cleaner production/cleaner technology global information systems were presented. The objective of the workshop was to increase cooperation and interaction on international clean production/clean technology information transfer activities and to identify ways to ensure continued cooperation and system improvements. Topics discussed included information format to meet user needs; coordination of effort to avoid duplication and to encourage consistency in information delivery; and marketing, to expand the dissemination of information on cleaner production/cleaner technology. In terms of information format, content, systems and reliability were identified as target issues. The group discussing coordination of effort suggested that a wholesale/retail approach to information dissemination be adopted. The group also called for regular meetings to supplement communication via the Internet. The marketing group suggested that there is a need to show the benefits of technologies and to establish links to industrial associations as being critical to success.
SoilGrids1km— global soil information based on automated mapping

NARCIS (Netherlands)

Hengl, T.; Mendes de Jesus, J.S.; Macmillan, R.A.; Batjes, N.H.; Heuvelink, G.B.M.; Carvalho Ribeiro, E.D.; Samuel Rosa, A.; Kempen, B.; Leenaars, J.G.B.; Walsh, M.G.; Ruiperez Gonzalez, M.

2014-01-01

Background Soils are widely recognized as a non-renewable natural resource and as biophysical carbon sinks. As such, there is a growing requirement for global soil information. Although several global soil information systems already exist, these tend to suffer from inconsistencies and limited
Global games with noisy sharing of information

KAUST Repository

Touri, Behrouz

2014-12-15

We provide a framework for the study of global games with noisy sharing of information. In contrast to the previous works where it is shown that an intuitive threshold policy is an equilibrium for such games, we show that noisy sharing of information leads to non-existence of such an equilibrium. We also investigate the group best-response dynamics of two groups of agents sharing the same information to threshold policies based on each group\\'s observation and show the convergence of such dynamics.
Applications of statistical physics and information theory to the analysis of DNA sequences

Science.gov (United States)

Grosse, Ivo

2000-10-01

DNA carries the genetic information of most living organisms, and the of genome projects is to uncover that genetic information. One basic task in the analysis of DNA sequences is the recognition of protein coding genes. Powerful computer programs for gene recognition have been developed, but most of them are based on statistical patterns that vary from species to species. In this thesis I address the question if there exist universal statistical patterns that are different in coding and noncoding DNA of all living species, regardless of their phylogenetic origin. In search for such species-independent patterns I study the mutual information function of genomic DNA sequences, and find that it shows persistent period-three oscillations. To understand the biological origin of the observed period-three oscillations, I compare the mutual information function of genomic DNA sequences to the mutual information function of stochastic model sequences. I find that the pseudo-exon model is able to reproduce the mutual information function of genomic DNA sequences. Moreover, I find that a generalization of the pseudo-exon model can connect the existence and the functional form of long-range correlations to the presence and the length distributions of coding and noncoding regions. Based on these theoretical studies I am able to find an information-theoretical quantity, the average mutual information (AMI), whose probability distributions are significantly different in coding and noncoding DNA, while they are almost identical in all studied species. These findings show that there exist universal statistical patterns that are different in coding and noncoding DNA of all studied species, and they suggest that the AMI may be used to identify genes in different living species, irrespective of their taxonomic origin.

Global Information Enterprise (GIE) Modeling and Simulation (GIESIM)

National Research Council Canada - National Science Library

Bell, Paul

2005-01-01

... AND S) toolkits into the Global Information Enterprise (GIE) Modeling and Simulation (GIESim) framework to create effective user analysis of candidate communications architectures and technologies...
Digitizing the Non-Digital: Creating a Global Context for Events, Artifacts, Ideas, and Information

Directory of Open Access Journals (Sweden)

Deborah L. MacPherson

2006-06-01

Full Text Available This paper discusses some of the problems associated with search and digital-rights management in the emerging age of interconnectivity. An open-source system called Context Driven Topologies (CDT is proposed to create one global context of geography, knowledge domains, and Internet addresses, using centralized spatial databases, geometry, and maps. The same concept can be described by different words, the same image can be interpreted a thousand ways by every viewer, but mathematics is a set of rules to ensure that certain relationships or sequences will be precisely regenerated. Therefore, unlike most of today’s digital records, CDTs are based on mathematics first, images second, words last. The aim is to permanently link the highest quality events, artifacts, ideas, and information into one record documenting the quickest paths to the most relevant information for specific data, users, and tasks. A model demonstration project using CDT to organize, search, and place information in new contexts while protecting the authors’ intent is also introduced.
Threshold policy for global games with noisy information sharing

KAUST Repository

Mahdavifar, Hessam

2015-12-15

It is known that global games with noisy sharing of information do not admit a certain type of threshold policies [1]. Motivated by this result, we investigate the existence of threshold-type policies on global games with noisy sharing of information and show that such equilibrium strategies exist and are unique if the sharing of information happens over a sufficiently noisy environment. To show this result, we establish that if a threshold function is an equilibrium strategy, then it will be a solution to a fixed point equation. Then, we show that for a sufficiently noisy environment, the functional fixed point equation leads to a contraction mapping, and hence, its iterations converge to a unique continuous threshold policy.
GRIN-Global: An International Project to Develop a Global Plant Genebank Information Management System

Science.gov (United States)

The mission of the GRIN-Global Project is to create a new, scalable version of the Germplasm Resource Information System (GRIN) to provide the world’s crop genebanks with a powerful, flexible, easy-to-use plant genetic resource (PGR) information management system. The system will help safeguard PGR ...
75 FR 70714 - Global Free Flow of Information on the Internet

Science.gov (United States)

2010-11-18

.... 100921457-0561-02] RIN 0660-XA20 Global Free Flow of Information on the Internet AGENCY: National... of comment period. SUMMARY: The Department of Commerce's Internet Policy Task Force announces that... on the global free flow of information on the Internet has been reopened and will extend until 5 p.m...
Linked Open Data in the Global Change Information System (GCIS)

Science.gov (United States)

Tilmes, Curt A.

2012-01-01

The U.S. Global Change Research Program (http://globalchange.gov) coordinates and integrates federal research on changes in the global environment and their implications for society. The USGCRP is developing a Global Change Information System (GCIS) that will centralize access to data and information related to global change across the U.S. federal government. The first implementation will focus on the 2013 National Climate Assessment (NCA) . (http://assessment.globalchange.gov) The NCA integrates, evaluates, and interprets the findings of the USGCRP; analyzes the effects of global change on the natural environment, agriculture, energy production and use, land and water resources, transportation, human health and welfare, human social systems, and biological diversity; and analyzes current trends in global change, both human-induced and natural, and projects major trends for the subsequent 25 to 100 years. The NCA has received over 500 distinct technical inputs to the process, many of which are reports distilling and synthesizing even more information, coming from thousands of individuals around the federal, state and local governments, academic institutions and non-governmental organizations. The GCIS will present a web-based version of the NCA including annotations linking the findings and content of the NCA with the scientific research, datasets, models, observations, etc. that led to its conclusions. It will use semantic tagging and a linked data approach, assigning globally unique, persistent, resolvable identifiers to all of the related entities and capturing and presenting the relationships between them, both internally and referencing out to other linked data sources and back to agency data centers. The developing W3C PROV Data Model and ontology will be used to capture the provenance trail and present it in both human readable web pages and machine readable formats such as RDF and SPARQL. This will improve visibility into the assessment process, increase
A method for partitioning the information contained in a protein sequence between its structure and function.

Science.gov (United States)

Possenti, Andrea; Vendruscolo, Michele; Camilloni, Carlo; Tiana, Guido

2018-05-23

Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We study the amount of information necessary to specify the protein structure, providing an estimate that keeps into account the thermodynamic properties of protein folding. We thus show that the information remaining in the protein sequence after encoding for its structure (the 'information gap') is very close to what needed to encode for its function and interactions. Then, by predicting the information gap directly from the protein sequence, we show that it may be possible to use these insights from information theory to discriminate between ordered and disordered proteins, to identify unknown functions, and to optimize artificially-designed protein sequences. This article is protected by copyright. All rights reserved. © 2018 Wiley Periodicals, Inc.
Globalization on Trial: The Human Condition and the Information ...

International Development Research Centre (IDRC) Digital Library (Canada)

He also focuses on our education system and how it will have to adapt to meet the new challenges of our global, information age. Globalization on Trial will interest ... Farhang Rajaee is a Visiting Associate Professor at the College of the Humanities, Carleton University, Ottawa, Canada. Professor Rajaee received his PhD ...
Magnetic resonance for T-staging of nasopharyngeal carcinoma. The most informative pair of sequences

International Nuclear Information System (INIS)

Lau, Kam Y.; Kan, Wai K.; Sze, Wai M.

2004-01-01

The objective of this study was to evaluate the most informative pair of sequences in magnetic resonance (MR) for T-staging of nasopharyngeal carcinoma (NPC). The MR images of 134 patients with newly diagnosed NRC, from 1996 to 2002, were retrospectively reviewed. All the patients were scanned using 1.5 Tesla MR systems. The images of the nasopharynx were reviewed by two qualified radiologists to determine the positive findings and the T-stage by Union Internationale Contre le Cancer (UICC) (6th edition) System, using each sequence separately. The T-stage derived from a single MR sequence was then compared with the T-stage based on the five selected sequences to assess the number and percentage of patients who were being understaged. Therefore, the overall percentage accuracy of each single sequence could be determined. A pair of sequences providing information to achieve almost 100% diagnostic accuracy was then derived. The overall percentage accuracy of five individual sequences of the nasopharynx is as follows: contrast-enhanced (CE) fat suppression (FS) axial T1 (94.8%), CE FS coronal T1 (88.1%), FS axial T2 (85.8%), non-contrast enhanced (NE) axial T1 (78.4%) and NE coronal T1 (77.6%). CE FS axial T1 has the best accuracy. All the structures that are missed in CE FS axial T1 which lead to apparent understaging, are appreciated in NE axial T1-weighted images. Individual sequences supplement each other in the NPC staging. CE FS axial T1 is the most informative individual sequence. Combination of CE FS axial T1 and NE axial T1 of the nasopharynx provides sufficient information to achieve almost 100% diagnostic accuracy in T-staging; therefore, both should be included in the MR-staging protocol. (author)
Information performances and illative sequences: Sequential organization of explanations of chemical phase equilibrium

Science.gov (United States)

Brown, Nathaniel James Swanton

While there is consensus that conceptual change is surprisingly difficult, many competing theories of conceptual change co-exist in the literature. This dissertation argues that this discord is partly the result of an inadequate account of the unwritten rules of human social interaction that underlie the field's preferred methodology---semi-structured interviewing. To better understand the contributions of interaction during explanations, I analyze eight undergraduate general chemistry students as they attempt to explain to various people, for various reasons, why phenomena involving chemical phase equilibrium occur. Using the methods of interaction analysis, I characterize the unwritten, but systematic, rules that these participants follow as they explain. The result is a description of the contributions of interaction to explaining. Each step in each explanation is a jointly performed expression of a subject-predicate relation, an interactive accomplishment I call an information performance (in-form, for short). Unlike clauses, in-forms need not have a coherent grammatical structure. Unlike speaker turns, in-forms have the clear function of expressing information. Unlike both clauses and speaker turns, in-forms are a co-construction, jointly performed by both the primary speaker and the other interlocutor. The other interlocutor strongly affects the form and content of each explanation by giving or withholding feedback at the end of each in-form, moments I call feedback-relevant places. While in-forms are the bricks out of which the explanation is constructed, they are secured by a series of inferential links I call an illative sequence. Illative sequences are forward-searching, starting with a remembered fact or observation and following a chain of inferences in the hope it leads to the target phenomenon. The participants treat an explanation as a success if the illative sequence generates an in-form that describes the phenomenon. If the illative sequence does
Information data systems for a global change technology initiative architecture trade study

Science.gov (United States)

Murray, Nicholas D.

1991-01-01

The Global Change Technology Initiative (GCTI) was established to develop technology which will enable use of satellite systems of Earth observations on a global scale, enable use of the observations to predictively model Earth's changes, and provide scientists, government, business, and industry with quick access to the resulting information. At LaRC, a GCTI Architecture Trade Study was undertaken to develop and evaluate the architectural implications to meet the requirements of the global change studies and the eventual implementation of a global change system. The output of the trade study are recommended technologies for the GCTI. That portion of the study concerned with the information data system is documented. The information data system for an earth global change modeling system can be very extensive and beyond affordability in terms of today's costs. Therefore, an incremental approach to gaining a system is most likely. An options approach to levels of capability versus needed technologies was developed. The primary drivers of the requirements for the information data system evaluation were the needed science products, the science measurements, the spacecraft orbits, the instruments configurations, and the spacecraft configurations and their attendant architectures. The science products requirements were not studied here; however, some consideration of the product needs were included in the evaluation results. The information data system technology items were identified from the viewpoint of the desirable overall information system characteristics.
Representation of protein-sequence information by amino acid subalphabets

DEFF Research Database (Denmark)

Andersen, C.A.F.; Brunak, Søren

2004-01-01

-sequence information, using machine learning strategies, where the primary goal is the discovery of novel powerful representations for use in AI techniques. In the case of proteins and the 20 different amino acids they typically contain, it is also a secondary goal to discover how the current selection of amino acids...
Automatic exchange of information: towards a new global standard of tax transparency

Directory of Open Access Journals (Sweden)

Miguel Eduardo Pecho Trigueros

2014-07-01

Full Text Available Tax authorities are increasingly relying on mutual cooperation with their foreign peers to enforce more effectively their internal tax laws. After the banking scandals of 2008 and the subsequent global financial crisis, the Global Forum on Transparency and Exchange of Information for TaxPurposes has proposed the exchange of information upon request as the fiscal transparency standard. However, some measures adopted by the European Union, previous initiatives from the Organization for Economic Cooperation and Development (OECD and, above all, the introduction of the Foreign Account Tax Compliance Act (Fatca by the United States in 2010 have promoted the need to adopt the automatic exchange of information as the new fiscal transparency standard. Automatic exchange of information allows home countries to verify whether their taxpayers have correctly included foreign income, allowing tax authorities to have early warning of possible noncompliance cases. In February 2014, the OECD published its proposal for a new global model of automatic exchange of financial account information. The new global model contains the necessary legal instruments and due diligence and reporting procedures, mainly for financial institutions.
Marketing library and information services II a global outlook

CERN Document Server

Gupta, Dinesh K; Massisimo, Angels

2013-01-01

With contributions from library and information professionals (practitioners, researchers, faculty members, consultants, and others), Marketing Library and Information Services: A Global Outlook highlights a variety of exemplary LIS marketing practices and efforts from around the globe. The following broad topics are explored: changing marketing concepts; marketing library
DeepProbe: Information Directed Sequence Understanding and Chatbot Design via Recurrent Neural Networks

OpenAIRE

Yin, Zi; Chang, Keng-hao; Zhang, Ruofei

2017-01-01

Information extraction and user intention identification are central topics in modern query understanding and recommendation systems. In this paper, we propose DeepProbe, a generic information-directed interaction framework which is built around an attention-based sequence to sequence (seq2seq) recurrent neural network. DeepProbe can rephrase, evaluate, and even actively ask questions, leveraging the generative ability and likelihood estimation made possible by seq2seq models. DeepProbe makes...
A generalized global alignment algorithm.

Science.gov (United States)

Huang, Xiaoqiu; Chao, Kun-Mao

2003-01-22

Homologous sequences are sometimes similar over some regions but different over other regions. Homologous sequences have a much lower global similarity if the different regions are much longer than the similar regions. We present a generalized global alignment algorithm for comparing sequences with intermittent similarities, an ordered list of similar regions separated by different regions. A generalized global alignment model is defined to handle sequences with intermittent similarities. A dynamic programming algorithm is designed to compute an optimal general alignment in time proportional to the product of sequence lengths and in space proportional to the sum of sequence lengths. The algorithm is implemented as a computer program named GAP3 (Global Alignment Program Version 3). The generalized global alignment model is validated by experimental results produced with GAP3 on both DNA and protein sequences. The GAP3 program extends the ability of standard global alignment programs to recognize homologous sequences of lower similarity. The GAP3 program is freely available for academic use at http://bioinformatics.iastate.edu/aat/align/align.html.
Architectural Design for the Global Legal Information Network

Science.gov (United States)

Kalpakis, Konstantinos

1999-01-01

In this report, we provide a summary of our activities regarding the goals, requirements analysis, design, and prototype implementation for the Global Legal Information Network, a joint effort between the Law Library of Congress and NASA.
Incorporating Global Information Security and Assurance in I.S. Education

Science.gov (United States)

White, Garry L.; Hewitt, Barbara; Kruck, S. E.

2013-01-01

Over the years, the news media has reported numerous information security incidents. Because of identity theft, terrorism, and other criminal activities, President Obama has made information security a national priority. Not only is information security and assurance an American priority, it is also a global issue. This paper discusses the…
Cenozoic global sea level, sequences, and the New Jersey transect: Results from coastal plain and continental slope drilling

Science.gov (United States)

Miller, K.G.; Mountain, Gregory S.; Browning, J.V.; Kominz, M.; Sugarman, P.J.; Christie-Blick, N.; Katz, M.E.; Wright, J.D.

1998-01-01

The New Jersey Sea Level Transect was designed to evaluate the relationships among global sea level (eustatic) change, unconformity-bounded sequences, and variations in subsidence, sediment supply, and climate on a passive continental margin. By sampling and dating Cenozoic strata from coastal plain and continental slope locations, we show that sequence boundaries correlate (within ??0.5 myr) regionally (onshore-offshore) and interregionally (New Jersey-Alabama-Bahamas), implicating a global cause. Sequence boundaries correlate with ??18O increases for at least the past 42 myr, consistent with an ice volume (glacioeustatic) control, although a causal relationship is not required because of uncertainties in ages and correlations. Evidence for a causal connection is provided by preliminary Miocene data from slope Site 904 that directly link ??18O increases with sequence boundaries. We conclude that variation in the size of ice sheets has been a primary control on the formation of sequence boundaries since ~42 Ma. We speculate that prior to this, the growth and decay of small ice sheets caused small-amplitude sea level changes (changes on mid-ocean ridges. Although our results are consistent with the general number and timing of Paleocene to middle Miocene sequences published by workers at Exxon Production Research Company, our estimates of sea level amplitudes are substantially lower than theirs. Lithofacies patterns within sequences follow repetitive, predictable patterns: (1) coastal plain sequences consist of basal transgressive sands overlain by regressive highstand silts and quartz sands; and (2) although slope lithofacies variations are subdued, reworked sediments constitute lowstand deposits, causing the strongest, most extensive seismic reflections. Despite a primary eustatic control on sequence boundaries, New Jersey sequences were also influenced by changes in tectonics, sediment supply, and climate. During the early to middle Eocene, low siliciclastic and
The Role of Global and Local Visual Information during Gaze-Cued Orienting of Attention.

Science.gov (United States)

Munsters, Nicolette M; van den Boomen, Carlijn; Hooge, Ignace T C; Kemner, Chantal

2016-01-01

Gaze direction is an important social communication tool. Global and local visual information are known to play specific roles in processing socially relevant information from a face. The current study investigated whether global visual information has a primary role during gaze-cued orienting of attention and, as such, may influence quality of interaction. Adults performed a gaze-cueing task in which a centrally presented face cued (valid or invalid) the location of a peripheral target through a gaze shift. We measured brain activity (electroencephalography) towards the cue and target and behavioral responses (manual and saccadic reaction times) towards the target. The faces contained global (i.e. lower spatial frequencies), local (i.e. higher spatial frequencies), or a selection of both global and local (i.e. mid-band spatial frequencies) visual information. We found a gaze cue-validity effect (i.e. valid versus invalid), but no interaction effects with spatial frequency content. Furthermore, behavioral responses towards the target were in all cue conditions slower when lower spatial frequencies were not present in the gaze cue. These results suggest that whereas gaze-cued orienting of attention can be driven by both global and local visual information, global visual information determines the speed of behavioral responses towards other entities appearing in the surrounding of gaze cue stimuli.

Always look on both sides: phylogenetic information conveyed by simple sequence repeat allele sequences.

Directory of Open Access Journals (Sweden)

Stéphanie Barthe

Full Text Available Simple sequence repeat (SSR markers are widely used tools for inferences about genetic diversity, phylogeography and spatial genetic structure. Their applications assume that variation among alleles is essentially caused by an expansion or contraction of the number of repeats and that, accessorily, mutations in the target sequences follow the stepwise mutation model (SMM. Generally speaking, PCR amplicon sizes are used as direct indicators of the number of SSR repeats composing an allele with the data analysis either ignoring the extent of allele size differences or assuming that there is a direct correlation between differences in amplicon size and evolutionary distance. However, without precisely knowing the kind and distribution of polymorphism within an allele (SSR and the associated flanking region (FR sequences, it is hard to say what kind of evolutionary message is conveyed by such a synthetic descriptor of polymorphism as DNA amplicon size. In this study, we sequenced several SSR alleles in multiple populations of three divergent tree genera and disentangled the types of polymorphisms contained in each portion of the DNA amplicon containing an SSR. The patterns of diversity provided by amplicon size variation, SSR variation itself, insertions/deletions (indels, and single nucleotide polymorphisms (SNPs observed in the FRs were compared. Amplicon size variation largely reflected SSR repeat number. The amount of variation was as large in FRs as in the SSR itself. The former contributed significantly to the phylogenetic information and sometimes was the main source of differentiation among individuals and populations contained by FR and SSR regions of SSR markers. The presence of mutations occurring at different rates within a marker's sequence offers the opportunity to analyse evolutionary events occurring on various timescales, but at the same time calls for caution in the interpretation of SSR marker data when the distribution of within
Globalization and Trust: Non-financial Information

Directory of Open Access Journals (Sweden)

Oscar Banda Lefaure

2015-09-01

Full Text Available Changes in the way of doing business that have resulted from globalization of markets have enabled countless benefits, but also a significant number of risks, that have been evident as since 2001 revelations about financial scandals around the world have occurred one after another. These unfortunate events showed the vulnerability to which investors (and other stakeholders are exposed for not having timely, clear and accurate information of the business progress in which they invest, and therefore not being able to take precautions. In addition, these business disasters have shown how the most unscrupulous executives do not hesitate to act illegally to hide their shady financial and accounting manoeuvres, in order to promote their personal benefit. Then the executive compensations policy lies behind. At this juncture, the importance of migrating to a new model of disclosure where the global financial community can take shelter of these malpractices and trust the board controls and the management stablishes has been increasing. This new model of disclosure has one of its pillars in non-financial information reports. This is not an alchemist solution, but is one of many efforts to be undertaken by companies to recover damaged trust. The international financial crisis affecting the world economy at the moment is another example of the need to give greater trust to the stakeholders –through transparency in the information they provide–. Only then, their participation in the capital market will be maintained and increased, and the costs that brings widespread lack of trust in which we live will be reduced.
Entropy-based analysis and bioinformatics-inspired integration of global economic information transfer.

Directory of Open Access Journals (Sweden)

Jinkyu Kim

Full Text Available The assessment of information transfer in the global economic network helps to understand the current environment and the outlook of an economy. Most approaches on global networks extract information transfer based mainly on a single variable. This paper establishes an entirely new bioinformatics-inspired approach to integrating information transfer derived from multiple variables and develops an international economic network accordingly. In the proposed methodology, we first construct the transfer entropies (TEs between various intra- and inter-country pairs of economic time series variables, test their significances, and then use a weighted sum approach to aggregate information captured in each TE. Through a simulation study, the new method is shown to deliver better information integration compared to existing integration methods in that it can be applied even when intra-country variables are correlated. Empirical investigation with the real world data reveals that Western countries are more influential in the global economic network and that Japan has become less influential following the Asian currency crisis.
Entropy-based analysis and bioinformatics-inspired integration of global economic information transfer.

Science.gov (United States)

Kim, Jinkyu; Kim, Gunn; An, Sungbae; Kwon, Young-Kyun; Yoon, Sungroh

2013-01-01

The assessment of information transfer in the global economic network helps to understand the current environment and the outlook of an economy. Most approaches on global networks extract information transfer based mainly on a single variable. This paper establishes an entirely new bioinformatics-inspired approach to integrating information transfer derived from multiple variables and develops an international economic network accordingly. In the proposed methodology, we first construct the transfer entropies (TEs) between various intra- and inter-country pairs of economic time series variables, test their significances, and then use a weighted sum approach to aggregate information captured in each TE. Through a simulation study, the new method is shown to deliver better information integration compared to existing integration methods in that it can be applied even when intra-country variables are correlated. Empirical investigation with the real world data reveals that Western countries are more influential in the global economic network and that Japan has become less influential following the Asian currency crisis.
CISAPS: Complex Informational Spectrum for the Analysis of Protein Sequences

Directory of Open Access Journals (Sweden)

Charalambos Chrysostomou

2015-01-01

Full Text Available Complex informational spectrum analysis for protein sequences (CISAPS and its web-based server are developed and presented. As recent studies show, only the use of the absolute spectrum in the analysis of protein sequences using the informational spectrum analysis is proven to be insufficient. Therefore, CISAPS is developed to consider and provide results in three forms including absolute, real, and imaginary spectrum. Biologically related features to the analysis of influenza A subtypes as presented as a case study in this study can also appear individually either in the real or imaginary spectrum. As the results presented, protein classes can present similarities or differences according to the features extracted from CISAPS web server. These associations are probable to be related with the protein feature that the specific amino acid index represents. In addition, various technical issues such as zero-padding and windowing that may affect the analysis are also addressed. CISAPS uses an expanded list of 611 unique amino acid indices where each one represents a different property to perform the analysis. This web-based server enables researchers with little knowledge of signal processing methods to apply and include complex informational spectrum analysis to their work.
Prediction of glutathionylation sites in proteins using minimal sequence information and their experimental validation.

Science.gov (United States)

Pal, Debojyoti; Sharma, Deepak; Kumar, Mukesh; Sandur, Santosh K

2016-09-01

S-glutathionylation of proteins plays an important role in various biological processes and is known to be protective modification during oxidative stress. Since, experimental detection of S-glutathionylation is labor intensive and time consuming, bioinformatics based approach is a viable alternative. Available methods require relatively longer sequence information, which may prevent prediction if sequence information is incomplete. Here, we present a model to predict glutathionylation sites from pentapeptide sequences. It is based upon differential association of amino acids with glutathionylated and non-glutathionylated cysteines from a database of experimentally verified sequences. This data was used to calculate position dependent F-scores, which measure how a particular amino acid at a particular position may affect the likelihood of glutathionylation event. Glutathionylation-score (G-score), indicating propensity of a sequence to undergo glutathionylation, was calculated using position-dependent F-scores for each amino-acid. Cut-off values were used for prediction. Our model returned an accuracy of 58% with Matthew's correlation-coefficient (MCC) value of 0.165. On an independent dataset, our model outperformed the currently available model, in spite of needing much less sequence information. Pentapeptide motifs having high abundance among glutathionylated proteins were identified. A list of potential glutathionylation hotspot sequences were obtained by assigning G-scores and subsequent Protein-BLAST analysis revealed a total of 254 putative glutathionable proteins, a number of which were already known to be glutathionylated. Our model predicted glutathionylation sites in 93.93% of experimentally verified glutathionylated proteins. Outcome of this study may assist in discovering novel glutathionylation sites and finding candidate proteins for glutathionylation.
Evolution of biological sequences implies an extreme value distribution of type I for both global and local pairwise alignment scores.

Science.gov (United States)

Bastien, Olivier; Maréchal, Eric

2008-08-07

Confidence in pairwise alignments of biological sequences, obtained by various methods such as Blast or Smith-Waterman, is critical for automatic analyses of genomic data. Two statistical models have been proposed. In the asymptotic limit of long sequences, the Karlin-Altschul model is based on the computation of a P-value, assuming that the number of high scoring matching regions above a threshold is Poisson distributed. Alternatively, the Lipman-Pearson model is based on the computation of a Z-value from a random score distribution obtained by a Monte-Carlo simulation. Z-values allow the deduction of an upper bound of the P-value (1/Z-value2) following the TULIP theorem. Simulations of Z-value distribution is known to fit with a Gumbel law. This remarkable property was not demonstrated and had no obvious biological support. We built a model of evolution of sequences based on aging, as meant in Reliability Theory, using the fact that the amount of information shared between an initial sequence and the sequences in its lineage (i.e., mutual information in Information Theory) is a decreasing function of time. This quantity is simply measured by a sequence alignment score. In systems aging, the failure rate is related to the systems longevity. The system can be a machine with structured components, or a living entity or population. "Reliability" refers to the ability to operate properly according to a standard. Here, the "reliability" of a sequence refers to the ability to conserve a sufficient functional level at the folded and maturated protein level (positive selection pressure). Homologous sequences were considered as systems 1) having a high redundancy of information reflected by the magnitude of their alignment scores, 2) which components are the amino acids that can independently be damaged by random DNA mutations. From these assumptions, we deduced that information shared at each amino acid position evolved with a constant rate, corresponding to the
Effects of Information Capitalism and Globalization on Teaching and Learning

Science.gov (United States)

Adeoye, Blessing F., Ed.; Tomei, Lawrence, Ed.

2014-01-01

As computers and Internet connections become widely available in schools and classrooms, it is critical to examine cross-cultural issues in the utilization of information and communication technologies. "Effects of Information Capitalism and Globalization on Teaching and Learning" examines issues concerning emerging multimedia…
MendeLIMS: a web-based laboratory information management system for clinical genome sequencing.

Science.gov (United States)

Grimes, Susan M; Ji, Hanlee P

2014-08-27

Large clinical genomics studies using next generation DNA sequencing require the ability to select and track samples from a large population of patients through many experimental steps. With the number of clinical genome sequencing studies increasing, it is critical to maintain adequate laboratory information management systems to manage the thousands of patient samples that are subject to this type of genetic analysis. To meet the needs of clinical population studies using genome sequencing, we developed a web-based laboratory information management system (LIMS) with a flexible configuration that is adaptable to continuously evolving experimental protocols of next generation DNA sequencing technologies. Our system is referred to as MendeLIMS, is easily implemented with open source tools and is also highly configurable and extensible. MendeLIMS has been invaluable in the management of our clinical genome sequencing studies. We maintain a publicly available demonstration version of the application for evaluation purposes at http://mendelims.stanford.edu. MendeLIMS is programmed in Ruby on Rails (RoR) and accesses data stored in SQL-compliant relational databases. Software is freely available for non-commercial use at http://dna-discovery.stanford.edu/software/mendelims/.
Environmental safety of the global information space

Directory of Open Access Journals (Sweden)

В’ячеслав Степанович Волошин

2015-03-01

Databases of full-text publications – journals, articles, monographs- are surely a means of salvation for science. There already exist a large number of such portals. Besides, advantages and disadvantages of electronic subscriptions to periodicals should certainly be considered. The former include the following most evident ones: aggregation of large data arrays, saving money on a subscription, an opportunity to work with relevant publications, thematic collections of materials, availability of records, simultaneous access of an unlimited number of users and others. Nevertheless, there are many disadvantages that make it difficult to work with full-text publications. They are the following: selective representativeness of publication numbers, complexity of keyword search, occasional presence of obsolete text formats, printed versions, possible psychological barrier, physiological incompatibility with computer equipment, fatigue caused by prolonged work on the computer. The Internet was followed by the appearance of global control networks, their aims ranging from control of a human life support to a unified control of humanity. So, the formed global information space promises the man to get access to almost any information source. Meanwhile, environmental safety of the man, his/her objective biological psyche and abilities in harmonious development are at serious risk
Living is information processing: from molecules to global systems

OpenAIRE

Farnsworth, Keith D.; Nelson, John; Gershenson, Carlos

2012-01-01

We extend the concept that life is an informational phenomenon, at every level of organisation, from molecules to the global ecological system. According to this thesis: (a) living is information processing, in which memory is maintained by both molecular states and ecological states as well as the more obvious nucleic acid coding; (b) this information processing has one overall function - to perpetuate itself; and (c) the processing method is filtration (cognition) of, and synthesis of, info...
The Present and Future of Whole Genome Sequencing (WGS and Whole Metagenome Sequencing (WMS for Surveillance of Antimicrobial Resistant Microorganisms and Antimicrobial Resistance Genes across the Food Chain

Directory of Open Access Journals (Sweden)

Elena A. Oniciuc

2018-05-01

Full Text Available Antimicrobial resistance (AMR surveillance is a critical step within risk assessment schemes, as it is the basis for informing global strategies, monitoring the effectiveness of public health interventions, and detecting new trends and emerging threats linked to food. Surveillance of AMR is currently based on the isolation of indicator microorganisms and the phenotypic characterization of clinical, environmental and food strains isolated. However, this approach provides very limited information on the mechanisms driving AMR or on the presence or spread of AMR genes throughout the food chain. Whole-genome sequencing (WGS of bacterial pathogens has shown potential for epidemiological surveillance, outbreak detection, and infection control. In addition, whole metagenome sequencing (WMS allows for the culture-independent analysis of complex microbial communities, providing useful information on AMR genes occurrence. Both technologies can assist the tracking of AMR genes and mobile genetic elements, providing the necessary information for the implementation of quantitative risk assessments and allowing for the identification of hotspots and routes of transmission of AMR across the food chain. This review article summarizes the information currently available on the use of WGS and WMS for surveillance of AMR in foodborne pathogenic bacteria and food-related samples and discusses future needs that will have to be considered for the routine implementation of these next-generation sequencing methodologies with this aim. In particular, methodological constraints that impede the use at a global scale of these high-throughput sequencing (HTS technologies are identified, and the standardization of methods and protocols is suggested as a measure to upgrade HTS-based AMR surveillance schemes.
Mixed multiscale finite element methods using approximate global information based on partial upscaling

KAUST Repository

Jiang, Lijian

2009-10-02

The use of limited global information in multiscale simulations is needed when there is no scale separation. Previous approaches entail fine-scale simulations in the computation of the global information. The computation of the global information is expensive. In this paper, we propose the use of approximate global information based on partial upscaling. A requirement for partial homogenization is to capture long-range (non-local) effects present in the fine-scale solution, while homogenizing some of the smallest scales. The local information at these smallest scales is captured in the computation of basis functions. Thus, the proposed approach allows us to avoid the computations at the scales that can be homogenized. This results in coarser problems for the computation of global fields. We analyze the convergence of the proposed method. Mathematical formalism is introduced, which allows estimating the errors due to small scales that are homogenized. The proposed method is applied to simulate two-phase flows in heterogeneous porous media. Numerical results are presented for various permeability fields, including those generated using two-point correlation functions and channelized permeability fields from the SPE Comparative Project (Christie and Blunt, SPE Reserv Evalu Eng 4:308-317, 2001). We consider simple cases where one can identify the scales that can be homogenized. For more general cases, we suggest the use of upscaling on the coarse grid with the size smaller than the target coarse grid where multiscale basis functions are constructed. This intermediate coarse grid renders a partially upscaled solution that contains essential non-local information. Numerical examples demonstrate that the use of approximate global information provides better accuracy than purely local multiscale methods. © 2009 Springer Science+Business Media B.V.
Delayed processing of global shape information in developmental prosopagnosia

DEFF Research Database (Denmark)

Gerlach, Christian; Klargaard, Solja K.; Petersen, Anders

2017-01-01

individuals with DP in Navon’s paradigm we find evidence of a reduced global precedence effect: The DPs are slower than controls to process global but not local shape information. Importantly, and in contrast to previous studies, we demonstrate that the DPs perform normally in a comprehensive test of visual......There is accumulating evidence suggesting that a central deficit in developmental prosopagnosia (DP), a disorder characterized by profound and lifelong difficulties with face recognition, concerns impaired holistic processing. Some of this evidence comes from studies using Navon’s paradigm where...... individuals with DP show a greater local or reduced global bias compared with controls. However, it has not been established what gives rise to this altered processing bias. Is it a reduced global precedence effect, changes in susceptibility to interference effects or both? By analyzing the performance of 10...
Requirements for a Global Greenhouse Gas Information System

Science.gov (United States)

Duren, R.; Boland, S.; Lempert, R.; Miller, C.

2008-12-01

A global greenhouse gas information system will prove a critical component of any successful effort to mitigate climate change which relies on limiting the atmospheric concentration of greenhouse gases. The system will provide the situational awareness necessary to actively reduce emissions, influence land use change, and sequester carbon. The information from such a system will be subject to intense scrutiny. Therefore, an effective system must openly and transparently produce data of unassailable quality. A global greenhouse gas information system will likely require a combination of space-and air-based remote- sensing assets, ground-based measurements, carbon cycle modeling and self-reporting. The specific requirements on such a system will be shaped by the degree of international cooperation it enjoys and the needs of the policy regime it aims to support, which might range from verifying treaty obligations, to certifying the tradable permits and offsets underlying a market in greenhouse gas emission reductions, to providing a comprehensive inventory of high and low emitters that could be used by non-governmental organizations and other international actors. While some technical studies have examined particular system components in single scenarios, there remains a need for a comprehensive survey of the range of potential requirements, options, and strategies for the overall system. We have initiated such a survey and recently hosted a workshop which engaged a diverse community of stakeholders to begin synthesizing requirements for such a system, with an initial focus on carbon dioxide. In this paper we describe our plan for completing the definition of the requirements, options, and strategies for a global greenhouse gas monitoring system. We discuss our overall approach and provide a status on the initial requirements synthesis activity.
Information-Theoretical Analysis of EEG Microstate Sequences in Python

Directory of Open Access Journals (Sweden)

Frederic von Wegner

2018-06-01

Full Text Available We present an open-source Python package to compute information-theoretical quantities for electroencephalographic data. Electroencephalography (EEG measures the electrical potential generated by the cerebral cortex and the set of spatial patterns projected by the brain's electrical potential on the scalp surface can be clustered into a set of representative maps called EEG microstates. Microstate time series are obtained by competitively fitting the microstate maps back into the EEG data set, i.e., by substituting the EEG data at a given time with the label of the microstate that has the highest similarity with the actual EEG topography. As microstate sequences consist of non-metric random variables, e.g., the letters A–D, we recently introduced information-theoretical measures to quantify these time series. In wakeful resting state EEG recordings, we found new characteristics of microstate sequences such as periodicities related to EEG frequency bands. The algorithms used are here provided as an open-source package and their use is explained in a tutorial style. The package is self-contained and the programming style is procedural, focusing on code intelligibility and easy portability. Using a sample EEG file, we demonstrate how to perform EEG microstate segmentation using the modified K-means approach, and how to compute and visualize the recently introduced information-theoretical tests and quantities. The time-lagged mutual information function is derived as a discrete symbolic alternative to the autocorrelation function for metric time series and confidence intervals are computed from Markov chain surrogate data. The software package provides an open-source extension to the existing implementations of the microstate transform and is specifically designed to analyze resting state EEG recordings.
Microbial Culturomics Application for Global Health: Noncontiguous Finished Genome Sequence and Description of Pseudomonas massiliensis Strain CB-1T sp. nov. in Brazil.

Science.gov (United States)

Bardet, Lucie; Cimmino, Teresa; Buffet, Clémence; Michelle, Caroline; Rathored, Jaishriram; Tandina, Fatalmoudou; Lagier, Jean-Christophe; Khelaifia, Saber; Abrahão, Jônatas; Raoult, Didier; Rolain, Jean-Marc

2018-02-01

Culturomics is a new postgenomics field that explores the microbial diversity of the human gut coupled with taxono-genomic strategy. Culturomics, and the microbiome science more generally, are anticipated to transform global health diagnostics and inform the ways in which gut microbial diversity contributes to human health and disease, and by extension, to personalized medicine. Using culturomics, we report in this study the description of strain CB1 T ( = CSUR P1334 = DSM 29075), a new species isolated from a stool specimen from a 37-year-old Brazilian woman. This description includes phenotypic characteristics and complete genome sequence and annotation. Strain CB1 T is a gram-negative aerobic and motile bacillus, exhibits neither catalase nor oxidase activities, and presents a 98.3% 16S rRNA sequence similarity with Pseudomonas putida. The 4,723,534 bp long genome contains 4239 protein-coding genes and 74 RNA genes, including 15 rRNA genes (5 16S rRNA, 4 23S rRNA, and 6 5S rRNA) and 59 tRNA genes. Strain CB1 T was named Pseudomonas massiliensis sp. nov. and classified into the family Pseudomonadaceae. This study demonstrates the usefulness of microbial culturomics in exploration of human microbiota in diverse geographies and offers new promise for incorporating new omics technologies for innovation in diagnostic medicine and global health.
Evolution of biological sequences implies an extreme value distribution of type I for both global and local pairwise alignment scores

Directory of Open Access Journals (Sweden)

Maréchal Eric

2008-08-01

Full Text Available Abstract Background Confidence in pairwise alignments of biological sequences, obtained by various methods such as Blast or Smith-Waterman, is critical for automatic analyses of genomic data. Two statistical models have been proposed. In the asymptotic limit of long sequences, the Karlin-Altschul model is based on the computation of a P-value, assuming that the number of high scoring matching regions above a threshold is Poisson distributed. Alternatively, the Lipman-Pearson model is based on the computation of a Z-value from a random score distribution obtained by a Monte-Carlo simulation. Z-values allow the deduction of an upper bound of the P-value (1/Z-value2 following the TULIP theorem. Simulations of Z-value distribution is known to fit with a Gumbel law. This remarkable property was not demonstrated and had no obvious biological support. Results We built a model of evolution of sequences based on aging, as meant in Reliability Theory, using the fact that the amount of information shared between an initial sequence and the sequences in its lineage (i.e., mutual information in Information Theory is a decreasing function of time. This quantity is simply measured by a sequence alignment score. In systems aging, the failure rate is related to the systems longevity. The system can be a machine with structured components, or a living entity or population. "Reliability" refers to the ability to operate properly according to a standard. Here, the "reliability" of a sequence refers to the ability to conserve a sufficient functional level at the folded and maturated protein level (positive selection pressure. Homologous sequences were considered as systems 1 having a high redundancy of information reflected by the magnitude of their alignment scores, 2 which components are the amino acids that can independently be damaged by random DNA mutations. From these assumptions, we deduced that information shared at each amino acid position evolved with a
Running a network on a shoestring: the Global Invasive Species Information Network

Science.gov (United States)

Jarnevich, Catherine S.; Simpson, Annie; Graham, James J; Newman, Gregory J.; Bargeron, Chuck T.

2015-01-01

The Global Invasive Species Information Network (GISIN) was conceptualized in 2004 to aggregate and disseminate invasive species data in a standardized way. A decade later the GISIN community has implemented a data portal and three of six GISIN data aggregation models in the GISIN data exchange Protocol, including invasive species status information, resource URLs, and occurrence data. The portal is based on a protocol developed by representatives from 15 countries and 27 organizations of the global invasive species information management community. The GISIN has 19 data providers sharing 34,343 species status records, 1,693,073 occurrences, and 15,601 resource URLs. While the GISIN's goal is to be global, much of its data and funding are provided by the United States. Several initiatives use the GISIN as their information backbone, such as the Great Lakes Early Detection Network (GLEDN) and the North American Invasive Species Network (NAISN). Here we share several success stories and organizational challenges that remain.
Generation of global hourly radiation sequences using a Transition Markov matrix for Madrid. Generacion de secuencias horarias de radiacion global utilizando matrices de transicion de Markov, para la localidad de Madrid

Energy Technology Data Exchange (ETDEWEB)

Mora, Ll

1989-11-01

The aim of this work is the generation of sequences of hourly global radiation which have similar statistically characteristics of real sequences for the city of Madrid (Spain). For this generation, a first order Markov model has been proposed. The input parameters of simulation method are the following: The maximum value of hourly radiation and the average monthly value of the transparency normalized index. The maximum value of hourly radiation has been calculated as a function of the solar height by an empirical expression. The transparency normalized index has been defined as the ratio among the measured hourly global radiation to the maximum value for the corresponding solar height. The method is based on the following observations: -The transparency normalized index shows a significant correlation only for two consecutive hours. -The months with the same average transparency normalized indies have similar probability density function. Global solar radiation, time series, simulation, Markov transition matrix, solar energy.

Importance of Viral Sequence Length and Number of Variable and Informative Sites in Analysis of HIV Clustering.

Science.gov (United States)

Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor; Essex, M

2015-05-01

To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice.
Global Information Justice: Rights, Responsibilities, and Caring Connections.

Science.gov (United States)

Smith, Martha

2001-01-01

Explains the concept of global information justice and describes it as an ethical ideal, as an organizing principle for a model for analysis, and as a direction for policy making. Discusses the use of new technologies; access to technology; ownership; privacy; security; community; and the Universal Declaration of Human Rights. (Author/LRW)
Progress toward an Integrated Global Greenhouse Gas Information System (IG3IS)

Science.gov (United States)

DeCola, P.; Butler, J. H.; Stanitski, D.; Tarasova, O. A.; Terblanche, D. E.; Duren, R. M.; Gurney, K. R.; Manning, A.; Reimann, S.; Ciais, P.; Arnold, T.; Burston, J.; Rayner, P. J.; Wofsy, S. C.; Hamburg, S.; Zavala-Araiza, D.; Miller, J. B.; Gerbig, C.; Vogel, F. R.; Canadell, J.

2016-12-01

Accurate and precise atmospheric long-term measurements of greenhouse gas (GHG) concentrations have revealed the rapid and unceasing rise of global GHG concentrations due to human socioeconomic activity. Long-term observations also show a resulting rise in global temperatures and evidence of negative impacts on society. In response to this mounting evidence, nations, sub-national governments, private enterprises and individuals are establishing and accelerating efforts to reduce GHG emissions while meeting the needs for global development and increasing energy access. With this motivation, WMO and its partners have called for an Integrated Global Greenhouse Information System (IG3IS). The IG3IS will serve as an international coordinating mechanism to establish and propagate consistent methods and standards to help assess emission-reduction actions. For the IG3IS initiative to succeed the end users must understand, trust, and recognize the value of the information they receive, and act more effectively in response. Over time, the IG3IS framework will be capable of promoting and accepting advancing technical capabilities (e.g., new satellite observations), continually improving the quality of and confidence in such information. By combining accurate atmospheric measurements with enhanced socioeconomic activity data and model analyses we can meet the overarching goals of IG3IS to: Reduce uncertainty of emission inventory reporting, Locate, quantify and prioritize previously unknown emission reduction opportunities, and Provide national and sub-national governments with timely and quantified information to support their assessment of progress towards their mitigation goals. An effective IG3IS will provide on-going, observation-based information on the relative success of GHG management efforts on policy-relevant scales and the response of the global carbon cycle to a warming world. The presentation will cover the principles and objectives of IG3IS, as well as progress
Automatic exchange of information: towards a new global standard of tax transparency

OpenAIRE

Miguel Eduardo Pecho Trigueros

2014-01-01

Tax authorities are increasingly relying on mutual cooperation with their foreign peers to enforce more effectively their internal tax laws. After the banking scandals of 2008 and the subsequent global financial crisis, the Global Forum on Transparency and Exchange of Information for TaxPurposes has proposed the exchange of information upon request as the fiscal transparency standard. However, some measures adopted by the European Union, previous initiatives from the Organization for Economic...
MIPS: a database for protein sequences, homology data and yeast genome information.

Science.gov (United States)

Mewes, H W; Albermann, K; Heumann, K; Liebl, S; Pfeiffer, F

1997-01-01

The MIPS group (Martinsried Institute for Protein Sequences) at the Max-Planck-Institute for Biochemistry, Martinsried near Munich, Germany, collects, processes and distributes protein sequence data within the framework of the tripartite association of the PIR-International Protein Sequence Database (,). MIPS contributes nearly 50% of the data input to the PIR-International Protein Sequence Database. The database is distributed on CD-ROM together with PATCHX, an exhaustive supplement of unique, unverified protein sequences from external sources compiled by MIPS. Through its WWW server (http://www.mips.biochem.mpg.de/ ) MIPS permits internet access to sequence databases, homology data and to yeast genome information. (i) Sequence similarity results from the FASTA program () are stored in the FASTA database for all proteins from PIR-International and PATCHX. The database is dynamically maintained and permits instant access to FASTA results. (ii) Starting with FASTA database queries, proteins have been classified into families and superfamilies (PROT-FAM). (iii) The HPT (hashed position tree) data structure () developed at MIPS is a new approach for rapid sequence and pattern searching. (iv) MIPS provides access to the sequence and annotation of the complete yeast genome (), the functional classification of yeast genes (FunCat) and its graphical display, the 'Genome Browser' (). A CD-ROM based on the JAVA programming language providing dynamic interactive access to the yeast genome and the related protein sequences has been compiled and is available on request. PMID:9016498
Subfamily logos: visualization of sequence deviations at alignment positions with high information content

Directory of Open Access Journals (Sweden)

Beitz Eric

2006-06-01

Full Text Available Abstract Background Recognition of relevant sequence deviations can be valuable for elucidating functional differences between protein subfamilies. Interesting residues at highly conserved positions can then be mutated and experimentally analyzed. However, identification of such sites is tedious because automated approaches are scarce. Results Subfamily logos visualize subfamily-specific sequence deviations. The display is similar to classical sequence logos but extends into the negative range. Positive, upright characters correspond to residues which are characteristic for the subfamily, negative, upside-down characters to residues typical for the remaining sequences. The symbol height is adjusted to the information content of the alignment position. Residues which are conserved throughout do not appear. Conclusion Subfamily logos provide an intuitive display of relevant sequence deviations. The method has proven to be valid using a set of 135 aligned aquaporin sequences in which established subfamily-specific positions were readily identified by the algorithm.
Working memory capacity and Stroop interference: global versus local indices of executive control.

Science.gov (United States)

Meier, Matt E; Kane, Michael J

2013-05-01

Two experiments examined the relations among working memory capacity (WMC), congruency-sequence effects, proportion-congruency effects, and the color-word Stroop effect to test whether congruency-sequence effects might inform theoretical claims regarding WMC's prediction of Stroop interference. In Experiment 1, subjects completed either a high-congruency or low-congruency Stroop task that restricted trial-to-trial repetitions of stimulus dimensions to examine WMC's relation to congruency-sequence effects while minimizing bottom-up, stimulus-driven contributions. Congruency-sequence effects and congruency-proportion effects were significant but did not interact. WMC predicted global Stroop interference under low-congruency conditions but neither local congruency-sequence effects nor global Stroop interference under high-congruency conditions, contrary to previous studies (e.g., Kane & Engle, 2003). A high-congruency Stroop task in Experiment 2 removed the Experiment 1 task constraints, and, here, we obtained the typical, global association between WMC and Stroop interference but still no relation between WMC and congruency-sequence effects. We thus examined the methodological differences between Experiments 1 and 2 to determine whether any of these were locally responsible for the global WMC-related differences. They were not, suggesting that the changes between Experiments 1 and 2 created a general task context that engaged (or disengaged) the executive processes associated with WMC.
Global Carrier Rates of Rare Inherited Disorders Using Population Exome Sequences.

Directory of Open Access Journals (Sweden)

Kohei Fujikura

Full Text Available Exome sequencing has revealed the causative mutations behind numerous rare, inherited disorders, but it is challenging to find reliable epidemiological values for rare disorders. Here, I provide a genetic epidemiology method to identify the causative mutations behind rare, inherited disorders using two population exome sequences (1000 Genomes and NHLBI. I created global maps of carrier rate distribution for 18 recessive disorders in 16 diverse ethnic populations. Out of a total of 161 mutations associated with 18 recessive disorders, I detected 24 mutations in either or both exome studies. The genetic mapping revealed strong international spatial heterogeneities in the carrier patterns of the inherited disorders. I next validated this methodology by statistically evaluating the carrier rate of one well-understood disorder, sickle cell anemia (SCA. The population exome-based epidemiology of SCA [African (allele frequency (AF = 0.0454, N = 2447, Asian (AF = 0, N = 286, European (AF = 0.000214, N = 4677, and Hispanic (AF = 0.0111, N = 362] was not significantly different from that obtained from a clinical prevalence survey. A pair-wise proportion test revealed no significant differences between the two exome projects in terms of AF (46/48 cases; P > 0.05. I conclude that population exome-based carrier rates can form the foundation for a prospectively maintained database of use to clinical geneticists. Similar modeling methods can be applied to many inherited disorders.
The minimum information about a genome sequence (MIGS) specification

DEFF Research Database (Denmark)

Field, D; Garrity, G; Gray, T

2008-01-01

With the quantity of genomic data increasing at an exponential rate, it is imperative that these data be captured electronically, in a standard format. Standardization activities must proceed within the auspices of open-access and international working bodies. To tackle the issues surrounding the...... that will be required to develop improved mechanisms of metadata capture and exchange. As part of its wider goals, the GSC also supports improving the 'transparency' of the information contained in existing genomic databases....... the development of better descriptions of genomic investigations, we have formed the Genomic Standards Consortium (GSC). Here, we introduce the minimum information about a genome sequence (MIGS) specification with the intent of promoting participation in its development and discussing the resources...
MIToS.jl: mutual information tools for protein sequence analysis in the Julia language

DEFF Research Database (Denmark)

Zea, Diego J.; Anfossi, Diego; Nielsen, Morten

2017-01-01

Motivation: MIToS is an environment for mutual information analysis and a framework for protein multiple sequence alignments (MSAs) and protein structures (PDB) management in Julia language. It integrates sequence and structural information through SIFTS, making Pfam MSAs analysis straightforward....... MIToS streamlines the implementation of any measure calculated from residue contingency tables and its optimization and testing in terms of protein contact prediction. As an example, we implemented and tested a BLOSUM62-based pseudo-count strategy in mutual information analysis. Availability...... and Implementation: The software is totally implemented in Julia and supported for Linux, OS X and Windows. It’s freely available on GitHub under MIT license: http://mitos.leloir.org.ar. Contacts:diegozea@gmail.com or cmb@leloir.org.ar Supplementary information: Supplementary data are available at Bioinformatics...
Information content in reflected global navigation satellite system signals

DEFF Research Database (Denmark)

Høeg, Per; Carlstrom, Anders

2011-01-01

The direct signals from satellites in global satellite navigation satellites systems (GNSS) as, GPS, GLONASS and GALILEO, constitute the primary source for positioning, navigation and timing from space. But also the reflected GNSS signals contain an important information content of signal travel...
SoilGrids1km--global soil information based on automated mapping.

Directory of Open Access Journals (Sweden)

Tomislav Hengl

Full Text Available BACKGROUND: Soils are widely recognized as a non-renewable natural resource and as biophysical carbon sinks. As such, there is a growing requirement for global soil information. Although several global soil information systems already exist, these tend to suffer from inconsistencies and limited spatial detail. METHODOLOGY/PRINCIPAL FINDINGS: We present SoilGrids1km--a global 3D soil information system at 1 km resolution--containing spatial predictions for a selection of soil properties (at six standard depths: soil organic carbon (g kg-1, soil pH, sand, silt and clay fractions (%, bulk density (kg m-3, cation-exchange capacity (cmol+/kg, coarse fragments (%, soil organic carbon stock (t ha-1, depth to bedrock (cm, World Reference Base soil groups, and USDA Soil Taxonomy suborders. Our predictions are based on global spatial prediction models which we fitted, per soil variable, using a compilation of major international soil profile databases (ca. 110,000 soil profiles, and a selection of ca. 75 global environmental covariates representing soil forming factors. Results of regression modeling indicate that the most useful covariates for modeling soils at the global scale are climatic and biomass indices (based on MODIS images, lithology, and taxonomic mapping units derived from conventional soil survey (Harmonized World Soil Database. Prediction accuracies assessed using 5-fold cross-validation were between 23-51%. CONCLUSIONS/SIGNIFICANCE: SoilGrids1km provide an initial set of examples of soil spatial data for input into global models at a resolution and consistency not previously available. Some of the main limitations of the current version of SoilGrids1km are: (1 weak relationships between soil properties/classes and explanatory variables due to scale mismatches, (2 difficulty to obtain covariates that capture soil forming factors, (3 low sampling density and spatial clustering of soil profile locations. However, as the SoilGrids system is
Medan City: Informality and the Historical Global City

Science.gov (United States)

Sudarmadji, N.; Tyaghita, B.; Astuti, P. T.; Etleen, D.

2018-05-01

As projected by UN that two-thirds of Indonesia’s population will live in urban areas by 2050, rapid urbanization is happening in Indonesian cities. Initial research on eight Indonesian Cities (which includes Medan, Jatinegara, Bandung, Surakarta, Yogyakarta, Surabaya, Balikpapan, and Manado) by Tunas Nusa Foundation since 2012 shows that urbanization of each city has happened throughout history creating cultural, economic, and environmental networks that are distinct from one city to another. While the networks remain until today and continuously shapes the urban agglomeration pattern, not all parts of the city could undergo subsequent development that confirms the existing pattern, leading to the creation informality. Nor could it make future planning that comprehends the nature of its integrated urban dynamic beyond its current administrative authority. In this paper, we would like to share our study for Medan, North Sumatra as it shows a portrait of a city with a long relationship to a global network since the Maritime trade era. Medan has become home to many ethnic groups which have sailed and migrated as part of a global economic agenda creating a strong economic network between port cities along the Malacca Strait. The city has kept its role in the global economic network until today, to name a few, becoming the frontier for the Indonesia-Malaysia-Thailand Growth Triangle. While we celebrate Medan’s potential to become a global city with major infrastructure development as well as cultural assets as its advantage in the future, we argue that microscale cohesion supported by government policy in agreed planning documents are fundamental for the city to thrive amidst the challenges it is facing. Yet, these cultural assets, as well as micro scale cohesion in Medan City today, are still undermined. Thus, informality in Medan exists as result of ignorance and marginalization of certain socio-cultural groups, abandoning places and identity, as well as the
Towards a Global Greenhouse Gas Information System (GHGIS)

Science.gov (United States)

Duren, Riley; Butler, James; Rotman, Doug; Miller, Charles; Decola, Phil; Sheffner, Edwin; Tucker, Compton; Mitchiner, John; Jonietz, Karl; Dimotakis, Paul

2010-05-01

Over the next few years, an increasing number of entities ranging from international, national, and regional governments, to businesses and private land-owners, are likely to become more involved in efforts to limit atmospheric concentrations of greenhouse gases. In such a world, geospatially resolved information about the location, amount, and rate of greenhouse gas (GHG) emissions will be needed, as well as the stocks and flows of all forms of carbon through terrestrial ecosystems and in the oceans. The ability to implement policies that limit GHG concentrations would be enhanced by a global, open, and transparent greenhouse gas information system (GHGIS). An operational and scientifically robust GHGIS would combine ground-based and space-based observations, carbon-cycle modeling, GHG inventories, meta-analysis, and an extensive data integration and distribution system, to provide information about sources, sinks, and fluxes of greenhouse gases at policy-relevant temporal and spatial scales. The GHGIS effort was initiated in 2008 as a grassroots inter-agency collaboration intended to rigorously identify the needs for such a system, assess the capabilities of current assets, and suggest priorities for future research and development. We will present a status of the GHGIS effort including our latest analysis and ideas for potential near-term pilot projects with potential relevance to European initiatives including the Global Monitoring for Environment and Security (GMES) and the Integrated Carbon Observing System (ICOS).
International earth science information network for global change decision making

Energy Technology Data Exchange (ETDEWEB)

Autrey-Hunley, C.; Kuhn, W.R.; Kasischke, E.; Trichel, M.T.; Coppola, R.

1991-01-01

Effective environmental decision making depends upon the ability to predict physical changes in the environment, societal responses to these changes, and how both the physical changes and societal responses will be affected by changes in government regulations, public perceptions and the environment. Technological advances in remote sensing have provided a wealth of earth science data necessary to study global change problems; the Earth Observatory System will provide an unprecedented data source in the late 1990's. The Consortium for an International Earth Science Information Network (CIESIN) will combine earth science data (both satellite and ground-based) with data on the social sciences (e.g., economics, demographics, public health) to support informed policy decisions and to transfer knowledge on global change and its causes to the public.
Global seismic inversion as the next standard step in the processing sequence

Energy Technology Data Exchange (ETDEWEB)

Maver, Kim G.; Hansen, Lars S.; Jepsen, Anne-Marie; Rasmussen, Klaus B.

1998-12-31

Seismic inversion of post stack seismic data has until recently been regarded as a reservoir oriented method since the standard inversion techniques rely on extensive well control and a detailed user derived input model. Most seismic inversion techniques further requires a stable wavelet. As a consequence seismic inversion is mainly utilised in mature areas focusing of specific zones only after the seismic data has been interpreted and is well understood. By using an advanced 3-D global technique, seismic inversion is presented as the next standard step in the processing sequence. The technique is robust towards noise within the seismic data, utilizes a time variant wavelet, and derives a low frequency model utilizing the stacking velocities and only limited well control. 4 figs.
Location of core diagnostic information across various sequences in brain MRI and implications for efficiency of MRI scanner utilization.

Science.gov (United States)

Sharma, Aseem; Chatterjee, Arindam; Goyal, Manu; Parsons, Matthew S; Bartel, Seth

2015-04-01

Targeting redundancy within MRI can improve its cost-effective utilization. We sought to quantify potential redundancy in our brain MRI protocols. In this retrospective review, we aggregated 207 consecutive adults who underwent brain MRI and reviewed their medical records to document clinical indication, core diagnostic information provided by MRI, and its clinical impact. Contributory imaging abnormalities constituted positive core diagnostic information whereas absence of imaging abnormalities constituted negative core diagnostic information. The senior author selected core sequences deemed sufficient for extraction of core diagnostic information. For validating core sequences selection, four readers assessed the relative ease of extracting core diagnostic information from the core sequences. Potential redundancy was calculated by comparing the average number of core sequences to the average number of sequences obtained. Scanning had been performed using 9.4±2.8 sequences over 37.3±12.3 minutes. Core diagnostic information was deemed extractable from 2.1±1.1 core sequences, with an assumed scanning time of 8.6±4.8 minutes, reflecting a potential redundancy of 74.5%±19.1%. Potential redundancy was least in scans obtained for treatment planning (14.9%±25.7%) and highest in scans obtained for follow-up of benign diseases (81.4%±12.6%). In 97.4% of cases, all four readers considered core diagnostic information to be either easily extractable from core sequences or the ease to be equivalent to that from the entire study. With only one MRI lacking clinical impact (0.48%), overutilization did not seem to contribute to potential redundancy. High potential redundancy that can be targeted for more efficient scanner utilization exists in brain MRI protocols.
The existence and global attractivity of almost periodic sequence solution of discrete-time neural networks

International Nuclear Information System (INIS)

Huang Zhenkun; Wang Xinghua; Gao Feng

2006-01-01

In this Letter, we discuss discrete-time analogue of a continuous-time cellular neural network. Sufficient conditions are obtained for the existence of a unique almost periodic sequence solution which is globally attractive. Our results demonstrate dynamics of the formulated discrete-time analogue as mathematical models for the continuous-time cellular neural network in almost periodic case. Finally, a computer simulation illustrates the suitability of our discrete-time analogue as numerical algorithms in simulating the continuous-time cellular neural network conveniently
The Global Heat Health Information Network (GHHIN): Putting the Pieces Together

Science.gov (United States)

Jones, H.; Shumake, J.; Trtanj, J.

2017-12-01

Human exposure to extreme heat is one of the principal and most manageable impacts of climate on human health. Yet, every year worldwide, tens of thousands of people die as a result of avoidable heat-induced health consequences and countless others experience reduced labor productivity, physiological stress and ill health. The IPCC predicts with high confidence, that the observed trend of longer lasting, more frequent, more intense, and earlier onset heat waves will continue into the future. This situation requires the global health community to aggressively confront this recognized risk. Many countries and cities worldwide have developed heat action plans or heat health early warning systems, but these efforts are only connected in an ad-hoc fashion, use a broad range of non-standardized tools, methods, and approaches, and lack a clear mechanism to learn from each other in order to more rapidly advance health protection. To address this gap and accelerate heat health protection, the Global Heat Health Information Network (GHHIN) was launched in June 2016, by the WMO/WHO joint office for Climate and Health and the NOAA Climate Program Office. GHHIN is envisioned to be an independent, voluntary, member driven forum of scientists, professionals, and policymakers focused on enhancing and multiplying the global and local learning and resilience-building for heat health that is already occurring. GHHIN seeks to serve as a catalyst, knowledge broker, disseminator of good practices, and a forum for facilitating exchange and identifying needs. GHHIN will promote evidence-driven interventions, shared-learning, co-production of information, synthesis of priorities and capacity building to empower actors to take more effective and informed life-saving preparedness and planning measures. GHHIN is working toward several activities in 2018. The first Global Heat Health Synthesis report will be published to synthesize the state of science and practice to monitor, predict, and
A fingerprint classification algorithm based on combination of local and global information

Science.gov (United States)

Liu, Chongjin; Fu, Xiang; Bian, Junjie; Feng, Jufu

2011-12-01

Fingerprint recognition is one of the most important technologies in biometric identification and has been wildly applied in commercial and forensic areas. Fingerprint classification, as the fundamental procedure in fingerprint recognition, can sharply decrease the quantity for fingerprint matching and improve the efficiency of fingerprint recognition. Most fingerprint classification algorithms are based on the number and position of singular points. Because the singular points detecting method only considers the local information commonly, the classification algorithms are sensitive to noise. In this paper, we propose a novel fingerprint classification algorithm combining the local and global information of fingerprint. Firstly we use local information to detect singular points and measure their quality considering orientation structure and image texture in adjacent areas. Furthermore the global orientation model is adopted to measure the reliability of singular points group. Finally the local quality and global reliability is weighted to classify fingerprint. Experiments demonstrate the accuracy and effectivity of our algorithm especially for the poor quality fingerprint images.

Ultra-deep sequencing reveals high prevalence and broad structural diversity of hepatitis B surface antigen mutations in a global population.

Science.gov (United States)

Gencay, Mikael; Hübner, Kirsten; Gohl, Peter; Seffner, Anja; Weizenegger, Michael; Neofytos, Dionysios; Batrla, Richard; Woeste, Andreas; Kim, Hyon-Suk; Westergaard, Gaston; Reinsch, Christine; Brill, Eva; Thu Thuy, Pham Thi; Hoang, Bui Huu; Sonderup, Mark; Spearman, C Wendy; Pabinger, Stephan; Gautier, Jérémie; Brancaccio, Giuseppina; Fasano, Massimo; Santantonio, Teresa; Gaeta, Giovanni B; Nauck, Markus; Kaminski, Wolfgang E

2017-01-01

The diversity of the hepatitis B surface antigen (HBsAg) has a significant impact on the performance of diagnostic screening tests and the clinical outcome of hepatitis B infection. Neutralizing or diagnostic antibodies against the HBsAg are directed towards its highly conserved major hydrophilic region (MHR), in particular towards its "a" determinant subdomain. Here, we explored, on a global scale, the genetic diversity of the HBsAg MHR in a large, multi-ethnic cohort of randomly selected subjects with HBV infection from four continents. A total of 1553 HBsAg positive blood samples of subjects originating from 20 different countries across Africa, America, Asia and central Europe were characterized for amino acid variation in the MHR. Using highly sensitive ultra-deep sequencing, we found 72.8% of the successfully sequenced subjects (n = 1391) demonstrated amino acid sequence variation in the HBsAg MHR. This indicates that the global variation frequency in the HBsAg MHR is threefold higher than previously reported. The majority of the amino acid mutations were found in the HBV genotypes B (28.9%) and C (25.4%). Collectively, we identified 345 distinct amino acid mutations in the MHR. Among these, we report 62 previously unknown mutations, which extends the worldwide pool of currently known HBsAg MHR mutations by 22%. Importantly, topological analysis identified the "a" determinant upstream flanking region as the structurally most diverse subdomain of the HBsAg MHR. The highest prevalence of "a" determinant region mutations was observed in subjects from Asia, followed by the African, American and European cohorts, respectively. Finally, we found that more than half (59.3%) of all HBV subjects investigated carried multiple MHR mutations. Together, this worldwide ultra-deep sequencing based genotyping study reveals that the global prevalence and structural complexity of variation in the hepatitis B surface antigen have, to date, been significantly underappreciated.
GLOBIL: WWF's Global Observation and Biodiversity Information Portal

Science.gov (United States)

Shapiro, A. C.; Nijsten, L.; Schmitt, S.; Tibaldeschi, P.

2015-04-01

Despite ever increasing availability of satellite imagery and spatial data, conservation managers, decision makers and planners are often unable to analyze data without special knowledge or software. WWF is bridging this gap by putting extensive spatial data into an easy to use online mapping environment, to allow visualization, manipulation and analysis of large data sets by any user. Consistent, reliable and repeatable ecosystem monitoring information for priority eco-regions is needed to increase transparency in WWF's global conservation work, to measure conservation impact, and to provide communications with the general public and organization members. Currently, much of this monitoring and evaluation data is isolated, incompatible, or inaccessible and not readily usable or available for those without specialized software or knowledge. Launched in 2013 by WWF Netherlands and WWF Germany, the Global Observation and Biodiversity Information Portal (GLOBIL) is WWF's new platform to unite, centralize, standardize and visualize geo-spatial data and information from more than 150 active GIS users worldwide via cloud-based ArcGIS Online. GLOBIL is increasing transparency, providing baseline data for monitoring and evaluation while communicating impacts and conservation successes to the public. GLOBIL is currently being used in the worldwide marine campaign as an advocacy tool for establishing more marine protected areas, and a monitoring interface to track the progress towards ocean protection goals. In the Kavango-Zambezi (KAZA) Transfrontier Conservation area, local partners are using the platform to monitor land cover changes, barriers to species migrations, potential human-wildlife conflict and local conservation impacts in vast wildlife corridor. In East Africa, an early warning system is providing conservation practitioners with real-time alerts of threats particularly to protected areas and World Heritage Sites by industrial extractive activities. And for
The minimum information about a genome sequence (MIGS) specification

Science.gov (United States)

Field, Dawn; Garrity, George; Gray, Tanya; Morrison, Norman; Selengut, Jeremy; Sterk, Peter; Tatusova, Tatiana; Thomson, Nicholas; Allen, Michael J; Angiuoli, Samuel V; Ashburner, Michael; Axelrod, Nelson; Baldauf, Sandra; Ballard, Stuart; Boore, Jeffrey; Cochrane, Guy; Cole, James; Dawyndt, Peter; De Vos, Paul; dePamphilis, Claude; Edwards, Robert; Faruque, Nadeem; Feldman, Robert; Gilbert, Jack; Gilna, Paul; Glöckner, Frank Oliver; Goldstein, Philip; Guralnick, Robert; Haft, Dan; Hancock, David; Hermjakob, Henning; Hertz-Fowler, Christiane; Hugenholtz, Phil; Joint, Ian; Kagan, Leonid; Kane, Matthew; Kennedy, Jessie; Kowalchuk, George; Kottmann, Renzo; Kolker, Eugene; Kravitz, Saul; Kyrpides, Nikos; Leebens-Mack, Jim; Lewis, Suzanna E; Li, Kelvin; Lister, Allyson L; Lord, Phillip; Maltsev, Natalia; Markowitz, Victor; Martiny, Jennifer; Methe, Barbara; Mizrachi, Ilene; Moxon, Richard; Nelson, Karen; Parkhill, Julian; Proctor, Lita; White, Owen; Sansone, Susanna-Assunta; Spiers, Andrew; Stevens, Robert; Swift, Paul; Taylor, Chris; Tateno, Yoshio; Tett, Adrian; Turner, Sarah; Ussery, David; Vaughan, Bob; Ward, Naomi; Whetzel, Trish; Gil, Ingio San; Wilson, Gareth; Wipat, Anil

2008-01-01

With the quantity of genomic data increasing at an exponential rate, it is imperative that these data be captured electronically, in a standard format. Standardization activities must proceed within the auspices of open-access and international working bodies. To tackle the issues surrounding the development of better descriptions of genomic investigations, we have formed the Genomic Standards Consortium (GSC). Here, we introduce the minimum information about a genome sequence (MIGS) specification with the intent of promoting participation in its development and discussing the resources that will be required to develop improved mechanisms of metadata capture and exchange. As part of its wider goals, the GSC also supports improving the ‘transparency’ of the information contained in existing genomic databases. PMID:18464787
Event Sequence Analysis of the Air Intelligence Agency Information Operations Center Flight Operations

National Research Council Canada - National Science Library

Larsen, Glen

1998-01-01

This report applies Event Sequence Analysis, methodology adapted from aircraft mishap investigation, to an investigation of the performance of the Air Intelligence Agency's Information Operations Center (IOC...
The Evolution of Global Politics

Directory of Open Access Journals (Sweden)

George Moldeski

1995-08-01

Full Text Available The rise and decline of world powers has attracted much scholarly attention in recent years. The theory of long cycles answers parsimoniously the question: why, in the past half millenium, have Portugal, the Dutch Republic, Britain (twice, and the United States risen to global leadership while others have failed to do so? This accounts for the success, or failure, of individual states, but to explain the entire sequence we need to employ an evolutionary paradigm that proposes that each of these long cycles is one mechanism in a spectrum of global evolutionary processes. The leadership succession is an intermediate stage in the evolution og global politics, whose next likely major phase, reaching a high point later in the 21st century, will be the gradual absorption of the informal role of global leadership, when embedded in a democratic community, into a network of more formal positions within an emerging global organization of a federalist character. The conditions of that process can now be specified.
An accurate and rapid continuous wavelet dynamic time warping algorithm for unbalanced global mapping in nanopore sequencing

KAUST Repository

Han, Renmin

2017-12-24

Long-reads, point-of-care, and PCR-free are the promises brought by nanopore sequencing. Among various steps in nanopore data analysis, the global mapping between the raw electrical current signal sequence and the expected signal sequence from the pore model serves as the key building block to base calling, reads mapping, variant identification, and methylation detection. However, the ultra-long reads of nanopore sequencing and an order of magnitude difference in the sampling speeds of the two sequences make the classical dynamic time warping (DTW) and its variants infeasible to solve the problem. Here, we propose a novel multi-level DTW algorithm, cwDTW, based on continuous wavelet transforms with different scales of the two signal sequences. Our algorithm starts from low-resolution wavelet transforms of the two sequences, such that the transformed sequences are short and have similar sampling rates. Then the peaks and nadirs of the transformed sequences are extracted to form feature sequences with similar lengths, which can be easily mapped by the original DTW. Our algorithm then recursively projects the warping path from a lower-resolution level to a higher-resolution one by building a context-dependent boundary and enabling a constrained search for the warping path in the latter. Comprehensive experiments on two real nanopore datasets on human and on Pandoraea pnomenusa, as well as two benchmark datasets from previous studies, demonstrate the efficiency and effectiveness of the proposed algorithm. In particular, cwDTW can almost always generate warping paths that are very close to the original DTW, which are remarkably more accurate than the state-of-the-art methods including FastDTW and PrunedDTW. Meanwhile, on the real nanopore datasets, cwDTW is about 440 times faster than FastDTW and 3000 times faster than the original DTW. Our program is available at https://github.com/realbigws/cwDTW.
Towards rationally redesigning bacterial signaling systems using information encoded in abundant sequence data

Science.gov (United States)

Cheng, Ryan; Morcos, Faruck; Levine, Herbert; Onuchic, Jose

2014-03-01

An important challenge in biology is to distinguish the subset of residues that allow bacterial two-component signaling (TCS) proteins to preferentially interact with their correct TCS partner such that they can bind and transfer signal. Detailed knowledge of this information would allow one to search sequence-space for mutations that can systematically tune the signal transmission between TCS partners as well as re-encode a TCS protein to preferentially transfer signals to a non-partner. Motivated by the notion that this detailed information is found in sequence data, we explore the mutual sequence co-evolution between signaling partners to infer how mutations can positively or negatively alter their interaction. Using Direct Coupling Analysis (DCA) for determining evolutionarily conserved interprotein interactions, we apply a DCA-based metric to quantify mutational changes in the interaction between TCS proteins and demonstrate that it accurately correlates with experimental mutagenesis studies probing the mutational change in the in vitro phosphotransfer. Our methodology serves as a potential framework for the rational design of TCS systems as well as a framework for the system-level study of protein-protein interactions in sequence-rich systems. This research has been supported by the NSF INSPIRE award MCB-1241332 and by the CTBP sponsored by the NSF (Grant PHY-1308264).
Unique features of a global human ectoparasite identified through sequencing of the bed bug genome.

Science.gov (United States)

Benoit, Joshua B; Adelman, Zach N; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C; Szuter, Elise M; Hagan, Richard W; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M; Nelson, David R; Rosendale, Andrew J; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R; Ioannidis, Panagiotis; Waterhouse, Robert M; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J Spencer; Gondhalekar, Ameya D; Scharf, Michael E; Peterson, Brittany F; Raje, Kapil R; Hottel, Benjamin A; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S T; Duncan, Elizabeth J; Murali, Shwetha C; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C; Muzny, Donna M; Wheeler, David; Panfilio, Kristen A; Vargas Jentzsch, Iris M; Vargo, Edward L; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T; Anderson, Michelle A E; Jones, Jeffery W; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D; Attardo, Geoffrey M; Robertson, Hugh M; Zdobnov, Evgeny M; Ribeiro, Jose M C; Gibbs, Richard A; Werren, John H; Palli, Subba R; Schal, Coby; Richards, Stephen

2016-02-02

The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host-symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human-bed bug and symbiont-bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite.
Unique features of a global human ectoparasite identified through sequencing of the bed bug genome

Science.gov (United States)

Benoit, Joshua B.; Adelman, Zach N.; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C.; Szuter, Elise M.; Hagan, Richard W.; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M.; Nelson, David R.; Rosendale, Andrew J.; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M.; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R.; Ioannidis, Panagiotis; Waterhouse, Robert M.; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J. Spencer; Gondhalekar, Ameya D.; Scharf, Michael E.; Peterson, Brittany F.; Raje, Kapil R.; Hottel, Benjamin A.; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S. T.; Duncan, Elizabeth J.; Murali, Shwetha C.; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L.; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C.; Muzny, Donna M.; Wheeler, David; Panfilio, Kristen A.; Vargas Jentzsch, Iris M.; Vargo, Edward L.; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T.; Anderson, Michelle A. E.; Jones, Jeffery W.; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D.; Attardo, Geoffrey M.; Robertson, Hugh M.; Zdobnov, Evgeny M.; Ribeiro, Jose M. C.; Gibbs, Richard A.; Werren, John H.; Palli, Subba R.; Schal, Coby; Richards, Stephen

2016-01-01

The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host–symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human–bed bug and symbiont–bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite. PMID:26836814
Quantitative assessment of drivers of recent global temperature variability: an information theoretic approach

Science.gov (United States)

Bhaskar, Ankush; Ramesh, Durbha Sai; Vichare, Geeta; Koganti, Triven; Gurubaran, S.

2017-12-01

Identification and quantification of possible drivers of recent global temperature variability remains a challenging task. This important issue is addressed adopting a non-parametric information theory technique, the Transfer Entropy and its normalized variant. It distinctly quantifies actual information exchanged along with the directional flow of information between any two variables with no bearing on their common history or inputs, unlike correlation, mutual information etc. Measurements of greenhouse gases: CO2, CH4 and N2O; volcanic aerosols; solar activity: UV radiation, total solar irradiance ( TSI) and cosmic ray flux ( CR); El Niño Southern Oscillation ( ENSO) and Global Mean Temperature Anomaly ( GMTA) made during 1984-2005 are utilized to distinguish driving and responding signals of global temperature variability. Estimates of their relative contributions reveal that CO2 ({˜ } 24 %), CH4 ({˜ } 19 %) and volcanic aerosols ({˜ }23 %) are the primary contributors to the observed variations in GMTA. While, UV ({˜ } 9 %) and ENSO ({˜ } 12 %) act as secondary drivers of variations in the GMTA, the remaining play a marginal role in the observed recent global temperature variability. Interestingly, ENSO and GMTA mutually drive each other at varied time lags. This study assists future modelling efforts in climate science.
Gathering and using information on a global scale

Science.gov (United States)

Mathews, C. W.

1977-01-01

The importance of information gathered, integrated and analyzed over broad regions of the world is discussed. Means of acquiring information on critical areas are outlined, and the particular role that remote sensing can play is described in each case. The possible implementation of a global information system and some of the current difficulties in initiation of such a system on an operational basis are explored. In this way, issues will be surfaced for consideration. Topics include: the importance of innovative leadership, and some actions that the government might take, both in Congress and in the Executive Branch; the relationship of U.S. government activities to international interests and to industry; and the need to stimulate more private sector initiative and to transfer responsibilities from government to commercial interests.
Progress toward an Integrated Global GHG Information System (IG3IS)

Science.gov (United States)

DeCola, Philip

2016-04-01

Accurate and precise atmospheric measurements of greenhouse gas (GHG) concentrations have shown the inexorable rise of global GHG concentrations due to human socioeconomic activity. Scientific observations also show a resulting rise in global temperatures and evidence of negative impacts on society. In response to this amassing evidence, nations, states, cities and private enterprises are accelerating efforts to reduce emissions of GHGs, and the UNFCCC process recently forged the Paris Agreement. Emission reduction strategies will vary by nation, region, and economic sector (e.g., INDCs), but regardless of the strategies and mechanisms applied, the ability to implement policies and manage them effectively over time will require consistent, reliable and timely information. A number of studies [e.g., Verifying Greenhouse Gas Emissions: Methods to Support International Climate Agreements (2010); GEO Carbon Strategy (2010); IPCC Task Force on National GHG Inventories: Expert Meeting Report on Uncertainty and Validation of Emission Inventories (2010)] have reported on the state of carbon cycle research, observations and models and the ability of these atmospheric observations and models to independently validate and improve the accuracy of self-reported emission inventories based on fossil fuel usage and land use activities. These studies concluded that by enhancing our in situ and remote-sensing observations and atmospheric data assimilation modeling capabilities, a GHG information system could be achieved in the coming decade to serve the needs of policies and actions to reduce GHG emissions. Atmospheric measurements and models are already being used to provide emissions information on a global and continental scale through existing networks, but these efforts currently provide insufficient information at the human-dimensions where nations, states, cities, and private enterprises can take valuable, and additional action that can reduce emissions for a specific GHG
On the Concept of Cis-regulatory Information: From Sequence Motifs to Logic Functions

Science.gov (United States)

Tarpine, Ryan; Istrail, Sorin

The regulatory genome is about the “system level organization of the core genomic regulatory apparatus, and how this is the locus of causality underlying the twin phenomena of animal development and animal evolution” (E.H. Davidson. The Regulatory Genome: Gene Regulatory Networks in Development and Evolution, Academic Press, 2006). Information processing in the regulatory genome is done through regulatory states, defined as sets of transcription factors (sequence-specific DNA binding proteins which determine gene expression) that are expressed and active at the same time. The core information processing machinery consists of modular DNA sequence elements, called cis-modules, that interact with transcription factors. The cis-modules “read” the information contained in the regulatory state of the cell through transcription factor binding, “process” it, and directly or indirectly communicate with the basal transcription apparatus to determine gene expression. This endowment of each gene with the information-receiving capacity through their cis-regulatory modules is essential for the response to every possible regulatory state to which it might be exposed during all phases of the life cycle and in all cell types. We present here a set of challenges addressed by our CYRENE research project aimed at studying the cis-regulatory code of the regulatory genome. The CYRENE Project is devoted to (1) the construction of a database, the cis-Lexicon, containing comprehensive information across species about experimentally validated cis-regulatory modules; and (2) the software development of a next-generation genome browser, the cis-Browser, specialized for the regulatory genome. The presentation is anchored on three main computational challenges: the Gene Naming Problem, the Consensus Sequence Bottleneck Problem, and the Logic Function Inference Problem.
The earth knowledge base and the global information society

Directory of Open Access Journals (Sweden)

A Martynenko

2006-01-01

Full Text Available Today many countries have applied the strategy of developing an information-oriented society and data infrastructure. Although varying it their details and means of realization, all these policies have the same aim - to build a global information society. Here in Russia this crucial role belongs to the Electronic (Digital Earth initiative, which integrates geoinformation technologies in the Earth Knowledge Base (EKB. It i designed to promote the economic, social and scientific progress. An analysis of the problem has been done in the article.
Global Perspectives on Activated Sludge Community Composition analyzed using 16S rRNA amplicon sequencing

DEFF Research Database (Denmark)

Nierychlo, Marta; Saunders, Aaron Marc; Albertsen, Mads

communities, and in this study activated sludge sampled from 32 Wastewater Treatment Plants (WWTPs) around the world was described and compared. The top abundant bacteria in the global activated sludge ecosystem were found and the core population shared by multiple samples was investigated. The results......Activated sludge is the most commonly applied bioprocess throughout the world for wastewater treatment. Microorganisms are key to the process, yet our knowledge of their identity and function is still limited. High-througput16S rRNA amplicon sequencing can reliably characterize microbial...
Chronology of Eocene-Miocene sequences on the New Jersey shallow shelf: implications for regional, interregional, and global correlations

Science.gov (United States)

Browning, James V.; Miller, Kenneth G.; Sugarman, Peter J.; Barron, John; McCarthy, Francine M.G.; Kulhanek, Denise K.; Katz, Miriam E.; Feigenson, Mark D.

2013-01-01

Integrated Ocean Drilling Program Expedition 313 continuously cored and logged latest Eocene to early-middle Miocene sequences at three sites (M27, M28, and M29) on the inner-middle continental shelf offshore New Jersey, providing an opportunity to evaluate the ages, global correlations, and significance of sequence boundaries. We provide a chronology for these sequences using integrated strontium isotopic stratigraphy and biostratigraphy (primarily calcareous nannoplankton, diatoms, and dinocysts [dinoflagellate cysts]). Despite challenges posed by shallow-water sediments, age resolution is typically ±0.5 m.y. and in many sequences is as good as ±0.25 m.y. Three Oligocene sequences were sampled at Site M27 on sequence bottomsets. Fifteen early to early-middle Miocene sequences were dated at Sites M27, M28, and M29 across clinothems in topsets, foresets (where the sequences are thickest), and bottomsets. A few sequences have coarse (∼1 m.y.) or little age constraint due to barren zones; we constrain the age estimates of these less well dated sequences by applying the principle of superposition, i.e., sediments above sequence boundaries in any site are younger than the sediments below the sequence boundaries at other sites. Our age control provides constraints on the timing of deposition in the clinothem; sequences on the topsets are generally the youngest in the clinothem, whereas the bottomsets generally are the oldest. The greatest amount of time is represented on foresets, although we have no evidence for a correlative conformity. Our chronology provides a baseline for regional and interregional correlations and sea-level reconstructions: (1) we correlate a major increase in sedimentation rate precisely with the timing of the middle Miocene climate changes associated with the development of a permanent East Antarctic Ice Sheet; and (2) the timing of sequence boundaries matches the deep-sea oxygen isotopic record, implicating glacioeustasy as a major driver
RStrucFam: a web server to associate structure and cognate RNA for RNA-binding proteins from sequence information.

Science.gov (United States)

Ghosh, Pritha; Mathew, Oommen K; Sowdhamini, Ramanathan

2016-10-07

RNA-binding proteins (RBPs) interact with their cognate RNA(s) to form large biomolecular assemblies. They are versatile in their functionality and are involved in a myriad of processes inside the cell. RBPs with similar structural features and common biological functions are grouped together into families and superfamilies. It will be useful to obtain an early understanding and association of RNA-binding property of sequences of gene products. Here, we report a web server, RStrucFam, to predict the structure, type of cognate RNA(s) and function(s) of proteins, where possible, from mere sequence information. The web server employs Hidden Markov Model scan (hmmscan) to enable association to a back-end database of structural and sequence families. The database (HMMRBP) comprises of 437 HMMs of RBP families of known structure that have been generated using structure-based sequence alignments and 746 sequence-centric RBP family HMMs. The input protein sequence is associated with structural or sequence domain families, if structure or sequence signatures exist. In case of association of the protein with a family of known structures, output features like, multiple structure-based sequence alignment (MSSA) of the query with all others members of that family is provided. Further, cognate RNA partner(s) for that protein, Gene Ontology (GO) annotations, if any and a homology model of the protein can be obtained. The users can also browse through the database for details pertaining to each family, protein or RNA and their related information based on keyword search or RNA motif search. RStrucFam is a web server that exploits structurally conserved features of RBPs, derived from known family members and imprinted in mathematical profiles, to predict putative RBPs from sequence information. Proteins that fail to associate with such structure-centric families are further queried against the sequence-centric RBP family HMMs in the HMMRBP database. Further, all other essential
Influence of globalization on creation of the information society in Russia

Directory of Open Access Journals (Sweden)

Т А Полякова

2008-03-01

Full Text Available Legal problems of influence of globalization on creation of the information society in Russia, and also the basic directions of development of the international cooperation in the given area to use information and telecommunicational technologies for development of some new forms and methods of training and for improving quality of education.
Splice site prediction in Arabidopsis thaliana pre-mRNA by combining local and global sequence information

DEFF Research Database (Denmark)

Hebsgaard, Stefan M.; Korning, Peter G.; Tolstrup, Niels

1996-01-01

Artificial neural networks have been combined with a rule based system to predict intron splice sites in the dicot plant Arabidopsis thaliana. A two step prediction scheme, where a global prediction of the coding potential regulates a cutoff level for a local predicition of splice sites, is refin...
Global Information Resources on Rice for Research and Development

Directory of Open Access Journals (Sweden)

Shri RAM

2012-12-01

Full Text Available Various issues concerning the progress of rice research are related to ambiguous germplasm identification, difficulty in tracing pedigree information, and lack of integration between genetic resources, characterization, breeding, evaluation and utilization data. These issues are the constraints in developing knowledge-intensive crop improvement programs. The rapid growth, development and the global spread of modern information and communication technology allow quick adoption in fundamental research. Thus, there is a need to provide an opportunity for the establishment of services which describe the rice information for better accessibility to information resources used by researchers to enhance the competitiveness. This work reviews some of available resources on rice bioinformatics and their roles in elucidating and propagating biological and genomic information in rice research. These reviews will also enable stakeholders to understand and adopt the change in research and development and share knowledge with the global community of agricultural scientists. The establishment like International Rice Information System, Rice Genome Research Project and Integrated Rice Genome Explorer are major initiatives for the improvement of rice. Creation of databases for comparative studies of rice and other cereals are major steps in further improvement of genetic compositions. This paper will also highlight some of the initiatives and organizations working in the field of rice improvement and explore the availability of the various web resources for the purpose of research and development of rice. We are developing a meta web server for integration of online resources such as databases, web servers and journals in the area of bioinformatics. This integrated platform, with acronym iBIRA, is available online at ibiranet.in. The resources reviewed here are the excerpts from the resources integrated in iBIRA.

Information Processing and Firm-Internal Environment Contingencies: Performance Impact on Global New Product Development

DEFF Research Database (Denmark)

Kleinschmidt, Elko; de Brentani, Ulrike; Salomo, Søren

2010-01-01

, functionally, geographically and culturally. To this end, an IT-communication strength is essential, one that is nested in an internal organizational environment that ensures its effective functioning. Using organizational information processing (OIP) theory as a framework, superior global NPD program......Innovation in its essence is an information processing activity. Thus, a major factor impacting the success of new product development (NPD) programs, especially those responding to global markets, is the firm's ability to access, share and apply NPD information, which is often widely dispersed...
Alpha-gamma phase amplitude coupling subserves information transfer during perceptual sequence learning.

Science.gov (United States)

Tzvi, Elinor; Bauhaus, Leon J; Kessler, Till U; Liebrand, Matthias; Wöstmann, Malte; Krämer, Ulrike M

2018-03-01

Cross-frequency coupling is suggested to serve transfer of information between wide-spread neuronal assemblies and has been shown to underlie many cognitive functions including learning and memory. In previous work, we found that alpha (8-13 Hz) - gamma (30-48 Hz) phase amplitude coupling (αγPAC) is decreased during sequence learning in bilateral frontal cortex and right parietal cortex. We interpreted this to reflect decreased demands for visuo-motor mapping once the sequence has been encoded. In the present study, we put this hypothesis to the test by adding a "simple" condition to the standard serial reaction time task (SRTT) with minimal needs for visuo-motor mapping. The standard SRTT in our paradigm entailed a perceptual sequence allowing for implicit learning of a sequence of colors with randomly assigned motor responses. Sequence learning in this case was thus not associated with reduced demands for visuo-motor mapping. Analysis of oscillatory power revealed a learning-related alpha decrease pointing to a stronger recruitment of occipito-parietal areas when encoding the perceptual sequence. Replicating our previous findings but in contrast to our hypothesis, αγPAC was decreased in sequence compared to random trials over right frontal and parietal cortex. It also tended to be smaller compared to trials requiring a simple motor sequence. We additionally analyzed αγPAC in resting-state data of a separate cohort. PAC in electrodes over right parietal cortex was significantly stronger compared to sequence trials and tended to be higher compared to simple and random trials of the SRTT data. We suggest that αγPAC in right parietal cortex reflects a "default-mode" brain state, which gets perturbed to allow for encoding of visual regularities into memory. Copyright © 2018 Elsevier Inc. All rights reserved.
Constraint Satisfaction Inference : Non-probabilistic Global Inference for Sequence Labelling

NARCIS (Netherlands)

Canisius, S.V.M.; van den Bosch, A.; Daelemans, W.; Basili, R.; Moschitti, A.

2006-01-01

We present a new method for performing sequence labelling based on the idea of using a machine-learning classifier to generate several possible output sequences, and then applying an inference procedure to select the best sequence among those. Most sequence labelling methods following a similar
Application of the accident management information needs methodology to a severe accident sequence

International Nuclear Information System (INIS)

Ward, L.W.; Hanson, D.J.; Nelson, W.R.; Solberg, D.E.

1989-01-01

The U.S. Nuclear Regulatory Commission (NRC) is conducting an Accident Management Research Program that emphasizes the application of severe accident research results to enhance the capability of plant operating personnel to effectively manage severe accidents. A methodology to identify and assess the information needs of the operating staff of a nuclear power plant during a severe accident has been developed as part of the research program designed to resolve this issue. The methodology identifies the information needs of the plant personnel during a wide range of accident conditions, the existing plant measurements capable of supplying these information needs and what, if any minor additions to instrument and display systems would enhance the capability to manage accidents, known limitations on the capability of these measurements to function properly under the conditions that will be present during a wide range of severe accidents, and areas in which the information systems could mislead plant personnel. This paper presents an application of this methodology to a severe accident sequence to demonstrate its use in identifying the information which is available for management of the event. The methodology has been applied to a severe accident sequence in a Pressurized Water Reactor with a large dry containment. An examination of the capability of the existing measurements was then performed to determine whether the information needs can be supplied
Sequence exploration reveals information bias among molecular markers used in phylogenetic reconstruction for Colletotrichum species.

Science.gov (United States)

Rampersad, Sephra N; Hosein, Fazeeda N; Carrington, Christine Vf

2014-01-01

The Colletotrichum gloeosporioides species complex is among the most destructive fungal plant pathogens in the world, however, identification of isolates of quarantine importance to the intra-specific level is confounded by a number of factors that affect phylogenetic reconstruction. Information bias and quality parameters were investigated to determine whether nucleotide sequence alignments and phylogenetic trees accurately reflect the genetic diversity and phylogenetic relatedness of individuals. Sequence exploration of GAPDH, ACT, TUB2 and ITS markers indicated that the query sequences had different patterns of nucleotide substitution but were without evidence of base substitution saturation. Regions of high entropy were much more dispersed in the ACT and GAPDH marker alignments than for the ITS and TUB2 markers. A discernible bimodal gap in the genetic distance frequency histograms was produced for the ACT and GAPDH markers which indicated successful separation of intra- and inter-specific sequences in the data set. Overall, analyses indicated clear differences in the ability of these markers to phylogenetically separate individuals to the intra-specific level which coincided with information bias.
The Role of Information Professionals in Reducing the Effects of Global Warming through Knowledge Management

Directory of Open Access Journals (Sweden)

Lect. Ph. D. Priti Jain

2009-05-01

Full Text Available As a result of global environmental change, global warming is the greatest environmental challenge in the 21st century. It could lead to the ultimate end of existence of earth and man. Potential catastrophic effects on the environment and for human life are one of the biggest concerns and most widely discussed issues in the world. This paper will explore how Information Professionals can build knowledge management related to global warming and thus make their contribution towards a sustainable environment. With a brief discussion of causes, effects, solutions and challenges related to global warming, the conclusion suggests a way forward for librarians and information professionals.
Moving target detection based on temporal-spatial information fusion for infrared image sequences

Science.gov (United States)

Toing, Wu-qin; Xiong, Jin-yu; Zeng, An-jun; Wu, Xiao-ping; Xu, Hao-peng

2009-07-01

Moving target detection and localization is one of the most fundamental tasks in visual surveillance. In this paper, through analyzing the advantages and disadvantages of the traditional approaches about moving target detection, a novel approach based on temporal-spatial information fusion is proposed for moving target detection. The proposed method combines the spatial feature in single frame and the temporal properties within multiple frames of an image sequence of moving target. First, the method uses the spatial image segmentation for target separation from background and uses the local temporal variance for extracting targets and wiping off the trail artifact. Second, the logical "and" operator is used to fuse the temporal and spatial information. In the end, to the fusion image sequence, the morphological filtering and blob analysis are used to acquire exact moving target. The algorithm not only requires minimal computation and memory but also quickly adapts to the change of background and environment. Comparing with other methods, such as the KDE, the Mixture of K Gaussians, etc., the simulation results show the proposed method has better validity and higher adaptive for moving target detection, especially in infrared image sequences with complex illumination change, noise change, and so on.
Organizing Global IS Management to Meet Competitive Challenges: Experiences from the Pharmaceutical Industry

OpenAIRE

Bettina Schwarzer

1995-01-01

Despite the widely acknowledged importance information technology plays in multinational corporations, many companies lack an understanding of when and how to (re)organize global IS management. The issues of timing and organization of global IS management, however, seem to be of utmost importance in a companyâ€™s attempt to implement a new, global business strategy. Based on three case studies from the pharmaceutical industry, this paper analyzes the sequence in which business strategy, organ...
Limitations in global information on species occurrences

Directory of Open Access Journals (Sweden)

Carsten Meyer

2016-07-01

Full Text Available Detailed information on species distributions is crucial for answering central questions in biogeography, ecology, evolutionary biology and conservation. Millions of species occurrence records have been mobilized via international data-sharing networks, but inherent biases, gaps and uncertainties hamper broader application. In my PhD thesis, I presented the first comprehensive analyses of global patterns and drivers of these limitations across different taxonomic groups and spatial scales. Integrating 300 million occurrence records for terrestrial vertebrates and plants with comprehensive taxonomic databases, expert range maps and regional checklists, I demonstrated extensive taxonomic, geographical and temporal biases, gaps and uncertainties. I identified key socio-economic drivers of data bias across different taxonomic groups and spatial scales. The results of my dissertation provide an empirical baseline for effectively accounting for data limitations in distribution models, as well as for prioritizing and monitoring efforts to collate additional occurrence information.
Systems Factorial Technology provides new insights on global-local information processing in autism spectrum disorders.

Science.gov (United States)

Johnson, Shannon A; Blaha, Leslie M; Houpt, Joseph W; Townsend, James T

2010-02-01

Previous studies of global-local processing in autism spectrum disorders (ASDs) have indicated mixed findings, with some evidence of a local processing bias, or preference for detail-level information, and other results suggesting typical global advantage, or preference for the whole or gestalt. Findings resulting from this paradigm have been used to argue for or against a detail focused processing bias in ASDs, and thus have important theoretical implications. We applied Systems Factorial Technology, and the associated Double Factorial Paradigm (both defined in the text), to examine information processing characteristics during a divided attention global-local task in high-functioning individuals with an ASD and typically developing controls. Group data revealed global advantage for both groups, contrary to some current theories of ASDs. Information processing models applied to each participant revealed that task performance, although showing no differences at the group level, was supported by different cognitive mechanisms in ASD participants compared to controls. All control participants demonstrated inhibitory parallel processing and the majority demonstrated a minimum-time stopping rule. In contrast, ASD participants showed exhaustive parallel processing with mild facilitatory interactions between global and local information. Thus our results indicate fundamental differences in the stopping rules and channel dependencies in individuals with an ASD.
SoilGrids1km — Global Soil Information Based on Automated Mapping

Science.gov (United States)

Hengl, Tomislav; de Jesus, Jorge Mendes; MacMillan, Robert A.; Batjes, Niels H.; Heuvelink, Gerard B. M.; Ribeiro, Eloi; Samuel-Rosa, Alessandro; Kempen, Bas; Leenaars, Johan G. B.; Walsh, Markus G.; Gonzalez, Maria Ruiperez

2014-01-01

Background Soils are widely recognized as a non-renewable natural resource and as biophysical carbon sinks. As such, there is a growing requirement for global soil information. Although several global soil information systems already exist, these tend to suffer from inconsistencies and limited spatial detail. Methodology/Principal Findings We present SoilGrids1km — a global 3D soil information system at 1 km resolution — containing spatial predictions for a selection of soil properties (at six standard depths): soil organic carbon (g kg−1), soil pH, sand, silt and clay fractions (%), bulk density (kg m−3), cation-exchange capacity (cmol+/kg), coarse fragments (%), soil organic carbon stock (t ha−1), depth to bedrock (cm), World Reference Base soil groups, and USDA Soil Taxonomy suborders. Our predictions are based on global spatial prediction models which we fitted, per soil variable, using a compilation of major international soil profile databases (ca. 110,000 soil profiles), and a selection of ca. 75 global environmental covariates representing soil forming factors. Results of regression modeling indicate that the most useful covariates for modeling soils at the global scale are climatic and biomass indices (based on MODIS images), lithology, and taxonomic mapping units derived from conventional soil survey (Harmonized World Soil Database). Prediction accuracies assessed using 5–fold cross-validation were between 23–51%. Conclusions/Significance SoilGrids1km provide an initial set of examples of soil spatial data for input into global models at a resolution and consistency not previously available. Some of the main limitations of the current version of SoilGrids1km are: (1) weak relationships between soil properties/classes and explanatory variables due to scale mismatches, (2) difficulty to obtain covariates that capture soil forming factors, (3) low sampling density and spatial clustering of soil profile locations. However, as the Soil
Introducing difference recurrence relations for faster semi-global alignment of long sequences.

Science.gov (United States)

Suzuki, Hajime; Kasahara, Masahiro

2018-02-19

The read length of single-molecule DNA sequencers is reaching 1 Mb. Popular alignment software tools widely used for analyzing such long reads often take advantage of single-instruction multiple-data (SIMD) operations to accelerate calculation of dynamic programming (DP) matrices in the Smith-Waterman-Gotoh (SWG) algorithm with a fixed alignment start position at the origin. Nonetheless, 16-bit or 32-bit integers are necessary for storing the values in a DP matrix when sequences to be aligned are long; this situation hampers the use of the full SIMD width of modern processors. We proposed a faster semi-global alignment algorithm, "difference recurrence relations," that runs more rapidly than the state-of-the-art algorithm by a factor of 2.1. Instead of calculating and storing all the values in a DP matrix directly, our algorithm computes and stores mainly the differences between the values of adjacent cells in the matrix. Although the SWG algorithm and our algorithm can output exactly the same result, our algorithm mainly involves 8-bit integer operations, enabling us to exploit the full width of SIMD operations (e.g., 32) on modern processors. We also developed a library, libgaba, so that developers can easily integrate our algorithm into alignment programs. Our novel algorithm and optimized library implementation will facilitate accelerating nucleotide long-read analysis algorithms that use pairwise alignment stages. The library is implemented in the C programming language and available at https://github.com/ocxtal/libgaba .
Comprehensive Information Retrieval and Model Input Sequence (CIRMIS)

International Nuclear Information System (INIS)

Friedrichs, D.R.

1977-04-01

The Comprehensive Information Retrieval and Model Input Sequence (CIRMIS) was developed to provide the research scientist with man--machine interactive capabilities in a real-time environment, and thereby produce results more quickly and efficiently. The CIRMIS system was originally developed to increase data storage and retrieval capabilities and ground-water model control for the Hanford site. The overall configuration, however, can be used in other areas. The CIRMIS system provides the user with three major functions: retrieval of well-based data, special application for manipulating surface data or background maps, and the manipulation and control of ground-water models. These programs comprise only a portion of the entire CIRMIS system. A complete description of the CIRMIS system is given in this report. 25 figures, 7 tables
The EUSTACE project: delivering global, daily information on surface air temperature

Science.gov (United States)

Ghent, D.; Rayner, N. A.

2017-12-01

Day-to-day variations in surface air temperature affect society in many ways; however, daily surface air temperature measurements are not available everywhere. A global daily analysis cannot be achieved with measurements made in situ alone, so incorporation of satellite retrievals is needed. To achieve this, in the EUSTACE project (2015-2018, https://www.eustaceproject.eu) we have developed an understanding of the relationships between traditional (land and marine) surface air temperature measurements and retrievals of surface skin temperature from satellite measurements, i.e. Land Surface Temperature, Ice Surface Temperature, Sea Surface Temperature and Lake Surface Water Temperature. Here we discuss the science needed to produce a fully-global daily analysis (or ensemble of analyses) of surface air temperature on the centennial scale, integrating different ground-based and satellite-borne data types. Information contained in the satellite retrievals is used to create globally-complete fields in the past, using statistical models of how surface air temperature varies in a connected way from place to place. This includes developing new "Big Data" analysis methods as the data volumes involved are considerable. We will present recent progress along this road in the EUSTACE project, i.e.: • identifying inhomogeneities in daily surface air temperature measurement series from weather stations and correcting for these over Europe; • estimating surface air temperature over all surfaces of Earth from surface skin temperature retrievals; • using new statistical techniques to provide information on higher spatial and temporal scales than currently available, making optimum use of information in data-rich eras. Information will also be given on how interested users can become involved.
Ion torrent personal genome machine sequencing for genomic typing of Neisseria meningitidis for rapid determination of multiple layers of typing information.

Science.gov (United States)

Vogel, Ulrich; Szczepanowski, Rafael; Claus, Heike; Jünemann, Sebastian; Prior, Karola; Harmsen, Dag

2012-06-01

Neisseria meningitidis causes invasive meningococcal disease in infants, toddlers, and adolescents worldwide. DNA sequence-based typing, including multilocus sequence typing, analysis of genetic determinants of antibiotic resistance, and sequence typing of vaccine antigens, has become the standard for molecular epidemiology of the organism. However, PCR of multiple targets and consecutive Sanger sequencing provide logistic constraints to reference laboratories. Taking advantage of the recent development of benchtop next-generation sequencers (NGSs) and of BIGSdb, a database accommodating and analyzing genome sequence data, we therefore explored the feasibility and accuracy of Ion Torrent Personal Genome Machine (PGM) sequencing for genomic typing of meningococci. Three strains from a previous meningococcus serogroup B community outbreak were selected to compare conventional typing results with data generated by semiconductor chip-based sequencing. In addition, sequencing of the meningococcal type strain MC58 provided information about the general performance of the technology. The PGM technology generated sequence information for all target genes addressed. The results were 100% concordant with conventional typing results, with no further editing being necessary. In addition, the amount of typing information, i.e., nucleotides and target genes analyzed, could be substantially increased by the combined use of genome sequencing and BIGSdb compared to conventional methods. In the near future, affordable and fast benchtop NGS machines like the PGM might enable reference laboratories to switch to genomic typing on a routine basis. This will reduce workloads and rapidly provide information for laboratory surveillance, outbreak investigation, assessment of vaccine preventability, and antibiotic resistance gene monitoring.
How could disclosing incidental information from whole-genome sequencing affect patient behavior?

Science.gov (United States)

Christensen, Kurt D; Green, Robert C

2013-06-01

In this article, we argue that disclosure of incidental findings from whole-genome sequencing has the potential to motivate individuals to change health behaviors through psychological mechanisms that differ from typical risk assessment interventions. Their ability to do so, however, is likely to be highly contingent upon the nature of the incidental findings and how they are disclosed, the context of the disclosure and the characteristics of the patient. Moreover, clinicians need to be aware that behavioral responses may occur in unanticipated ways. This article argues for commentators and policy makers to take a cautious but optimistic perspective while empirical evidence is collected through ongoing research involving whole-genome sequencing and the disclosure of incidental information.
Global information network on chemicals (GINC) and its Asian component

International Nuclear Information System (INIS)

Kaminuma, Tsuguchika; Nakata, Kotoko

2003-01-01

The Global Information Network on Chemicals (GINC) is an effort to build a global information network that links international, national, and other organizations working for the safe management of chemicals in order to exchange information and improve communications. The project was originally proposed in 1993 by one of the authors then at the National Institute of Health Sciences (NIHS) of Japan to the International Program on Chemical Safety (IPCS), which is a joint project of World Health Organization (WHO), International Labor Organization (ILO), and United Nations Environment Program (UNEP). The base support system was first implemented at NIHS using the Internet/World Wide Web (WWW) technology in 1995. The project was then endorsed by the Intergovernmental Forum on Chemical Safety (IFCS) and was adopted by the Inter-Organization Program for the Sound Management of Chemicals (IOMC). However, the base system (http://www.nihs.go.jp/GINC/index.html) has been developed and maintained solely by the NIHS group under the support of the Ministry of Health and Welfare (MHW), Japan. Asia, particularly East Asia and the Pacific region, was chosen as the feasibility study region for this project. During the period from December 1994 to July 2002, NIHS hosted eight meetings on this project held in Tokyo
Globalization and advances in information and communication technologies: the impact on nursing and health.

Science.gov (United States)

Abbott, Patricia A; Coenen, Amy

2008-01-01

Globalization and information and communication technology (ICT) continue to change us and the world we live in. Nursing stands at an opportunity intersection where challenging global health issues, an international workforce shortage, and massive growth of ICT combine to create a very unique space for nursing leadership and nursing intervention. Learning from prior successes in the field can assist nurse leaders in planning and advancing strategies for global health using ICT. Attention to lessons learned will assist in combating the technological apartheid that is already present in many areas of the globe and will highlight opportunities for innovative applications in health. ICT has opened new channels of communication, creating the beginnings of a global information society that will facilitate access to isolated areas where health needs are extreme and where nursing can contribute significantly to the achievement of "Health for All." The purpose of this article is to discuss the relationships between globalization, health, and ICT, and to illuminate opportunities for nursing in this flattening and increasingly interconnected world.
Global and Local Processing of Incidental Information and Memory Retrieval at 6 Months.

Science.gov (United States)

Bhatt, Ramesh S.; And Others

1994-01-01

Five experiments examined the role of global and local cues in memory retrieval in infancy. Results showed that infants encode and remember for substantial periods of time not only the shape of figures displayed in their periphery but also the global organization of these figures. They also adapt this information when responding to new events.…
Integrating Global Open Geo-Information for Major Disaster Assessment: A Case Study of the Myanmar Flood

Directory of Open Access Journals (Sweden)

Suju Li

2017-07-01

Full Text Available Major disasters typically impact large areas, cause considerable damages, and result in significant human and economic losses. The timely and accurate estimation of impacts and damages is essential to better understand disaster conditions and to support emergency response operations. Geo-information drawn from various sources at multi spatial-temporal scales can be used for disaster assessments through a synthesis of hazard, exposure, and post disaster information based on pertinent approaches. Along with the increased availability of open sourced data and cooperation initiatives, more global scale geo-information, including global land cover datasets, has been produced and can be integrated with other information for disaster dynamic damage assessment (e.g., impact estimation immediately after a disaster occurs, physical damage assessment during the emergency response stage, and comprehensive assessment following an emergency response. Residential areas and arable lands affected by the flood disaster occurring from July to August 2015 in Myanmar were assessed based on satellite images, GlobeLand30 data, and other global open sourced information as a study case. The results show that integrating global open geo-information could serve as a practical and efficient means of assessing damage resulting from major disasters worldwide, especially at the early emergency response stage.

A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing

Directory of Open Access Journals (Sweden)

Pan-Gyu Kim

2008-01-01

Full Text Available We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to find the longest acyclic graphs. Using end read pairs of fixed-sized mate-pair libraries, ConPath determines relative orientations of all contigs, estimates the gap size of each adjacent contig pair, and reports wrong assembly information by validating orientations and gap sizes. We have utilized ConPath in more than 10 microbial genome projects, including Mannheimia succiniciproducens and Vibro vulnificus, where we verified contig assembly and identified several erroneous contigs using the four types of error defined in ConPath. Also, ConPath supports some convenient features and viewers that permit investigation of each contig in detail; these include contig viewer, scaffold viewer, edge information list, mate-pair list, and the printing of complex scaffold structures.
The GRIN-Global Information Management System – Public Interface Demonstration and Input Opportunity

Science.gov (United States)

The GRIN-Global (GG) Information Management System, under development for the past three years, provides the world's crop genebanks and plant genetic resource (PGR) users with a powerful, flexible, easy-to-use PGR information management system. Developed jointly by the USDA Agricultural Research Ser...
Global transcriptional profiling of the toxic dinoflagellate Alexandrium fundyense using Massively Parallel Signature Sequencing

Directory of Open Access Journals (Sweden)

Anderson Donald M

2006-04-01

Full Text Available Abstract Background Dinoflagellates are one of the most important classes of marine and freshwater algae, notable both for their functional diversity and ecological significance. They occur naturally as free-living cells, as endosymbionts of marine invertebrates and are well known for their involvement in "red tides". Dinoflagellates are also notable for their unusual genome content and structure, which suggests that the organization and regulation of dinoflagellate genes may be very different from that of most eukaryotes. To investigate the content and regulation of the dinoflagellate genome, we performed a global analysis of the transcriptome of the toxic dinoflagellate Alexandrium fundyense under nitrate- and phosphate-limited conditions using Massively Parallel Signature Sequencing (MPSS. Results Data from the two MPSS libraries showed that the number of unique signatures found in A. fundyense cells is similar to that of humans and Arabidopsis thaliana, two eukaryotes that have been extensively analyzed using this method. The general distribution, abundance and expression patterns of the A. fundyense signatures were also quite similar to other eukaryotes, and at least 10% of the A. fundyense signatures were differentially expressed between the two conditions. RACE amplification and sequencing of a subset of signatures showed that multiple signatures arose from sequence variants of a single gene. Single signatures also mapped to different sequence variants of the same gene. Conclusion The MPSS data presented here provide a quantitative view of the transcriptome and its regulation in these unusual single-celled eukaryotes. The observed signature abundance and distribution in Alexandrium is similar to that of other eukaryotes that have been analyzed using MPSS. Results of signature mapping via RACE indicate that many signatures result from sequence variants of individual genes. These data add to the growing body of evidence for widespread gene
Global Drought Services: Collaborations Toward an Information System for Early Warning

Science.gov (United States)

Hayes, M. J.; Pulwarty, R. S.; Svoboda, M.

2014-12-01

Drought is a hazard that lends itself well to diligent, sustained monitoring and early warning. However, unlike most hazards, the fact that droughts typically evolve slowly, can last for months or years and cover vast areas spanning multiple political boundaries/jurisdictions and economic sectors can make it a daunting task to monitor, develop plans for, and identify appropriate, proactive mitigation strategies. The National Drought Mitigation Center (NDMC) and National Integrated Drought Information System (NIDIS) have been working together to reduce societal vulnerability to drought by helping decision makers at all levels to: 1) implement drought early warning/forecasting and decision support systems; 2) support and advocate for better collection of, and understanding of drought impacts; and 3) increase long-term resilience to drought through proactive planning. The NDMC and NIDIS risk management approach has been the basis from which many partners around the world are developing a collaboration and coordination nexus with an ultimate goal of building comprehensive global drought early warning information systems (GDEWIS). The core emphasis of this model is on developing and applying useful and usable information that can be integrated and transferred freely to other regions around the globe. The High-Level Ministerial Declaration on Drought, the Integrated Drought Management Programme (IDMP) co-led by the WMO and the Global Water Partnership (GWP), and the Global Framework for Climate Services are drawing extensively from the integrated NDMC-NIDIS risk management framework. This presentation will describe, in detail, the various drought resources, tools, services, and collaborations already being provided and undertaken at the national and regional scales by the NDMC, NIDIS, and their partners. The presentation will be forward-looking, identifying improvements in existing and proposed mechanisms to help strengthen national and international drought early
Surveying the global virome

DEFF Research Database (Denmark)

Scheel, Troels K H; Simmonds, Peter; Kapoor, Amit

2015-01-01

Recent advances in sequencing technologies have greatly enhanced our abilities to identify novel microbial sequences. Thus, our understanding of the global virome and the virome of specific host species in particular is rapidly expanding. Identification of animal viruses is important for understa......Recent advances in sequencing technologies have greatly enhanced our abilities to identify novel microbial sequences. Thus, our understanding of the global virome and the virome of specific host species in particular is rapidly expanding. Identification of animal viruses is important....... Much remains to be learned on the novel hepaciviruses, including their association with disease, and thereby how relevant they will become as HCV model systems and for studies of animal disease. This review discusses how virome analysis led to identification of novel hepaci- and pegiviruses...
Diversity and Genome Analysis of Australian and Global Oilseed Brassica napus L. Germplasm Using Transcriptomics and Whole Genome Re-sequencing

Directory of Open Access Journals (Sweden)

M. Michelle Malmberg

2018-04-01

Full Text Available Intensive breeding of Brassica napus has resulted in relatively low diversity, such that B. napus would benefit from germplasm improvement schemes that sustain diversity. As such, samples representative of global germplasm pools need to be assessed for existing population structure, diversity and linkage disequilibrium (LD. Complexity reduction genotyping-by-sequencing (GBS methods, including GBS-transcriptomics (GBS-t, enable cost-effective screening of a large number of samples, while whole genome re-sequencing (WGR delivers the ability to generate large numbers of unbiased genomic single nucleotide polymorphisms (SNPs, and identify structural variants (SVs. Furthermore, the development of genomic tools based on whole genomes representative of global oilseed diversity and orientated by the reference genome has substantial industry relevance and will be highly beneficial for canola breeding. As recent studies have focused on European and Chinese varieties, a global diversity panel as well as a substantial number of Australian spring types were included in this study. Focusing on industry relevance, 633 varieties were initially genotyped using GBS-t to examine population structure using 61,037 SNPs. Subsequently, 149 samples representative of global diversity were selected for WGR and both data sets used for a side-by-side evaluation of diversity and LD. The WGR data was further used to develop genomic resources consisting of a list of 4,029,750 high-confidence SNPs annotated using SnpEff, and SVs in the form of 10,976 deletions and 2,556 insertions. These resources form the basis of a reliable and repeatable system allowing greater integration between canola genomics studies, with a strong focus on breeding germplasm and industry applicability.
Collecting Information for Rating Global Assessment of Functioning (GAF): Sources of Information and Methods for Information Collection.

Science.gov (United States)

I H, Monrad Aas

2014-11-01

Global Assessment of Functioning (GAF) is an assessment instrument that is known worldwide. It is widely used for rating the severity of illness. Results from evaluations in psychiatry should characterize the patients. Rating of GAF is based on collected information. The aim of the study is to identify the factors involved in collecting information that is relevant for rating GAF, and gaps in knowledge where it is likely that further development would play a role for improved scoring. A literature search was conducted with a combination of thorough hand search and search in the bibliographic databases PubMed, PsycINFO, Google Scholar, and Campbell Collaboration Library of Systematic Reviews. Collection of information for rating GAF depends on two fundamental factors: the sources of information and the methods for information collection. Sources of information are patients, informants, health personnel, medical records, letters of referral and police records about violence and substance abuse. Methods for information collection include the many different types of interview - unstructured, semi-structured, structured, interviews for Axis I and II disorders, semistructured interviews for rating GAF, and interviews of informants - as well as instruments for rating symptoms and functioning, and observation. The different sources of information, and methods for collection, frequently result in inconsistencies in the information collected. The variation in collected information, and lack of a generally accepted algorithm for combining collected information, is likely to be important for rated GAF values, but there is a fundamental lack of knowledge about the degree of importance. Research to improve GAF has not reached a high level. Rated GAF values are likely to be influenced by both the sources of information used and the methods employed for information collection, but the lack of research-based information about these influences is fundamental. Further development of
Linked Data for Fighting Global Hunger:Experiences in setting standards for Agricultural Information Management

Science.gov (United States)

Baker, Thomas; Keizer, Johannes

FAO, the Food and Agriculture Organization of the UN, has the global goal to defeat hunger and eliminate poverty. One of its core functions is the generation, dissemination and application of information and knowledge. Since 2000, the Agricultural InformationManagement Standards (AIMS) activity in FAO's Knowledge Exchange and Capacity Building Division has promoted the use of Semantic Web standards to improve information sharing within a global network of research institutes and related partner organizations. The strategy emphasizes the use of simple descriptive metadata, thesauri, and ontologies for integrating access to information from a wide range of sources for both scientific and non-expert audiences. An early adopter of Semantic Web technology, the AIMS strategy is evolving to help information providers in nineteen language areas use modern Linked Data methods to improve the quality of life in developing rural areas, home to seventy percent of the world's poor and hungry people.
(DeCentralization of the Global Informational Ecosystem

Directory of Open Access Journals (Sweden)

Johanna Möller

2017-09-01

Full Text Available Centralization and decentralization are key concepts in debates that focus on the (antidemocratic character of digital societies. Centralization is understood as the control over communication and data flows, and decentralization as giving it (back to users. Communication and media research focuses on centralization put forward by dominant digital media platforms, such as Facebook and Google, and governments. Decentralization is investigated regarding its potential in civil society, i.e., hacktivism, (encryption technologies, and grass-root technology movements. As content-based media companies increasingly engage with technology, they move into the focus of critical media studies. Moreover, as formerly nationally oriented companies now compete with global media platforms, they share several interests with civil society decentralization agents. Based on 26 qualitative interviews with leading media managers, we investigate (decentralization strategies applied by content-oriented media companies. Theoretically, this perspective on media companies as agents of (decentralization expands (decentralization research beyond traditional democratic stakeholders by considering economic actors within the “global informational ecosystem” (Birkinbine, Gómez, & Wasko, 2017. We provide a three-dimensional framework to empirically investigate (decentralization. From critical media studies, we borrow the (decentralization of data and infrastructures, from media business research, the (decentralization of content distribution.
Vision for an Open, Global Greenhouse Gas Information System (GHGIS)

Science.gov (United States)

Duren, R. M.; Butler, J. H.; Rotman, D.; Ciais, P.; Greenhouse Gas Information System Team

2010-12-01

Over the next few years, an increasing number of entities ranging from international, national, and regional governments, to businesses and private land-owners, are likely to become more involved in efforts to limit atmospheric concentrations of greenhouse gases. In such a world, geospatially resolved information about the location, amount, and rate of greenhouse gas (GHG) emissions will be needed, as well as the stocks and flows of all forms of carbon through the earth system. The ability to implement policies that limit GHG concentrations would be enhanced by a global, open, and transparent greenhouse gas information system (GHGIS). An operational and scientifically robust GHGIS would combine ground-based and space-based observations, carbon-cycle modeling, GHG inventories, synthesis analysis, and an extensive data integration and distribution system, to provide information about anthropogenic and natural sources, sinks, and fluxes of greenhouse gases at temporal and spatial scales relevant to decision making. The GHGIS effort was initiated in 2008 as a grassroots inter-agency collaboration intended to identify the needs for such a system, assess the capabilities of current assets, and suggest priorities for future research and development. We will present a vision for an open, global GHGIS including latest analysis of system requirements, critical gaps, and relationship to related efforts at various agencies, the Group on Earth Observations, and the Intergovernmental Panel on Climate Change.
Toward allotetraploid cotton genome assembly: integration of a high-density molecular genetic linkage map with DNA sequence information

Science.gov (United States)

2012-01-01

Background Cotton is the world’s most important natural textile fiber and a significant oilseed crop. Decoding cotton genomes will provide the ultimate reference and resource for research and utilization of the species. Integration of high-density genetic maps with genomic sequence information will largely accelerate the process of whole-genome assembly in cotton. Results In this paper, we update a high-density interspecific genetic linkage map of allotetraploid cultivated cotton. An additional 1,167 marker loci have been added to our previously published map of 2,247 loci. Three new marker types, InDel (insertion-deletion) and SNP (single nucleotide polymorphism) developed from gene information, and REMAP (retrotransposon-microsatellite amplified polymorphism), were used to increase map density. The updated map consists of 3,414 loci in 26 linkage groups covering 3,667.62 cM with an average inter-locus distance of 1.08 cM. Furthermore, genome-wide sequence analysis was finished using 3,324 informative sequence-based markers and publicly-available Gossypium DNA sequence information. A total of 413,113 EST and 195 BAC sequences were physically anchored and clustered by 3,324 sequence-based markers. Of these, 14,243 ESTs and 188 BACs from different species of Gossypium were clustered and specifically anchored to the high-density genetic map. A total of 2,748 candidate unigenes from 2,111 ESTs clusters and 63 BACs were mined for functional annotation and classification. The 337 ESTs/genes related to fiber quality traits were integrated with 132 previously reported cotton fiber quality quantitative trait loci, which demonstrated the important roles in fiber quality of these genes. Higher-level sequence conservation between different cotton species and between the A- and D-subgenomes in tetraploid cotton was found, indicating a common evolutionary origin for orthologous and paralogous loci in Gossypium. Conclusion This study will serve as a valuable genomic resource
MIPS: analysis and annotation of genome information in 2007.

Science.gov (United States)

Mewes, H W; Dietmann, S; Frishman, D; Gregory, R; Mannhaupt, G; Mayer, K F X; Münsterkötter, M; Ruepp, A; Spannagl, M; Stümpflen, V; Rattei, T

2008-01-01

The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) combines automatic processing of large amounts of sequences with manual annotation of selected model genomes. Due to the massive growth of the available data, the depth of annotation varies widely between independent databases. Also, the criteria for the transfer of information from known to orthologous sequences are diverse. To cope with the task of global in-depth genome annotation has become unfeasible. Therefore, our efforts are dedicated to three levels of annotation: (i) the curation of selected genomes, in particular from fungal and plant taxa (e.g. CYGD, MNCDB, MatDB), (ii) the comprehensive, consistent, automatic annotation employing exhaustive methods for the computation of sequence similarities and sequence-related attributes as well as the classification of individual sequences (SIMAP, PEDANT and FunCat) and (iii) the compilation of manually curated databases for protein interactions based on scrutinized information from the literature to serve as an accepted set of reliable annotated interaction data (MPACT, MPPI, CORUM). All databases and tools described as well as the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de).
Multi-scale coding of genomic information: From DNA sequence to genome structure and function

International Nuclear Information System (INIS)

Arneodo, Alain; Vaillant, Cedric; Audit, Benjamin; Argoul, Francoise; D'Aubenton-Carafa, Yves; Thermes, Claude

2011-01-01

Understanding how chromatin is spatially and dynamically organized in the nucleus of eukaryotic cells and how this affects genome functions is one of the main challenges of cell biology. Since the different orders of packaging in the hierarchical organization of DNA condition the accessibility of DNA sequence elements to trans-acting factors that control the transcription and replication processes, there is actually a wealth of structural and dynamical information to learn in the primary DNA sequence. In this review, we show that when using concepts, methodologies, numerical and experimental techniques coming from statistical mechanics and nonlinear physics combined with wavelet-based multi-scale signal processing, we are able to decipher the multi-scale sequence encoding of chromatin condensation-decondensation mechanisms that play a fundamental role in regulating many molecular processes involved in nuclear functions.
HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information

Directory of Open Access Journals (Sweden)

Hu Jianjun

2011-05-01

Full Text Available Abstract Background Accurate prediction of binding residues involved in the interactions between proteins and small ligands is one of the major challenges in structural bioinformatics. Heme is an essential and commonly used ligand that plays critical roles in electron transfer, catalysis, signal transduction and gene expression. Although much effort has been devoted to the development of various generic algorithms for ligand binding site prediction over the last decade, no algorithm has been specifically designed to complement experimental techniques for identification of heme binding residues. Consequently, an urgent need is to develop a computational method for recognizing these important residues. Results Here we introduced an efficient algorithm HemeBIND for predicting heme binding residues by integrating structural and sequence information. We systematically investigated the characteristics of binding interfaces based on a non-redundant dataset of heme-protein complexes. It was found that several sequence and structural attributes such as evolutionary conservation, solvent accessibility, depth and protrusion clearly illustrate the differences between heme binding and non-binding residues. These features can then be separately used or combined to build the structure-based classifiers using support vector machine (SVM. The results showed that the information contained in these features is largely complementary and their combination achieved the best performance. To further improve the performance, an attempt has been made to develop a post-processing procedure to reduce the number of false positives. In addition, we built a sequence-based classifier based on SVM and sequence profile as an alternative when only sequence information can be used. Finally, we employed a voting method to combine the outputs of structure-based and sequence-based classifiers, which demonstrated remarkably better performance than the individual classifier alone
A Teaching-Learning Sequence of Colour Informed by History and Philosophy of Science

Science.gov (United States)

Maurício, Paulo; Valente, Bianor; Chagas, Isabel

2017-01-01

In this work, we present a teaching-learning sequence on colour intended to a pre-service elementary teacher programme informed by History and Philosophy of Science. Working in a socio-constructivist framework, we made an excursion on the history of colour. Our excursion through history of colour, as well as the reported misconception on colour…
A laboratory information management system for DNA barcoding workflows

NARCIS (Netherlands)

Vu, D.; Eberhardt, U.; Szöke, S.; Groenewald, M.; Robert, V.

2012-01-01

This paper presents a laboratory information management system for DNA sequences (LIMS) created and based on the needs of a DNA barcoding project at the CBS-KNAW Fungal Biodiversity Centre (Utrecht, the Netherlands). DNA barcoding is a global initiative for species identification through simple DNA
Evaluation of haplotype diversity of Achatina fulica (Lissachatina) [Bowdich] from Indian sub-continent by means of 16S rDNA sequence and its phylogenetic relationships with other global populations.

Science.gov (United States)

Ayyagari, Vijaya Sai; Sreerama, Krupanidhi

2017-08-01

Achatina fulica (Lissachatina fulica) is one of the most invasive species found across the globe causing a significant damage to crops, vegetables, and horticultural plants. This terrestrial snail is native to east Africa and spread to different parts of the world by introductions. India, a hot spot for biodiversity of several endemic gastropods, has witnessed an outburst of this snail population in several parts of the country posing a serious threat to crop loss and also to human health. With an objective to evaluate the genetic diversity of this snail, we have sampled this snail from different parts of India and analyzed its haplotype diversity by means of 16S rDNA sequence information. Apart from this, we have studied the phylogenetic relationships of the isolates sequenced in the present study in relation with other global populations by Bayesian and Maximum-likelihood approaches. Of the isolates sequenced, haplotype 'C' is the predominant one. A new haplotype 'S' from the state of Odisha was observed. The isolates sequenced in the present study clustered with its conspecifics from the Indian sub-continent. Haplotype network analyses were also carried out for studying the evolution of different haplotypes. It was observed that haplotype 'S' was associated with a Mauritius haplotype 'H', indicating the possibility of multiple introductions of A. fulica to India.
Global monitoring of dynamic information systems a case study in the international supply chain

NARCIS (Netherlands)

Pruksasri, P.; Berg, J. van den; Hofman, W.J.

2014-01-01

Global information systems are becoming more complex and dynamic everyday: huge amounts of data and messages through those systems show dynamically changing traffic patterns. Because of this, diagnosing when sub-systems are not working properly is difficult. System failures or errors in information
Skeleton-Based Human Action Recognition With Global Context-Aware Attention LSTM Networks

Science.gov (United States)

Liu, Jun; Wang, Gang; Duan, Ling-Yu; Abdiyeva, Kamila; Kot, Alex C.

2018-04-01

Human action recognition in 3D skeleton sequences has attracted a lot of research attention. Recently, Long Short-Term Memory (LSTM) networks have shown promising performance in this task due to their strengths in modeling the dependencies and dynamics in sequential data. As not all skeletal joints are informative for action recognition, and the irrelevant joints often bring noise which can degrade the performance, we need to pay more attention to the informative ones. However, the original LSTM network does not have explicit attention ability. In this paper, we propose a new class of LSTM network, Global Context-Aware Attention LSTM (GCA-LSTM), for skeleton based action recognition. This network is capable of selectively focusing on the informative joints in each frame of each skeleton sequence by using a global context memory cell. To further improve the attention capability of our network, we also introduce a recurrent attention mechanism, with which the attention performance of the network can be enhanced progressively. Moreover, we propose a stepwise training scheme in order to train our network effectively. Our approach achieves state-of-the-art performance on five challenging benchmark datasets for skeleton based action recognition.
Telecommunication Sector of the Russian Economy: Transformation Into a Global Information and Telecommunication Infrastructure

Directory of Open Access Journals (Sweden)

Fokina Elena Anatolyevna

2014-12-01

Full Text Available The author concerns the current state and possible ways of telecommunication sector of the Russian economy development in the conditions of world economy globalization and suggests that the process of globalization reflects the current stage of telecommunication companies’ capital internationalization. The analysis of telecommunication sector shows that it is not only a perspective, highmargin and dynamically developing sector but is still one of the most integrated into the system of world economic relations. The stages of Russian telecommunication companies’ capital internationalization are determined, the internal connections between internationalization process and globalization are revealed. It is revealed that the new information and communication technologies development and expansion results in substantial increase in cooperation between economical entities and provides a sustainable long-term economical growth of telecommunication enterprises. The financial and operational data determining the effectiveness of telecommunication companies’ activity are presented. The analysis of tendencies promoting the extension of the market activity of Russian telecommunication companies at global information and telecommunication infrastructure shows that the main tendencies are the following ones: foreign capital inflow increase, capital integration and expansion of new services based on technologies convergence. The author reasonably concludes in recent times, the telecommunication sector of the Russian economy formation and development is determined by the existing global trends.

Space Applications and Global Information Infrastructure: a Global Approach against Epidemics

Science.gov (United States)

Bastos, C. R.

2002-01-01

Brazilian space expenditures correspond to a low-middle rank among the space-faring nations. In this regard, international partnerships have opened doors for the country to take part in a wider range of projects than it would be possible if carried out on its own. Within the above framework, this paper will address a concept in which countries join efforts in pursuit of common objectives and needs in the field of health, countries whose similarities tend to make them face the same types of health problems. Exactly for this reason, such countries can get together and share the costs, risks and ultimately the benefits of their joint efforts. Infectious diseases are mankind's leading causes of death. And their agents travel around the world by the action of their vectors: insects, birds, winds, infected individuals, and others. The ways how Global Information Infrastructure and Space applications can be very helpful in the detection, identification, tracking and fighting migratory diseases will then be discussed. A concept for an international cooperative initiative is presented, addressing its composition, its implementation, the international coordination requirements, the financial and funding issues related to its implementation and sustainability, and the roles to be played by such an organization. The funding issue deserves a closer attention, since many good ideas are killed by financial problems in their early implementation stages. Finally, a conclusion drives the audience's attention towards the potential advantages of space-based assets in covering large portions of the Earth, and consequently being suitable for global initiatives for the benefit of mankind.
Action-embedded transformational leadership in self-managing global information systems development teams

NARCIS (Netherlands)

Eseryel, U. Yeliz; Eseryel, Deniz

While software development teams are becoming more and more distributed around the globe, most software development methodologies used by global teams prescribe self-managing teams. Transformational leadership is the key to successful information systems development and use for competitive
77 FR 37430 - BSEE Information Collection Activity: Global Positioning System for MODUs, Extension of a...

Science.gov (United States)

2012-06-21

... major weather event, like a hurricane, lessees and operators need to report new GPS information to BSEE...-0012; OMB Control Number 1014-0013] BSEE Information Collection Activity: Global Positioning System for... Paperwork Reduction Act of 1995 (PRA), BSEE is inviting comments on a collection of information pertaining...
Global climate change and human health: Information needs, research priorities, and strategic considerations

Energy Technology Data Exchange (ETDEWEB)

Farrell, M.P.; Kanciruk, P. (Oak Ridge National Lab., TN (USA)); O' Hara, F.M. Jr. (O' Hara (Fred M., Jr.), Oak Ridge, TN (USA))

1989-01-01

The US Global Research Plan and the International Geosphere-Biosphere Programme were created to assess the effects of global climate change but have not been able to devote much attention to the consequences climate change will have on human health and welfare. Although researchers and policy makers recognize that climate change will have complex effects on resources, in general, the social and medical sciences have not received appropriate international attention under the banner of global change. To address this imbalance, the public health research community needs to launch a international coordinated effort so that the social and medical sciences are as fully represented as other scientific disciplines. This document discusses the information needs, research priorities and strategic considerations of the global change and its impact on human health.
Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence

Science.gov (United States)

Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya

2015-01-01

Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930
Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence.

Directory of Open Access Journals (Sweden)

Kacy L Gordon

2015-05-01

Full Text Available Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2 from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements.
Intergenic DNA sequences from the human X chromosome reveal high rates of global gene flow

Directory of Open Access Journals (Sweden)

Wall Jeffrey D

2008-11-01

Full Text Available Abstract Background Despite intensive efforts devoted to collecting human polymorphism data, little is known about the role of gene flow in the ancestry of human populations. This is partly because most analyses have applied one of two simple models of population structure, the island model or the splitting model, which make unrealistic biological assumptions. Results Here, we analyze 98-kb of DNA sequence from 20 independently evolving intergenic regions on the X chromosome in a sample of 90 humans from six globally diverse populations. We employ an isolation-with-migration (IM model, which assumes that populations split and subsequently exchange migrants, to independently estimate effective population sizes and migration rates. While the maximum effective size of modern humans is estimated at ~10,000, individual populations vary substantially in size, with African populations tending to be larger (2,300–9,000 than non-African populations (300–3,300. We estimate mean rates of bidirectional gene flow at 4.8 × 10-4/generation. Bidirectional migration rates are ~5-fold higher among non-African populations (1.5 × 10-3 than among African populations (2.7 × 10-4. Interestingly, because effective sizes and migration rates are inversely related in African and non-African populations, population migration rates are similar within Africa and Eurasia (e.g., global mean Nm = 2.4. Conclusion We conclude that gene flow has played an important role in structuring global human populations and that migration rates should be incorporated as critical parameters in models of human demography.
Global transformation of erythrocyte properties via engagement of an SH2-like sequence in band 3.

Science.gov (United States)

Puchulu-Campanella, Estela; Turrini, Francesco M; Li, Yen-Hsing; Low, Philip S

2016-11-29

Src homology 2 (SH2) domains are composed of weakly conserved sequences of ∼100 aa that bind phosphotyrosines in signaling proteins and thereby mediate intra- and intermolecular protein-protein interactions. In exploring the mechanism whereby tyrosine phosphorylation of the erythrocyte anion transporter, band 3, triggers membrane destabilization, vesiculation, and fragmentation, we discovered a SH2 signature motif positioned between membrane-spanning helices 4 and 5. Evidence that this exposed cytoplasmic sequence contributes to a functional SH2-like domain is provided by observations that: (i) it contains the most conserved sequence of SH2 domains, GSFLVR; (ii) it binds the tyrosine phosphorylated cytoplasmic domain of band 3 (cdb3-PO 4 ) with K d = 14 nM; (iii) binding of cdb3-PO 4 to erythrocyte membranes is inhibited both by antibodies against the SH2 signature sequence and dephosphorylation of cdb3-PO 4 ; (iv) label transfer experiments demonstrate the covalent transfer of photoactivatable biotin from isolated cdb3-PO 4 (but not cdb3) to band 3 in erythrocyte membranes; and (v) phosphorylation-induced binding of cdb3-PO 4 to the membrane-spanning domain of band 3 in intact cells causes global changes in membrane properties, including (i) displacement of a glycolytic enzyme complex from the membrane, (ii) inhibition of anion transport, and (iii) rupture of the band 3-ankyrin bridge connecting the spectrin-based cytoskeleton to the membrane. Because SH2-like motifs are not retrieved by normal homology searches for SH2 domains, but can be found in many tyrosine kinase-regulated transport proteins using modified search programs, we suggest that related cases of membrane transport proteins containing similar motifs are widespread in nature where they participate in regulation of cell properties.
Developing Information Services and Tools to Access and Evaluate Data Quality in Global Satellite-based Precipitation Products

Science.gov (United States)

Liu, Z.; Shie, C. L.; Meyer, D. J.

2017-12-01

Global satellite-based precipitation products have been widely used in research and applications around the world. Compared to ground-based observations, satellite-based measurements provide precipitation data on a global scale, especially in remote continents and over oceans. Over the years, satellite-based precipitation products have evolved from single sensor and single algorithm to multi-sensors and multi-algorithms. As a result, many satellite-based precipitation products have been enhanced such as spatial and temporal coverages. With inclusion of ground-based measurements, biases of satellite-based precipitation products have been significantly reduced. However, data quality issues still exist and can be caused by many factors such as observations, satellite platform anomaly, algorithms, production, calibration, validation, data services, etc. The NASA Goddard Earth Sciences (GES) Data and Information Services Center (DISC) is home to NASA global precipitation product archives including the Tropical Rainfall Measuring Mission (TRMM), the Global Precipitation Measurement (GPM), as well as other global and regional precipitation products. Precipitation is one of the top downloaded and accessed parameters in the GES DISC data archive. Meanwhile, users want to easily locate and obtain data quality information at regional and global scales to better understand how precipitation products perform and how reliable they are. As data service providers, it is necessary to provide an easy access to data quality information, however, such information normally is not available, and when it is available, it is not in one place and difficult to locate. In this presentation, we will present challenges and activities at the GES DISC to address precipitation data quality issues.
Global search demand for varicose vein information on the internet.

Science.gov (United States)

El-Sheikha, Joseph

2015-09-01

Changes in internet search trends can provide healthcare professionals detailed information on prevalence of disease and symptoms. Chronic venous disease, more commonly known as varicose veins, is a common symptomatic disease among the adult population. This study aims to measure the change in global search demand for varicose vein information using Google over the past 8 years. The Google Trends instrument was used to measure the change in demand for the use of the local name for varicose veins in several countries across the world between January 2006 and December 2012. The measurements were normalised onto a scale relative to the largest volume of search requests received during a designated time and geographical location. Comparison of national levels of private healthcare and healthcare spending per capita to search demand was also undertaken using Organisation for Economic Co-operation and development economic measurements. Global interest has increased significantly, with linear regression demonstrating a 3.72% year-on-year increase in demand over the 8-year time period (r(2 )= 0.385, p demand significantly increased in the northern hemisphere (p demand compared to cooler winter months (demand (r(2 )= 0.120 p = 0.306). Healthcare spending per capita did not relate to search demand (r(2 )= 0.450 p = 0.077). There is increasing demand for information about varicose veins on the internet, especially during the warmer months of the year. Online search demand does not appear to be related to healthcare spending. © The Author(s) 2014.
MatrixPlot: visualizing sequence constraints

DEFF Research Database (Denmark)

Gorodkin, Jan; Stærfeldt, Hans Henrik; Lund, Ole

1999-01-01

MatrixPlot: visualizing sequence constraints. Sub-title Abstract Summary : MatrixPlot is a program for making high-quality matrix plots, such as mutual information plots of sequence alignments and distance matrices of sequences with known three-dimensional coordinates. The user can add information...
Divide and conquer: enriching environmental sequencing data.

Directory of Open Access Journals (Sweden)

Anne Bergeron

2007-09-01

Full Text Available In environmental sequencing projects, a mix of DNA from a whole microbial community is fragmented and sequenced, with one of the possible goals being to reconstruct partial or complete genomes of members of the community. In communities with high diversity of species, a significant proportion of the sequences do not overlap any other fragment in the sample. This problem will arise not only in situations with a relatively even distribution of many species, but also when the community in a particular environment is routinely dominated by the same few species. In the former case, no genomes may be assembled at all, while in the latter case a few dominant species in an environment will always be sequenced at high coverage to the detriment of coverage of the greater number of sparse species.Here we show that, with the same global sequencing effort, separating the species into two or more sub-communities prior to sequencing can yield a much higher proportion of sequences that can be assembled. We first use the Lander-Waterman model to show that, if the expected percentage of singleton sequences is higher than 25%, then, under the uniform distribution hypothesis, splitting the community is always a wise choice. We then construct simulated microbial communities to show that the results hold for highly non-uniform distributions. We also show that, for the distributions considered in the experiments, it is possible to estimate quite accurately the relative diversity of the two sub-communities.Given the fact that several methods exist to split microbial communities based on physical properties such as size, density, surface biochemistry, or optical properties, we strongly suggest that groups involved in environmental sequencing, and expecting high diversity, consider splitting their communities in order to maximize the information content of their sequencing effort.
Core Genome Multilocus Sequence Typing for Identification of Globally Distributed Clonal Groups and Differentiation of Outbreak Strains of Listeria monocytogenes

OpenAIRE

Chen, Yi; Gonzalez-Escalona, Narjol; Hammack, Thomas S.; Allard, Marc W.; Strain, Errol A.; Brown, Eric W.

2016-01-01

ABSTRACT Many listeriosis outbreaks are caused by a few globally distributed clonal groups, designated clonal complexes or epidemic clones, of Listeria monocytogenes, several of which have been defined by classic multilocus sequence typing (MLST) schemes targeting 6 to 8 housekeeping or virulence genes. We have developed and evaluated core genome MLST (cgMLST) schemes and applied them to isolates from multiple clonal groups, including those associated with 39 listeriosis outbreaks. The cgMLST...
Power spectral density and scaling exponent of high frequency global solar radiation sequences

Science.gov (United States)

Calif, Rudy; Schmitt, François G.; Huang, Yongxiang

2013-04-01

The part of the solar power production from photovlotaïcs systems is constantly increasing in the electric grids. Solar energy converter devices such as photovoltaic cells are very sensitive to instantaneous solar radiation fluctuations. Thus rapid variation of solar radiation due to changes in the local meteorological condition can induce large amplitude fluctuations of the produced electrical power and reduce the overall efficiency of the system. When large amount of photovoltaic electricity is send into a weak or small electricity network such as island network, the electric grid security can be in jeopardy due to these power fluctuations. The integration of this energy in the electrical network remains a major challenge, due to the high variability of solar radiation in time and space. To palliate these difficulties, it is essential to identify the characteristic of these fluctuations in order to anticipate the eventuality of power shortage or power surge. The objective of this study is to present an approach based on Empirical Mode Decomposition (EMD) and Hilbert-Huang Transform (HHT) to highlight the scaling properties of global solar irradiance data G(t). The scale of invariance is detected on this dataset using the Empirical Mode Decomposition in association with arbitrary-order Hilbert spectral analysis, a generalization of (HHT) or Hilbert Spectral Analysis (HSA). The first step is the EMD, consists in decomposing the normalized global solar radiation data G'(t) into several Intrinsic Mode Functions (IMF) Ci(t) without giving an a priori basis. Consequently, the normalized original solar radiation sequence G'(t) can be written as a sum of Ci(t) with a residual rn. From all IMF modes, a joint PDF P(f,A) of locally and instantaneous frequency f and amplitude A, is estimated. To characterize the scaling behavior in amplitude-frequency space, an arbitrary-order Hilbert marginal spectrum is defined to: Iq(f) = 0 P (f,A)A dA (1) with q × 0 In case of scale
TRX-LOGOS - a graphical tool to demonstrate DNA information content dependent upon backbone dynamics in addition to base sequence.

Science.gov (United States)

Fortin, Connor H; Schulze, Katharina V; Babbitt, Gregory A

2015-01-01

It is now widely-accepted that DNA sequences defining DNA-protein interactions functionally depend upon local biophysical features of DNA backbone that are important in defining sites of binding interaction in the genome (e.g. DNA shape, charge and intrinsic dynamics). However, these physical features of DNA polymer are not directly apparent when analyzing and viewing Shannon information content calculated at single nucleobases in a traditional sequence logo plot. Thus, sequence logos plots are severely limited in that they convey no explicit information regarding the structural dynamics of DNA backbone, a feature often critical to binding specificity. We present TRX-LOGOS, an R software package and Perl wrapper code that interfaces the JASPAR database for computational regulatory genomics. TRX-LOGOS extends the traditional sequence logo plot to include Shannon information content calculated with regard to the dinucleotide-based BI-BII conformation shifts in phosphate linkages on the DNA backbone, thereby adding a visual measure of intrinsic DNA flexibility that can be critical for many DNA-protein interactions. TRX-LOGOS is available as an R graphics module offered at both SourceForge and as a download supplement at this journal. To demonstrate the general utility of TRX logo plots, we first calculated the information content for 416 Saccharomyces cerevisiae transcription factor binding sites functionally confirmed in the Yeastract database and matched to previously published yeast genomic alignments. We discovered that flanking regions contain significantly elevated information content at phosphate linkages than can be observed at nucleobases. We also examined broader transcription factor classifications defined by the JASPAR database, and discovered that many general signatures of transcription factor binding are locally more information rich at the level of DNA backbone dynamics than nucleobase sequence. We used TRX-logos in combination with MEGA 6.0 software
The International Nucleotide Sequence Database Collaboration.

Science.gov (United States)

Cochrane, Guy; Karsch-Mizrachi, Ilene; Nakamura, Yasukazu

2011-01-01

Under the International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org), globally comprehensive public domain nucleotide sequence is captured, preserved and presented. The partners of this long-standing collaboration work closely together to provide data formats and conventions that enable consistent data submission to their databases and support regular data exchange around the globe. Clearly defined policy and governance in relation to free access to data and relationships with journal publishers have positioned INSDC databases as a key provider of the scientific record and a core foundation for the global bioinformatics data infrastructure. While growth in sequence data volumes comes no longer as a surprise to INSDC partners, the uptake of next-generation sequencing technology by mainstream science that we have witnessed in recent years brings a step-change to growth, necessarily making a clear mark on INSDC strategy. In this article, we introduce the INSDC, outline data growth patterns and comment on the challenges of increased growth.
A platform-independent method for detecting errors in metagenomic sequencing data: DRISEE.

Directory of Open Access Journals (Sweden)

Kevin P Keegan

Full Text Available We provide a novel method, DRISEE (duplicate read inferred sequencing error estimation, to assess sequencing quality (alternatively referred to as "noise" or "error" within and/or between sequencing samples. DRISEE provides positional error estimates that can be used to inform read trimming within a sample. It also provides global (whole sample error estimates that can be used to identify samples with high or varying levels of sequencing error that may confound downstream analyses, particularly in the case of studies that utilize data from multiple sequencing samples. For shotgun metagenomic data, we believe that DRISEE provides estimates of sequencing error that are more accurate and less constrained by technical limitations than existing methods that rely on reference genomes or the use of scores (e.g. Phred. Here, DRISEE is applied to (non amplicon data sets from both the 454 and Illumina platforms. The DRISEE error estimate is obtained by analyzing sets of artifactual duplicate reads (ADRs, a known by-product of both sequencing platforms. We present DRISEE as an open-source, platform-independent method to assess sequencing error in shotgun metagenomic data, and utilize it to discover previously uncharacterized error in de novo sequence data from the 454 and Illumina sequencing platforms.
Rapid detection, classification and accurate alignment of up to a million or more related protein sequences.

Science.gov (United States)

Neuwald, Andrew F

2009-08-01

The patterns of sequence similarity and divergence present within functionally diverse, evolutionarily related proteins contain implicit information about corresponding biochemical similarities and differences. A first step toward accessing such information is to statistically analyze these patterns, which, in turn, requires that one first identify and accurately align a very large set of protein sequences. Ideally, the set should include many distantly related, functionally divergent subgroups. Because it is extremely difficult, if not impossible for fully automated methods to align such sequences correctly, researchers often resort to manual curation based on detailed structural and biochemical information. However, multiply-aligning vast numbers of sequences in this way is clearly impractical. This problem is addressed using Multiply-Aligned Profiles for Global Alignment of Protein Sequences (MAPGAPS). The MAPGAPS program uses a set of multiply-aligned profiles both as a query to detect and classify related sequences and as a template to multiply-align the sequences. It relies on Karlin-Altschul statistics for sensitivity and on PSI-BLAST (and other) heuristics for speed. Using as input a carefully curated multiple-profile alignment for P-loop GTPases, MAPGAPS correctly aligned weakly conserved sequence motifs within 33 distantly related GTPases of known structure. By comparison, the sequence- and structurally based alignment methods hmmalign and PROMALS3D misaligned at least 11 and 23 of these regions, respectively. When applied to a dataset of 65 million protein sequences, MAPGAPS identified, classified and aligned (with comparable accuracy) nearly half a million putative P-loop GTPase sequences. A C++ implementation of MAPGAPS is available at http://mapgaps.igs.umaryland.edu. Supplementary data are available at Bioinformatics online.
Timing and sequencing of events marking the transition to adulthood in two informal settlements in Nairobi, Kenya.

Science.gov (United States)

Beguy, Donatien; Kabiru, Caroline W; Zulu, Eliya M; Ezeh, Alex C

2011-06-01

Young people living in poor urban informal settlements face unique challenges as they transition to adulthood. This exploratory paper uses retrospective information from the baseline survey of a 3-year prospective study to examine the timing and sequencing of four key markers (first sex, marriage, birth, and independent housing) of the transition to adulthood among 3,944 adolescents in two informal settlements in Nairobi city, Kenya. Event history analysis techniques are employed to examine the timing of the events. Results indicate that there is no significant gender difference with regard to first sexual debut among adolescents. For many boys and girls, the first sexual experience occurs outside of marriage or other union. For males, the sequencing of entry begins with entry into first sex, followed by independent housing. Conversely, for females, the sequencing begins with first sex and then parenthood. Apart from sexual debut, the patterns of entry into union and parenthood do not differ much from what was observed for Nairobi as a whole. The space constraints that typify the two slums may have influenced the pattern of leaving home observed. We discuss these and other findings in light of their implications for young people's health and well-being in resource-poor settings in urban areas.
High-density rhesus macaque oligonucleotide microarray design using early-stage rhesus genome sequence information and human genome annotations

Directory of Open Access Journals (Sweden)

Magness Charles L

2007-01-01

Full Text Available Abstract Background Until recently, few genomic reagents specific for non-human primate research have been available. To address this need, we have constructed a macaque-specific high-density oligonucleotide microarray by using highly fragmented low-pass sequence contigs from the rhesus genome project together with the detailed sequence and exon structure of the human genome. Using this method, we designed oligonucleotide probes to over 17,000 distinct rhesus/human gene orthologs and increased by four-fold the number of available genes relative to our first-generation expressed sequence tag (EST-derived array. Results We constructed a database containing 248,000 exon sequences from 23,000 human RefSeq genes and compared each human exon with its best matching sequence in the January 2005 version of the rhesus genome project list of 486,000 DNA contigs. Best matching rhesus exon sequences for each of the 23,000 human genes were then concatenated in the proper order and orientation to produce a rhesus "virtual transcriptome." Microarray probes were designed, one per gene, to the region closest to the 3' untranslated region (UTR of each rhesus virtual transcript. Each probe was compared to a composite rhesus/human transcript database to test for cross-hybridization potential yielding a final probe set representing 18,296 rhesus/human gene orthologs, including transcript variants, and over 17,000 distinct genes. We hybridized mRNA from rhesus brain and spleen to both the EST- and genome-derived microarrays. Besides four-fold greater gene coverage, the genome-derived array also showed greater mean signal intensities for genes present on both arrays. Genome-derived probes showed 99.4% identity when compared to 4,767 rhesus GenBank sequence tag site (STS sequences indicating that early stage low-pass versions of complex genomes are of sufficient quality to yield valuable functional genomic information when combined with finished genome information from

Forecasting an invasive species’ distribution with global distribution data, local data, and physiological information

Science.gov (United States)

Jarnevich, Catherine S.; Young, Nicholas E.; Talbert, Marian; Talbert, Colin

2018-01-01

Understanding invasive species distributions and potential invasions often requires broad‐scale information on the environmental tolerances of the species. Further, resource managers are often faced with knowing these broad‐scale relationships as well as nuanced environmental factors related to their landscape that influence where an invasive species occurs and potentially could occur. Using invasive buffelgrass (Cenchrus ciliaris), we developed global models and local models for Saguaro National Park, Arizona, USA, based on location records and literature on physiological tolerances to environmental factors to investigate whether environmental relationships of a species at a global scale are also important at local scales. In addition to correlative models with five commonly used algorithms, we also developed a model using a priori user‐defined relationships between occurrence and environmental characteristics based on a literature review. All correlative models at both scales performed well based on statistical evaluations. The user‐defined curves closely matched those produced by the correlative models, indicating that the correlative models may be capturing mechanisms driving the distribution of buffelgrass. Given climate projections for the region, both global and local models indicate that conditions at Saguaro National Park may become more suitable for buffelgrass. Combining global and local data with correlative models and physiological information provided a holistic approach to forecasting invasive species distributions.
Reference evapotranspiration forecasting based on local meteorological and global climate information screened by partial mutual information

Science.gov (United States)

Fang, Wei; Huang, Shengzhi; Huang, Qiang; Huang, Guohe; Meng, Erhao; Luan, Jinkai

2018-06-01

In this study, reference evapotranspiration (ET0) forecasting models are developed for the least economically developed regions subject to meteorological data scarcity. Firstly, the partial mutual information (PMI) capable of capturing the linear and nonlinear dependence is investigated regarding its utility to identify relevant predictors and exclude those that are redundant through the comparison with partial linear correlation. An efficient input selection technique is crucial for decreasing model data requirements. Then, the interconnection between global climate indices and regional ET0 is identified. Relevant climatic indices are introduced as additional predictors to comprise information regarding ET0, which ought to be provided by meteorological data unavailable. The case study in the Jing River and Beiluo River basins, China, reveals that PMI outperforms the partial linear correlation in excluding the redundant information, favouring the yield of smaller predictor sets. The teleconnection analysis identifies the correlation between Nino 1 + 2 and regional ET0, indicating influences of ENSO events on the evapotranspiration process in the study area. Furthermore, introducing Nino 1 + 2 as predictors helps to yield more accurate ET0 forecasts. A model performance comparison also shows that non-linear stochastic models (SVR or RF with input selection through PMI) do not always outperform linear models (MLR with inputs screen by linear correlation). However, the former can offer quite comparable performance depending on smaller predictor sets. Therefore, efforts such as screening model inputs through PMI and incorporating global climatic indices interconnected with ET0 can benefit the development of ET0 forecasting models suitable for data-scarce regions.
[Complete genome sequencing and sequence analysis of BCG Tice].

Science.gov (United States)

Wang, Zhiming; Pan, Yuanlong; Wu, Jun; Zhu, Baoli

2012-10-04

The objective of this study is to obtain the complete genome sequence of Bacillus Calmette-Guerin Tice (BCG Tice), in order to provide more information about the molecular biology of BCG Tice and design more reasonable vaccines to prevent tuberculosis. We assembled the data from high-throughput sequencing with SOAPdenovo software, with many contigs and scaffolds obtained. There are many sequence gaps and physical gaps remained as a result of regional low coverage and low quality. We designed primers at the end of contigs and performed PCR amplification in order to link these contigs and scaffolds. With various enzymes to perform PCR amplification, adjustment of PCR reaction conditions, and combined with clone construction to sequence, all the gaps were finished. We obtained the complete genome sequence of BCG Tice and submitted it to GenBank of National Center for Biotechnology Information (NCBI). The genome of BCG Tice is 4334064 base pairs in length, with GC content 65.65%. The problems and strategies during the finishing step of BCG Tice sequencing are illuminated here, with the hope of affording some experience to those who are involved in the finishing step of genome sequencing. The microarray data were verified by our results.
DEVELOPMENT OF INFORMATION SERVICES AND PRODUCTS IN UZBEKISTAN DURING GLOBALIZATION

Directory of Open Access Journals (Sweden)

Feruza Khayrullaevna Sidikova

2014-10-01

Full Text Available The main purpose of the article is investigation of the issues of development of information services and products in Uzbekistan during globalization role of information and communicative technologies in development international business and trade. As the results of the research there revealed a connection of introducing electronic commerce and business in practice of firms, corporation and banks there conducted changes in the character of carrying out commercial and financial transactions, interrelations with partners and clients, elaborations and introduction business strategies and competition itself. In the conclusion there offered the suggestions of joining and adjusting to each other varying legislation of different countries and developing international system of taxation of Internet commerce satisfying all participants of electronic trading transactions.
Noninvasive estimation of global activation sequence using the extended Kalman filter.

Science.gov (United States)

Liu, Chenguang; He, Bin

2011-03-01

A new algorithm for 3-D imaging of the activation sequence from noninvasive body surface potentials is proposed. After formulating the nonlinear relationship between the 3-D activation sequence and the body surface recordings during activation, the extended Kalman filter (EKF) is utilized to estimate the activation sequence in a recursive way. The state vector containing the activation sequence is optimized during iteration by updating the error variance/covariance matrix. A new regularization scheme is incorporated into the "predict" procedure of EKF to tackle the ill-posedness of the inverse problem. The EKF-based algorithm shows good performance in simulation under single-site pacing. Between the estimated activation sequences and true values, the average correlation coefficient (CC) is 0.95, and the relative error (RE) is 0.13. The average localization error (LE) when localizing the pacing site is 3.0 mm. Good results are also obtained under dual-site pacing (CC = 0.93, RE = 0.16, and LE = 4.3 mm). Furthermore, the algorithm shows robustness to noise. The present promising results demonstrate that the proposed EKF-based inverse approach can noninvasively estimate the 3-D activation sequence with good accuracy and the new algorithm shows good features due to the application of EKF.
The GRIN-Global Information Management System – A Preview and Opportunity for Public User Input

Science.gov (United States)

The GRIN-Global Information Management System, under development for the past two years, will provide the world's crop genebanks and plant genetic resource (PGR) users with a powerful, flexible, easy-to-use PGR information management system. Developed jointly by the USDA Agricultural Research Servi...
77 FR 58412 - Meeting of the Global Justice Information Sharing Initiative Federal Advisory Committee

Science.gov (United States)

2012-09-20

... support of the Administration's justice priorities. The GAC will guide and monitor the development of the Global information sharing concept. It will advise the Assistant Attorney General, OJP; the Attorney...
Through Increasing "Information Literacy" Capital and Habitus (Agency): The Complementary Impact on Composition Skills When Appropriately Sequenced

Science.gov (United States)

Karas, Timothy

2017-01-01

Through a case study approach of a cohort of community college students at a single community college, the impact on success rates in composition courses was analyzed based on the sequence of completing an information literacy course. Two student cohorts were sampled based on completing an information literacy course prior to, or concurrently with…
Hardware Accelerated Sequence Alignment with Traceback

Directory of Open Access Journals (Sweden)

Scott Lloyd

2009-01-01

in a timely manner. Known methods to accelerate alignment on reconfigurable hardware only address sequence comparison, limit the sequence length, or exhibit memory and I/O bottlenecks. A space-efficient, global sequence alignment algorithm and architecture is presented that accelerates the forward scan and traceback in hardware without memory and I/O limitations. With 256 processing elements in FPGA technology, a performance gain over 300 times that of a desktop computer is demonstrated on sequence lengths of 16000. For greater performance, the architecture is scalable to more processing elements.
[Study on quality evaluation of sequence and SSR information in transcriptome of Astragalus membranacus].

Science.gov (United States)

Chang, Yue; Yang, Song; Liu, Zhen-Peng; Ren, Wei-Chao; Liu, Jie; Ma, Wei

2016-04-01

In this study, 454/Roche GS FLX sequencing technology was used to obtain the data of the Astragalus membranaceus. Four hundred and fifty-four Sequencing System Software was applied to carry out the transcription of the group from scratch. Using MISA tools, 9 893 unigenes were selected for the sequence of the genome of A. membranaceus, and the information of SSR locus was analyzed. According to the result, the average length of reads was 413 bp, about 86% of the reads was involved in the splicing, the length of the N50 was 1 205 bp, the number of unigenes was measured by the whole transcript. 1 729 SSR loci in the A. membranaceus transcriptome were searched, the occurrence frequency of SSR was 9.24%, the frequency of SSR in the whole transcriptome was 13.42%, the average length of SSR was 7.97 kb. One hundred and twenty-seven kinds of core repeat sequences were found, the dominant type was TG/AC type of dinucleotide, it appeared to account for 4.25% of the total SSR locus. The results of the sequence of the transcription of the A. membranaceus transcriptome revealed the overall expression, and a large number of unigenessequence was obtained, and the SSR locus in the genome of the A. membranaceus is high, and the type is diverse, and the polymorphism of the gene is high. Copyright© by the Chinese Pharmaceutical Association.
Examining the global health arena: strengths and weaknesses of a convention approach to global health challenges.

Science.gov (United States)

Haffeld, Just Balstad; Siem, Harald; Røttingen, John-Arne

2010-01-01

The article comprises a conceptual framework to analyze the strengths and weaknesses of a global health convention. The analyses are inspired by Lawrence Gostin's suggested Framework Convention on Global Health. The analytical model takes a starting-point in events tentatively following a logic sequence: Input (global health funding), Processes (coordination, cooperation, accountability, allocation of aid), Output (definition of basic survival needs), Outcome (access to health services), and Impact (health for all). It then examines to what degree binding international regulations can create order in such a sequence of events. We conclude that a global health convention could be an appropriate instrument to deal with some of the problems of global health. We also show that some of the tasks preceding a convention approach might be to muster international support for supra-national health regulations, negotiate compromises between existing stakeholders in the global health arena, and to utilize WHO as a platform for further discussions on a global health convention. © 2010 American Society of Law, Medicine & Ethics, Inc.
Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences

Directory of Open Access Journals (Sweden)

Jiang Tao

2011-10-01

Full Text Available Abstract Background Population levels of microbial phylotypes can be examined using a hybridization-based method that utilizes a small set of computationally-designed DNA probes targeted to a gene common to all. Our previous algorithm attempts to select a set of probes such that each training sequence manifests a unique theoretical hybridization pattern (a binary fingerprint to a probe set. It does so without taking into account similarity between training gene sequences or their putative taxonomic classifications, however. We present an improved algorithm for probe set selection that utilizes the available taxonomic information of training gene sequences and attempts to choose probes such that the resultant binary fingerprints cluster into real taxonomic groups. Results Gene sequences manifesting identical fingerprints with probes chosen by the new algorithm are more likely to be from the same taxonomic group than probes chosen by the previous algorithm. In cases where they are from different taxonomic groups, underlying DNA sequences of identical fingerprints are more similar to each other in probe sets made with the new versus the previous algorithm. Complete removal of large taxonomic groups from training data does not greatly decrease the ability of probe sets to distinguish those groups. Conclusions Probe sets made from the new algorithm create fingerprints that more reliably cluster into biologically meaningful groups. The method can readily distinguish microbial phylotypes that were excluded from the training sequences, suggesting novel microbes can also be detected.
Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences.

Science.gov (United States)

Ruegger, Paul M; Della Vedova, Gianluca; Jiang, Tao; Borneman, James

2011-10-10

Population levels of microbial phylotypes can be examined using a hybridization-based method that utilizes a small set of computationally-designed DNA probes targeted to a gene common to all. Our previous algorithm attempts to select a set of probes such that each training sequence manifests a unique theoretical hybridization pattern (a binary fingerprint) to a probe set. It does so without taking into account similarity between training gene sequences or their putative taxonomic classifications, however. We present an improved algorithm for probe set selection that utilizes the available taxonomic information of training gene sequences and attempts to choose probes such that the resultant binary fingerprints cluster into real taxonomic groups. Gene sequences manifesting identical fingerprints with probes chosen by the new algorithm are more likely to be from the same taxonomic group than probes chosen by the previous algorithm. In cases where they are from different taxonomic groups, underlying DNA sequences of identical fingerprints are more similar to each other in probe sets made with the new versus the previous algorithm. Complete removal of large taxonomic groups from training data does not greatly decrease the ability of probe sets to distinguish those groups. Probe sets made from the new algorithm create fingerprints that more reliably cluster into biologically meaningful groups. The method can readily distinguish microbial phylotypes that were excluded from the training sequences, suggesting novel microbes can also be detected.
Exploiting sequence and stability information for directing nanobody stability engineering.

Science.gov (United States)

Kunz, Patrick; Flock, Tilman; Soler, Nicolas; Zaiss, Moritz; Vincke, Cécile; Sterckx, Yann; Kastelic, Damjana; Muyldermans, Serge; Hoheisel, Jörg D

2017-09-01

Variable domains of camelid heavy-chain antibodies, commonly named nanobodies, have high biotechnological potential. In view of their broad range of applications in research, diagnostics and therapy, engineering their stability is of particular interest. One important aspect is the improvement of thermostability, because it can have immediate effects on conformational stability, protease resistance and aggregation propensity of the protein. We analyzed the sequences and thermostabilities of 78 purified nanobody binders. From this data, potentially stabilizing amino acid variations were identified and studied experimentally. Some mutations improved the stability of nanobodies by up to 6.1°C, with an average of 2.3°C across eight modified nanobodies. The stabilizing mechanism involves an improvement of both conformational stability and aggregation behavior, explaining the variable degree of stabilization in individual molecules. In some instances, variations predicted to be stabilizing actually led to thermal destabilization of the proteins. The reasons for this contradiction between prediction and experiment were investigated. The results reveal a mutational strategy to improve the biophysical behavior of nanobody binders and indicate a species-specificity of nanobody architecture. This study illustrates the potential and limitations of engineering nanobody thermostability by merging sequence information with stability data, an aspect that is becoming increasingly important with the recent development of high-throughput biophysical methods. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Globalization, Information Technology and Higher Education in Nigeria: The Roles of Library Professionals

Science.gov (United States)

Uwhekadom, Ejimaji Emmanuel; Olawolu, Oladunni Elizabeth

2013-01-01

The influence of globalization and information technology on higher education in Nigeria was investigated through a descriptive survey design. Forty-five professional librarians from University of Port Harcourt, Port Harcourt, Ignatius Ajuru University of Education, Rumuolumeni Port Harcourt, Federal College of Education (Technical) Omoku Rivers…
Information communications technologies that surpass the global communications network. Sekai tsushinmo o koeru joho tsushin gijutsu

Energy Technology Data Exchange (ETDEWEB)

1990-05-01

Development of information communications technologies that surpass the global communications network is being pushed forward in order to establish the global village that McLuhan foretold in 1964. Effects of hybrid intensification with the intensification of communications technologies and computer technologies have become evident as facsimiles, automated teller machines of banks, home videos, automatic response telephones with synthetic voices, compact disks, portable telephones, video games and high-definition televisions were developed and put to use in a wide range. Intensification and integration of computer technologies and communications technologies has every possibility, but it also has a peculiar aspect of lacking guiding principles. Uncertain factors of the values of informations in the market are ever increasing, and their true values are yet to be found. Anyhow, it is a long way to the goal of the global village.
Complete Genome Sequences of Isolates of Enterococcus faecium Sequence Type 117, a Globally Disseminated Multidrug-Resistant Clone

Science.gov (United States)

Tedim, Ana P.; Lanza, Val F.; Manrique, Marina; Pareja, Eduardo; Ruiz-Garbajosa, Patricia; Cantón, Rafael; Baquero, Fernando; Tobes, Raquel

2017-01-01

ABSTRACT The emergence of nosocomial infections by multidrug-resistant sequence type 117 (ST117) Enterococcus faecium has been reported in several European countries. ST117 has been detected in Spanish hospitals as one of the main causes of bloodstream infections. We analyzed genome variations of ST117 strains isolated in Madrid and describe the first ST117 closed genome sequences. PMID:28360174
Dynamic Sequence Assignment.

Science.gov (United States)

1983-12-01

D-136 548 DYNAMIIC SEQUENCE ASSIGNMENT(U) ADVANCED INFORMATION AND 1/2 DECISION SYSTEMS MOUNTAIN YIELW CA C A 0 REILLY ET AL. UNCLSSIIED DEC 83 AI/DS...I ADVANCED INFORMATION & DECISION SYSTEMS Mountain View. CA 94040 84 u ,53 V,..’. Unclassified _____ SCURITY CLASSIFICATION OF THIS PAGE REPORT...reviews some important heuristic algorithms developed for fas- ter solution of the sequence assignment problem. 3.1. DINAMIC MOGRAMUNIG FORMULATION FOR
A proposed clinical decision support architecture capable of supporting whole genome sequence information.

Science.gov (United States)

Welch, Brandon M; Loya, Salvador Rodriguez; Eilbeck, Karen; Kawamoto, Kensaku

2014-04-04

Whole genome sequence (WGS) information may soon be widely available to help clinicians personalize the care and treatment of patients. However, considerable barriers exist, which may hinder the effective utilization of WGS information in a routine clinical care setting. Clinical decision support (CDS) offers a potential solution to overcome such barriers and to facilitate the effective use of WGS information in the clinic. However, genomic information is complex and will require significant considerations when developing CDS capabilities. As such, this manuscript lays out a conceptual framework for a CDS architecture designed to deliver WGS-guided CDS within the clinical workflow. To handle the complexity and breadth of WGS information, the proposed CDS framework leverages service-oriented capabilities and orchestrates the interaction of several independently-managed components. These independently-managed components include the genome variant knowledge base, the genome database, the CDS knowledge base, a CDS controller and the electronic health record (EHR). A key design feature is that genome data can be stored separately from the EHR. This paper describes in detail: (1) each component of the architecture; (2) the interaction of the components; and (3) how the architecture attempts to overcome the challenges associated with WGS information. We believe that service-oriented CDS capabilities will be essential to using WGS information for personalized medicine.
Whole genome shotgun sequencing of Indian strains of Streptococcus agalactiae

Directory of Open Access Journals (Sweden)

Balaji Veeraraghavan

2017-12-01

Full Text Available Group B streptococcus is known as a leading cause of neonatal infections in developing countries. The present study describes the whole genome shotgun sequences of four Group B Streptococcus (GBS isolates. Molecular data on clonality is lacking for GBS in India. The present genome report will add important information on the scarce genome data of GBS and will help in deriving comparative genome studies of GBS isolates at global level. This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession numbers NHPL00000000 – NHPO00000000.

Globalization: Ecological consequences of global-scale connectivity in people, resources and information

Science.gov (United States)

Globalization is a phenomenon affecting all facets of the Earth System. Within the context of ecological systems, it is becoming increasingly apparent that global connectivity among terrestrial systems, the atmosphere, and oceans is driving many ecological dynamics at finer scales and pushing thresh...
A laboratory information management system for DNA barcoding workflows.

Science.gov (United States)

Vu, Thuy Duong; Eberhardt, Ursula; Szöke, Szániszló; Groenewald, Marizeth; Robert, Vincent

2012-07-01

This paper presents a laboratory information management system for DNA sequences (LIMS) created and based on the needs of a DNA barcoding project at the CBS-KNAW Fungal Biodiversity Centre (Utrecht, the Netherlands). DNA barcoding is a global initiative for species identification through simple DNA sequence markers. We aim at generating barcode data for all strains (or specimens) included in the collection (currently ca. 80 k). The LIMS has been developed to better manage large amounts of sequence data and to keep track of the whole experimental procedure. The system has allowed us to classify strains more efficiently as the quality of sequence data has improved, and as a result, up-to-date taxonomic names have been given to strains and more accurate correlation analyses have been carried out.
The States of Sub Saharan Africa on the way to the Global Information Society

Directory of Open Access Journals (Sweden)

Konstantin A. Pantserev

2010-12-01

Full Text Available The paper devotes to the problem of overcoming of the digital divide in the Sub Saharan African States. On the example of Kenya the author speaks about the comparative success of the development of the information technologies in Africa and in turn underlines the most significant obstacles on the way of African states to the global information society and suggests the means how to overcome them.
Pairwise Sequence Alignment Library

Energy Technology Data Exchange (ETDEWEB)

2015-05-20

Vector extensions, such as SSE, have been part of the x86 CPU since the 1990s, with applications in graphics, signal processing, and scientific applications. Although many algorithms and applications can naturally benefit from automatic vectorization techniques, there are still many that are difficult to vectorize due to their dependence on irregular data structures, dense branch operations, or data dependencies. Sequence alignment, one of the most widely used operations in bioinformatics workflows, has a computational footprint that features complex data dependencies. The trend of widening vector registers adversely affects the state-of-the-art sequence alignment algorithm based on striped data layouts. Therefore, a novel SIMD implementation of a parallel scan-based sequence alignment algorithm that can better exploit wider SIMD units was implemented as part of the Parallel Sequence Alignment Library (parasail). Parasail features: Reference implementations of all known vectorized sequence alignment approaches. Implementations of Smith Waterman (SW), semi-global (SG), and Needleman Wunsch (NW) sequence alignment algorithms. Implementations across all modern CPU instruction sets including AVX2 and KNC. Language interfaces for C/C++ and Python.
SDSS-IV MaNGA: Spatially Resolved Star Formation Main Sequence and LI(N)ER Sequence

Science.gov (United States)

Hsieh, B. C.; Lin, Lihwai; Lin, J. H.; Pan, H. A.; Hsu, C. H.; Sánchez, S. F.; Cano-Díaz, M.; Zhang, K.; Yan, R.; Barrera-Ballesteros, J. K.; Boquien, M.; Riffel, R.; Brownstein, J.; Cruz-González, I.; Hagen, A.; Ibarra, H.; Pan, K.; Bizyaev, D.; Oravetz, D.; Simmons, A.

2017-12-01

We present our study on the spatially resolved Hα and M * relation for 536 star-forming and 424 quiescent galaxies taken from the MaNGA survey. We show that the star formation rate surface density ({{{Σ }}}{SFR}), derived based on the Hα emissions, is strongly correlated with the M * surface density ({{{Σ }}}* ) on kiloparsec scales for star-forming galaxies and can be directly connected to the global star-forming sequence. This suggests that the global main sequence may be a consequence of a more fundamental relation on small scales. On the other hand, our result suggests that ∼20% of quiescent galaxies in our sample still have star formation activities in the outer region with lower specific star formation rate (SSFR) than typical star-forming galaxies. Meanwhile, we also find a tight correlation between {{{Σ }}}{{H}α } and {{{Σ }}}* for LI(N)ER regions, named the resolved “LI(N)ER” sequence, in quiescent galaxies, which is consistent with the scenario that LI(N)ER emissions are primarily powered by the hot, evolved stars as suggested in the literature.
Extracting information from an ensemble of GCMs to reliably assess future global runoff change

NARCIS (Netherlands)

Sperna Weiland, F.C.; Beek, L.P.H. van; Weerts, A.H.; Bierkens, M.F.P.

2011-01-01

Future runoff projections derived from different global climate models (GCMs) show large differences. Therefore, within this study the, information from multiple GCMs has been combined to better assess hydrological changes. For projections of precipitation and temperature the Reliability ensemble
Global MLST of Salmonella Typhi Revisited in Post-Genomic Era: Genetic conservation, Population Structure and Comparative genomics of rare sequence types

Directory of Open Access Journals (Sweden)

Kien-Pong eYap

2016-03-01

Full Text Available Typhoid fever, caused by Salmonella enterica serovar Typhi, remains an important public health burden in Southeast Asia and other endemic countries. Various genotyping methods have been applied to study the genetic variations of this human-restricted pathogen. Multilocus Sequence Typing (MLST is one of the widely accepted methods, and recently, there is a growing interest in the re-application of MLST in the post-genomic era. In this study, we provide the global MLST distribution of S. Typhi utilizing both publicly available 1,826 S. Typhi genome sequences in addition to performing conventional MLST on S. Typhi strains isolated from various endemic regions spanning over a century. Our global MLST analysis confirms the predominance of two sequence types (ST1 and ST2 co-existing in the endemic regions. Interestingly, S. Typhi strains with ST8 are currently confined within the African continent. Comparative genomic analyses of ST8 and other rare STs with genomes of ST1/ST2 revealed unique mutations in important virulence genes such as flhB, sipC and tviD that may explain the variations that differentiate between seemingly successful (widespread and unsuccessful (poor dissemination S. Typhi populations. Large scale whole-genome phylogeny demonstrated evidence of phylogeographical structuring and showed that ST8 may have diverged from the earlier ancestral population of ST1 and ST2, which later lost some of its fitness advantages, leading to poor worldwide dissemination. In response to the unprecedented increase in genomic data, this study demonstrates and highlights the utility of large-scale genome-based MLST as a quick and effective approach to narrow the scope of in-depth comparative genomic analysis and consequently provide new insights into the fine scale of pathogen evolution and population structure.
A Secure and Efficient Communications Architecture for Global Information Grid Users Via Cooperating Space Assets

National Research Council Canada - National Science Library

Hubenko, Jr, Victor P

2008-01-01

With the Information Age in full and rapid development, users expect to have global, seamless, ubiquitous, secure, and efficient communications capable of providing access to real-time applications and collaboration...
76 FR 13226 - Meeting of the Department of Justice Global Justice Information Sharing Initiative Federal...

Science.gov (United States)

2011-03-10

... coordination of national policy, practices, and technical solutions in support of the Administration's justice priorities. The GAC will guide and monitor the development of the Global information sharing concept. It will...
Global alignment algorithms implementations | Fatumo ...

African Journals Online (AJOL)

In this paper, we implemented the two routes for sequence comparison, that is; the dotplot and Needleman-wunsch algorithm for global sequence alignment. Our algorithms were implemented in python programming language and were tested on Linux platform 1.60GHz, 512 MB of RAM SUSE 9.2 and 10.1 versions.
Socio-Economic Correlates of Information Security Threats and Controls in Global Financial Services Industry: An Analysis

OpenAIRE

Princely Ifinedo

2015-01-01

Threats to data and information assets of Global Financial Services Industry (GFSI) are ever-present; such problems, if not well understood, could lead to huge negative impact. To some extent, the environment where a business operates does matter for its success. This study presents information about the relationships between selected socio-economic factors and information security threats and controls in the financial services industry. Essentially, it seeks to enrich the information provide...
A Proposed Clinical Decision Support Architecture Capable of Supporting Whole Genome Sequence Information

Directory of Open Access Journals (Sweden)

Brandon M. Welch

2014-04-01

Full Text Available Whole genome sequence (WGS information may soon be widely available to help clinicians personalize the care and treatment of patients. However, considerable barriers exist, which may hinder the effective utilization of WGS information in a routine clinical care setting. Clinical decision support (CDS offers a potential solution to overcome such barriers and to facilitate the effective use of WGS information in the clinic. However, genomic information is complex and will require significant considerations when developing CDS capabilities. As such, this manuscript lays out a conceptual framework for a CDS architecture designed to deliver WGS-guided CDS within the clinical workflow. To handle the complexity and breadth of WGS information, the proposed CDS framework leverages service-oriented capabilities and orchestrates the interaction of several independently-managed components. These independently-managed components include the genome variant knowledge base, the genome database, the CDS knowledge base, a CDS controller and the electronic health record (EHR. A key design feature is that genome data can be stored separately from the EHR. This paper describes in detail: (1 each component of the architecture; (2 the interaction of the components; and (3 how the architecture attempts to overcome the challenges associated with WGS information. We believe that service-oriented CDS capabilities will be essential to using WGS information for personalized medicine.
Effects of global and local contexts on chord processing: An ERP study.

Science.gov (United States)

Zhang, Jingjing; Zhou, Xuefeng; Chang, Ruohan; Yang, Yufang

2018-01-31

In real life, the processing of an incoming event is continuously influenced by prior information at multiple timescales. The present study investigated how harmonic contexts at both local and global levels influence the processing of an incoming chord in an event-related potentials experiment. Chord sequences containing two phrases were presented to musically trained listeners, with the last critical chord either harmonically related or less related to its preceding context at local and/or global levels. ERPs data showed an ERAN-like effect for local context in early time window and a N5-like component for later interaction between the local context and global context. These results suggest that both the local and global contexts influence the processing of an incoming music event, and the local effect happens earlier than the global. Moreover, the interaction between the local context and global context in N5 may suggest that music syntactic integration at local level takes place prior to the integration at global level. Copyright © 2017 Elsevier Ltd. All rights reserved.
THE FACTOR OF ENERGY-INFORMATION SECURITY IN THE FRAMEWORK OF GLOBAL CIVILIZATION-RELATED CHANGES

OpenAIRE

Alexey Viktorovich SUHORUKHIH

2015-01-01

The paper examined the grounds having involved global social and cultural changes, and emphasized the precedence taken by an energy-information component to the geopolitical dynamics of the civilization continuum. The study emphasized the relevance of new facets in social and cultural insight urged to respond to challenges of direct mental hazards emerging over the world, and requirement of energy-information security the civilization has sought for, assumed to be the framework for considerin...
The NIAID Division of AIDS enterprise information system: integrated decision support for global clinical research programs

Science.gov (United States)

Gupta, Nitin; Varghese, Suresh; Virkar, Hemant

2011-01-01

The National Institute of Allergy and Infectious Diseases (NIAID) Division of AIDS (DAIDS) Enterprise Information System (DAIDS-ES) is a web-based system that supports NIAID in the scientific, strategic, and tactical management of its global clinical research programs for HIV/AIDS vaccines, prevention, and therapeutics. Different from most commercial clinical trials information systems, which are typically protocol-driven, the DAIDS-ES was built to exchange information with those types of systems and integrate it in ways that help scientific program directors lead the research effort and keep pace with the complex and ever-changing global HIV/AIDS pandemic. Whereas commercially available clinical trials support systems are not usually disease-focused, DAIDS-ES was specifically designed to capture and incorporate unique scientific, demographic, and logistical aspects of HIV/AIDS treatment, prevention, and vaccine research in order to provide a rich source of information to guide informed decision-making. Sharing data across its internal components and with external systems, using defined vocabularies, open standards and flexible interfaces, the DAIDS-ES enables NIAID, its global collaborators and stakeholders, access to timely, quality information about NIAID-supported clinical trials which is utilized to: (1) analyze the research portfolio, assess capacity, identify opportunities, and avoid redundancies; (2) help support study safety, quality, ethics, and regulatory compliance; (3) conduct evidence-based policy analysis and business process re-engineering for improved efficiency. This report summarizes how the DAIDS-ES was conceptualized, how it differs from typical clinical trial support systems, the rationale for key design choices, and examples of how it is being used to advance the efficiency and effectiveness of NIAID's HIV/AIDS clinical research programs. PMID:21816958
Complete Genome Sequence of Ikoma Lyssavirus

OpenAIRE

Marston, Denise A.; Ellis, Richard J.; Horton, Daniel L.; Kuzmin, Ivan V.; Wise, Emma L.; McElhinney, Lorraine M.; Banyard, Ashley C.; Ngeleja, Chanasa; Keyyu, Julius; Cleaveland, Sarah; Lembo, Tiziana; Rupprecht, Charles E.; Fooks, Anthony R.

2012-01-01

Lyssaviruses (family Rhabdoviridae) constitute one of the most important groups of viral zoonoses globally. All lyssaviruses cause the disease rabies, an acute progressive encephalitis for which, once symptoms occur, there is no effective cure. Currently available vaccines are highly protective against the predominantly circulating lyssavirus species. Using next-generation sequencing technologies, we have obtained the whole-genome sequence for a novel lyssavirus, Ikoma lyssavirus (IKOV), isol...
Is central dogma a global property of cellular information flow?

Science.gov (United States)

Piras, Vincent; Tomita, Masaru; Selvarajoo, Kumar

2012-01-01

The central dogma of molecular biology has come under scrutiny in recent years. Here, we reviewed high-throughput mRNA and protein expression data of Escherichia coli, Saccharomyces cerevisiae, and several mammalian cells. At both single cell and population scales, the statistical comparisons between the entire transcriptomes and proteomes show clear correlation structures. In contrast, the pair-wise correlations of single transcripts to proteins show nullity. These data suggest that the organizing structure guiding cellular processes is observed at omics-wide scale, and not at single molecule level. The central dogma, thus, globally emerges as an average integrated flow of cellular information.
Allele Re-sequencing Technologies

DEFF Research Database (Denmark)

Byrne, Stephen; Farrell, Jacqueline Danielle; Asp, Torben

2013-01-01

The development of next-generation sequencing technologies has made sequencing an affordable approach for detection of genetic variations associated with various traits. However, the cost of whole genome re-sequencing still remains too high to be feasible for many plant species with large...... alternative to whole genome re-sequencing to identify causative genetic variations in plants. One challenge, however, will be efficient bioinformatics strategies for data handling and analysis from the increasing amount of sequence information....
LPTAU, Quasi Random Sequence Generator

International Nuclear Information System (INIS)

Sobol, Ilya M.

1993-01-01

1 - Description of program or function: LPTAU generates quasi random sequences. These are uniformly distributed sets of L=M N points in the N-dimensional unit cube: I N =[0,1]x...x[0,1]. These sequences are used as nodes for multidimensional integration; as searching points in global optimization; as trial points in multi-criteria decision making; as quasi-random points for quasi Monte Carlo algorithms. 2 - Method of solution: Uses LP-TAU sequence generation (see references). 3 - Restrictions on the complexity of the problem: The number of points that can be generated is L 30 . The dimension of the space cannot exceed 51
High-Throughput Next-Generation Sequencing of Polioviruses

Science.gov (United States)

Montmayeur, Anna M.; Schmidt, Alexander; Zhao, Kun; Magaña, Laura; Iber, Jane; Castro, Christina J.; Chen, Qi; Henderson, Elizabeth; Ramos, Edward; Shaw, Jing; Tatusov, Roman L.; Dybdahl-Sissoko, Naomi; Endegue-Zanga, Marie Claire; Adeniji, Johnson A.; Oberste, M. Steven; Burns, Cara C.

2016-01-01

ABSTRACT The poliovirus (PV) is currently targeted for worldwide eradication and containment. Sanger-based sequencing of the viral protein 1 (VP1) capsid region is currently the standard method for PV surveillance. However, the whole-genome sequence is sometimes needed for higher resolution global surveillance. In this study, we optimized whole-genome sequencing protocols for poliovirus isolates and FTA cards using next-generation sequencing (NGS), aiming for high sequence coverage, efficiency, and throughput. We found that DNase treatment of poliovirus RNA followed by random reverse transcription (RT), amplification, and the use of the Nextera XT DNA library preparation kit produced significantly better results than other preparations. The average viral reads per total reads, a measurement of efficiency, was as high as 84.2% ± 15.6%. PV genomes covering >99 to 100% of the reference length were obtained and validated with Sanger sequencing. A total of 52 PV genomes were generated, multiplexing as many as 64 samples in a single Illumina MiSeq run. This high-throughput, sequence-independent NGS approach facilitated the detection of a diverse range of PVs, especially for those in vaccine-derived polioviruses (VDPV), circulating VDPV, or immunodeficiency-related VDPV. In contrast to results from previous studies on other viruses, our results showed that filtration and nuclease treatment did not discernibly increase the sequencing efficiency of PV isolates. However, DNase treatment after nucleic acid extraction to remove host DNA significantly improved the sequencing results. This NGS method has been successfully implemented to generate PV genomes for molecular epidemiology of the most recent PV isolates. Additionally, the ability to obtain full PV genomes from FTA cards will aid in facilitating global poliovirus surveillance. PMID:27927929

Refined repetitive sequence searches utilizing a fast hash function and cross species information retrievals

Directory of Open Access Journals (Sweden)

Reneker Jeff

2005-05-01

Full Text Available Abstract Background Searching for small tandem/disperse repetitive DNA sequences streamlines many biomedical research processes. For instance, whole genomic array analysis in yeast has revealed 22 PHO-regulated genes. The promoter regions of all but one of them contain at least one of the two core Pho4p binding sites, CACGTG and CACGTT. In humans, microsatellites play a role in a number of rare neurodegenerative diseases such as spinocerebellar ataxia type 1 (SCA1. SCA1 is a hereditary neurodegenerative disease caused by an expanded CAG repeat in the coding sequence of the gene. In bacterial pathogens, microsatellites are proposed to regulate expression of some virulence factors. For example, bacteria commonly generate intra-strain diversity through phase variation which is strongly associated with virulence determinants. A recent analysis of the complete sequences of the Helicobacter pylori strains 26695 and J99 has identified 46 putative phase-variable genes among the two genomes through their association with homopolymeric tracts and dinucleotide repeats. Life scientists are increasingly interested in studying the function of small sequences of DNA. However, current search algorithms often generate thousands of matches – most of which are irrelevant to the researcher. Results We present our hash function as well as our search algorithm to locate small sequences of DNA within multiple genomes. Our system applies information retrieval algorithms to discover knowledge of cross-species conservation of repeat sequences. We discuss our incorporation of the Gene Ontology (GO database into these algorithms. We conduct an exhaustive time analysis of our system for various repetitive sequence lengths. For instance, a search for eight bases of sequence within 3.224 GBases on 49 different chromosomes takes 1.147 seconds on average. To illustrate the relevance of the search results, we conduct a search with and without added annotation terms for the
HydroSHEDS: A global comprehensive hydrographic dataset

Science.gov (United States)

Wickel, B. A.; Lehner, B.; Sindorf, N.

2007-12-01

The Hydrological data and maps based on SHuttle Elevation Derivatives at multiple Scales (HydroSHEDS) is an innovative product that, for the first time, provides hydrographic information in a consistent and comprehensive format for regional and global-scale applications. HydroSHEDS offers a suite of geo-referenced data sets, including stream networks, watershed boundaries, drainage directions, and ancillary data layers such as flow accumulations, distances, and river topology information. The goal of developing HydroSHEDS was to generate key data layers to support regional and global watershed analyses, hydrological modeling, and freshwater conservation planning at a quality, resolution and extent that had previously been unachievable. Available resolutions range from 3 arc-second (approx. 90 meters at the equator) to 5 minute (approx. 10 km at the equator) with seamless near-global extent. HydroSHEDS is derived from elevation data of the Shuttle Radar Topography Mission (SRTM) at 3 arc-second resolution. The original SRTM data have been hydrologically conditioned using a sequence of automated procedures. Existing methods of data improvement and newly developed algorithms have been applied, including void filling, filtering, stream burning, and upscaling techniques. Manual corrections were made where necessary. Preliminary quality assessments indicate that the accuracy of HydroSHEDS significantly exceeds that of existing global watershed and river maps. HydroSHEDS was developed by the Conservation Science Program of the World Wildlife Fund (WWF) in partnership with the U.S. Geological Survey (USGS), the International Centre for Tropical Agriculture (CIAT), The Nature Conservancy (TNC), and the Center for Environmental Systems Research (CESR) of the University of Kassel, Germany.
A bibliometric analysis of global research on genome sequencing ...

African Journals Online (AJOL)

The results show that disease and protein related researches were the leading research focuses, and comparative genomics and evolution related research had strong potential in the near future. Key words: Genome sequencing, research trend, scientometrics, science citation index expanded (SCI-Expanded), word cluster ...
Simultaneous and complete genome sequencing of influenza A and B with high coverage by Illumina MiSeq Platform.

Science.gov (United States)

Rutvisuttinunt, Wiriya; Chinnawirotpisan, Piyawan; Simasathien, Sriluck; Shrestha, Sanjaya K; Yoon, In-Kyu; Klungthong, Chonticha; Fernandez, Stefan

2013-11-01

Active global surveillance and characterization of influenza viruses are essential for better preparation against possible pandemic events. Obtaining comprehensive information about the influenza genome can improve our understanding of the evolution of influenza viruses and emergence of new strains, and improve the accuracy when designing preventive vaccines. This study investigated the use of deep sequencing by the next-generation sequencing (NGS) Illumina MiSeq Platform to obtain complete genome sequence information from influenza virus isolates. The influenza virus isolates were cultured from 6 respiratory acute clinical specimens collected in Thailand and Nepal. DNA libraries obtained from each viral isolate were mixed and all were sequenced simultaneously. Total information of 2.6 Gbases was obtained from a 455±14 K/mm2 density with 95.76% (8,571,655/8,950,724 clusters) of the clusters passing quality control (QC) filters. Approximately 93.7% of all sequences from Read1 and 83.5% from Read2 contained high quality sequences that were ≥Q30, a base calling QC score standard. Alignments analysis identified three seasonal influenza A H3N2 strains, one 2009 pandemic influenza A H1N1 strain and two influenza B strains. The nearly entire genomes of all six virus isolates yielded equal or greater than 600-fold sequence coverage depth. MiSeq Platform identified seasonal influenza A H3N2, 2009 pandemic influenza A H1N1and influenza B in the DNA library mixtures efficiently. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.
Patent challenges for standard-setting in the global economy : lessons from information and communication industry

NARCIS (Netherlands)

Maskus, K.; Merrill, S.A.; Bekkers, R.N.A.; Sandy Block, Marc; Contreras, Jorge; Gilbert, Richard; Goodman, David; Marasco, Amy; Simcoe, Tim; Smoot, Oliver; Suttmeier, Richard; Updegrove, Andrew

2014-01-01

Patent Challenges for Standard-Setting in the Global Economy: Lessons from Information and Communication Technology examines how leading national and multinational standard-setting organizations (SSOs) address patent disclosures, licensing terms, transfers of patent ownership, and other issues that
Is central dogma a global property of cellular information flow?

Directory of Open Access Journals (Sweden)

Vincent ePiras

2012-11-01

Full Text Available The central dogma of molecular biology has come under scrutiny in recent years. Here, we reviewed high-throughput mRNA and protein expression data of Escherichia coli, Saccharomyces cerevisiae, and several mammalian cells. At both single cell and population scales, the statistical comparisons between the entire transcriptomes and proteomes show clear correlation structures. In contrast, the pair-wise correlations of single transcript to protein show nullity. These data suggest that the organizing structure guiding cellular processes is observed at omics-wide scale and not at single molecule level. The central dogma, thus, globally emerges as an average integrated flow of cellular information.
Clinical decision support for whole genome sequence information leveraging a service-oriented architecture: a prototype.

Science.gov (United States)

Welch, Brandon M; Rodriguez-Loya, Salvador; Eilbeck, Karen; Kawamoto, Kensaku

2014-01-01

Whole genome sequence (WGS) information could soon be routinely available to clinicians to support the personalized care of their patients. At such time, clinical decision support (CDS) integrated into the clinical workflow will likely be necessary to support genome-guided clinical care. Nevertheless, developing CDS capabilities for WGS information presents many unique challenges that need to be overcome for such approaches to be effective. In this manuscript, we describe the development of a prototype CDS system that is capable of providing genome-guided CDS at the point of care and within the clinical workflow. To demonstrate the functionality of this prototype, we implemented a clinical scenario of a hypothetical patient at high risk for Lynch Syndrome based on his genomic information. We demonstrate that this system can effectively use service-oriented architecture principles and standards-based components to deliver point of care CDS for WGS information in real-time.
Imaging 2015 Mw 7.8 Gorkha Earthquake and Its Aftershock Sequence Combining Multiple Calibrated Global Seismic Arrays

Science.gov (United States)

LI, B.; Ghosh, A.

2016-12-01

The 2015 Mw 7.8 Gorkha earthquake provides a good opportunity to study the tectonics and earthquake hazards in the Himalayas, one of the most seismically active plate boundaries. Details of the seismicity patterns and associated structures in the Himalayas are poorly understood mainly due to limited instrumentation. Here, we apply a back-projection method to study the mainshock rupture and the following aftershock sequence using four large aperture global seismic arrays. All the arrays show eastward rupture propagation of about 130 km and reveal similar evolution of seismic energy radiation, with strong high-frequency energy burst about 50 km north of Kathmandu. Each single array, however, is typically limited by large azimuthal gap, low resolution, and artifacts due to unmodeled velocity structures. Therefore, we use a self-consistent empirical calibration method to combine four different arrays to image the Gorkha event. It greatly improves the resolution, can better track rupture and reveal details that cannot be resolved by any individual array. In addition, we also use the same arrays at teleseismic distances and apply a back-projection technique to detect and locate the aftershocks immediately following the Gorkha earthquake. We detect about 2.5 times the aftershocks recorded by the Advance National Seismic System comprehensive earthquake catalog during the 19 days following the mainshock. The aftershocks detected by the arrays show an east-west trend in general, with majority of the aftershocks located at the eastern part of the rupture patch and surrounding the rupture zone of the largest Mw 7.3 aftershock. Overall spatiotemporal aftershock pattern agrees well with global catalog, with our catalog showing more details relative to the standard global catalog. The improved aftershock catalog enables us to better study the aftershock dynamics, stress evolution in this region. Moreover, rapid and better imaging of aftershock distribution may aid rapid response
Taxonomic evaluation of selected Ganoderma species and database sequence validation

Directory of Open Access Journals (Sweden)

Suldbold Jargalmaa

2017-07-01

Full Text Available Species in the genus Ganoderma include several ecologically important and pathogenic fungal species whose medicinal and economic value is substantial. Due to the highly similar morphological features within the Ganoderma, identification of species has relied heavily on DNA sequencing using BLAST searches, which are only reliable if the GenBank submissions are accurately labeled. In this study, we examined 113 specimens collected from 1969 to 2016 from various regions in Korea using morphological features and multigene analysis (internal transcribed spacer, translation elongation factor 1-α, and the second largest subunit of RNA polymerase II. These specimens were identified as four Ganoderma species: G. sichuanense, G. cf. adspersum, G. cf. applanatum, and G. cf. gibbosum. With the exception of G. sichuanense, these species were difficult to distinguish based solely on morphological features. However, phylogenetic analysis at three different loci yielded concordant phylogenetic information, and supported the four species distinctions with high bootstrap support. A survey of over 600 Ganoderma sequences available on GenBank revealed that 65% of sequences were either misidentified or ambiguously labeled. Here, we suggest corrected annotations for GenBank sequences based on our phylogenetic validation and provide updated global distribution patterns for these Ganoderma species.
Taxonomic evaluation of selected Ganoderma species and database sequence validation

Science.gov (United States)

Jargalmaa, Suldbold; Eimes, John A.; Park, Myung Soo; Park, Jae Young; Oh, Seung-Yoon

2017-01-01

Species in the genus Ganoderma include several ecologically important and pathogenic fungal species whose medicinal and economic value is substantial. Due to the highly similar morphological features within the Ganoderma, identification of species has relied heavily on DNA sequencing using BLAST searches, which are only reliable if the GenBank submissions are accurately labeled. In this study, we examined 113 specimens collected from 1969 to 2016 from various regions in Korea using morphological features and multigene analysis (internal transcribed spacer, translation elongation factor 1-α, and the second largest subunit of RNA polymerase II). These specimens were identified as four Ganoderma species: G. sichuanense, G. cf. adspersum, G. cf. applanatum, and G. cf. gibbosum. With the exception of G. sichuanense, these species were difficult to distinguish based solely on morphological features. However, phylogenetic analysis at three different loci yielded concordant phylogenetic information, and supported the four species distinctions with high bootstrap support. A survey of over 600 Ganoderma sequences available on GenBank revealed that 65% of sequences were either misidentified or ambiguously labeled. Here, we suggest corrected annotations for GenBank sequences based on our phylogenetic validation and provide updated global distribution patterns for these Ganoderma species. PMID:28761785
Optimizing multiple sequence alignments using a genetic algorithm based on three objectives: structural information, non-gaps percentage and totally conserved columns.

Science.gov (United States)

Ortuño, Francisco M; Valenzuela, Olga; Rojas, Fernando; Pomares, Hector; Florido, Javier P; Urquiza, Jose M; Rojas, Ignacio

2013-09-01

Multiple sequence alignments (MSAs) are widely used approaches in bioinformatics to carry out other tasks such as structure predictions, biological function analyses or phylogenetic modeling. However, current tools usually provide partially optimal alignments, as each one is focused on specific biological features. Thus, the same set of sequences can produce different alignments, above all when sequences are less similar. Consequently, researchers and biologists do not agree about which is the most suitable way to evaluate MSAs. Recent evaluations tend to use more complex scores including further biological features. Among them, 3D structures are increasingly being used to evaluate alignments. Because structures are more conserved in proteins than sequences, scores with structural information are better suited to evaluate more distant relationships between sequences. The proposed multiobjective algorithm, based on the non-dominated sorting genetic algorithm, aims to jointly optimize three objectives: STRIKE score, non-gaps percentage and totally conserved columns. It was significantly assessed on the BAliBASE benchmark according to the Kruskal-Wallis test (P algorithm also outperforms other aligners, such as ClustalW, Multiple Sequence Alignment Genetic Algorithm (MSA-GA), PRRP, DIALIGN, Hidden Markov Model Training (HMMT), Pattern-Induced Multi-sequence Alignment (PIMA), MULTIALIGN, Sequence Alignment Genetic Algorithm (SAGA), PILEUP, Rubber Band Technique Genetic Algorithm (RBT-GA) and Vertical Decomposition Genetic Algorithm (VDGA), according to the Wilcoxon signed-rank test (P 0.05) with the advantage of being able to use less structures. Structural information is included within the objective function to evaluate more accurately the obtained alignments. The source code is available at http://www.ugr.es/~fortuno/MOSAStrE/MO-SAStrE.zip.
Global polar geospatial information service retrieval based on search engine and ontology reasoning

Science.gov (United States)

Chen, Nengcheng; E, Dongcheng; Di, Liping; Gong, Jianya; Chen, Zeqiang

2007-01-01

In order to improve the access precision of polar geospatial information service on web, a new methodology for retrieving global spatial information services based on geospatial service search and ontology reasoning is proposed, the geospatial service search is implemented to find the coarse service from web, the ontology reasoning is designed to find the refined service from the coarse service. The proposed framework includes standardized distributed geospatial web services, a geospatial service search engine, an extended UDDI registry, and a multi-protocol geospatial information service client. Some key technologies addressed include service discovery based on search engine and service ontology modeling and reasoning in the Antarctic geospatial context. Finally, an Antarctica multi protocol OWS portal prototype based on the proposed methodology is introduced.
Design of Long Period Pseudo-Random Sequences from the Addition of -Sequences over

Directory of Open Access Journals (Sweden)

Ren Jian

2004-01-01

Full Text Available Pseudo-random sequence with good correlation property and large linear span is widely used in code division multiple access (CDMA communication systems and cryptology for reliable and secure information transmission. In this paper, sequences with long period, large complexity, balance statistics, and low cross-correlation property are constructed from the addition of -sequences with pairwise-prime linear spans (AMPLS. Using -sequences as building blocks, the proposed method proved to be an efficient and flexible approach to construct long period pseudo-random sequences with desirable properties from short period sequences. Applying the proposed method to , a signal set is constructed.
SoilInfo App: global soil information on your palm

Science.gov (United States)

Hengl, Tomislav; Mendes de Jesus, Jorge

2015-04-01

ISRIC ' World Soil Information has released in 2014 and app for mobile de- vices called 'SoilInfo' (http://soilinfo-app.org) and which aims at providing free access to the global soil data. SoilInfo App (available for Android v.4.0 Ice Cream Sandwhich or higher, and Apple v.6.x and v.7.x iOS) currently serves the Soil- Grids1km data ' a stack of soil property and class maps at six standard depths at a resolution of 1 km (30 arc second) predicted using automated geostatistical mapping and global soil data models. The list of served soil data includes: soil organic carbon (), soil pH, sand, silt and clay fractions (%), bulk density (kg/m3), cation exchange capacity of the fine earth fraction (cmol+/kg), coarse fragments (%), World Reference Base soil groups, and USDA Soil Taxonomy suborders (DOI: 10.1371/journal.pone.0105992). New soil properties and classes will be continuously added to the system. SoilGrids1km are available for download under a Creative Commons non-commercial license via http://soilgrids.org. They are also accessible via a Representational State Transfer API (http://rest.soilgrids.org) service. SoilInfo App mimics common weather apps, but is also largely inspired by the crowdsourcing systems such as the OpenStreetMap, Geo-wiki and similar. Two development aspects of the SoilInfo App and SoilGrids are constantly being worked on: Data quality in terms of accuracy of spatial predictions and derived information, and Data usability in terms of ease of access and ease of use (i.e. flexibility of the cyberinfrastructure / functionalities such as the REST SoilGrids API, SoilInfo App etc). The development focus in 2015 is on improving the thematic and spatial accuracy of SoilGrids predictions, primarily by using finer resolution covariates (250 m) and machine learning algorithms (such as random forests) to improve spatial predictions.
Personal efficacy, the information environment, and attitudes toward global warming and climate change in the United States.

Science.gov (United States)

Kellstedt, Paul M; Zahran, Sammy; Vedlitz, Arnold

2008-02-01

Despite the growing scientific consensus about the risks of global warming and climate change, the mass media frequently portray the subject as one of great scientific controversy and debate. And yet previous studies of the mass public's subjective assessments of the risks of global warming and climate change have not sufficiently examined public informedness, public confidence in climate scientists, and the role of personal efficacy in affecting global warming outcomes. By examining the results of a survey on an original and representative sample of Americans, we find that these three forces-informedness, confidence in scientists, and personal efficacy-are related in interesting and unexpected ways, and exert significant influence on risk assessments of global warming and climate change. In particular, more informed respondents both feel less personally responsible for global warming, and also show less concern for global warming. We also find that confidence in scientists has unexpected effects: respondents with high confidence in scientists feel less responsible for global warming, and also show less concern for global warming. These results have substantial implications for the interaction between scientists and the public in general, and for the public discussion of global warming and climate change in particular.
Roles of repetitive sequences

Energy Technology Data Exchange (ETDEWEB)

Bell, G.I.

1991-12-31

The DNA of higher eukaryotes contains many repetitive sequences. The study of repetitive sequences is important, not only because many have important biological function, but also because they provide information on genome organization, evolution and dynamics. In this paper, I will first discuss some generic effects that repetitive sequences will have upon genome dynamics and evolution. In particular, it will be shown that repetitive sequences foster recombination among, and turnover of, the elements of a genome. I will then consider some examples of repetitive sequences, notably minisatellite sequences and telomere sequences as examples of tandem repeats, without and with respectively known function, and Alu sequences as an example of interspersed repeats. Some other examples will also be considered in less detail.
Lessons learned in building a global information network on chemicals (GINC)

International Nuclear Information System (INIS)

Kaminuma, Tsuguchika

2005-01-01

The Global Information Network on Chemicals (GINC) was a project to construct a worldwide information network linking international, national, and other organizations working for the safe management of chemicals. Proposed in 1993, the project started the next year and lasted almost 10 years. It was begun as a joint project of World Health Organization (WHO), International Labor Organization (ILO), and United Nations Environment Program (UNEP), and later endorsed by the Intergovernmental Forum on Chemical Safety (IFCS). Asia, particularly East Asia and the Pacific islands, was chosen as the feasibility study region. The author's group then at the National Institute of Health Sciences (NIHS) of Japan led this initiative and hosted numerous meetings. At these meetings, tutorial sessions for communicating chemical safety expertise and emerging new information technologies relevant to the safe management of chemicals were offered. Our experience with this project, particularly the Web-based system and the tutorial sessions, may be of use to others involved with Web-based instruction and the training of chemical safety specialists from both developed and developing countries
O papel das sequências narrativas na estrutura global de reportagens = The role of narrative sequences in the global structure of reports

Directory of Open Access Journals (Sweden)

Gustavo Ximenes Cunha

2013-04-01

Full Text Available Este artigo estuda a função macroestrutural que as sequências narrativas exercem no gênero reportagem. Com base no Modelo de Análise Modular do Discurso (ROULET et al., 2001, analisamos seis reportagens. Após as análises, constatamos que as sequências desses textos não exercem papel meramente informativo. Ao contrário, a maior parte delas tem o estatuto de subordinadas e funcionam como argumentos com que o jornalista defende uma opinião. Nesse sentido, este trabalho mostra que, em reportagens, a narração é um recurso que auxilia o jornalista a produzir os efeitos de objetividade e de imparcialidade, porque baseia suas afirmações nos acontecimentos narrados. As sequências não são meramente informativas.This paper studies the macroestrutural function of narrative sequences of reports. We analyze six reports and we use the principles of Modular Approach to Discourse Analysis (ROULET et al., 2001. We observe that the narrative sequences are not merely informative. They have subordinate status and they act as arguments to defend an opinion. So, this work shows that in reports the narration is a resource to produce the effects of objectivity and impartiality.
Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions

KAUST Repository

Odat, Enas M.

2011-01-01

The algorithm has been simulated using C#.Net programming language and a number of experiments have been done to verify the proved statements. The results of these experiments show that the number of optimal alignments is reduced after each step of optimization. Furthermore, it has been verified that as the sequence length increased linearly then the number of optimal alignments increased exponentially which also depends on the cost function that is used. Finally, the number of executed operations increases polynomially as the sequence length increase linearly.
A Framework for Effective Assessment of Model-based Projections of Biodiversity to Inform the Next Generation of Global Conservation Targets

Science.gov (United States)

Myers, B.; Beard, T. D.; Weiskopf, S. R.; Jackson, S. T.; Tittensor, D.; Harfoot, M.; Senay, G. B.; Casey, K.; Lenton, T. M.; Leidner, A. K.; Ruane, A. C.; Ferrier, S.; Serbin, S.; Matsuda, H.; Shiklomanov, A. N.; Rosa, I.

2017-12-01

Biodiversity and ecosystems services underpin political targets for the conservation of biodiversity; however, previous incarnations of these biodiversity-related targets have not relied on integrated model based projections of possible outcomes based on climate and land use change. Although a few global biodiversity models are available, most biodiversity models lie along a continuum of geography and components of biodiversity. Model-based projections of the future of global biodiversity are critical to support policymakers in the development of informed global conservation targets, but the scientific community lacks a clear strategy for integrating diverse data streams in developing, and evaluating the performance of, such biodiversity models. Therefore, in this paper, we propose a framework for ongoing testing and refinement of model-based projections of biodiversity trends and change, by linking a broad variety of biodiversity models with data streams generated by advances in remote sensing, coupled with new and emerging in-situ observation technologies to inform development of essential biodiversity variables, future global biodiversity targets, and indicators. Our two main objectives are to (1) develop a framework for model testing and refining projections of a broad range of biodiversity models, focusing on global models, through the integration of diverse data streams and (2) identify the realistic outputs that can be developed and determine coupled approaches using remote sensing and new and emerging in-situ observations (e.g., metagenomics) to better inform the next generation of global biodiversity targets.

The role of INGVterremoti blog in information management during the earthquake sequence in central Italy

Directory of Open Access Journals (Sweden)

Maurizio Pignone

2017-01-01

Full Text Available In this paper, we describe the role the INGVterremoti blog in information management during the first part of the earthquake sequence in central Italy (August 24 to September 30. In the last four years, we have been working on the INGVterremoti blog in order to provide quick updates on the ongoing seismic activity in Italy and in-depth scientific information. These include articles on specific historical earthquakes, seismic hazard, geological interpretations, source models from different type of data, effects at the surface, and so on. We have delivered information in quasi-real-time also about all the recent magnitude M≥4.0 earthquakes in Italy, the strongest events in the Mediterranean and in the world. During the 2016 central Italy, the INGVterremoti blog has continuously released information about seismic sequences with three types of posts: i updates on the ongoing seismic activity; ii reports on the activities carried out by the INGV teams in the field and any other working groups; iii in-depth scientific articles describing some specific analysis and results. All the blog posts have been shared automatically and in real time on the other social media of the INGVterremoti platform, also to counter the bad information and to fight rumors. These include Facebook, Twitter and INGVterremoti App on IOS and Android. As well, both the main INGV home page (http://www.ingv.it and the INGV earthquake portal (http://terremoti.ingv.it have published the contents of the blog on dedicated pages that were fed automatically. The work done day by day on the INGVterremoti blog has been coordinated with the INGV Press Office that has written several press releases based on the contents of the blog. Since August 24, 53 articles were published on the blog they have had more than 1.9 million views and 1 million visitors. The peak in the number of views, which was more than 800,000 in a single day, was registered on August 24, 2016, following the M 6
Informational and linguistic analysis of large genomic sequence collections via efficient Hadoop cluster algorithms.

Science.gov (United States)

Ferraro Petrillo, Umberto; Roscigno, Gianluca; Cattaneo, Giuseppe; Giancarlo, Raffaele

2018-06-01

Information theoretic and compositional/linguistic analysis of genomes have a central role in bioinformatics, even more so since the associated methodologies are becoming very valuable also for epigenomic and meta-genomic studies. The kernel of those methods is based on the collection of k-mer statistics, i.e. how many times each k-mer in {A,C,G,T}k occurs in a DNA sequence. Although this problem is computationally very simple and efficiently solvable on a conventional computer, the sheer amount of data available now in applications demands to resort to parallel and distributed computing. Indeed, those type of algorithms have been developed to collect k-mer statistics in the realm of genome assembly. However, they are so specialized to this domain that they do not extend easily to the computation of informational and linguistic indices, concurrently on sets of genomes. Following the well-established approach in many disciplines, and with a growing success also in bioinformatics, to resort to MapReduce and Hadoop to deal with 'Big Data' problems, we present KCH, the first set of MapReduce algorithms able to perform concurrently informational and linguistic analysis of large collections of genomic sequences on a Hadoop cluster. The benchmarking of KCH that we provide indicates that it is quite effective and versatile. It is also competitive with respect to the parallel and distributed algorithms highly specialized to k-mer statistics collection for genome assembly problems. In conclusion, KCH is a much needed addition to the growing number of algorithms and tools that use MapReduce for bioinformatics core applications. The software, including instructions for running it over Amazon AWS, as well as the datasets are available at http://www.di-srv.unisa.it/KCH. umberto.ferraro@uniroma1.it. Supplementary data are available at Bioinformatics online.
Exploration of noncoding sequences in metagenomes.

Directory of Open Access Journals (Sweden)

Fabián Tobar-Tosse

Full Text Available Environment-dependent genomic features have been defined for different metagenomes, whose genes and their associated processes are related to specific environments. Identification of ORFs and their functional categories are the most common methods for association between functional and environmental features. However, this analysis based on finding ORFs misses noncoding sequences and, therefore, some metagenome regulatory or structural information could be discarded. In this work we analyzed 23 whole metagenomes, including coding and noncoding sequences using the following sequence patterns: (G+C content, Codon Usage (Cd, Trinucleotide Usage (Tn, and functional assignments for ORF prediction. Herein, we present evidence of a high proportion of noncoding sequences discarded in common similarity-based methods in metagenomics, and the kind of relevant information present in those. We found a high density of trinucleotide repeat sequences (TRS in noncoding sequences, with a regulatory and adaptive function for metagenome communities. We present associations between trinucleotide values and gene function, where metagenome clustering correlate with microorganism adaptations and kinds of metagenomes. We propose here that noncoding sequences have relevant information to describe metagenomes that could be considered in a whole metagenome analysis in order to improve their organization, classification protocols, and their relation with the environment.
Global copy number profiling of cancer genomes | Office of Cancer Genomics

Science.gov (United States)

In this article, we introduce a robust and efficient strategy for deriving global and allele-specific copy number alternations (CNA) from cancer whole exome sequencing data based on Log R ratios and B-allele frequencies. Applying the approach to the analysis of over 200 skin cancer samples, we demonstrate its utility for discovering distinct CNA events and for deriving ancillary information such as tumor purity. Availability and implementation: https://github.com/xfwang/CLOSE CONTACT: xuefeng.wang@stonybrook.edu or michael.krauthammer@yale.edu. (Publication Abstract)
Neisseria gonorrhoeae Sequence Typing for Antimicrobial Resistance, a Novel Antimicrobial Resistance Multilocus Typing Scheme for Tracking Global Dissemination of N. gonorrhoeae Strains.

Science.gov (United States)

Demczuk, W; Sidhu, S; Unemo, M; Whiley, D M; Allen, V G; Dillon, J R; Cole, M; Seah, C; Trembizki, E; Trees, D L; Kersh, E N; Abrams, A J; de Vries, H J C; van Dam, A P; Medina, I; Bharat, A; Mulvey, M R; Van Domselaar, G; Martin, I

2017-05-01

A curated Web-based user-friendly sequence typing tool based on antimicrobial resistance determinants in Neisseria gonorrhoeae was developed and is publicly accessible (https://ngstar.canada.ca). The N. gonorrhoeae Sequence Typing for Antimicrobial Resistance (NG-STAR) molecular typing scheme uses the DNA sequences of 7 genes ( penA , mtrR , porB , ponA , gyrA , parC , and 23S rRNA) associated with resistance to β-lactam antimicrobials, macrolides, or fluoroquinolones. NG-STAR uses the entire penA sequence, combining the historical nomenclature for penA types I to XXXVIII with novel nucleotide sequence designations; the full mtrR sequence and a portion of its promoter region; portions of ponA , porB , gyrA , and parC ; and 23S rRNA sequences. NG-STAR grouped 768 isolates into 139 sequence types (STs) ( n = 660) consisting of 29 clonal complexes (CCs) having a maximum of a single-locus variation, and 76 NG-STAR STs ( n = 109) were identified as unrelated singletons. NG-STAR had a high Simpson's diversity index value of 96.5% (95% confidence interval [CI] = 0.959 to 0.969). The most common STs were NG-STAR ST-90 ( n = 100; 13.0%), ST-42 and ST-91 ( n = 45; 5.9%), ST-64 ( n = 44; 5.72%), and ST-139 ( n = 42; 5.5%). Decreased susceptibility to azithromycin was associated with NG-STAR ST-58, ST-61, ST-64, ST-79, ST-91, and ST-139 ( n = 156; 92.3%); decreased susceptibility to cephalosporins was associated with NG-STAR ST-90, ST-91, and ST-97 ( n = 162; 94.2%); and ciprofloxacin resistance was associated with NG-STAR ST-26, ST-90, ST-91, ST-97, ST-150, and ST-158 ( n = 196; 98.0%). All isolates of NG-STAR ST-42, ST-43, ST-63, ST-81, and ST-160 ( n = 106) were susceptible to all four antimicrobials. The standardization of nomenclature associated with antimicrobial resistance determinants through an internationally available database will facilitate the monitoring of the global dissemination of antimicrobial-resistant N. gonorrhoeae strains. © Crown copyright 2017.
Sequence Quality Analysis Tool for HIV Type 1 Protease and Reverse Transcriptase

OpenAIRE

DeLong, Allison K.; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W.; Kantor, Rami

2012-01-01

Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802...
Entropic fluctuations in DNA sequences

Science.gov (United States)

Thanos, Dimitrios; Li, Wentian; Provata, Astero

2018-03-01

The Local Shannon Entropy (LSE) in blocks is used as a complexity measure to study the information fluctuations along DNA sequences. The LSE of a DNA block maps the local base arrangement information to a single numerical value. It is shown that despite this reduction of information, LSE allows to extract meaningful information related to the detection of repetitive sequences in whole chromosomes and is useful in finding evolutionary differences between organisms. More specifically, large regions of tandem repeats, such as centromeres, can be detected based on their low LSE fluctuations along the chromosome. Furthermore, an empirical investigation of the appropriate block sizes is provided and the relationship of LSE properties with the structure of the underlying repetitive units is revealed by using both computational and mathematical methods. Sequence similarity between the genomic DNA of closely related species also leads to similar LSE values at the orthologous regions. As an application, the LSE covariance function is used to measure the evolutionary distance between several primate genomes.
Tidying up international nucleotide sequence databases: ecological, geographical and sequence quality annotation of its sequences of mycorrhizal fungi.

Science.gov (United States)

Tedersoo, Leho; Abarenkov, Kessy; Nilsson, R Henrik; Schüssler, Arthur; Grelet, Gwen-Aëlle; Kohout, Petr; Oja, Jane; Bonito, Gregory M; Veldre, Vilmar; Jairus, Teele; Ryberg, Martin; Larsson, Karl-Henrik; Kõljalg, Urmas

2011-01-01

Sequence analysis of the ribosomal RNA operon, particularly the internal transcribed spacer (ITS) region, provides a powerful tool for identification of mycorrhizal fungi. The sequence data deposited in the International Nucleotide Sequence Databases (INSD) are, however, unfiltered for quality and are often poorly annotated with metadata. To detect chimeric and low-quality sequences and assign the ectomycorrhizal fungi to phylogenetic lineages, fungal ITS sequences were downloaded from INSD, aligned within family-level groups, and examined through phylogenetic analyses and BLAST searches. By combining the fungal sequence database UNITE and the annotation and search tool PlutoF, we also added metadata from the literature to these accessions. Altogether 35,632 sequences belonged to mycorrhizal fungi or originated from ericoid and orchid mycorrhizal roots. Of these sequences, 677 were considered chimeric and 2,174 of low read quality. Information detailing country of collection, geographical coordinates, interacting taxon and isolation source were supplemented to cover 78.0%, 33.0%, 41.7% and 96.4% of the sequences, respectively. These annotated sequences are publicly available via UNITE (http://unite.ut.ee/) for downstream biogeographic, ecological and taxonomic analyses. In European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena/), the annotated sequences have a special link-out to UNITE. We intend to expand the data annotation to additional genes and all taxonomic groups and functional guilds of fungi.
Urban Environmental Education for Global Transformation Initiatives - Integrating Information and Communication Systems for Urban Sustainability in 2050.

Science.gov (United States)

Chaudhari, K.

2017-12-01

The Urban population of developing countries is predicted to rise from one third in 1990 to over 50% by 2025. In 1950 the world's total urban population was 734 million, of whom 448 million were living in developed countries and remaining 286 were in developing region. The total population on earth is predicted to increase by more than one billion people within the next 15 years, reaching 8.5 billion in 2030, and to increase further to 9.7 billion in 2050 and 11.2 billion by 2100. Looking at the ever increasing urbanization.In 2016, an estimated 54.5 per cent of the world's populations inhabited in urban region. By 2030, urban areas are projected to shelter 60 per cent of people worldwide and one in every three people will live in cities with at least half a million inhabitants.On the basis of these figures and other global trends, it would appear that Africa and Asia will have the highest share of world's urban growth in next 25 years, resulting consideration rise of large number of metropolitan cities and towns. Therefore issues related to urban climate change will be important for socio economic development for urban transformation through environmental sustainability.The information and communication systems plays an important role in achieving the social sustainability through environmental sustainability for urban transformation. This presentation aims to start the Global initiatives on the problem identifications in environment education for global transformation, education for socio-economic and environmental sustainability due to urbanization in 2050 to investigate problems related to social-economic risks and management issues resulting from urbanization to aid mitigation planning in globalized world and to educate scientists and local populations to form a basis for sustainable solutions in environment learning.The presentation aims to assess the potential of information and communication technology for environment education,both within different
Next-Generation Sequencing Platforms

Science.gov (United States)

Mardis, Elaine R.

2013-06-01

Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.
Degree product rule tempers explosive percolation in the absence of global information

Science.gov (United States)

Trevelyan, Alexander J.; Tsekenis, Georgios; Corwin, Eric I.

2018-02-01

We introduce a guided network growth model, which we call the degree product rule process, that uses solely local information when adding new edges. For small numbers of candidate edges our process gives rise to a second-order phase transition, but becomes first order in the limit of global choice. We provide the set of critical exponents required to characterize the nature of this percolation transition. Such a process permits interventions which can delay the onset of percolation while tempering the explosiveness caused by cluster product rule processes.
Distress vocalization sequences broadcasted by bats carry redundant information.

Science.gov (United States)

Hechavarría, Julio C; Beetz, M Jerome; Macias, Silvio; Kössl, Manfred

2016-07-01

Distress vocalizations (also known as alarm or screams) are an important component of the vocal repertoire of a number of animal species, including bats, humans, monkeys and birds, among others. Although the behavioral relevance of distress vocalizations is undeniable, at present, little is known about the rules that govern vocalization production when in alarmful situations. In this article, we show that when distressed, bats of the species Carollia perspicillata produce repetitive vocalization sequences in which consecutive syllables are likely to be similar to one another regarding their physical attributes. The uttered distress syllables are broadband (12-73 kHz) with most of their energy focussing at 23 kHz. Distress syllables are short (~4 ms), their average sound pressure level is close to 70 dB SPL, and they are produced at high repetition rates (every 14 ms). We discuss that, because of their physical attributes, bat distress vocalizations could serve a dual purpose: (1) advertising threatful situations to conspecifics, and (2) informing the threatener that the bats are ready to defend themselves. We also discuss possible advantages of advertising danger/discomfort using repetitive utterances, a calling strategy that appears to be ubiquitous across the animal kingdom.
iDNA at Sea: Recovery of Whale Shark (Rhincodon typus Mitochondrial DNA Sequences from the Whale Shark Copepod (Pandarus rhincodonicus Confirms Global Population Structure

Directory of Open Access Journals (Sweden)

Mark Meekan

2017-12-01

Full Text Available The whale shark (Rhincodon typus is an iconic and endangered species with a broad distribution spanning warm-temperate and tropical oceans. Effective conservation management of the species requires an understanding of the degree of genetic connectivity among populations, which is hampered by the need for sampling that involves invasive techniques. Here, the feasibility of minimally-invasive sampling was explored by isolating and sequencing whale shark DNA from a commensal or possibly parasitic copepod, Pandarus rhincodonicus that occurs on the skin of the host. We successfully recovered mitochondrial control region DNA sequences (~1,000 bp of the host via DNA extraction and polymerase chain reaction from whole copepod specimens. DNA sequences obtained from multiple copepods collected from the same shark exhibited 100% sequence similarity, suggesting a persistent association of copepods with individual hosts. Newly-generated mitochondrial haplotypes of whale shark hosts derived from the copepods were included in an analysis of the genetic structure of the global population of whale sharks (644 sequences; 136 haplotypes. Our results supported those of previous studies and suggested limited genetic structuring across most of the species range, but the presence of a genetically unique and potentially isolated population in the Atlantic Ocean. Furthermore, we recovered the mitogenome and nuclear ribosomal genes of a whale shark using a shotgun sequencing approach on copepod tissue. The recovered mitogenome is the third mitogenome reported for the species and the first from the Mozambique population. Our invertebrate DNA (iDNA approach could be used to better understand the population structure of whale sharks, particularly in the Atlantic Ocean, and also for genetic analyses of other elasmobranchs parasitized by pandarid copepods.
Plastid, nuclear and reverse transcriptase sequences in the mitochondrial genome of Oenothera: is genetic information transferred between organelles via RNA?

Science.gov (United States)

Schuster, W; Brennicke, A

1987-01-01

We describe an open reading frame (ORF) with high homology to reverse transcriptase in the mitochondrial genome of Oenothera. This ORF displays all the characteristics of an active plant mitochondrial gene with a possible ribosome binding site and 39% T in the third codon position. It is located between a sequence fragment from the plastid genome and one of nuclear origin downstream from the gene encoding subunit 5 of the NADH dehydrogenase. The nuclear derived sequence consists of 528 nucleotides from the small ribosomal RNA and contains an expansion segment unique to nuclear rRNAs. The plastid sequence contains part of the ribosomal protein S4 and the complete tRNA(Ser). The observation that only transcribed sequences have been found i more than one subcellular compartment in higher plants suggests that interorganellar transfer of genetic information may occur via RNA and subsequent local reverse transcription and genomic integration. PMID:14650433
Managing Identifiers for Elements of Provenance of the Third National Climate Assessment in the Global Change Information System (Invited)

Science.gov (United States)

Tilmes, C.; Aulenbach, S.; Duggan, B.; Goldstein, J.

2013-12-01

A Federal Advisory Committee (The "National Climate Assessment and Development Advisory Committee" or NCADAC) has overseen the development of a draft climate report that after extensive review will be considered by the Federal Government in the Third National Climate Assessment (NCA). This comprehensive report (1) Integrates, evaluates, and interprets the findings of the Program and discusses the scientific uncertainties associated with such findings; (2) Analyzes the effects of global change on the natural environment, agriculture, energy production and use, land and water resources, transportation, human health and welfare, human social systems, and biological diversity; and (3) Analyzes current trends in global change, both human-induced and natural, and projects major trends for the subsequent 25 to 100 years. The U.S. Global Change Program (USGCRP), composed of the 13 federal agencies most concerned with global change, is building a Global Change Information System (GCIS) that will ultimately organize access to all of the research, data, and information about global change from across the system. A prototype of the system has been constructed that captures and presents all of the elements of provenance of the NCA through a coherent data model and friendly front end web site. This work will focus on the globally unique and persistent identifiers used to reference and organize those items. These include externally referenced items, such as DOIs used by scientific journal publishers for research articles or by agencies as dataset identifiers, as well as our own internal approach to identifiers, our overall data model and experiences managing persistent identifiers within the GCIS.
Importance of the temporal structure of movement sequences on the ability of monkeys to use serial order information.

Science.gov (United States)

Deffains, Marc; Legallet, Eric; Apicella, Paul

2011-10-01

also used in the repeated sequence. This performance advantage was most prominently detectable when temporal prediction of forthcoming target stimuli was optimized. Taken together, the present findings demonstrate that the monkey's capacity to make use of serial order information to speed task performance was dependent on the temporal structure of the motor sequence.
Coordinating the Global Information Grid Initiative with the NG9-1-1 Initiative

Energy Technology Data Exchange (ETDEWEB)

Michael Schmitt

2008-05-01

As the Department of Defense develops the Global Information Grid, the Department of Transportation develops the Next Generation 9-1-1 system. Close examinations of these initiatives show that the two are similar in architectures, applications, and communications interoperability. These similarities are extracted from the lowest user level to the highest commander rank that will be involved in each network. Once the similarities are brought into perspective, efforts should be made to collaborate between the two departments.
Phylo-mLogo: an interactive and hierarchical multiple-logo visualization tool for alignment of many sequences

Directory of Open Access Journals (Sweden)

Lee DT

2007-02-01

Full Text Available Abstract Background When aligning several hundreds or thousands of sequences, such as epidemic virus sequences or homologous/orthologous sequences of some big gene families, to reconstruct the epidemiological history or their phylogenies, how to analyze and visualize the alignment results of many sequences has become a new challenge for computational biologists. Although there are several tools available for visualization of very long sequence alignments, few of them are applicable to the alignments of many sequences. Results A multiple-logo alignment visualization tool, called Phylo-mLogo, is presented in this paper. Phylo-mLogo calculates the variabilities and homogeneities of alignment sequences by base frequencies or entropies. Different from the traditional representations of sequence logos, Phylo-mLogo not only displays the global logo patterns of the whole alignment of multiple sequences, but also demonstrates their local homologous logos for each clade hierarchically. In addition, Phylo-mLogo also allows the user to focus only on the analysis of some important, structurally or functionally constrained sites in the alignment selected by the user or by built-in automatic calculation. Conclusion With Phylo-mLogo, the user can symbolically and hierarchically visualize hundreds of aligned sequences simultaneously and easily check the changes of their amino acid sites when analyzing many homologous/orthologous or influenza virus sequences. More information of Phylo-mLogo can be found at URL http://biocomp.iis.sinica.edu.tw/phylomlogo.
Position-specific prediction of methylation sites from sequence conservation based on information theory.

Science.gov (United States)

Shi, Yinan; Guo, Yanzhi; Hu, Yayun; Li, Menglong

2015-07-23

Protein methylation plays vital roles in many biological processes and has been implicated in various human diseases. To fully understand the mechanisms underlying methylation for use in drug design and work in methylation-related diseases, an initial but crucial step is to identify methylation sites. The use of high-throughput bioinformatics methods has become imperative to predict methylation sites. In this study, we developed a novel method that is based only on sequence conservation to predict protein methylation sites. Conservation difference profiles between methylated and non-methylated peptides were constructed by the information entropy (IE) in a wider neighbor interval around the methylation sites that fully incorporated all of the environmental information. Then, the distinctive neighbor residues were identified by the importance scores of information gain (IG). The most representative model was constructed by support vector machine (SVM) for Arginine and Lysine methylation, respectively. This model yielded a promising result on both the benchmark dataset and independent test set. The model was used to screen the entire human proteome, and many unknown substrates were identified. These results indicate that our method can serve as a useful supplement to elucidate the mechanism of protein methylation and facilitate hypothesis-driven experimental design and validation.
Global Transcriptome Sequencing Identifies Chlamydospore Specific Markers in Candida albicans and Candida dubliniensis

LENUS (Irish Health Repository)

Palige, Katja

2013-04-15

Candida albicans and Candida dubliniensis are pathogenic fungi that are highly related but differ in virulence and in some phenotypic traits. During in vitro growth on certain nutrient-poor media, C. albicans and C. dubliniensis are the only yeast species which are able to produce chlamydospores, large thick-walled cells of unknown function. Interestingly, only C. dubliniensis forms pseudohyphae with abundant chlamydospores when grown on Staib medium, while C. albicans grows exclusively as a budding yeast. In order to further our understanding of chlamydospore development and assembly, we compared the global transcriptional profile of both species during growth in liquid Staib medium by RNA sequencing. We also included a C. albicans mutant in our study which lacks the morphogenetic transcriptional repressor Nrg1. This strain, which is characterized by its constitutive pseudohyphal growth, specifically produces masses of chlamydospores in Staib medium, similar to C. dubliniensis. This comparative approach identified a set of putatively chlamydospore-related genes. Two of the homologous C. albicans and C. dubliniensis genes (CSP1 and CSP2) which were most strongly upregulated during chlamydospore development were analysed in more detail. By use of the green fluorescent protein as a reporter, the encoded putative cell wall related proteins were found to exclusively localize to C. albicans and C. dubliniensis chlamydospores. Our findings uncover the first chlamydospore specific markers in Candida species and provide novel insights in the complex morphogenetic development of these important fungal pathogens.

Ethical issues in consumer genome sequencing: Use of consumers' samples and data.

Science.gov (United States)

Niemiec, Emilia; Howard, Heidi Carmen

2016-03-01

High throughput approaches such as whole genome sequencing (WGS) and whole exome sequencing (WES) create an unprecedented amount of data providing powerful resources for clinical care and research. Recently, WGS and WES services have been made available by commercial direct-to-consumer (DTC) companies. The DTC offer of genetic testing (GT) has already brought attention to potentially problematic issues such as the adequacy of consumers' informed consent and transparency of companies' research activities. In this study, we analysed the websites of four DTC GT companies offering WGS and/or WES with regard to their policies governing storage and future use of consumers' data and samples. The results are discussed in relation to recommendations and guiding principles such as the "Statement of the European Society of Human Genetics on DTC GT for health-related purposes" (2010) and the "Framework for responsible sharing of genomic and health-related data" (Global Alliance for Genomics and Health, 2014). The analysis reveals that some companies may store and use consumers' samples or sequencing data for unspecified research and share the data with third parties. Moreover, the companies do not provide sufficient or clear information to consumers about this, which can undermine the validity of the consent process. Furthermore, while all companies state that they provide privacy safeguards for data and mention the limitations of these, information about the possibility of re-identification is lacking. Finally, although the companies that may conduct research do include information regarding proprietary claims and commercialisation of the results, it is not clear whether consumers are aware of the consequences of these policies. These results indicate that DTC GT companies still need to improve the transparency regarding handling of consumers' samples and data, including having an explicit and clear consent process for research activities.
Ethical issues in consumer genome sequencing: Use of consumers' samples and data

Directory of Open Access Journals (Sweden)

Emilia Niemiec

2016-03-01

Full Text Available High throughput approaches such as whole genome sequencing (WGS and whole exome sequencing (WES create an unprecedented amount of data providing powerful resources for clinical care and research. Recently, WGS and WES services have been made available by commercial direct-to-consumer (DTC companies. The DTC offer of genetic testing (GT has already brought attention to potentially problematic issues such as the adequacy of consumers' informed consent and transparency of companies' research activities. In this study, we analysed the websites of four DTC GT companies offering WGS and/or WES with regard to their policies governing storage and future use of consumers' data and samples. The results are discussed in relation to recommendations and guiding principles such as the “Statement of the European Society of Human Genetics on DTC GT for health-related purposes” (2010 and the “Framework for responsible sharing of genomic and health-related data” (Global Alliance for Genomics and Health, 2014. The analysis reveals that some companies may store and use consumers' samples or sequencing data for unspecified research and share the data with third parties. Moreover, the companies do not provide sufficient or clear information to consumers about this, which can undermine the validity of the consent process. Furthermore, while all companies state that they provide privacy safeguards for data and mention the limitations of these, information about the possibility of re-identification is lacking. Finally, although the companies that may conduct research do include information regarding proprietary claims and commercialisation of the results, it is not clear whether consumers are aware of the consequences of these policies. These results indicate that DTC GT companies still need to improve the transparency regarding handling of consumers' samples and data, including having an explicit and clear consent process for research activities.
Extraction of High Molecular Weight DNA from Fungal Rust Spores for Long Read Sequencing.

Science.gov (United States)

Schwessinger, Benjamin; Rathjen, John P

2017-01-01

Wheat rust fungi are complex organisms with a complete life cycle that involves two different host plants and five different spore types. During the asexual infection cycle on wheat, rusts produce massive amounts of dikaryotic urediniospores. These spores are dikaryotic (two nuclei) with each nucleus containing one haploid genome. This dikaryotic state is likely to contribute to their evolutionary success, making them some of the major wheat pathogens globally. Despite this, most published wheat rust genomes are highly fragmented and contain very little haplotype-specific sequence information. Current long-read sequencing technologies hold great promise to provide more contiguous and haplotype-phased genome assemblies. Long reads are able to span repetitive regions and phase structural differences between the haplomes. This increased genome resolution enables the identification of complex loci and the study of genome evolution beyond simple nucleotide polymorphisms. Long-read technologies require pure high molecular weight DNA as an input for sequencing. Here, we describe a DNA extraction protocol for rust spores that yields pure double-stranded DNA molecules with molecular weight of >50 kilo-base pairs (kbp). The isolated DNA is of sufficient purity for PacBio long-read sequencing, but may require additional purification for other sequencing technologies such as Nanopore and 10× Genomics.
Googling DNA sequences on the World Wide Web.

Science.gov (United States)

Hajibabaei, Mehrdad; Singer, Gregory A C

2009-11-10

New web-based technologies provide an excellent opportunity for sharing and accessing information and using web as a platform for interaction and collaboration. Although several specialized tools are available for analyzing DNA sequence information, conventional web-based tools have not been utilized for bioinformatics applications. We have developed a novel algorithm and implemented it for searching species-specific genomic sequences, DNA barcodes, by using popular web-based methods such as Google. We developed an alignment independent character based algorithm based on dividing a sequence library (DNA barcodes) and query sequence to words. The actual search is conducted by conventional search tools such as freely available Google Desktop Search. We implemented our algorithm in two exemplar packages. We developed pre and post-processing software to provide customized input and output services, respectively. Our analysis of all publicly available DNA barcode sequences shows a high accuracy as well as rapid results. Our method makes use of conventional web-based technologies for specialized genetic data. It provides a robust and efficient solution for sequence search on the web. The integration of our search method for large-scale sequence libraries such as DNA barcodes provides an excellent web-based tool for accessing this information and linking it to other available categories of information on the web.
Democracy or Informational Autocracy? The Internet's Role in the Global Society of the 21st Century

Directory of Open Access Journals (Sweden)

Tarcisio Teixeira

2016-10-01

Full Text Available Employing the deductive method, it analyzes the effects of the Internet, combined with the globalization context, in different sectors, from the economic aspect of the global ecommerce to the political consequences, which is inferred when ideological movements emerge on social networks. This incurs the current crisis scenario of state sovereignty, given the failure of states to regulate the virtual space, now marked by neoliberal practices and poorly distributed information. Finally, it contrasts with the Internet as an instrument of domination and social emancipation, in a scenario of excessive consumption of electronic devices and their use to achieve effective democracy.
Stock return predictability and market integration: The role of global and local information

Directory of Open Access Journals (Sweden)

David G. McMillan

2016-12-01

Full Text Available This paper examines the predictability of a range of international stock markets where we allow the presence of both local and global predictive factors. Recent research has argued that US returns have predictive power for international stock returns. We expand this line of research, following work on market integration, to include a more general definition of the global factor, based on principal components analysis. Results identify three global expected returns factors, one related to the major stock markets of the US, UK and Asia and one related to the other markets analysed. The third component is related to dividend growth. A single dominant realised returns factor is also noted. A forecasting exercise comparing the principal components based factors to a US return factor and local market only factors, as well as the historical mean benchmark finds supportive evidence for the former approach. It is hoped that the results from this paper will be informative on three counts. First, to academics interested in understanding the dynamics asset price movement. Second, to market participants who aim to time the market and engage in portfolio and risk management. Third, to those (policy makers and others who are interested in linkages across international markets and the nature and degree of integration.
Financial time series analysis based on information categorization method

Science.gov (United States)

Tian, Qiang; Shang, Pengjian; Feng, Guochen

2014-12-01

The paper mainly applies the information categorization method to analyze the financial time series. The method is used to examine the similarity of different sequences by calculating the distances between them. We apply this method to quantify the similarity of different stock markets. And we report the results of similarity in US and Chinese stock markets in periods 1991-1998 (before the Asian currency crisis), 1999-2006 (after the Asian currency crisis and before the global financial crisis), and 2007-2013 (during and after global financial crisis) by using this method. The results show the difference of similarity between different stock markets in different time periods and the similarity of the two stock markets become larger after these two crises. Also we acquire the results of similarity of 10 stock indices in three areas; it means the method can distinguish different areas' markets from the phylogenetic trees. The results show that we can get satisfactory information from financial markets by this method. The information categorization method can not only be used in physiologic time series, but also in financial time series.
DNA Sequencing as a Tool to Monitor Marine Ecological Status

Directory of Open Access Journals (Sweden)

Kelly D. Goodwin

2017-05-01

Full Text Available Many ocean policies mandate integrated, ecosystem-based approaches to marine monitoring, driving a global need for efficient, low-cost bioindicators of marine ecological quality. Most traditional methods to assess biological quality rely on specialized expertise to provide visual identification of a limited set of specific taxonomic groups, a time-consuming process that can provide a narrow view of ecological status. In addition, microbial assemblages drive food webs but are not amenable to visual inspection and thus are largely excluded from detailed inventory. Molecular-based assessments of biodiversity and ecosystem function offer advantages over traditional methods and are increasingly being generated for a suite of taxa using a “microbes to mammals” or “barcodes to biomes” approach. Progress in these efforts coupled with continued improvements in high-throughput sequencing and bioinformatics pave the way for sequence data to be employed in formal integrated ecosystem evaluation, including food web assessments, as called for in the European Union Marine Strategy Framework Directive. DNA sequencing of bioindicators, both traditional (e.g., benthic macroinvertebrates, ichthyoplankton and emerging (e.g., microbial assemblages, fish via eDNA, promises to improve assessment of marine biological quality by increasing the breadth, depth, and throughput of information and by reducing costs and reliance on specialized taxonomic expertise.
Globalization and information access tools: the way forward for ...

African Journals Online (AJOL)

PROMOTING ACCESS TO AFRICAN RESEARCH ... Globalization is a process of interaction and integration among people, ... face the new challenges so as to render more effective services to library clientele in the globalized environment.
ThinkHazard!: an open-source, global tool for understanding hazard information

Science.gov (United States)

Fraser, Stuart; Jongman, Brenden; Simpson, Alanna; Nunez, Ariel; Deparday, Vivien; Saito, Keiko; Murnane, Richard; Balog, Simone

2016-04-01

Rapid and simple access to added-value natural hazard and disaster risk information is a key issue for various stakeholders of the development and disaster risk management (DRM) domains. Accessing available data often requires specialist knowledge of heterogeneous data, which are often highly technical and can be difficult for non-specialists in DRM to find and exploit. Thus, availability, accessibility and processing of these information sources are crucial issues, and an important reason why many development projects suffer significant impacts from natural hazards. The World Bank's Global Facility for Disaster Reduction and Recovery (GFDRR) is currently developing a new open-source tool to address this knowledge gap: ThinkHazard! The main aim of the ThinkHazard! project is to develop an analytical tool dedicated to facilitating improvements in knowledge and understanding of natural hazards among non-specialists in DRM. It also aims at providing users with relevant guidance and information on handling the threats posed by the natural hazards present in a chosen location. Furthermore, all aspects of this tool will be open and transparent, in order to give users enough information to understand its operational principles. In this presentation, we will explain the technical approach behind the tool, which translates state-of-the-art probabilistic natural hazard data into understandable hazard classifications and practical recommendations. We will also demonstrate the functionality of the tool, and discuss limitations from a scientific as well as an operational perspective.
ReRep: Computational detection of repetitive sequences in genome survey sequences (GSS

Directory of Open Access Journals (Sweden)

Alves-Ferreira Marcelo

2008-09-01

Full Text Available Abstract Background Genome survey sequences (GSS offer a preliminary global view of a genome since, unlike ESTs, they cover coding as well as non-coding DNA and include repetitive regions of the genome. A more precise estimation of the nature, quantity and variability of repetitive sequences very early in a genome sequencing project is of considerable importance, as such data strongly influence the estimation of genome coverage, library quality and progress in scaffold construction. Also, the elimination of repetitive sequences from the initial assembly process is important to avoid errors and unnecessary complexity. Repetitive sequences are also of interest in a variety of other studies, for instance as molecular markers. Results We designed and implemented a straightforward pipeline called ReRep, which combines bioinformatics tools for identifying repetitive structures in a GSS dataset. In a case study, we first applied the pipeline to a set of 970 GSSs, sequenced in our laboratory from the human pathogen Leishmania braziliensis, the causative agent of leishmaniosis, an important public health problem in Brazil. We also verified the applicability of ReRep to new sequencing technologies using a set of 454-reads of an Escheria coli. The behaviour of several parameters in the algorithm is evaluated and suggestions are made for tuning of the analysis. Conclusion The ReRep approach for identification of repetitive elements in GSS datasets proved to be straightforward and efficient. Several potential repetitive sequences were found in a L. braziliensis GSS dataset generated in our laboratory, and further validated by the analysis of a more complete genomic dataset from the EMBL and Sanger Centre databases. ReRep also identified most of the E. coli K12 repeats prior to assembly in an example dataset obtained by automated sequencing using 454 technology. The parameters controlling the algorithm behaved consistently and may be tuned to the properties
FCJ-198 New International Information Order (NIIO Revisited: Global Algorithmic Governance and Neocolonialism

Directory of Open Access Journals (Sweden)

Danny Butt

2016-03-01

Full Text Available The field of Internet governance has been dominated by Euro-American actors and has largely resisted consideration of a holistic and integrative rights-based agenda, confining itself to narrow discussions on the technical stability of Internet Protocol resources and debates about nation-state involvement in multistakeholder governance of those resources. In light of the work of Edward Snowden documenting the close relationship between government security agencies and dominant social media platforms, this paper revisits the relevance of the New International Information Order (NIIO, a conceptualisation of the global politics of information described at the 1973 Fourth Summit Conference of the Non-Aligned Movement of nations in Algiers. This paper argues that critical analysis of the oligopolistic structure of “platforms” and their algorithmic forms of governance can build a more inclusive movement toward social justice by extending the NIIO framework’s emphasis on decolonisation, collective ownership of strategic information resources, and documentation of powerful transnational entities.
Genome and Transcriptome Sequencing of the Ostreid herpesvirus 1 From Tomales Bay, California

Science.gov (United States)

Burge, C. A.; Langevin, S.; Closek, C. J.; Roberts, S. B.; Friedman, C. S.

2016-02-01

Mass mortalities of larval and seed bivalve molluscs attributed to the Ostreid herpesvirus 1 (OsHV-1) occur globally. OsHV-1 was fully sequenced and characterized as a member of the Family Malacoherpesviridae. Multiple strains of OsHV-1 exist and may vary in virulence, i.e. OsHV-1 µvar. For most global variants of OsHV-1, sequence data is limited to PCR-based sequencing of segments, including two recent genomes. In the United States, OsHV-1 is limited to detection in adjacent embayments in California, Tomales and Drakes bays. Limited DNA sequence data of OsHV-1 infecting oysters in Tomales Bay indicates the virus detected in Tomales Bay is similar but not identical to any one global variant of OsHV-1. In order to better understand both strain variation and virulence of OsHV-1 infecting oysters in Tomales Bay, we used genomic and transcriptomic sequencing. Meta-genomic sequencing (Illumina MiSeq) was conducted from infected oysters (n=4 per year) collected in 2003, 2007, and 2014, where full OsHV-1 genome sequences and low overall microbial diversity were achieved from highly infected oysters. Increased microbial diversity was detected in three of four samples sequenced from 2003, where qPCR based genome copy numbers of OsHV-1 were lower. Expression analysis (SOLiD RNA sequencing) of OsHV-1 genes expressed in oyster larvae at 24 hours post exposure revealed a nearly complete transcriptome, with several highly expressed genes, which are similar to recent transcriptomic analyses of other OsHV-1 variants. Taken together, our results indicate that genome and transcriptome sequencing may be powerful tools in understanding both strain variation and virulence of non-culturable marine viruses.
Communications satellites in the national and global health care information infrastructure: their role, impact, and issues

Science.gov (United States)

Zuzek, J. E.; Bhasin, K. B.

1996-01-01

Health care services delivered from a distance, known collectively as telemedicine, are being increasingly demonstrated on various transmission media. Telemedicine activities have included diagnosis by a doctor at a remote location, emergency and disaster medical assistance, medical education, and medical informatics. The ability of communications satellites to offer communication channels and bandwidth on demand, connectivity to mobile, remote and under served regions, and global access will afford them a critical role for telemedicine applications within the National and Global Information Infrastructure (NII/GII). The importance that communications satellites will have in telemedicine applications within the NII/GII the differences in requirements for NII vs. GII, the major issues such as interoperability, confidentiality, quality, availability, and costs, and preliminary conclusions for future usability based on the review of several recent trails at national and global levels are presented.
Controlling the structure of sequence-defined poly(phosphodiester)s for optimal MS/MS reading of digital information.

Science.gov (United States)

Amalian, J-A; Al Ouahabi, A; Cavallo, G; König, N F; Poyer, S; Lutz, J-F; Charles, L

2017-11-01

Digital polymers are monodisperse chains with a controlled sequence of co-monomers, defined as letters of an alphabet, and are used to store information at the molecular level. Reading such messages is hence a sequencing task that can be efficiently achieved by tandem mass spectrometry. To improve their readability, structure of sequence-controlled synthetic polymers can be optimized, based on considerations regarding their fragmentation behavior. This strategy is described here for poly(phosphodiester)s, which were synthesized as monodisperse chains with more than 100 units but exhibited extremely complex dissociation spectra. In these polymers, two repeating units that differ by a simple H/CH 3 variation were defined as the 0 and 1 bit of the ASCII code and spaced by a phosphate moiety. They were readily ionized in negative ion mode electrospray but dissociated via cleavage at all phosphate bonds upon collisional activation. Although allowing a complete sequence coverage of digital poly(phosphodiester)s, this fragmentation behavior was not efficient for macromolecules with more than 50 co-monomers, and data interpretation was very tedious. The structure of these polymers was then modified by introducing alkoxyamine linkages at appropriate location throughout the chain. A first design consisted of placing these low dissociation energy bonds between each monomeric bit: while cleavage of this sole bond greatly simplified MS/MS spectra, efficient sequencing was limited to chains with up to about 50 units. In contrast, introduction of alkoxyamine bonds between each byte (i.e. a set of eight co-monomers) was a more successful strategy. Long messages (so far, up to 8 bytes) could be read in MS 3 experiments, where single-byte containing fragments released during the first activation stage were further dissociated for sequencing. The whole sequence of such byte-truncated poly(phosphodiester)s could be easily re-constructed based on a mass tagging system which permits
Automated sequence-specific protein NMR assignment using the memetic algorithm MATCH

International Nuclear Information System (INIS)

Volk, Jochen; Herrmann, Torsten; Wuethrich, Kurt

2008-01-01

MATCH (Memetic Algorithm and Combinatorial Optimization Heuristics) is a new memetic algorithm for automated sequence-specific polypeptide backbone NMR assignment of proteins. MATCH employs local optimization for tracing partial sequence-specific assignments within a global, population-based search environment, where the simultaneous application of local and global optimization heuristics guarantees high efficiency and robustness. MATCH thus makes combined use of the two predominant concepts in use for automated NMR assignment of proteins. Dynamic transition and inherent mutation are new techniques that enable automatic adaptation to variable quality of the experimental input data. The concept of dynamic transition is incorporated in all major building blocks of the algorithm, where it enables switching between local and global optimization heuristics at any time during the assignment process. Inherent mutation restricts the intrinsically required randomness of the evolutionary algorithm to those regions of the conformation space that are compatible with the experimental input data. Using intact and artificially deteriorated APSY-NMR input data of proteins, MATCH performed sequence-specific resonance assignment with high efficiency and robustness
Sequence- vs. chip-assisted genomic selection: accurate biological information is advised.

Science.gov (United States)

Pérez-Enciso, Miguel; Rincón, Juan C; Legarra, Andrés

2015-05-09

The development of next-generation sequencing technologies (NGS) has made the use of whole-genome sequence data for routine genetic evaluations possible, which has triggered a considerable interest in animal and plant breeding fields. Here, we investigated whether complete or partial sequence data can improve upon existing SNP (single nucleotide polymorphism) array-based selection strategies by simulation using a mixed coalescence - gene-dropping approach. We simulated 20 or 100 causal mutations (quantitative trait nucleotides, QTN) within 65 predefined 'gene' regions, each 10 kb long, within a genome composed of ten 3-Mb chromosomes. We compared prediction accuracy by cross-validation using a medium-density chip (7.5 k SNPs), a high-density (HD, 17 k) and sequence data (335 k). Genetic evaluation was based on a GBLUP method. The simulations showed: (1) a law of diminishing returns with increasing number of SNPs; (2) a modest effect of SNP ascertainment bias in arrays; (3) a small advantage of using whole-genome sequence data vs. HD arrays i.e. ~4%; (4) a minor effect of NGS errors except when imputation error rates are high (≥20%); and (5) if QTN were known, prediction accuracy approached 1. Since this is obviously unrealistic, we explored milder assumptions. We showed that, if all SNPs within causal genes were included in the prediction model, accuracy could also dramatically increase by ~40%. However, this criterion was highly sensitive to either misspecification (including wrong genes) or to the use of an incomplete gene list; in these cases, accuracy fell rapidly towards that reached when all SNPs from sequence data were blindly included in the model. Our study shows that, unless an accurate prior estimate on the functionality of SNPs can be included in the predictor, there is a law of diminishing returns with increasing SNP density. As a result, use of whole-genome sequence data may not result in a highly increased selection response over high
LigandRFs: random forest ensemble to identify ligand-binding residues from sequence information alone

KAUST Repository

Chen, Peng

2014-12-03

Background Protein-ligand binding is important for some proteins to perform their functions. Protein-ligand binding sites are the residues of proteins that physically bind to ligands. Despite of the recent advances in computational prediction for protein-ligand binding sites, the state-of-the-art methods search for similar, known structures of the query and predict the binding sites based on the solved structures. However, such structural information is not commonly available. Results In this paper, we propose a sequence-based approach to identify protein-ligand binding residues. We propose a combination technique to reduce the effects of different sliding residue windows in the process of encoding input feature vectors. Moreover, due to the highly imbalanced samples between the ligand-binding sites and non ligand-binding sites, we construct several balanced data sets, for each of which a random forest (RF)-based classifier is trained. The ensemble of these RF classifiers forms a sequence-based protein-ligand binding site predictor. Conclusions Experimental results on CASP9 and CASP8 data sets demonstrate that our method compares favorably with the state-of-the-art protein-ligand binding site prediction methods.
Learning Sequences of Actions in Collectives of Autonomous Agents

Science.gov (United States)

Turner, Kagan; Agogino, Adrian K.; Wolpert, David H.; Clancy, Daniel (Technical Monitor)

2001-01-01

In this paper we focus on the problem of designing a collective of autonomous agents that individually learn sequences of actions such that the resultant sequence of joint actions achieves a predetermined global objective. We are particularly interested in instances of this problem where centralized control is either impossible or impractical. For single agent systems in similar domains, machine learning methods (e.g., reinforcement learners) have been successfully used. However, applying such solutions directly to multi-agent systems often proves problematic, as agents may work at cross-purposes, or have difficulty in evaluating their contribution to achievement of the global objective, or both. Accordingly, the crucial design step in multiagent systems centers on determining the private objectives of each agent so that as the agents strive for those objectives, the system reaches a good global solution. In this work we consider a version of this problem involving multiple autonomous agents in a grid world. We use concepts from collective intelligence to design goals for the agents that are 'aligned' with the global goal, and are 'learnable' in that agents can readily see how their behavior affects their utility. We show that reinforcement learning agents using those goals outperform both 'natural' extensions of single agent algorithms and global reinforcement, learning solutions based on 'team games'.
Global Oncology; Harvard Global Health Catalyst summit lecture notes

Science.gov (United States)

Ngwa, Wilfred; Nguyen, Paul

2017-08-01

The material presented in this book is at the cutting-edge of global oncology and provides highly illuminating examples, addresses frequently asked questions, and provides information and a reference for future work in global oncology care, research, education, and outreach.

Mitochondrial DNA sequencing of cat hair: an informative forensic tool.

Science.gov (United States)

Tarditi, Christy R; Grahn, Robert A; Evans, Jeffrey J; Kurushima, Jennifer D; Lyons, Leslie A

2011-01-01

Approximately 81.7 million cats are in 37.5 million U.S. households. Shed fur can be criminal evidence because of transfer to victims, suspects, and/or their belongings. To improve cat hairs as forensic evidence, the mtDNA control region from single hairs, with and without root tags, was sequenced. A dataset of a 402-bp control region segment from 174 random-bred cats representing four U.S. geographic areas was generated to determine the informativeness of the mtDNA region. Thirty-two mtDNA mitotypes were observed ranging in frequencies from 0.6-27%. Four common types occurred in all populations. Low heteroplasmy, 1.7%, was determined. Unique mitotypes were found in 18 individuals, 10.3% of the population studied. The calculated discrimination power implied that 8.3 of 10 randomly selected individuals can be excluded by this region. The genetic characteristics of the region and the generated dataset support the use of this cat mtDNA region in forensic applications. 2010 American Academy of Forensic Sciences. Published 2010. This article is a U.S. Government work and is in the public domain in the U.S.A.
ASAP: Amplification, sequencing & annotation of plastomes

Directory of Open Access Journals (Sweden)

Folta Kevin M

2005-12-01

Full Text Available Abstract Background Availability of DNA sequence information is vital for pursuing structural, functional and comparative genomics studies in plastids. Traditionally, the first step in mining the valuable information within a chloroplast genome requires sequencing a chloroplast plasmid library or BAC clones. These activities involve complicated preparatory procedures like chloroplast DNA isolation or identification of the appropriate BAC clones to be sequenced. Rolling circle amplification (RCA is being used currently to amplify the chloroplast genome from purified chloroplast DNA and the resulting products are sheared and cloned prior to sequencing. Herein we present a universal high-throughput, rapid PCR-based technique to amplify, sequence and assemble plastid genome sequence from diverse species in a short time and at reasonable cost from total plant DNA, using the large inverted repeat region from strawberry and peach as proof of concept. The method exploits the highly conserved coding regions or intergenic regions of plastid genes. Using an informatics approach, chloroplast DNA sequence information from 5 available eudicot plastomes was aligned to identify the most conserved regions. Cognate primer pairs were then designed to generate ~1 – 1.2 kb overlapping amplicons from the inverted repeat region in 14 diverse genera. Results 100% coverage of the inverted repeat region was obtained from Arabidopsis, tobacco, orange, strawberry, peach, lettuce, tomato and Amaranthus. Over 80% coverage was obtained from distant species, including Ginkgo, loblolly pine and Equisetum. Sequence from the inverted repeat region of strawberry and peach plastome was obtained, annotated and analyzed. Additionally, a polymorphic region identified from gel electrophoresis was sequenced from tomato and Amaranthus. Sequence analysis revealed large deletions in these species relative to tobacco plastome thus exhibiting the utility of this method for structural and
The politics of agenda setting at the global level: key informant interviews regarding the International Labour Organization Decent Work Agenda.

Science.gov (United States)

Di Ruggiero, Erica; Cohen, Joanna E; Cole, Donald C

2014-07-01

Global labour markets continue to undergo significant transformations resulting from socio-political instability combined with rises in structural inequality, employment insecurity, and poor working conditions. Confronted by these challenges, global institutions are providing policy guidance to protect and promote the health and well-being of workers. This article provides an account of how the International Labour Organization's Decent Work Agenda contributes to the work policy agendas of the World Health Organization and the World Bank. This qualitative study involved semi-structured interviews with representatives from three global institutions--the International Labour Organization (ILO), the World Health Organization and the World Bank. Of the 25 key informants invited to participate, 16 took part in the study. Analysis for key themes was followed by interpretation using selected agenda setting theories. Interviews indicated that through the Decent Work Agenda, the International Labour Organization is shaping the global policy narrative about work among UN agencies, and that the pursuit of decent work and the Agenda were perceived as important goals with the potential to promote just policies. The Agenda was closely linked to the World Health Organization's conception of health as a human right. However, decent work was consistently identified by World Bank informants as ILO terminology in contrast to terms such as job creation and job access. The limited evidence base and its conceptual nature were offered as partial explanations for why the Agenda has yet to fully influence other global institutions. Catalytic events such as the economic crisis were identified as creating the enabling conditions to influence global work policy agendas. Our evidence aids our understanding of how an issue like decent work enters and stays on the policy agendas of global institutions, using the Decent Work Agenda as an illustrative example. Catalytic events and policy
The politics of agenda setting at the global level: key informant interviews regarding the International Labour Organization Decent Work Agenda

Science.gov (United States)

2014-01-01

Background Global labour markets continue to undergo significant transformations resulting from socio-political instability combined with rises in structural inequality, employment insecurity, and poor working conditions. Confronted by these challenges, global institutions are providing policy guidance to protect and promote the health and well-being of workers. This article provides an account of how the International Labour Organization’s Decent Work Agenda contributes to the work policy agendas of the World Health Organization and the World Bank. Methods This qualitative study involved semi-structured interviews with representatives from three global institutions – the International Labour Organization (ILO), the World Health Organization and the World Bank. Of the 25 key informants invited to participate, 16 took part in the study. Analysis for key themes was followed by interpretation using selected agenda setting theories. Results Interviews indicated that through the Decent Work Agenda, the International Labour Organization is shaping the global policy narrative about work among UN agencies, and that the pursuit of decent work and the Agenda were perceived as important goals with the potential to promote just policies. The Agenda was closely linked to the World Health Organization’s conception of health as a human right. However, decent work was consistently identified by World Bank informants as ILO terminology in contrast to terms such as job creation and job access. The limited evidence base and its conceptual nature were offered as partial explanations for why the Agenda has yet to fully influence other global institutions. Catalytic events such as the economic crisis were identified as creating the enabling conditions to influence global work policy agendas. Conclusions Our evidence aids our understanding of how an issue like decent work enters and stays on the policy agendas of global institutions, using the Decent Work Agenda as an illustrative
Fractals in DNA sequence analysis

Institute of Scientific and Technical Information of China (English)

Yu Zu-Guo(喻祖国); Vo Anh; Gong Zhi-Min(龚志民); Long Shun-Chao(龙顺潮)

2002-01-01

Fractal methods have been successfully used to study many problems in physics, mathematics, engineering, finance,and even in biology. There has been an increasing interest in unravelling the mysteries of DNA; for example, how can we distinguish coding and noncoding sequences, and the problems of classification and evolution relationship of organisms are key problems in bioinformatics. Although much research has been carried out by taking into consideration the long-range correlations in DNA sequences, and the global fractal dimension has been used in these works by other people, the models and methods are somewhat rough and the results are not satisfactory. In recent years, our group has introduced a time series model (statistical point of view) and a visual representation (geometrical point of view)to DNA sequence analysis. We have also used fractal dimension, correlation dimension, the Hurst exponent and the dimension spectrum (multifractal analysis) to discuss problems in this field. In this paper, we introduce these fractal models and methods and the results of DNA sequence analysis.
SeqLib: a C ++ API for rapid BAM manipulation, sequence alignment and sequence assembly.

Science.gov (United States)

Wala, Jeremiah; Beroukhim, Rameen

2017-03-01

We present SeqLib, a C ++ API and command line tool that provides a rapid and user-friendly interface to BAM/SAM/CRAM files, global sequence alignment operations and sequence assembly. Four C libraries perform core operations in SeqLib: HTSlib for BAM access, BWA-MEM and BLAT for sequence alignment and Fermi for error correction and sequence assembly. Benchmarking indicates that SeqLib has lower CPU and memory requirements than leading C ++ sequence analysis APIs. We demonstrate an example of how minimal SeqLib code can extract, error-correct and assemble reads from a CRAM file and then align with BWA-MEM. SeqLib also provides additional capabilities, including chromosome-aware interval queries and read plotting. Command line tools are available for performing integrated error correction, micro-assemblies and alignment. SeqLib is available on Linux and OSX for the C ++98 standard and later at github.com/walaj/SeqLib. SeqLib is released under the Apache2 license. Additional capabilities for BLAT alignment are available under the BLAT license. jwala@broadinstitue.org ; rameen@broadinstitute.org. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
E-business Environment in the Global Information Society

OpenAIRE

Vymětal, Dominik; Suchánek, Petr

2009-01-01

In today´s digital 21st century, almost all businesses face intense competition from competitors all around the globe. There are no borders and business area for the all companies is almost unlimited. As the main supports of mentioned fact are globalization and ICT´s development. Influences such as globalization, increased popularity of outsourcing and offshoring have recently combined to produce an environment where ICT graduates need to have up-to-date and industry-relevant knowledge and sk...
Sequence Capture versus Restriction Site Associated DNA Sequencing for Shallow Systematics.

Science.gov (United States)

Harvey, Michael G; Smith, Brian Tilston; Glenn, Travis C; Faircloth, Brant C; Brumfield, Robb T

2016-09-01

Sequence capture and restriction site associated DNA sequencing (RAD-Seq) are two genomic enrichment strategies for applying next-generation sequencing technologies to systematics studies. At shallow timescales, such as within species, RAD-Seq has been widely adopted among researchers, although there has been little discussion of the potential limitations and benefits of RAD-Seq and sequence capture. We discuss a series of issues that may impact the utility of sequence capture and RAD-Seq data for shallow systematics in non-model species. We review prior studies that used both methods, and investigate differences between the methods by re-analyzing existing RAD-Seq and sequence capture data sets from a Neotropical bird (Xenops minutus). We suggest that the strengths of RAD-Seq data sets for shallow systematics are the wide dispersion of markers across the genome, the relative ease and cost of laboratory work, the deep coverage and read overlap at recovered loci, and the high overall information that results. Sequence capture's benefits include flexibility and repeatability in the genomic regions targeted, success using low-quality samples, more straightforward read orthology assessment, and higher per-locus information content. The utility of a method in systematics, however, rests not only on its performance within a study, but on the comparability of data sets and inferences with those of prior work. In RAD-Seq data sets, comparability is compromised by low overlap of orthologous markers across species and the sensitivity of genetic diversity in a data set to an interaction between the level of natural heterozygosity in the samples examined and the parameters used for orthology assessment. In contrast, sequence capture of conserved genomic regions permits interrogation of the same loci across divergent species, which is preferable for maintaining comparability among data sets and studies for the purpose of drawing general conclusions about the impact of
Correction of projective distortion in long-image-sequence mosaics without prior information

Science.gov (United States)

Yang, Chenhui; Mao, Hongwei; Abousleman, Glen; Si, Jennie

2010-04-01

Image mosaicking is the process of piecing together multiple video frames or still images from a moving camera to form a wide-area or panoramic view of the scene being imaged. Mosaics have widespread applications in many areas such as security surveillance, remote sensing, geographical exploration, agricultural field surveillance, virtual reality, digital video, and medical image analysis, among others. When mosaicking a large number of still images or video frames, the quality of the resulting mosaic is compromised by projective distortion. That is, during the mosaicking process, the image frames that are transformed and pasted to the mosaic become significantly scaled down and appear out of proportion with respect to the mosaic. As more frames continue to be transformed, important target information in the frames can be lost since the transformed frames become too small, which eventually leads to the inability to continue further. Some projective distortion correction techniques make use of prior information such as GPS information embedded within the image, or camera internal and external parameters. Alternatively, this paper proposes a new algorithm to reduce the projective distortion without using any prior information whatsoever. Based on the analysis of the projective distortion, we approximate the projective matrix that describes the transformation between image frames using an affine model. Using singular value decomposition, we can deduce the affine model scaling factor that is usually very close to 1. By resetting the image scale of the affine model to 1, the transformed image size remains unchanged. Even though the proposed correction introduces some error in the image matching, this error is typically acceptable and more importantly, the final mosaic preserves the original image size after transformation. We demonstrate the effectiveness of this new correction algorithm on two real-world unmanned air vehicle (UAV) sequences. The proposed method is
Sequencing at sea : challenges and experiences in Ion Torrent PGM sequencing during the 2013 Southern Line Islands Research Expedition

NARCIS (Netherlands)

Lim, Yan Wei; Cuevas, Daniel A; Silva, Genivaldo Gueiros Z; Aguinaldo, Kristen; Dinsdale, Elizabeth A; Haas, Andreas F; Hatay, Mark; Sanchez, Savannah E; Wegley-Kelly, Linda; Dutilh, Bas E; Harkins, Timothy T; Lee, Clarence C; Tom, Warren; Sandin, Stuart A; Smith, Jennifer E; Zgliczynski, Brian; Vermeij, Mark J A; Rohwer, Forest; Edwards, Robert A

2014-01-01

Genomics and metagenomics have revolutionized our understanding of marine microbial ecology and the importance of microbes in global geochemical cycles. However, the process of DNA sequencing has always been an abstract extension of the research expedition, completed once the samples were returned
Sequencing at sea: challenges and experiences in Ion Torrent PGM sequencing during the 2013 Southern Line Islands Research Expedition

NARCIS (Netherlands)

Lim, Y.W.; Cuevas, D.A.; Silva, G.G.Z.; Aguinaldo, K.; Dinsdale, E.A.; Haas, A.F.; Hatay, M.; Sanchez, S.E.; Wegley-Kelly, L.; Dutilh, B.E.; Harkins, T.T.; Lee, C.C.; Tom, W.; Sandin, S.A.; Smith, J.E.; Zgliczynski, B.; Vermeij, M.J.A.; Rohwer, F.; Edwards, R.A.

2014-01-01

Genomics and metagenomics have revolutionized our understanding of marine microbial ecology and the importance of microbes in global geochemical cycles. However, the process of DNA sequencing has always been an abstract extension of the research expedition, completed once the samples were returned
Sequence complexity and work extraction

International Nuclear Information System (INIS)

Merhav, Neri

2015-01-01

We consider a simplified version of a solvable model by Mandal and Jarzynski, which constructively demonstrates the interplay between work extraction and the increase of the Shannon entropy of an information reservoir which is in contact with a physical system. We extend Mandal and Jarzynski’s main findings in several directions: first, we allow sequences of correlated bits rather than just independent bits. Secondly, at least for the case of binary information, we show that, in fact, the Shannon entropy is only one measure of complexity of the information that must increase in order for work to be extracted. The extracted work can also be upper bounded in terms of the increase in other quantities that measure complexity, like the predictability of future bits from past ones. Third, we provide an extension to the case of non-binary information (i.e. a larger alphabet), and finally, we extend the scope to the case where the incoming bits (before the interaction) form an individual sequence, rather than a random one. In this case, the entropy before the interaction can be replaced by the Lempel–Ziv (LZ) complexity of the incoming sequence, a fact that gives rise to an entropic meaning of the LZ complexity, not only in information theory, but also in physics. (paper)
Global epidemiology of capsular group W meningococcal disease (1970-2015): Multifocal emergence and persistence of hypervirulent sequence type (ST)-11 clonal complex.

Science.gov (United States)

Mustapha, Mustapha M; Marsh, Jane W; Harrison, Lee H

2016-03-18

Following an outbreak in Mecca Saudi Arabia in 2000, meningococcal strains expressing capsular group W (W) emerged as a major cause of invasive meningococcal disease (IMD) worldwide. The Saudi Arabian outbreak strain (Hajj clone) belonging to the ST-11 clonal complex (cc11) is similar to W cc11 causing occasional sporadic disease before 2000. Since 2000, W cc11 has caused large meningococcal disease epidemics in the African meningitis belt and endemic disease in South America, Europe and China. Traditional molecular epidemiologic typing suggested that a majority of current W cc11 burden represented global spread of the Hajj clone. However, recent whole genome sequencing (WGS) analyses revealed significant genetic heterogeneity among global W cc11 strains. While continued spread of the Hajj clone occurs in the Middle East, the meningitis belt and South Africa have co-circulation of the Hajj clone and other unrelated W cc11 strains. Notably, South America, the UK, and France share a genetically distinct W cc11 strain. Other W lineages persist in low numbers in Europe, North America and the meningitis belt. In summary, WGS is helping to unravel the complex genomic epidemiology of group W meningococcal strains. Wider application of WGS and strengthening of global IMD surveillance is necessary to monitor the continued evolution of group W lineages. Copyright © 2016 Elsevier Ltd. All rights reserved.
Sequencing of BAC pools by different next generation sequencing platforms and strategies

Directory of Open Access Journals (Sweden)

Scholz Uwe

2011-10-01

Full Text Available Abstract Background Next generation sequencing of BACs is a viable option for deciphering the sequence of even large and highly repetitive genomes. In order to optimize this strategy, we examined the influence of read length on the quality of Roche/454 sequence assemblies, to what extent Illumina/Solexa mate pairs (MPs improve the assemblies by scaffolding and whether barcoding of BACs is dispensable. Results Sequencing four BACs with both FLX and Titanium technologies revealed similar sequencing accuracy, but showed that the longer Titanium reads produce considerably less misassemblies and gaps. The 454 assemblies of 96 barcoded BACs were improved by scaffolding 79% of the total contig length with MPs from a non-barcoded library. Assembly of the unmasked 454 sequences without separation by barcodes revealed chimeric contig formation to be a major problem, encompassing 47% of the total contig length. Masking the sequences reduced this fraction to 24%. Conclusion Optimal BAC pool sequencing should be based on the longest available reads, with barcoding essential for a comprehensive assessment of both repetitive and non-repetitive sequence information. When interest is restricted to non-repetitive regions and repeats are masked prior to assembly, barcoding is non-essential. In any case, the assemblies can be improved considerably by scaffolding with non-barcoded BAC pool MPs.
Act local, think global: how the Malawi experience of scaling up antiretroviral treatment has informed global policy

Directory of Open Access Journals (Sweden)

Anthony D. Harries

2016-09-01

Full Text Available Abstract The scale-up of antiretroviral therapy (ART in Malawi was based on a public health approach adapted to its resource-poor setting, with principles and practices borrowed from the successful tuberculosis control framework. From 2004 to 2015, the number of new patients started on ART increased from about 3000 to over 820,000. Despite being a small country, Malawi has made a significant contribution to the 15 million people globally on ART and has also contributed policy and service delivery innovations that have supported international guidelines and scale up in other countries. The first set of global guidelines for scaling up ART released by the World Health Organization (WHO in 2002 focused on providing clinical guidance. In Malawi, the ART guidelines adopted from the outset a more operational and programmatic approach with recommendations on health systems and services that were needed to deliver HIV treatment to affected populations. Seven years after the start of national scale-up, Malawi launched a new strategy offering all HIV-infected pregnant women lifelong ART regardless of the CD4-cell count, named Option B+. This strategy was subsequently incorporated into a WHO programmatic guide in 2012 and WHO ART guidelines in 2013, and has since then been adopted by the majority of countries worldwide. In conclusion, the Malawi experience of ART scale-up has become a blueprint for a public health response to HIV and has informed international efforts to end the AIDS epidemic by 2030.
Act local, think global: how the Malawi experience of scaling up antiretroviral treatment has informed global policy.

Science.gov (United States)

Harries, Anthony D; Ford, Nathan; Jahn, Andreas; Schouten, Erik J; Libamba, Edwin; Chimbwandira, Frank; Maher, Dermot

2016-09-06

The scale-up of antiretroviral therapy (ART) in Malawi was based on a public health approach adapted to its resource-poor setting, with principles and practices borrowed from the successful tuberculosis control framework. From 2004 to 2015, the number of new patients started on ART increased from about 3000 to over 820,000. Despite being a small country, Malawi has made a significant contribution to the 15 million people globally on ART and has also contributed policy and service delivery innovations that have supported international guidelines and scale up in other countries. The first set of global guidelines for scaling up ART released by the World Health Organization (WHO) in 2002 focused on providing clinical guidance. In Malawi, the ART guidelines adopted from the outset a more operational and programmatic approach with recommendations on health systems and services that were needed to deliver HIV treatment to affected populations. Seven years after the start of national scale-up, Malawi launched a new strategy offering all HIV-infected pregnant women lifelong ART regardless of the CD4-cell count, named Option B+. This strategy was subsequently incorporated into a WHO programmatic guide in 2012 and WHO ART guidelines in 2013, and has since then been adopted by the majority of countries worldwide. In conclusion, the Malawi experience of ART scale-up has become a blueprint for a public health response to HIV and has informed international efforts to end the AIDS epidemic by 2030.
Feasibility of integrating other federal information systems into the Global Network of Environment and Technology, GNET{reg_sign}

Energy Technology Data Exchange (ETDEWEB)

NONE

1998-05-01

The Global Environment and Technology Enterprise (GETE) of the Global Environment and Technology Foundation (GETF) has been tasked by the US Department of Energy`s (DOE), Federal Energy Technology Center (FETC) to assist in reducing DOE`s cost for the Global Network of Environment and Technology (GNET{reg_sign}). As part of this task, GETE is seeking federal partners to invest in GNET{reg_sign}. The authors are also seeking FETC`s commitment to serve as GNET`s federal agency champion promoting the system to potential agency partners. This report assesses the benefits of partnering with GNET{reg_sign} and provides recommendations for identifying and integrating other federally funded (non-DOE) environmental information management systems into GNET{reg_sign}.
Mitochondrial DNA sequence evolution in shorebird populations

NARCIS (Netherlands)

Wenink, P.W.

1994-01-01

This thesis describes the global molecular population structure of two shorebird species, in particular of the dunlin, Calidris alpina, by means of comparative sequence analysis of the most variable part of the mitochondrial DNA (mtDNA) genome. There are several reasons
Medical Information Exchange: Pattern of Global Mobile Messenger Usage among Otolaryngologists.

Science.gov (United States)

Siegal, Gil; Dagan, Elad; Wolf, Michael; Duvdevani, Shay; Alon, Eran E

2016-11-01

Information technology has revolutionized health care. However, the development of dedicated mobile health software has been lagging, leading to the use of general mobile applications to fill in the void. The use of such applications has several legal, ethical, and regulatory implications. We examined the experience and practices governing the usage of a global mobile messenger application (WhatsApp) for mobile health purposes in a national cohort of practicing otolaryngologists in Israel, a known early adaptor information technology society. Cross-sectional data were collected from practicing otolaryngologists and otolaryngology residents via self-administered questionnaire. The questionnaire was composed of a demographic section, a section surveying the practices of mobile application use, mobile health application use, and knowledge regarding institutional policies governing the transmission of medical data. The sample included 22 otolaryngology residents and 47 practicing otolaryngologists. Of the physicians, 83% worked in academic centers, and 88% and 40% of the physicians who worked in a hospital setting or a community clinic used WhatsApp for medical use, respectively. Working with residents increased the medical usage of WhatsApp from 50% to 91% (P = .006). Finally, 72% were unfamiliar with any institutional policy regarding the transfer of medical information by personal smartphones. Mobile health is becoming an integral part of modern medical systems, improving accessibility, efficiency, and possibly quality of medical care. The need to incorporate personal mobile devices in the overall information technology standards, guidelines, and regulation is becoming more acute. Nonetheless, practices must be properly instituted to prevent unwanted consequences. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
Towards predicting the encoding capability of MR fingerprinting sequences.

Science.gov (United States)

Sommer, K; Amthor, T; Doneva, M; Koken, P; Meineke, J; Börnert, P

2017-09-01

Sequence optimization and appropriate sequence selection is still an unmet need in magnetic resonance fingerprinting (MRF). The main challenge in MRF sequence design is the lack of an appropriate measure of the sequence's encoding capability. To find such a measure, three different candidates for judging the encoding capability have been investigated: local and global dot-product-based measures judging dictionary entry similarity as well as a Monte Carlo method that evaluates the noise propagation properties of an MRF sequence. Consistency of these measures for different sequence lengths as well as the capability to predict actual sequence performance in both phantom and in vivo measurements was analyzed. While the dot-product-based measures yielded inconsistent results for different sequence lengths, the Monte Carlo method was in a good agreement with phantom experiments. In particular, the Monte Carlo method could accurately predict the performance of different flip angle patterns in actual measurements. The proposed Monte Carlo method provides an appropriate measure of MRF sequence encoding capability and may be used for sequence optimization. Copyright © 2017 Elsevier Inc. All rights reserved.

Global information infrastructure.

Science.gov (United States)

Lindberg, D A

1994-01-01

The High Performance Computing and Communications Program (HPCC) is a multiagency federal initiative under the leadership of the White House Office of Science and Technology Policy, established by the High Performance Computing Act of 1991. It has been assigned a critical role in supporting the international collaboration essential to science and to health care. Goals of the HPCC are to extend USA leadership in high performance computing and networking technologies; to improve technology transfer for economic competitiveness, education, and national security; and to provide a key part of the foundation for the National Information Infrastructure. The first component of the National Institutes of Health to participate in the HPCC, the National Library of Medicine (NLM), recently issued a solicitation for proposals to address a range of issues, from privacy to 'testbed' networks, 'virtual reality,' and more. These efforts will build upon the NLM's extensive outreach program and other initiatives, including the Unified Medical Language System (UMLS), MEDLARS, and Grateful Med. New Internet search tools are emerging, such as Gopher and 'Knowbots'. Medicine will succeed in developing future intelligent agents to assist in utilizing computer networks. Our ability to serve patients is so often restricted by lack of information and knowledge at the time and place of medical decision-making. The new technologies, properly employed, will also greatly enhance our ability to serve the patient.
Assessing the Primary Data Hosted by the Spanish Node of the Global Biodiversity Information Facility (GBIF)

Science.gov (United States)

Otegui, Javier; Ariño, Arturo H.; Encinas, María A.; Pando, Francisco

2013-01-01

In order to effectively understand and cope with the current ‘biodiversity crisis’, having large-enough sets of qualified data is necessary. Information facilitators such as the Global Biodiversity Information Facility (GBIF) are ensuring increasing availability of primary biodiversity records by linking data collections spread over several institutions that have agreed to publish their data in a common access schema. We have assessed the primary records that one such publisher, the Spanish node of GBIF (GBIF.ES), hosts on behalf of a number of institutions, considered to be a highly representative sample of the total mass of available data for a country in order to know the quantity and quality of the information made available. Our results may provide an indication of the overall fitness-for-use in these data. We have found a number of patterns in the availability and accrual of data that seem to arise naturally from the digitization processes. Knowing these patterns and features may help deciding when and how these data can be used. Broadly, the error level seems low. The available data may be of capital importance for the development of biodiversity research, both locally and globally. However, wide swaths of records lack data elements such as georeferencing or taxonomical levels. Although the remaining information is ample and fit for many uses, improving the completeness of the records would likely increase the usability span for these data. PMID:23372828
The Global Invasive Species Information Network: contributing to GEO Task BI-07-01b

Science.gov (United States)

Graham, J.; Morisette, J. T.; Simpson, A.

2009-12-01

Invasive alien species (IAS) threaten biodiversity and exert a tremendous cost on society for IAS prevention and eradication. They endanger natural ecosystem functioning and seriously impact biodiversity and agricultural production. The task definition for the GEO task BI-07-01b: Invasive Species Monitoring System is to characterize, monitor, and predict changes in the distribution of invasive species. This includes characterizing the current requirements and capacity for invasive species monitoring and developing strategies for implementing cross-search functionality among existing online invasive species information systems from around the globe. The Task is being coordinated by members of the Global Invasive Species Information Network (GISIN) and their partners. Information on GISIN and a prototype of the network is available at www.gisin.org. This talk will report on the current status of GISIN and review how researchers can either contribute to or utilize data from this network.
Global features of the Alcanivorax borkumensis SK2 genome

DEFF Research Database (Denmark)

Reva, Oleg N.; Hallin, Peter Fischer; Willenbrock, Hanni

2008-01-01

The global feature of the completely sequenced Alcanivorax borkumensis SK2 type strain chromosome is its symmetry and homogeneity. The origin and terminus of replication are located opposite to each other in the chromosome and are discerned with high signal to noise ratios by maximal oligonucleot......The global feature of the completely sequenced Alcanivorax borkumensis SK2 type strain chromosome is its symmetry and homogeneity. The origin and terminus of replication are located opposite to each other in the chromosome and are discerned with high signal to noise ratios by maximal...... oligonucleotide usage biases on the leading and lagging strand. Genomic DNA structure is rather uniform throughout the chromosome with respect to intrinsic curvature, position preference or base stacking energy. The orthologs and paralogs of A. borkumensis genes with the highest sequence homology were found...
Establishing a framework for comparative analysis of genome sequences

Energy Technology Data Exchange (ETDEWEB)

Bansal, A.K.

1995-06-01

This paper describes a framework and a high-level language toolkit for comparative analysis of genome sequence alignment The framework integrates the information derived from multiple sequence alignment and phylogenetic tree (hypothetical tree of evolution) to derive new properties about sequences. Multiple sequence alignments are treated as an abstract data type. Abstract operations have been described to manipulate a multiple sequence alignment and to derive mutation related information from a phylogenetic tree by superimposing parsimonious analysis. The framework has been applied on protein alignments to derive constrained columns (in a multiple sequence alignment) that exhibit evolutionary pressure to preserve a common property in a column despite mutation. A Prolog toolkit based on the framework has been implemented and demonstrated on alignments containing 3000 sequences and 3904 columns.
Global Dispersal Pattern of HIV Type 1 Subtype CRF01_AE

OpenAIRE

Poljak, Mario; Angelis, Konstantinos; Albert, Jan; Mamais, Ioannis; Magiorkinis, Gkikas; Hatzakis, Angelos; Hamouda, Osamah; Stuck, Daniel; Vercauteren, Jurgen; Wensing, Annemarie; Alexiev, Ivailo

2016-01-01

Background. Human immunodeficiency virus type 1 (HIV-1) subtype CRF01_AE originated in Africa and then passed to Thailand, where it established a major epidemic. Despite the global presence of CRF01_AE, little is known about its subsequent dispersal pattern. Methods. We assembled a global data set of 2736 CRF01_AE sequences by pooling sequences from public databases and patient-cohort studies. We estimated viral dispersal patterns, using statistical phylogeographic analysis run over bootstrap...
Prediction of membrane transport proteins and their substrate specificities using primary sequence information.

Directory of Open Access Journals (Sweden)

Nitish K Mishra

Full Text Available Membrane transport proteins (transporters move hydrophilic substrates across hydrophobic membranes and play vital roles in most cellular functions. Transporters represent a diverse group of proteins that differ in topology, energy coupling mechanism, and substrate specificity as well as sequence similarity. Among the functional annotations of transporters, information about their transporting substrates is especially important. The experimental identification and characterization of transporters is currently costly and time-consuming. The development of robust bioinformatics-based methods for the prediction of membrane transport proteins and their substrate specificities is therefore an important and urgent task.Support vector machine (SVM-based computational models, which comprehensively utilize integrative protein sequence features such as amino acid composition, dipeptide composition, physico-chemical composition, biochemical composition, and position-specific scoring matrices (PSSM, were developed to predict the substrate specificity of seven transporter classes: amino acid, anion, cation, electron, protein/mRNA, sugar, and other transporters. An additional model to differentiate transporters from non-transporters was also developed. Among the developed models, the biochemical composition and PSSM hybrid model outperformed other models and achieved an overall average prediction accuracy of 76.69% with a Mathews correlation coefficient (MCC of 0.49 and a receiver operating characteristic area under the curve (AUC of 0.833 on our main dataset. This model also achieved an overall average prediction accuracy of 78.88% and MCC of 0.41 on an independent dataset.Our analyses suggest that evolutionary information (i.e., the PSSM and the AAIndex are key features for the substrate specificity prediction of transport proteins. In comparison, similarity-based methods such as BLAST, PSI-BLAST, and hidden Markov models do not provide accurate predictions
Nucleotide sequence preservation of human mitochondrial DNA

International Nuclear Information System (INIS)

Monnat, R.J. Jr.; Loeb, L.A.

1985-01-01

Recombinant DNA techniques have been used to quantitate the amount of nucleotide sequence divergence in the mitochondrial DNA population of individual normal humans. Mitochondrial DNA was isolated from the peripheral blood lymphocytes of five normal humans and cloned in M13 mp11; 49 kilobases of nucleotide sequence information was obtained from 248 independently isolated clones from the five normal donors. Both between- and within-individual differences were identified. Between-individual differences were identified in approximately = to 1/200 nucleotides. In contrast, only one within-individual difference was identified in 49 kilobases of nucleotide sequence information. This high degree of mitochondrial nucleotide sequence homogeneity in human somatic cells is in marked contrast to the rapid evolutionary divergence of human mitochondrial DNA and suggests the existence of mechanisms for the concerted preservation of mammalian mitochondrial DNA sequences in single organisms
Rapid and Accurate Sequencing of Enterovirus Genomes Using MinION Nanopore Sequencer.

Science.gov (United States)

Wang, Ji; Ke, Yue Hua; Zhang, Yong; Huang, Ke Qiang; Wang, Lei; Shen, Xin Xin; Dong, Xiao Ping; Xu, Wen Bo; Ma, Xue Jun

2017-10-01

Knowledge of an enterovirus genome sequence is very important in epidemiological investigation to identify transmission patterns and ascertain the extent of an outbreak. The MinION sequencer is increasingly used to sequence various viral pathogens in many clinical situations because of its long reads, portability, real-time accessibility of sequenced data, and very low initial costs. However, information is lacking on MinION sequencing of enterovirus genomes. In this proof-of-concept study using Enterovirus 71 (EV71) and Coxsackievirus A16 (CA16) strains as examples, we established an amplicon-based whole genome sequencing method using MinION. We explored the accuracy, minimum sequencing time, discrimination and high-throughput sequencing ability of MinION, and compared its performance with Sanger sequencing. Within the first minute (min) of sequencing, the accuracy of MinION was 98.5% for the single EV71 strain and 94.12%-97.33% for 10 genetically-related CA16 strains. In as little as 14 min, 99% identity was reached for the single EV71 strain, and in 17 min (on average), 99% identity was achieved for 10 CA16 strains in a single run. MinION is suitable for whole genome sequencing of enteroviruses with sufficient accuracy and fine discrimination and has the potential as a fast, reliable and convenient method for routine use. Copyright © 2017 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.
INFORMATION THREATS IN A GLOBALIZED WORLD: ECONOMICS, POLITICS, SOCIETY (EXPERIENCE OF UKRAINE

Directory of Open Access Journals (Sweden)

Anatoliy Holovka

2016-11-01

Full Text Available The scientific article deals with both integral vision of the contemporary informative risks in the globalized world and their classification. The essence of the informative security is exposed, which is one of main factors of steady development of the modern informative society. In consideration of the foreign practice, the experience of Ukraine is also analyzed in counteraction to the contemporary informative threats. The effective policy of safety and counteraction to the informative threats is one of the basic constituents of the state national safety system and at the same time testifies to the correct character of connections between the public organs and the society. Under the conditions of unrestrained progress of information technologies and general informatization in all sectors of people’s life (politics, economy, defense, energy etc., providing of control and defense of informative space of the country becomes much more difficult task. Modern Ukrainian realities certify convincingly, that Ukraine is in an extremely difficult political situation that influences all spheres of Ukrainians’ life. The key reason of such situation is a military-informative aggression against Ukraine from Russia, which is the fact of waging a «hybrid war». As it is known, this type of war combines the application of both classic soldiery instruments (military technique, firearms, regular troops and methods of informative influence (cyber-attack, informative diversions, aggressive propaganda, impact on public opinion. This factor encourages such research. The object of the study is the phenomenon of information risks in the modern world. Subject of research – is the impact of modern information threats to the state and society, namely the economic, political and social spheres. For a holistic analysis of the subject of research was used appropriate methodology – systematic approach, method of comparative analysis, general scientific methods
Why Replacing Legacy Systems Is So Hard in Global Software Development: An Information Infrastructure Perspective

DEFF Research Database (Denmark)

Matthiesen, Stina; Bjørn, Pernille

2015-01-01

We report on an ethnographic study of an outsourcing global software development (GSD) setup between a Danish IT company and an Indian IT vendor developing a system to replace a legacy system for social services administration in Denmark. Physical distance and GSD collaboration issues tend...... to be obvious explanations for why GSD tasks fail to reach completion; however, we account for the difficulties within the technical nature of software system task. We use the framework of information infrastructure to show how replacing a legacy system in governmental information infrastructures includes...... the work of tracing back to knowledge concerning law, technical specifications, as well as how information infrastructures have dynamically evolved over time. Not easily carried out in a GSD setup is the work around technical tasks that requires careful examination of mundane technical aspects, standards...
Global analysis of small molecule binding to related protein targets.

Directory of Open Access Journals (Sweden)

Felix A Kruger

2012-01-01

Full Text Available We report on the integration of pharmacological data and homology information for a large scale analysis of small molecule binding to related targets. Differences in small molecule binding have been assessed for curated pairs of human to rat orthologs and also for recently diverged human paralogs. Our analysis shows that in general, small molecule binding is conserved for pairs of human to rat orthologs. Using statistical tests, we identified a small number of cases where small molecule binding is different between human and rat, some of which had previously been reported in the literature. Knowledge of species specific pharmacology can be advantageous for drug discovery, where rats are frequently used as a model system. For human paralogs, we demonstrate a global correlation between sequence identity and the binding of small molecules with equivalent affinity. Our findings provide an initial general model relating small molecule binding and sequence divergence, containing the foundations for a general model to anticipate and predict within-target-family selectivity.
Protein Function Prediction Based on Sequence and Structure Information

KAUST Repository

Smaili, Fatima Z.

2016-01-01

operate. In this master thesis project, we worked on inferring protein functions based on the primary protein sequence. In the approach we follow, 3D models are first constructed using I-TASSER. Functions are then deduced by structurally matching
Stratigraphical analysis of the neoproterozoic sedimentary sequences of the Sao Francisco Basin

International Nuclear Information System (INIS)

Martins, Mariela; Lemos, Valesca Brasil

2007-01-01

A stratigraphic analysis was performed under the principles of Sequence Stratigraphy on the neoproterozoic sedimentary sequences of the Sao Francisco Basin (Central Brazil). Three periods of deposition separated by unconformities were recognized in the Sao Francisco Megasequence: (1) Sequences 1 and 2, a cryogenian glaciogenic sequence, followed by a distal scarp carbonate ramp, developed during stable conditions, (2) Sequence 3, a Upper Cryogenian stack homoclinal ramps with mixed carbonate-siliciclastic sedimentation, deposited under a progressive influence of compressional stresses of the Brasiliano Cycle, (3) Sequence 4, a Lower Ediacaran shallow platform dominated by siliciclastic sedimentation of molassic nature, the erosion product of the nearby uplifted thrust sheets. Each of the carbonate-bearing sequences presents a distinct δ 13 C isotopic signature. The superposition to the global curve for carbon isotopic variation allowed the recognition of a major depositional hiatus between the Paranoa and Sao Francisco Megasequences, and suggested that the glacial diamictite deposition (Jequitai Formation) took place most probably around 800 Ma. This constrains the Sao Francisco Megasequence deposition to the interval between 800 and 600 Ma (the known ages of the Brasiliano Orogeny defines the upper limit). A minor depositional hiatus (700.680 Ma) was also identified separating sequences 2 and 3. Isotopic analyses suggest that from then on, more restricted environmental conditions were established in the basin, probably associated with a first order global event, which prevailed throughout deposition of the Sequence 3. (author)
Computing handbook information systems and information technology

CERN Document Server

Topi, Heikki

2014-01-01

Disciplinary Foundations and Global ImpactEvolving Discipline of Information Systems Heikki TopiDiscipline of Information Technology Barry M. Lunt and Han ReichgeltInformation Systems as a Practical Discipline Juhani IivariInformation Technology Han Reichgelt, Joseph J. Ekstrom, Art Gowan, and Barry M. LuntSociotechnical Approaches to the Study of Information Systems Steve Sawyer and Mohammad Hossein JarrahiIT and Global Development Erkki SutinenUsing ICT for Development, Societal Transformation, and Beyond Sherif KamelTechnical Foundations of Data and Database ManagementData Models Avi Silber
Human Genome Sequencing in Health and Disease

Science.gov (United States)

Gonzaga-Jauregui, Claudia; Lupski, James R.; Gibbs, Richard A.

2013-01-01

Following the “finished,” euchromatic, haploid human reference genome sequence, the rapid development of novel, faster, and cheaper sequencing technologies is making possible the era of personalized human genomics. Personal diploid human genome sequences have been generated, and each has contributed to our better understanding of variation in the human genome. We have consequently begun to appreciate the vastness of individual genetic variation from single nucleotide to structural variants. Translation of genome-scale variation into medically useful information is, however, in its infancy. This review summarizes the initial steps undertaken in clinical implementation of personal genome information, and describes the application of whole-genome and exome sequencing to identify the cause of genetic diseases and to suggest adjuvant therapies. Better analysis tools and a deeper understanding of the biology of our genome are necessary in order to decipher, interpret, and optimize clinical utility of what the variation in the human genome can teach us. Personal genome sequencing may eventually become an instrument of common medical practice, providing information that assists in the formulation of a differential diagnosis. We outline herein some of the remaining challenges. PMID:22248320
Information Seeking about Global Climate Change among Adolescents: The Role of Risk Perceptions, Efficacy Beliefs and Parental Influences

Science.gov (United States)

Mead, Erin; Roser-Renouf, Connie; Rimal, Rajiv N.; Flora, June A.; Maibach, Edward W.; Leiserowitz, Anthony

2012-01-01

Global climate change is likely to have significant impacts on public health. Effective communication is critical to informing public decision making and behavior to mitigate climate change. An effective method of audience segmentation, the risk perception attitude (RPA) framework has been previously tested with other health behaviors and classifies people into 4 groups on the basis of their perceptions of risk and beliefs about personal efficacy. The 4 groups – indifference (low risk, weak efficacy), proactive (low risk, strong efficacy), avoidance (high risk, weak efficacy), and responsive (high risk, strong efficacy) – are hypothesized to differ in their self-protective behaviors and in their motivations to seek information. In this paper, we extend the RPA framework in two ways. First, we use it at the household level to determine whether parental classifications into the 4 groups are associated with their teenage children’s classification into the same 4 groups. Second, we predict adolescent information-seeking behaviors on the basis of their and their parents’ membership in the 4 RPA groups. Results (N = 523 parent-adolescent pairs) indicated that parental membership in the 4 RPA groups was significantly associated with children’s membership in the same 4 groups. Furthermore, the RPA framework was a significant predictor of adolescent information-seeking: those in the responsive and avoidance groups sought more information on climate change than the indifference group. Family communication on global warming was positively associated with adolescents’ information-seeking. Implications for interventions are discussed. PMID:22866024
Rfam: annotating families of non-coding RNA sequences.

Science.gov (United States)

Daub, Jennifer; Eberhardt, Ruth Y; Tate, John G; Burge, Sarah W

2015-01-01

The primary task of the Rfam database is to collate experimentally validated noncoding RNA (ncRNA) sequences from the published literature and facilitate the prediction and annotation of new homologues in novel nucleotide sequences. We group homologous ncRNA sequences into "families" and related families are further grouped into "clans." We collate and manually curate data cross-references for these families from other databases and external resources. Our Web site offers researchers a simple interface to Rfam and provides tools with which to annotate their own sequences using our covariance models (CMs), through our tools for searching, browsing, and downloading information on Rfam families. In this chapter, we will work through examples of annotating a query sequence, collating family information, and searching for data.
Information as Wealth.

Science.gov (United States)

Deruchie, Douglas M.

1992-01-01

Discusses the value of information-based services in today's global economy. The combination of information technology with library and information services at an international accounting and auditing firm is described; the Global Information Network is explained; and the importance of the appropriate use of information is discussed. (LRW)
Global properties of cellular automata

International Nuclear Information System (INIS)

Jen, E.

1986-01-01

Cellular automata are discrete mathematical systems that generate diverse, often complicated, behavior using simple deterministic rules. Analysis of the local structure of these rules makes possible a description of the global properties of the associated automata. A class of cellular automata that generate infinitely many aperoidic temporal sequences is defined,a s is the set of rules for which inverses exist. Necessary and sufficient conditions are derived characterizing the classes of ''nearest-neighbor'' rules for which arbitrary finite initial conditions (i) evolve to a homogeneous state; (ii) generate at least one constant temporal sequence

Sequence History Update Tool

Science.gov (United States)

Khanampompan, Teerapat; Gladden, Roy; Fisher, Forest; DelGuercio, Chris

2008-01-01

The Sequence History Update Tool performs Web-based sequence statistics archiving for Mars Reconnaissance Orbiter (MRO). Using a single UNIX command, the software takes advantage of sequencing conventions to automatically extract the needed statistics from multiple files. This information is then used to populate a PHP database, which is then seamlessly formatted into a dynamic Web page. This tool replaces a previous tedious and error-prone process of manually editing HTML code to construct a Web-based table. Because the tool manages all of the statistics gathering and file delivery to and from multiple data sources spread across multiple servers, there is also a considerable time and effort savings. With the use of The Sequence History Update Tool what previously took minutes is now done in less than 30 seconds, and now provides a more accurate archival record of the sequence commanding for MRO.
Complete genome sequence of Ikoma lyssavirus.

Science.gov (United States)

Marston, Denise A; Ellis, Richard J; Horton, Daniel L; Kuzmin, Ivan V; Wise, Emma L; McElhinney, Lorraine M; Banyard, Ashley C; Ngeleja, Chanasa; Keyyu, Julius; Cleaveland, Sarah; Lembo, Tiziana; Rupprecht, Charles E; Fooks, Anthony R

2012-09-01

Lyssaviruses (family Rhabdoviridae) constitute one of the most important groups of viral zoonoses globally. All lyssaviruses cause the disease rabies, an acute progressive encephalitis for which, once symptoms occur, there is no effective cure. Currently available vaccines are highly protective against the predominantly circulating lyssavirus species. Using next-generation sequencing technologies, we have obtained the whole-genome sequence for a novel lyssavirus, Ikoma lyssavirus (IKOV), isolated from an African civet in Tanzania displaying clinical signs of rabies. Genetically, this virus is the most divergent within the genus Lyssavirus. Characterization of the genome will help to improve our understanding of lyssavirus diversity and enable investigation into vaccine-induced immunity and protection.
Effects of Sequences of Cognitions on Group Performance Over Time.

Science.gov (United States)

Molenaar, Inge; Chiu, Ming Ming

2017-04-01

Extending past research showing that sequences of low cognitions (low-level processing of information) and high cognitions (high-level processing of information through questions and elaborations) influence the likelihoods of subsequent high and low cognitions, this study examines whether sequences of cognitions are related to group performance over time; 54 primary school students (18 triads) discussed and wrote an essay about living in another country (32,375 turns of talk). Content analysis and statistical discourse analysis showed that within each lesson, groups with more low cognitions or more sequences of low cognition followed by high cognition added more essay words. Groups with more high cognitions, sequences of low cognition followed by low cognition, or sequences of high cognition followed by an action followed by low cognition, showed different words and sequences, suggestive of new ideas. The links between cognition sequences and group performance over time can inform facilitation and assessment of student discussions.
Sequencing at sea: challenges and experiences in Ion Torrent PGM sequencing during the 2013 Southern Line Islands Research Expedition

Directory of Open Access Journals (Sweden)

Yan Wei Lim

2014-08-01

Full Text Available Genomics and metagenomics have revolutionized our understanding of marine microbial ecology and the importance of microbes in global geochemical cycles. However, the process of DNA sequencing has always been an abstract extension of the research expedition, completed once the samples were returned to the laboratory. During the 2013 Southern Line Islands Research Expedition, we started the first effort to bring next generation sequencing to some of the most remote locations on our planet. We successfully sequenced twenty six marine microbial genomes, and two marine microbial metagenomes using the Ion Torrent PGM platform on the Merchant Yacht Hanse Explorer. Onboard sequence assembly, annotation, and analysis enabled us to investigate the role of the microbes in the coral reef ecology of these islands and atolls. This analysis identified phosphonate as an important phosphorous source for microbes growing in the Line Islands and reinforced the importance of L-serine in marine microbial ecosystems. Sequencing in the field allowed us to propose hypotheses and conduct experiments and further sampling based on the sequences generated. By eliminating the delay between sampling and sequencing, we enhanced the productivity of the research expedition. By overcoming the hurdles associated with sequencing on a boat in the middle of the Pacific Ocean we proved the flexibility of the sequencing, annotation, and analysis pipelines.
LookSeq: A browser-based viewer for deep sequencing data

OpenAIRE

Manske, Heinrich Magnus; Kwiatkowski, Dominic P.

2009-01-01

Sequencing a genome to great depth can be highly informative about heterogeneity within an individual or a population. Here we address the problem of how to visualize the multiple layers of information contained in deep sequencing data. We propose an interactive AJAX-based web viewer for browsing large data sets of aligned sequence reads. By enabling seamless browsing and fast zooming, the LookSeq program assists the user to assimilate information at different levels of resolution, from an ov...
Next-generation sequencing

DEFF Research Database (Denmark)

Rieneck, Klaus; Bak, Mads; Jønson, Lars

2013-01-01

, Illumina); several millions of PCR sequences were analyzed. RESULTS: The results demonstrated the feasibility of diagnosing the fetal KEL1 or KEL2 blood group from cell-free DNA purified from maternal plasma. CONCLUSION: This method requires only one primer pair, and the large amount of sequence...... information obtained allows well for statistical analysis of the data. This general approach can be integrated into current laboratory practice and has numerous applications. Besides DNA-based predictions of blood group phenotypes, platelet phenotypes, or sickle cell anemia, and the determination of zygosity...
[Sex differences in relationship between creativity and hemispheric information processing in global and local levels].

Science.gov (United States)

Razumnikova, O M; Vol'f, N V

2012-01-01

Sex differences in creativity related global-local hemispheric selective processing were examined by hierarchical letter presenting in conditions of their perception and comparison. Fifty-six right-handed males and 68 females (aged 17-22 years) participated in the experiments. Originality-imagery was assessed by a computer-based Torrance 'Incomplete Figures' test software. Verbal creativity was valued by original sentence using of three nouns from remote semantic categories. The results show that irrespectively of the sex factor and the type of creative thinking, its originality is provided by high speed of right-hemispheric processes of information selection on the global level and delay in the interhemispheric communication. Relationships between originality of ideas and hemispheric attentional characteristics are presented mostly in men while verbal creative problem solving, and in women while figurative original thinking. Originality of verbal activity in men is more associated with success of selective processes in the left hemisphere, but in women--with selective functions of both hemispheres. Figurative thinking in men is less related to hemispheric characteristics of attention compared with women. Increase of figurative originality in women is accompanied acceleration of processes of selection of the information in the right hemisphere, and also higher efficiency of local attention as well as speeds ofglobal processing in the left hemisphere.
ASSESSMENT OF THE BUSINESS ENVIRONMENT FOR DEVELOPMENT OF GLOBAL MARKETING STRATEGY

Directory of Open Access Journals (Sweden)

V. Savelyev

2014-03-01

Full Text Available The article concerns with essence of assessment of the business environment and specific directions of analysis during the working out of global marketing strategy. The classification of the global marketing environment researches and tasks sequence in the context of the decisions made on each stage of global marketing strategy is proposed.
Transcriptome analysis of carnation (Dianthus caryophyllus L.) based on next-generation sequencing technology.

Science.gov (United States)

Tanase, Koji; Nishitani, Chikako; Hirakawa, Hideki; Isobe, Sachiko; Tabata, Satoshi; Ohmiya, Akemi; Onozaki, Takashi

2012-07-02

Carnation (Dianthus caryophyllus L.), in the family Caryophyllaceae, can be found in a wide range of colors and is a model system for studies of flower senescence. In addition, it is one of the most important flowers in the global floriculture industry. However, few genomics resources, such as sequences and markers are available for carnation or other members of the Caryophyllaceae. To increase our understanding of the genetic control of important characters in carnation, we generated an expressed sequence tag (EST) database for a carnation cultivar important in horticulture by high-throughput sequencing using 454 pyrosequencing technology. We constructed a normalized cDNA library and a 3'-UTR library of carnation, obtaining a total of 1,162,126 high-quality reads. These reads were assembled into 300,740 unigenes consisting of 37,844 contigs and 262,896 singlets. The contigs were searched against an Arabidopsis sequence database, and 61.8% (23,380) of them had at least one BLASTX hit. These contigs were also annotated with Gene Ontology (GO) and were found to cover a broad range of GO categories. Furthermore, we identified 17,362 potential simple sequence repeats (SSRs) in 14,291 of the unigenes. We focused on gene discovery in the areas of flower color and ethylene biosynthesis. Transcripts were identified for almost every gene involved in flower chlorophyll and carotenoid metabolism and in anthocyanin biosynthesis. Transcripts were also identified for every step in the ethylene biosynthesis pathway. We present the first large-scale sequence data set for carnation, generated using next-generation sequencing technology. The large EST database generated from these sequences is an informative resource for identifying genes involved in various biological processes in carnation and provides an EST resource for understanding the genetic diversity of this plant.
Transcriptome analysis of carnation (Dianthus caryophyllus L. based on next-generation sequencing technology

Directory of Open Access Journals (Sweden)

Tanase Koji

2012-07-01

Full Text Available Abstract Background Carnation (Dianthus caryophyllus L., in the family Caryophyllaceae, can be found in a wide range of colors and is a model system for studies of flower senescence. In addition, it is one of the most important flowers in the global floriculture industry. However, few genomics resources, such as sequences and markers are available for carnation or other members of the Caryophyllaceae. To increase our understanding of the genetic control of important characters in carnation, we generated an expressed sequence tag (EST database for a carnation cultivar important in horticulture by high-throughput sequencing using 454 pyrosequencing technology. Results We constructed a normalized cDNA library and a 3’-UTR library of carnation, obtaining a total of 1,162,126 high-quality reads. These reads were assembled into 300,740 unigenes consisting of 37,844 contigs and 262,896 singlets. The contigs were searched against an Arabidopsis sequence database, and 61.8% (23,380 of them had at least one BLASTX hit. These contigs were also annotated with Gene Ontology (GO and were found to cover a broad range of GO categories. Furthermore, we identified 17,362 potential simple sequence repeats (SSRs in 14,291 of the unigenes. We focused on gene discovery in the areas of flower color and ethylene biosynthesis. Transcripts were identified for almost every gene involved in flower chlorophyll and carotenoid metabolism and in anthocyanin biosynthesis. Transcripts were also identified for every step in the ethylene biosynthesis pathway. Conclusions We present the first large-scale sequence data set for carnation, generated using next-generation sequencing technology. The large EST database generated from these sequences is an informative resource for identifying genes involved in various biological processes in carnation and provides an EST resource for understanding the genetic diversity of this plant.
Integrating remote sensing, geographic information systems and global positioning system techniques with hydrological modeling

Science.gov (United States)

Thakur, Jay Krishna; Singh, Sudhir Kumar; Ekanthalu, Vicky Shettigondahalli

2017-07-01

Integration of remote sensing (RS), geographic information systems (GIS) and global positioning system (GPS) are emerging research areas in the field of groundwater hydrology, resource management, environmental monitoring and during emergency response. Recent advancements in the fields of RS, GIS, GPS and higher level of computation will help in providing and handling a range of data simultaneously in a time- and cost-efficient manner. This review paper deals with hydrological modeling, uses of remote sensing and GIS in hydrological modeling, models of integrations and their need and in last the conclusion. After dealing with these issues conceptually and technically, we can develop better methods and novel approaches to handle large data sets and in a better way to communicate information related with rapidly decreasing societal resources, i.e. groundwater.
Global Ethics Applied: Global Ethics, Economic Ethics

OpenAIRE

Stückelberger, Christoph

2016-01-01

Global Ethics Applied’ in four volumes is a reader of 88 selected articles from the author on 13 domains: Vol. 1 Global Ethics, Economic Ethics; Vol. 2 Environmental Ethics; Vol. 3 Development Ethics, Political Ethics, Dialogue and Peace Ethics, Innovation and Research Ethics, Information and Communication Ethics; Vol. 4 Bioethics and Medical Ethics, Family Ethics and Sexual Ethics, Leadership Ethics, Theological Ethics and Ecclesiology, Methods of Ethics. It concludes with the extended Bibli...
How information technology can help sustainability and aid in combating global warming[ACI SP-234-44

Energy Technology Data Exchange (ETDEWEB)

Kondratova, I.L.; Goldfarb, I. [National Research Council of Canada, Ottawa, ON (Canada). Inst. for Information Technology

2006-07-01

This presentation addressed the need to reduce the environmental impact of concrete production. Unit based carbon dioxide emissions in cement production vary from 0.73 to 0.99 kg carbon dioxide per kg of cement. As such, annual cement manufacturing contributes significantly to global warming. The challenge facing the concrete industry regarding sustainable growth was discussed. It was suggested that sustainable development in the cement industry can be accomplished not only by making an industry wide shift to conservation of energy and materials, but by making greater use of the Internet for information technology on sustainable construction materials such as lightweight aggregates and lightweight concrete. The paper outlined the evolution of various methods of disseminating research results on the durability of concrete at the United States Army Corps of Engineers Treat Island marine exposure site. The results indicated that structural lightweight and semi-lightweight concrete provides long-term durability in a marine environment. It was noted that knowledge utilization includes technology transfer, information dissemination and utilization, research utilization, innovation, and organizational change. The paper emphasized the use of web portals as a tool for improving access to practical information on a full range of sustainable industry practices, products and resources. These tools allow side-by side comparison of testing results for different concrete mixtures and support decision-making on the choice of environmentally sound and durable concrete. The authors demonstrated by advantages of using modern information technology tools by suggesting that with the development of a full scale Portal, the Expanded Shale, Clay, and Slate Institute (ESCSI) could become a global source of credible information and expertise in the area of lightweight concrete. As such ESCSI could be in a position to influence innovation and technology transfer to the industry. The paper
Globalization and American Education

Science.gov (United States)

Merriman, William; Nicoletti, Augustine

2008-01-01

Globalization is a potent force in today's world. The welfare of the United States is tied to the welfare of other countries by economics, the environment, politics, culture, information, and technology. This paper identifies the implications of globalization for education, presents applications of important aspects of globalization that teachers…
Genomic analysis of expressed sequence tags in American black bear Ursus americanus

Science.gov (United States)

2010-01-01

Background Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Results Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. Conclusion We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes. PMID:20338065
Genomic analysis of expressed sequence tags in American black bear Ursus americanus.

Science.gov (United States)

Zhao, Sen; Shao, Chunxuan; Goropashnaya, Anna V; Stewart, Nathan C; Xu, Yichi; Tøien, Øivind; Barnes, Brian M; Fedorov, Vadim B; Yan, Jun

2010-03-26

Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes.
Globally COnstrained Local Function Approximation via Hierarchical Modelling, a Framework for System Modelling under Partial Information

DEFF Research Database (Denmark)

Øjelund, Henrik; Sadegh, Payman

2000-01-01

be obtained. This paper presents a new approach for system modelling under partial (global) information (or the so called Gray-box modelling) that seeks to perserve the benefits of the global as well as local methodologies sithin a unified framework. While the proposed technique relies on local approximations......Local function approximations concern fitting low order models to weighted data in neighbourhoods of the points where the approximations are desired. Despite their generality and convenience of use, local models typically suffer, among others, from difficulties arising in physical interpretation...... simultaneously with the (local estimates of) function values. The approach is applied to modelling of a linear time variant dynamic system under prior linear time invariant structure where local regression fails as a result of high dimensionality....
LookSeq: a browser-based viewer for deep sequencing data.

Science.gov (United States)

Manske, Heinrich Magnus; Kwiatkowski, Dominic P

2009-11-01

Sequencing a genome to great depth can be highly informative about heterogeneity within an individual or a population. Here we address the problem of how to visualize the multiple layers of information contained in deep sequencing data. We propose an interactive AJAX-based web viewer for browsing large data sets of aligned sequence reads. By enabling seamless browsing and fast zooming, the LookSeq program assists the user to assimilate information at different levels of resolution, from an overview of a genomic region to fine details such as heterogeneity within the sample. A specific problem, particularly if the sample is heterogeneous, is how to depict information about structural variation. LookSeq provides a simple graphical representation of paired sequence reads that is more revealing about potential insertions and deletions than are conventional methods.
Information security protecting the global enterprise

CERN Document Server

Pipkin, Donald L

2000-01-01

In this book, IT security expert Donald Pipkin addresses every aspect of information security: the business issues, the technical process issues, and the legal issues. Pipkin starts by reviewing the key business issues: estimating the value of information assets, evaluating the cost to the organization if they are lost or disclosed, and determining the appropriate levels of protection and response to security incidents. Next, he walks through the technical processes required to build a consistent, reasonable information security system, with appropriate intrusion detection and reporting features. Finally, Pipkin reviews the legal issues associated with information security, including corporate officers' personal liability for taking care that information is protected. The book's coverage is applicable to businesses of any size, from 50 employees to 50,000 or more, and ideal for everyone who needs at least a basic understanding of information security: network/system administrators, managers, planners, archite...
Design of Long Period Pseudo-Random Sequences from the Addition of m -Sequences over 𝔽 p

Directory of Open Access Journals (Sweden)

Ren Jian

2004-01-01

Full Text Available Pseudo-random sequence with good correlation property and large linear span is widely used in code division multiple access (CDMA communication systems and cryptology for reliable and secure information transmission. In this paper, sequences with long period, large complexity, balance statistics, and low cross-correlation property are constructed from the addition of m -sequences with pairwise-prime linear spans (AMPLS. Using m -sequences as building blocks, the proposed method proved to be an efficient and flexible approach to construct long period pseudo-random sequences with desirable properties from short period sequences. Applying the proposed method to 𝔽 2 , a signal set ( ( 2 n − 1 ( 2 m − 1 , ( 2 n + 1 ( 2 m + 1 , ( 2 ( n + 1 / 2 + 1 ( 2 ( m + 1 / 2 + 1 is constructed.

Human Capital Response to Globalization: Education and Information Technology in India

Science.gov (United States)

Shastry, Gauri Kartini

2012-01-01

Recent studies suggest that globalization increases inequality, by increasing skilled wage premiums in developing countries. This effect may be mitigated, however, if human capital responds to global opportunities. I study how the impact of globalization varies across Indian districts with different costs of learning English. Linguistic diversity…
Developing a framework to assess the cost-effectiveness of COMPARE -A global platform for the exchange of sequence-based pathogen data

DEFF Research Database (Denmark)

Alleweldt, F.; Kara, Sami; Osinski, A.

2017-01-01

Analysing the genomic data of pathogens with the help of next-generation sequencing (NGS) is an increasingly important part of disease outbreak investigations and helps guide responses. While this technology has already been successfully employed to elucidate and control disease outbreaks, wider...... implementation of NGS also depends on its cost-effectiveness. COMPARE - short for 'Collaborative Management Platform for detection and Analyses of (Re-) emerging and foodborne outbreaks' - is a major project, funded by the European Union, to develop a global platform for sharing and analysing NGS data...... and thereby improve the rapid identification, containment and mitigation of emerging infectious diseases and foodborne outbreaks. This article introduces the project and presents the results of a review of the literature, composed of previous relevant cost-benefit and cost-effectiveness analyses. The authors...
The Latest Books on Globalization

Directory of Open Access Journals (Sweden)

Elia Zaru

2016-12-01

Full Text Available The latest books published on globalization raise interesting issues which reflect upon the very complexity of the process we are facing. In The Great Convergence: Information Technology and the New Globalization Richard Baldwin proposes a history of globalization divided into two stages. As Baldwin argues, the process of globalization has to be divided into “old” and “new” age. The “old” globalization took place between 1820 and 1990. It was characterized by the “great divergence”, that is by the centralization of world income in today’s wealthy nations. However, since 1990 the sharing of world income has plummeted to where it was in 1900. According to Baldwin, this reversal of fortune is a symptom of a shift in the globalization process. The “new” globalization, driven by information technology, has combined high tech with low wages, and lead simultaneously to the industrialization of developing nations and deindustrialization of developed ones. This is the “great convergence”: in the “new” globalization rich and developing nations are alike and they face equal global challenges.
Doing global science a guide to responsible conduct in the global research enterprise

CERN Document Server

InterAcademy Partnership

2016-01-01

This concise introductory guide explains the values that should inform the responsible conduct of scientific research in today's global setting. Featuring accessible discussions and ample real-world scenarios, Doing Global Science covers proper conduct, fraud and bias, the researcher's responsibilities to society, communication with the public, and much more. The book places special emphasis on the international and highly networked environment in which modern research is done, presenting science as an enterprise that is being transformed by globalization, interdisciplinary research projects, team science, and information technologies. Accessibly written by an InterAcademy Partnership committee comprised of leading scientists from around the world, Doing Global Science is required reading for students, practitioners, and anyone concerned about the responsible conduct of science today.
Genomic multiple sequence alignments: refinement using a genetic algorithm

Directory of Open Access Journals (Sweden)

Lefkowitz Elliot J

2005-08-01

Full Text Available Abstract Background Genomic sequence data cannot be fully appreciated in isolation. Comparative genomics – the practice of comparing genomic sequences from different species – plays an increasingly important role in understanding the genotypic differences between species that result in phenotypic differences as well as in revealing patterns of evolutionary relationships. One of the major challenges in comparative genomics is producing a high-quality alignment between two or more related genomic sequences. In recent years, a number of tools have been developed for aligning large genomic sequences. Most utilize heuristic strategies to identify a series of strong sequence similarities, which are then used as anchors to align the regions between the anchor points. The resulting alignment is globally correct, but in many cases is suboptimal locally. We describe a new program, GenAlignRefine, which improves the overall quality of global multiple alignments by using a genetic algorithm to improve local regions of alignment. Regions of low quality are identified, realigned using the program T-Coffee, and then refined using a genetic algorithm. Because a better COFFEE (Consistency based Objective Function For alignmEnt Evaluation score generally reflects greater alignment quality, the algorithm searches for an alignment that yields a better COFFEE score. To improve the intrinsic slowness of the genetic algorithm, GenAlignRefine was implemented as a parallel, cluster-based program. Results We tested the GenAlignRefine algorithm by running it on a Linux cluster to refine sequences from a simulation, as well as refine a multiple alignment of 15 Orthopoxvirus genomic sequences approximately 260,000 nucleotides in length that initially had been aligned by Multi-LAGAN. It took approximately 150 minutes for a 40-processor Linux cluster to optimize some 200 fuzzy (poorly aligned regions of the orthopoxvirus alignment. Overall sequence identity increased only
Global phylogenetic analysis of contemporary aleutian mink disease viruses (AMDVs)

DEFF Research Database (Denmark)

Ryt-Hansen, Pia; Hagberg, E. E.; Chriél, Mariann

2017-01-01

a strain originating from Sweden. In contrast, we did not identify any potential source for the other and more widespread outbreak strain. To the authors knowledge this is the first major global phylogenetic study of contemporary AMDV partial NS1 sequences. The study proved that partial NS1 sequencing can...
Transcriptome sequencing of the Microarray Quality Control (MAQC RNA reference samples using next generation sequencing

Directory of Open Access Journals (Sweden)

Thierry-Mieg Danielle

2009-06-01

Full Text Available Abstract Background Transcriptome sequencing using next-generation sequencing platforms will soon be competing with DNA microarray technologies for global gene expression analysis. As a preliminary evaluation of these promising technologies, we performed deep sequencing of cDNA synthesized from the Microarray Quality Control (MAQC reference RNA samples using Roche's 454 Genome Sequencer FLX. Results We generated more that 3.6 million sequence reads of average length 250 bp for the MAQC A and B samples and introduced a data analysis pipeline for translating cDNA read counts into gene expression levels. Using BLAST, 90% of the reads mapped to the human genome and 64% of the reads mapped to the RefSeq database of well annotated genes with e-values ≤ 10-20. We measured gene expression levels in the A and B samples by counting the numbers of reads that mapped to individual RefSeq genes in multiple sequencing runs to evaluate the MAQC quality metrics for reproducibility, sensitivity, specificity, and accuracy and compared the results with DNA microarrays and Quantitative RT-PCR (QRTPCR from the MAQC studies. In addition, 88% of the reads were successfully aligned directly to the human genome using the AceView alignment programs with an average 90% sequence similarity to identify 137,899 unique exon junctions, including 22,193 new exon junctions not yet contained in the RefSeq database. Conclusion Using the MAQC metrics for evaluating the performance of gene expression platforms, the ExpressSeq results for gene expression levels showed excellent reproducibility, sensitivity, and specificity that improved systematically with increasing shotgun sequencing depth, and quantitative accuracy that was comparable to DNA microarrays and QRTPCR. In addition, a careful mapping of the reads to the genome using the AceView alignment programs shed new light on the complexity of the human transcriptome including the discovery of thousands of new splice variants.
PseudoMLSA: a database for multigenic sequence analysis of Pseudomonas species

Directory of Open Access Journals (Sweden)

Lalucat Jorge

2010-04-01

Full Text Available Abstract Background The genus Pseudomonas comprises more than 100 species of environmental, clinical, agricultural, and biotechnological interest. Although, the recommended method for discriminating bacterial species is DNA-DNA hybridisation, alternative techniques based on multigenic sequence analysis are becoming a common practice in bacterial species discrimination studies. Since there is not a general criterion for determining which genes are more useful for species resolution; the number of strains and genes analysed is increasing continuously. As a result, sequences of different genes are dispersed throughout several databases. This sequence information needs to be collected in a common database, in order to be useful for future identification-based projects. Description The PseudoMLSA Database is a comprehensive database of multiple gene sequences from strains of Pseudomonas species. The core of the database is composed of selected gene sequences from all Pseudomonas type strains validly assigned to the genus through 2008. The database is aimed to be useful for MultiLocus Sequence Analysis (MLSA procedures, for the identification and characterisation of any Pseudomonas bacterial isolate. The sequences are available for download via a direct connection to the National Center for Biotechnology Information (NCBI. Additionally, the database includes an online BLAST interface for flexible nucleotide queries and similarity searches with the user's datasets, and provides a user-friendly output for easily parsing, navigating, and analysing BLAST results. Conclusions The PseudoMLSA database amasses strains and sequence information of validly described Pseudomonas species, and allows free querying of the database via a user-friendly, web-based interface available at http://www.uib.es/microbiologiaBD/Welcome.html. The web-based platform enables easy retrieval at strain or gene sequence information level; including references to published peer
Relationships between emm and multilocus sequence types within a global collection of Streptococcus pyogenes

Directory of Open Access Journals (Sweden)

McGregor Karen F

2008-04-01

Full Text Available Abstract Background The M type-specific surface protein antigens encoded by the 5' end of emm genes are targets of protective host immunity and attractive vaccine candidates against infection by Streptococcus pyogenes, a global human pathogen. A history of genetic change in emm was evaluated for a worldwide collection of > 500 S. pyogenes isolates that were defined for genetic background by multilocus sequence typing of housekeeping genes. Results Organisms were categorized by genotypes that roughly correspond to throat specialists, skin specialists, and generalists often recovered from infections at either tissue site. Recovery of distant clones sharing the same emm type was ~4-fold higher for skin specialists and generalists, as compared to throat specialists. Importantly, emm type was often a poor marker for clone. Recovery of clones that underwent recombinational replacement with a new emm type was most evident for the throat and skin specialists. The average ratio of nonsynonymous substitutions per nonsynonymous site (Ka and synonymous substitutions per synonymous site (Ks was 4.9, 1.5 and 1.3 for emm types of the throat specialist, skin specialist and generalist groups, respectively. Conclusion Data indicate that the relationships between emm type and genetic background differ among the three host tissue-related groups, and that the selection pressures acting on emm appear to be strongest for the throat specialists. Since positive selection is likely due in part to a protective host immune response, the findings may have important implications for vaccine design and vaccination strategies.
Meditation increases the depth of information processing and improves the allocation of attention in space

Directory of Open Access Journals (Sweden)

Sara evan Leeuwen

2012-05-01

Full Text Available During meditation, practitioners are required to center their attention on a specific object for extended periods of time. When their thoughts get diverted, they learn to quickly disengage from the distracter. We hypothesized that learning to respond to the dual demand of engaging attention on specific objects and disengaging quickly from distracters enhances the efficiency by which meditation practitioners can allocate attention. We tested this hypothesis in a global-to-local task while measuring electroencephalographic activity from a group of eight highly trained Buddhist monks and nuns and a group of eight age and education matched controls with no previous meditation experience. Specifically, we investigated the effect of attentional training on the global precedence effect, i.e., faster detection of targets on a global than on a local level. We expected to find a reduced global precedence effect in meditation practitioners but not in controls, reflecting that meditators can more quickly disengage their attention from the dominant global level. Analysis of reaction times confirmed this prediction. To investigate the underlying changes in brain activity and their time course, we analyzed event-related potentials. Meditators showed an enhanced ability to select the respective target level, as reflected by enhanced processing of target level information. In contrast with control group, which showed a local target selection effect only in the P1 and a global target selection effect in the P3 component, meditators showed effects of local information processing in the P1, N2 and P3 and of global processing for the N1, N2 and P3. Thus, meditators seem to display enhanced depth of processing. In addition, meditation altered the uptake of information such that meditators selected target level information earlier in the processing sequence than controls. In a longitudinal experiment, we could replicate the behavioral effects, suggesting that
Prediction of Human Activity by Discovering Temporal Sequence Patterns.

Science.gov (United States)

Li, Kang; Fu, Yun

2014-08-01

Early prediction of ongoing human activity has become more valuable in a large variety of time-critical applications. To build an effective representation for prediction, human activities can be characterized by a complex temporal composition of constituent simple actions and interacting objects. Different from early detection on short-duration simple actions, we propose a novel framework for long -duration complex activity prediction by discovering three key aspects of activity: Causality, Context-cue, and Predictability. The major contributions of our work include: (1) a general framework is proposed to systematically address the problem of complex activity prediction by mining temporal sequence patterns; (2) probabilistic suffix tree (PST) is introduced to model causal relationships between constituent actions, where both large and small order Markov dependencies between action units are captured; (3) the context-cue, especially interactive objects information, is modeled through sequential pattern mining (SPM), where a series of action and object co-occurrence are encoded as a complex symbolic sequence; (4) we also present a predictive accumulative function (PAF) to depict the predictability of each kind of activity. The effectiveness of our approach is evaluated on two experimental scenarios with two data sets for each: action-only prediction and context-aware prediction. Our method achieves superior performance for predicting global activity classes and local action units.
A Proteomic Workflow Using High-Throughput De Novo Sequencing Towards Complementation of Genome Information for Improved Comparative Crop Science.

Science.gov (United States)

Turetschek, Reinhard; Lyon, David; Desalegn, Getinet; Kaul, Hans-Peter; Wienkoop, Stefanie

2016-01-01

The proteomic study of non-model organisms, such as many crop plants, is challenging due to the lack of comprehensive genome information. Changing environmental conditions require the study and selection of adapted cultivars. Mutations, inherent to cultivars, hamper protein identification and thus considerably complicate the qualitative and quantitative comparison in large-scale systems biology approaches. With this workflow, cultivar-specific mutations are detected from high-throughput comparative MS analyses, by extracting sequence polymorphisms with de novo sequencing. Stringent criteria are suggested to filter for confidential mutations. Subsequently, these polymorphisms complement the initially used database, which is ready to use with any preferred database search algorithm. In our example, we thereby identified 26 specific mutations in two cultivars of Pisum sativum and achieved an increased number (17 %) of peptide spectrum matches.
On the efficiency of chaos optimization algorithms for global optimization

International Nuclear Information System (INIS)

Yang Dixiong; Li Gang; Cheng Gengdong

2007-01-01

Chaos optimization algorithms as a novel method of global optimization have attracted much attention, which were all based on Logistic map. However, we have noticed that the probability density function of the chaotic sequences derived from Logistic map is a Chebyshev-type one, which may affect the global searching capacity and computational efficiency of chaos optimization algorithms considerably. Considering the statistical property of the chaotic sequences of Logistic map and Kent map, the improved hybrid chaos-BFGS optimization algorithm and the Kent map based hybrid chaos-BFGS algorithm are proposed. Five typical nonlinear functions with multimodal characteristic are tested to compare the performance of five hybrid optimization algorithms, which are the conventional Logistic map based chaos-BFGS algorithm, improved Logistic map based chaos-BFGS algorithm, Kent map based chaos-BFGS algorithm, Monte Carlo-BFGS algorithm, mesh-BFGS algorithm. The computational performance of the five algorithms is compared, and the numerical results make us question the high efficiency of the chaos optimization algorithms claimed in some references. It is concluded that the efficiency of the hybrid optimization algorithms is influenced by the statistical property of chaotic/stochastic sequences generated from chaotic/stochastic algorithms, and the location of the global optimum of nonlinear functions. In addition, it is inappropriate to advocate the high efficiency of the global optimization algorithms only depending on several numerical examples of low-dimensional functions
Global Education: What the Research Shows. Information Capsule. Volume 0604

Science.gov (United States)

Blazer, Christie

2006-01-01

Teaching from a global perspective is important because the lives of people around the world are increasingly interconnected through politics, economics, technology, and the environment. Global education teaches students to understand and appreciate people from different cultural backgrounds; view events from a variety of perspectives; recognize…
10KP: A phylodiverse genome sequencing plan.

Science.gov (United States)

Cheng, Shifeng; Melkonian, Michael; Smith, Stephen A; Brockington, Samuel; Archibald, John M; Delaux, Pierre-Marc; Li, Fay-Wei; Melkonian, Barbara; Mavrodiev, Evgeny V; Sun, Wenjing; Fu, Yuan; Yang, Huanming; Soltis, Douglas E; Graham, Sean W; Soltis, Pamela S; Liu, Xin; Xu, Xun; Wong, Gane Ka-Shu

2018-03-01

Understanding plant evolution and diversity in a phylogenomic context is an enormous challenge due, in part, to limited availability of genome-scale data across phylodiverse species. The 10KP (10,000 Plants) Genome Sequencing Project will sequence and characterize representative genomes from every major clade of embryophytes, green algae, and protists (excluding fungi) within the next 5 years. By implementing and continuously improving leading-edge sequencing technologies and bioinformatics tools, 10KP will catalogue the genome content of plant and protist diversity and make these data freely available as an enduring foundation for future scientific discoveries and applications. 10KP is structured as an international consortium, open to the global community, including botanical gardens, plant research institutes, universities, and private industry. Our immediate goal is to establish a policy framework for this endeavor, the principles of which are outlined here.
10KP: A phylodiverse genome sequencing plan

Science.gov (United States)

Cheng, Shifeng; Melkonian, Michael; Brockington, Samuel; Archibald, John M; Delaux, Pierre-Marc; Melkonian, Barbara; Mavrodiev, Evgeny V; Sun, Wenjing; Fu, Yuan; Yang, Huanming; Soltis, Douglas E; Graham, Sean W; Soltis, Pamela S; Liu, Xin; Xu, Xun

2018-01-01

Abstract Understanding plant evolution and diversity in a phylogenomic context is an enormous challenge due, in part, to limited availability of genome-scale data across phylodiverse species. The 10KP (10,000 Plants) Genome Sequencing Project will sequence and characterize representative genomes from every major clade of embryophytes, green algae, and protists (excluding fungi) within the next 5 years. By implementing and continuously improving leading-edge sequencing technologies and bioinformatics tools, 10KP will catalogue the genome content of plant and protist diversity and make these data freely available as an enduring foundation for future scientific discoveries and applications. 10KP is structured as an international consortium, open to the global community, including botanical gardens, plant research institutes, universities, and private industry. Our immediate goal is to establish a policy framework for this endeavor, the principles of which are outlined here. PMID:29618049
The Applied Development of a Tiered Multilocus Sequence Typing (MLST) Scheme for Dichelobacter nodosus.

Science.gov (United States)

Blanchard, Adam M; Jolley, Keith A; Maiden, Martin C J; Coffey, Tracey J; Maboni, Grazieli; Staley, Ceri E; Bollard, Nicola J; Warry, Andrew; Emes, Richard D; Davies, Peers L; Tötemeyer, Sabine

2018-01-01

Dichelobacter nodosus ( D. nodosus ) is the causative pathogen of ovine footrot, a disease that has a significant welfare and financial impact on the global sheep industry. Previous studies into the phylogenetics of D. nodosus have focused on Australia and Scandinavia, meaning the current diversity in the United Kingdom (U.K.) population and its relationship globally, is poorly understood. Numerous epidemiological methods are available for bacterial typing; however, few account for whole genome diversity or provide the opportunity for future application of new computational techniques. Multilocus sequence typing (MLST) measures nucleotide variations within several loci with slow accumulation of variation to enable the designation of allele numbers to determine a sequence type. The usage of whole genome sequence data enables the application of MLST, but also core and whole genome MLST for higher levels of strain discrimination with a negligible increase in experimental cost. An MLST database was developed alongside a seven loci scheme using publically available whole genome data from the sequence read archive. Sequence type designation and strain discrimination was compared to previously published data to ensure reproducibility. Multiple D. nodosus isolates from U.K. farms were directly compared to populations from other countries. The U.K. isolates define new clades within the global population of D. nodosus and predominantly consist of serogroups A, B and H, however serogroups C, D, E, and I were also found. The scheme is publically available at https://pubmlst.org/dnodosus/.
Rapid Diagnostics of Onboard Sequences

Science.gov (United States)

Starbird, Thomas W.; Morris, John R.; Shams, Khawaja S.; Maimone, Mark W.

2012-01-01

Keeping track of sequences onboard a spacecraft is challenging. When reviewing Event Verification Records (EVRs) of sequence executions on the Mars Exploration Rover (MER), operators often found themselves wondering which version of a named sequence the EVR corresponded to. The lack of this information drastically impacts the operators diagnostic capabilities as well as their situational awareness with respect to the commands the spacecraft has executed, since the EVRs do not provide argument values or explanatory comments. Having this information immediately available can be instrumental in diagnosing critical events and can significantly enhance the overall safety of the spacecraft. This software provides auditing capability that can eliminate that uncertainty while diagnosing critical conditions. Furthermore, the Restful interface provides a simple way for sequencing tools to automatically retrieve binary compiled sequence SCMFs (Space Command Message Files) on demand. It also enables developers to change the underlying database, while maintaining the same interface to the existing applications. The logging capabilities are also beneficial to operators when they are trying to recall how they solved a similar problem many days ago: this software enables automatic recovery of SCMF and RML (Robot Markup Language) sequence files directly from the command EVRs, eliminating the need for people to find and validate the corresponding sequences. To address the lack of auditing capability for sequences onboard a spacecraft during earlier missions, extensive logging support was added on the Mars Science Laboratory (MSL) sequencing server. This server is responsible for generating all MSL binary SCMFs from RML input sequences. The sequencing server logs every SCMF it generates into a MySQL database, as well as the high-level RML file and dictionary name inputs used to create the SCMF. The SCMF is then indexed by a hash value that is automatically included in all command
Internal Transcribed Spacer 1 (ITS1 based sequence typing reveals phylogenetically distinct Ascaris population

Directory of Open Access Journals (Sweden)

Koushik Das

2015-01-01

Full Text Available Taxonomic differentiation among morphologically identical Ascaris species is a debatable scientific issue in the context of Ascariasis epidemiology. To explain the disease epidemiology and also the taxonomic position of different Ascaris species, genome information of infecting strains from endemic areas throughout the world is certainly crucial. Ascaris population from human has been genetically characterized based on the widely used genetic marker, internal transcribed spacer1 (ITS1. Along with previously reported and prevalent genotype G1, 8 new sequence variants of ITS1 have been identified. Genotype G1 was significantly present among female patients aged between 10 to 15 years. Intragenic linkage disequilibrium (LD analysis at target locus within our study population has identified an incomplete LD value with potential recombination events. A separate cluster of Indian isolates with high bootstrap value indicate their distinct phylogenetic position in comparison to the global Ascaris population. Genetic shuffling through recombination could be a possible reason for high population diversity and frequent emergence of new sequence variants, identified in present and other previous studies. This study explores the genetic organization of Indian Ascaris population for the first time which certainly includes some fundamental information on the molecular epidemiology of Ascariasis.
Parallel algorithms for testing finite state machines:Generating UIO sequences

OpenAIRE

Hierons, RM; Turker, UC

2016-01-01

This paper describes an efficient parallel algorithm that uses many-core GPUs for automatically deriving Unique Input Output sequences (UIOs) from Finite State Machines. The proposed algorithm uses the global scope of the GPU's global memory through coalesced memory access and minimises the transfer between CPU and GPU memory. The results of experiments indicate that the proposed method yields considerably better results compared to a single core UIO construction algorithm. Our algorithm is s...

Galaxy LIMS for next-generation sequencing

NARCIS (Netherlands)

Scholtalbers, J.; Rossler, J.; Sorn, P.; Graaf, J. de; Boisguerin, V.; Castle, J.; Sahin, U.

2013-01-01

SUMMARY: We have developed a laboratory information management system (LIMS) for a next-generation sequencing (NGS) laboratory within the existing Galaxy platform. The system provides lab technicians standard and customizable sample information forms, barcoded submission forms, tracking of input
Globalization, the Information Society and New Crimes : the Challenge for the XXI Century / Mondialisation, société de l’information et crimes nouveaux : le défi du 21ème siècle

Directory of Open Access Journals (Sweden)

Viano Emilio

2012-08-01

Full Text Available This paper examines in depth the current phenomenon of globalization and its trends; its impact, positive or negative which is the cornerstone of the current restructuring of the global economy; the nature and consequences of the globalization of trade and financial services; the operations and impact of the multinationals; and the hierarchy of countries based on their relative importance in the global economy, and its consequences. The paper then distinguishes between real and virtual globalization and its impact on economic growth; its inclusive and exclusive dynamics and their consequences for individual and corporate economic actors. The paper then addresses the information society, a phenomenon that accompanies and significantly facilitates globalization through drastic and significant improvements in communications and transport. The Internet and electronic devices used for massive surveillance, the collection of personal information, and the systematic erosion of privacy - the Internet as Panopticon – are also examined.Finally, the paper analyzes the vulnerability of the global information society to crime, especially economic and identity theft crimes, ironically facilitated especially by its global and inter-connected nature. Even the global improvement in the financial conditions of most people worldwide, creating more wealth and economic well being, has negative, and at times criminal, repercussions, impacting especially indigenous or vulnerable populations, fauna and flora. Most illustrative examples of these criminal trends are trafficking in people, endangered species, animal organs and products, antiquities, art and various types of counterfeiting. In conclusion there is a clear and close connection between globalization, the information society, criminal behavior and a society’s ability to effectively protect itself from piracy and the violations of a country’s laws and treaties.Ce document examine en profondeur le ph
Integrating genomic information with protein sequence and 3D atomic level structure at the RCSB protein data bank.

Science.gov (United States)

Prlic, Andreas; Kalro, Tara; Bhattacharya, Roshni; Christie, Cole; Burley, Stephen K; Rose, Peter W

2016-12-15

The Protein Data Bank (PDB) now contains more than 120,000 three-dimensional (3D) structures of biological macromolecules. To allow an interpretation of how PDB data relates to other publicly available annotations, we developed a novel data integration platform that maps 3D structural information across various datasets. This integration bridges from the human genome across protein sequence to 3D structure space. We developed novel software solutions for data management and visualization, while incorporating new libraries for web-based visualization using SVG graphics. The new views are available from http://www.rcsb.org and software is available from https://github.com/rcsb/. andreas.prlic@rcsb.orgSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Measuring global oil trade dependencies: An application of the point-wise mutual information method

International Nuclear Information System (INIS)

Kharrazi, Ali; Fath, Brian D.

2016-01-01

Oil trade is one of the most vital networks in the global economy. In this paper, we analyze the 1998–2012 oil trade networks using the point-wise mutual information (PMI) method and determine the pairwise trade preferences and dependencies. Using examples of the USA's trade partners, this research demonstrates the usefulness of the PMI method as an additional methodological tool to evaluate the outcomes from countries' decisions to engage in preferred trading partners. A positive PMI value indicates trade preference where trade is larger than would be expected. For example, in 2012 the USA imported 2,548.7 kbpd despite an expected 358.5 kbpd of oil from Canada. Conversely, a negative PMI value indicates trade dis-preference where the amount of trade is smaller than what would be expected. For example, the 15-year average of annual PMI between Saudi Arabia and the U.S.A. is −0.130 and between Russia and the USA −1.596. We reflect the three primary reasons of discrepancies between actual and neutral model trade can be related to position, price, and politics. The PMI can quantify the political success or failure of trade preferences and can more accurately account temporal variation of interdependencies. - Highlights: • We analyzed global oil trade networks using the point-wise mutual information method. • We identified position, price, & politics as drivers of oil trade preference. • The PMI method is useful in research on complex trade networks and dependency theory. • A time-series analysis of PMI can track dependencies & evaluate policy decisions.
Quantitative comparison between a multiecho sequence and a single-echo sequence for susceptibility-weighted phase imaging.

Science.gov (United States)

Gilbert, Guillaume; Savard, Geneviève; Bard, Céline; Beaudoin, Gilles

2012-06-01

The aim of this study was to investigate the benefits arising from the use of a multiecho sequence for susceptibility-weighted phase imaging using a quantitative comparison with a standard single-echo acquisition. Four healthy adult volunteers were imaged on a clinical 3-T system using a protocol comprising two different three-dimensional susceptibility-weighted gradient-echo sequences: a standard single-echo sequence and a multiecho sequence. Both sequences were repeated twice in order to evaluate the local noise contribution by a subtraction of the two acquisitions. For the multiecho sequence, the phase information from each echo was independently unwrapped, and the background field contribution was removed using either homodyne filtering or the projection onto dipole fields method. The phase information from all echoes was then combined using a weighted linear regression. R2 maps were also calculated from the multiecho acquisitions. The noise standard deviation in the reconstructed phase images was evaluated for six manually segmented regions of interest (frontal white matter, posterior white matter, globus pallidus, putamen, caudate nucleus and lateral ventricle). The use of the multiecho sequence for susceptibility-weighted phase imaging led to a reduction of the noise standard deviation for all subjects and all regions of interest investigated in comparison to the reference single-echo acquisition. On average, the noise reduction ranged from 18.4% for the globus pallidus to 47.9% for the lateral ventricle. In addition, the amount of noise reduction was found to be strongly inversely correlated to the estimated R2 value (R=-0.92). In conclusion, the use of a multiecho sequence is an effective way to decrease the noise contribution in susceptibility-weighted phase images, while preserving both contrast and acquisition time. The proposed approach additionally permits the calculation of R2 maps. Copyright © 2012 Elsevier Inc. All rights reserved.
A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.

Science.gov (United States)

Luo, Li; Zhu, Yun; Xiong, Momiao

2012-06-01

The genome-wide association studies (GWAS) designed for next-generation sequencing data involve testing association of genomic variants, including common, low frequency, and rare variants. The current strategies for association studies are well developed for identifying association of common variants with the common diseases, but may be ill-suited when large amounts of allelic heterogeneity are present in sequence data. Recently, group tests that analyze their collective frequency differences between cases and controls shift the current variant-by-variant analysis paradigm for GWAS of common variants to the collective test of multiple variants in the association analysis of rare variants. However, group tests ignore differences in genetic effects among SNPs at different genomic locations. As an alternative to group tests, we developed a novel genome-information content-based statistics for testing association of the entire allele frequency spectrum of genomic variation with the diseases. To evaluate the performance of the proposed statistics, we use large-scale simulations based on whole genome low coverage pilot data in the 1000 Genomes Project to calculate the type 1 error rates and power of seven alternative statistics: a genome-information content-based statistic, the generalized T(2), collapsing method, multivariate and collapsing (CMC) method, individual χ(2) test, weighted-sum statistic, and variable threshold statistic. Finally, we apply the seven statistics to published resequencing dataset from ANGPTL3, ANGPTL4, ANGPTL5, and ANGPTL6 genes in the Dallas Heart Study. We report that the genome-information content-based statistic has significantly improved type 1 error rates and higher power than the other six statistics in both simulated and empirical datasets.
Highly accurate sequence imputation enables precise QTL mapping in Brown Swiss cattle.

Science.gov (United States)

Frischknecht, Mirjam; Pausch, Hubert; Bapst, Beat; Signer-Hasler, Heidi; Flury, Christine; Garrick, Dorian; Stricker, Christian; Fries, Ruedi; Gredler-Grandl, Birgit

2017-12-29

Within the last few years a large amount of genomic information has become available in cattle. Densities of genomic information vary from a few thousand variants up to whole genome sequence information. In order to combine genomic information from different sources and infer genotypes for a common set of variants, genotype imputation is required. In this study we evaluated the accuracy of imputation from high density chips to whole genome sequence data in Brown Swiss cattle. Using four popular imputation programs (Beagle, FImpute, Impute2, Minimac) and various compositions of reference panels, the accuracy of the imputed sequence variant genotypes was high and differences between the programs and scenarios were small. We imputed sequence variant genotypes for more than 1600 Brown Swiss bulls and performed genome-wide association studies for milk fat percentage at two stages of lactation. We found one and three quantitative trait loci for early and late lactation fat content, respectively. Known causal variants that were imputed from the sequenced reference panel were among the most significantly associated variants of the genome-wide association study. Our study demonstrates that whole-genome sequence information can be imputed at high accuracy in cattle populations. Using imputed sequence variant genotypes in genome-wide association studies may facilitate causal variant detection.
Inferring influenza global transmission networks without complete phylogenetic information.

Science.gov (United States)

Aris-Brosou, Stéphane

2014-03-01

Influenza is one of the most severe respiratory infections affecting humans throughout the world, yet the dynamics of its global transmission network are still contentious. Here, I describe a novel combination of phylogenetics, time series, and graph theory to analyze 14.25 years of data stratified in space and in time, focusing on the main target of the human immune response, the hemagglutinin gene. While bypassing the complete phylogenetic inference of huge data sets, the method still extracts information suggesting that waves of genetic or of nucleotide diversity circulate continuously around the globe for subtypes that undergo sustained transmission over several seasons, such as H3N2 and pandemic H1N1/09, while diversity of prepandemic H1N1 viruses had until 2009 a noncontinuous transmission pattern consistent with a source/sink model. Irrespective of the shift in the structure of H1N1 diversity circulation with the emergence of the pandemic H1N1/09 strain, US prevalence peaks during the winter months when genetic diversity is at its lowest. This suggests that a dominant strain is generally responsible for epidemics and that monitoring genetic and/or nucleotide diversity in real time could provide public health agencies with an indirect estimate of prevalence.
Enrichment and genome sequence of the group I.1a ammonia-oxidizing Archaeon "Ca. Nitrosotenuis uzonensis" representing a clade globally distributed in thermal habitats.

Directory of Open Access Journals (Sweden)

Elena V Lebedeva

Full Text Available The discovery of ammonia-oxidizing archaea (AOA of the phylum Thaumarchaeota and the high abundance of archaeal ammonia monooxygenase subunit A encoding gene sequences in many environments have extended our perception of nitrifying microbial communities. Moreover, AOA are the only aerobic ammonia oxidizers known to be active in geothermal environments. Molecular data indicate that in many globally distributed terrestrial high-temperature habits a thaumarchaeotal lineage within the Nitrosopumilus cluster (also called "marine" group I.1a thrives, but these microbes have neither been isolated from these systems nor functionally characterized in situ yet. In this study, we report on the enrichment and genomic characterization of a representative of this lineage from a thermal spring in Kamchatka. This thaumarchaeote, provisionally classified as "Candidatus Nitrosotenuis uzonensis", is a moderately thermophilic, non-halophilic, chemolithoautotrophic ammonia oxidizer. The nearly complete genome sequence (assembled into a single scaffold of this AOA confirmed the presence of the typical thaumarchaeotal pathways for ammonia oxidation and carbon fixation, and indicated its ability to produce coenzyme F420 and to chemotactically react to its environment. Interestingly, like members of the genus Nitrosoarchaeum, "Candidatus N. uzonensis" also possesses a putative artubulin-encoding gene. Genome comparisons to related AOA with available genome sequences confirmed that the newly cultured AOA has an average nucleotide identity far below the species threshold and revealed a substantial degree of genomic plasticity with unique genomic regions in "Ca. N. uzonensis", which potentially include genetic determinants of ecological niche differentiation.
Enrichment and genome sequence of the group I.1a ammonia-oxidizing Archaeon "Ca. Nitrosotenuis uzonensis" representing a clade globally distributed in thermal habitats.

Science.gov (United States)

Lebedeva, Elena V; Hatzenpichler, Roland; Pelletier, Eric; Schuster, Nathalie; Hauzmayer, Sandra; Bulaev, Aleksandr; Grigor'eva, Nadezhda V; Galushko, Alexander; Schmid, Markus; Palatinszky, Marton; Le Paslier, Denis; Daims, Holger; Wagner, Michael

2013-01-01

The discovery of ammonia-oxidizing archaea (AOA) of the phylum Thaumarchaeota and the high abundance of archaeal ammonia monooxygenase subunit A encoding gene sequences in many environments have extended our perception of nitrifying microbial communities. Moreover, AOA are the only aerobic ammonia oxidizers known to be active in geothermal environments. Molecular data indicate that in many globally distributed terrestrial high-temperature habits a thaumarchaeotal lineage within the Nitrosopumilus cluster (also called "marine" group I.1a) thrives, but these microbes have neither been isolated from these systems nor functionally characterized in situ yet. In this study, we report on the enrichment and genomic characterization of a representative of this lineage from a thermal spring in Kamchatka. This thaumarchaeote, provisionally classified as "Candidatus Nitrosotenuis uzonensis", is a moderately thermophilic, non-halophilic, chemolithoautotrophic ammonia oxidizer. The nearly complete genome sequence (assembled into a single scaffold) of this AOA confirmed the presence of the typical thaumarchaeotal pathways for ammonia oxidation and carbon fixation, and indicated its ability to produce coenzyme F420 and to chemotactically react to its environment. Interestingly, like members of the genus Nitrosoarchaeum, "Candidatus N. uzonensis" also possesses a putative artubulin-encoding gene. Genome comparisons to related AOA with available genome sequences confirmed that the newly cultured AOA has an average nucleotide identity far below the species threshold and revealed a substantial degree of genomic plasticity with unique genomic regions in "Ca. N. uzonensis", which potentially include genetic determinants of ecological niche differentiation.
Model-free aftershock forecasts constructed from similar sequences in the past

Science.gov (United States)

van der Elst, N.; Page, M. T.

2017-12-01

The basic premise behind aftershock forecasting is that sequences in the future will be similar to those in the past. Forecast models typically use empirically tuned parametric distributions to approximate past sequences, and project those distributions into the future to make a forecast. While parametric models do a good job of describing average outcomes, they are not explicitly designed to capture the full range of variability between sequences, and can suffer from over-tuning of the parameters. In particular, parametric forecasts may produce a high rate of "surprises" - sequences that land outside the forecast range. Here we present a non-parametric forecast method that cuts out the parametric "middleman" between training data and forecast. The method is based on finding past sequences that are similar to the target sequence, and evaluating their outcomes. We quantify similarity as the Poisson probability that the observed event count in a past sequence reflects the same underlying intensity as the observed event count in the target sequence. Event counts are defined in terms of differential magnitude relative to the mainshock. The forecast is then constructed from the distribution of past sequences outcomes, weighted by their similarity. We compare the similarity forecast with the Reasenberg and Jones (RJ95) method, for a set of 2807 global aftershock sequences of M≥6 mainshocks. We implement a sequence-specific RJ95 forecast using a global average prior and Bayesian updating, but do not propagate epistemic uncertainty. The RJ95 forecast is somewhat more precise than the similarity forecast: 90% of observed sequences fall within a factor of two of the median RJ95 forecast value, whereas the fraction is 85% for the similarity forecast. However, the surprise rate is much higher for the RJ95 forecast; 10% of observed sequences fall in the upper 2.5% of the (Poissonian) forecast range. The surprise rate is less than 3% for the similarity forecast. The similarity
A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution.

Science.gov (United States)

Reinharz, Vladimir; Ponty, Yann; Waldispühl, Jérôme

2013-07-01

The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding properties) and more importantly do not allow the user to control explicitly the nucleotide distribution such as the GC-content in their sequences. However, the latter is an important criterion for large-scale applications as it could presumably be used to design sequences with better transcription rates and/or structural plasticity. In this article, we introduce IncaRNAtion, a novel algorithm to design RNA sequences folding into target secondary structures with a predefined nucleotide distribution. IncaRNAtion uses a global sampling approach and weighted sampling techniques. We show that our approach is fast (i.e. running time comparable or better than local search methods), seedless (we remove the bias of the seed in local search heuristics) and successfully generates high-quality sequences (i.e. thermodynamically stable) for any GC-content. To complete this study, we develop a hybrid method combining our global sampling approach with local search strategies. Remarkably, our glocal methodology overcomes both local and global approaches for sampling sequences with a specific GC-content and target structure. IncaRNAtion is available at csb.cs.mcgill.ca/incarnation/. Supplementary data are available at Bioinformatics online.
Protecting genomic sequence anonymity with generalization lattices.

Science.gov (United States)

Malin, B A

2005-01-01

Current genomic privacy technologies assume the identity of genomic sequence data is protected if personal information, such as demographics, are obscured, removed, or encrypted. While demographic features can directly compromise an individual's identity, recent research demonstrates such protections are insufficient because sequence data itself is susceptible to re-identification. To counteract this problem, we introduce an algorithm for anonymizing a collection of person-specific DNA sequences. The technique is termed DNA lattice anonymization (DNALA), and is based upon the formal privacy protection schema of k -anonymity. Under this model, it is impossible to observe or learn features that distinguish one genetic sequence from k-1 other entries in a collection. To maximize information retained in protected sequences, we incorporate a concept generalization lattice to learn the distance between two residues in a single nucleotide region. The lattice provides the most similar generalized concept for two residues (e.g. adenine and guanine are both purines). The method is tested and evaluated with several publicly available human population datasets ranging in size from 30 to 400 sequences. Our findings imply the anonymization schema is feasible for the protection of sequences privacy. The DNALA method is the first computational disclosure control technique for general DNA sequences. Given the computational nature of the method, guarantees of anonymity can be formally proven. There is room for improvement and validation, though this research provides the groundwork from which future researchers can construct genomics anonymization schemas tailored to specific datasharing scenarios.
Complete genome sequence of Arcanobacterium haemolyticum type strain (11018T)

Energy Technology Data Exchange (ETDEWEB)

Yasawong, Montri [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Teshima, Hazuki [Los Alamos National Laboratory (LANL); Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Liolios, Konstantinos [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Sikorski, Johannes [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Pukall, Rudiger [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany

2010-01-01

Vulcanisaeta distributa Itoh et al. 2002 belongs to the family Thermoproteaceae in the phylum Crenarchaeota. The genus Vulcanisaeta is characterized by a global distribution in hot and acidic springs. This is the first genome sequence from a member of the genus Vulcanisaeta and seventh genome sequence in the family Thermoproteaceae. The 2,374,137 bp long genome with its 2,544 protein-coding and 49 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.
Implicit sequence learning in deaf children with cochlear implants.

Science.gov (United States)

Conway, Christopher M; Pisoni, David B; Anaya, Esperanza M; Karpicke, Jennifer; Henning, Shirley C

2011-01-01

Deaf children with cochlear implants (CIs) represent an intriguing opportunity to study neurocognitive plasticity and reorganization when sound is introduced following a period of auditory deprivation early in development. Although it is common to consider deafness as affecting hearing alone, it may be the case that auditory deprivation leads to more global changes in neurocognitive function. In this paper, we investigate implicit sequence learning abilities in deaf children with CIs using a novel task that measured learning through improvement to immediate serial recall for statistically consistent visual sequences. The results demonstrated two key findings. First, the deaf children with CIs showed disturbances in their visual sequence learning abilities relative to the typically developing normal-hearing children. Second, sequence learning was significantly correlated with a standardized measure of language outcome in the CI children. These findings suggest that a period of auditory deprivation has secondary effects related to general sequencing deficits, and that disturbances in sequence learning may at least partially explain why some deaf children still struggle with language following cochlear implantation. © 2010 Blackwell Publishing Ltd.
A safe an easy method for building consensus HIV sequences from 454 massively parallel sequencing data.

Science.gov (United States)

Fernández-Caballero Rico, Jose Ángel; Chueca Porcuna, Natalia; Álvarez Estévez, Marta; Mosquera Gutiérrez, María Del Mar; Marcos Maeso, María Ángeles; García, Federico

2018-02-01

To show how to generate a consensus sequence from the information of massive parallel sequences data obtained from routine HIV anti-retroviral resistance studies, and that may be suitable for molecular epidemiology studies. Paired Sanger (Trugene-Siemens) and next-generation sequencing (NGS) (454 GSJunior-Roche) HIV RT and protease sequences from 62 patients were studied. NGS consensus sequences were generated using Mesquite, using 10%, 15%, and 20% thresholds. Molecular evolutionary genetics analysis (MEGA) was used for phylogenetic studies. At a 10% threshold, NGS-Sanger sequences from 17/62 patients were phylogenetically related, with a median bootstrap-value of 88% (IQR83.5-95.5). Association increased to 36/62 sequences, median bootstrap 94% (IQR85.5-98)], using a 15% threshold. Maximum association was at the 20% threshold, with 61/62 sequences associated, and a median bootstrap value of 99% (IQR98-100). A safe method is presented to generate consensus sequences from HIV-NGS data at 20% threshold, which will prove useful for molecular epidemiological studies. Copyright © 2016 Elsevier España, S.L.U. and Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.
Microarrays for global expression constructed with a low redundancy set of 27,500 sequenced cDNAs representing an array of developmental stages and physiological conditions of the soybean plant

Directory of Open Access Journals (Sweden)

Retzel Ernest

2004-09-01

Full Text Available Abstract Background Microarrays are an important tool with which to examine coordinated gene expression. Soybean (Glycine max is one of the most economically valuable crop species in the world food supply. In order to accelerate both gene discovery as well as hypothesis-driven research in soybean, global expression resources needed to be developed. The applications of microarray for determining patterns of expression in different tissues or during conditional treatments by dual labeling of the mRNAs are unlimited. In addition, discovery of the molecular basis of traits through examination of naturally occurring variation in hundreds of mutant lines could be enhanced by the construction and use of soybean cDNA microarrays. Results We report the construction and analysis of a low redundancy 'unigene' set of 27,513 clones that represent a variety of soybean cDNA libraries made from a wide array of source tissue and organ systems, developmental stages, and stress or pathogen-challenged plants. The set was assembled from the 5' sequence data of the cDNA clones using cluster analysis programs. The selected clones were then physically reracked and sequenced at the 3' end. In order to increase gene discovery from immature cotyledon libraries that contain abundant mRNAs representing storage protein gene families, we utilized a high density filter normalization approach to preferentially select more weakly expressed cDNAs. All 27,513 cDNA inserts were amplified by polymerase chain reaction. The amplified products, along with some repetitively spotted control or 'choice' clones, were used to produce three 9,728-element microarrays that have been used to examine tissue specific gene expression and global expression in mutant isolines. Conclusions Global expression studies will be greatly aided by the availability of the sequence-validated and low redundancy cDNA sets described in this report. These cDNAs and ESTs represent a wide array of developmental
Global comparative analysis of ESTs from the southern cattle tick, Rhipicephalus (Boophilus microplus

Directory of Open Access Journals (Sweden)

Pertea Geo

2007-10-01

Full Text Available Abstract Background The southern cattle tick, Rhipicephalus (Boophilus microplus, is an economically important parasite of cattle and can transmit several pathogenic microorganisms to its cattle host during the feeding process. Understanding the biology and genomics of R. microplus is critical to developing novel methods for controlling these ticks. Results We present a global comparative genomic analysis of a gene index of R. microplus comprised of 13,643 unique transcripts assembled from 42,512 expressed sequence tags (ESTs, a significant fraction of the complement of R. microplus genes. The source material for these ESTs consisted of polyA RNA from various tissues, lifestages, and strains of R. microplus, including larvae exposed to heat, cold, host odor, and acaricide. Functional annotation using RPS-Blast analysis identified conserved protein domains in the conceptually translated gene index and assigned GO terms to those database transcripts which had informative BlastX hits. Blast Score Ratio and SimiTri analysis compared the conceptual transcriptome of the R. microplus database to other eukaryotic proteomes and EST databases, including those from 3 ticks. The most abundant protein domains in BmiGI were also analyzed by SimiTri methodology. Conclusion These results indicate that a large fraction of BmiGI entries have no homologs in other sequenced genomes. Analysis with the PartiGene annotation pipeline showed 64% of the members of BmiGI could not be assigned GO annotation, thus minimal information is available about a significant fraction of the tick genome. This highlights the important insights in tick biology which are likely to result from a tick genome sequencing project. Global comparative analysis identified some tick genes with unexpected phylogenetic relationships which detailed analysis attributed to gene losses in some members of the animal kingdom. Some tick genes were identified which had close orthologues to mammalian genes
Real-time global illumination on mobile device

Science.gov (United States)

Ahn, Minsu; Ha, Inwoo; Lee, Hyong-Euk; Kim, James D. K.

2014-02-01

We propose a novel method for real-time global illumination on mobile devices. Our approach is based on instant radiosity, which uses a sequence of virtual point lights in order to represent the e ect of indirect illumination. Our rendering process consists of three stages. With the primary light, the rst stage generates a local illumination with the shadow map on GPU The second stage of the global illumination uses the re ective shadow map on GPU and generates the sequence of virtual point lights on CPU. Finally, we use the splatting method of Dachsbacher et al 1 and add the indirect illumination to the local illumination on GPU. With the limited computing resources in mobile devices, a small number of virtual point lights are allowed for real-time rendering. Our approach uses the multi-resolution sampling method with 3D geometry and attributes simultaneously and reduce the total number of virtual point lights. We also use the hybrid strategy, which collaboratively combines the CPUs and GPUs available in a mobile SoC due to the limited computing resources in mobile devices. Experimental results demonstrate the global illumination performance of the proposed method.
Absence of auditory 'global interference' in autism.

Science.gov (United States)

Foxton, Jessica M; Stewart, Mary E; Barnard, Louise; Rodgers, Jacqui; Young, Allan H; O'Brien, Gregory; Griffiths, Timothy D

2003-12-01

There has been considerable recent interest in the cognitive style of individuals with Autism Spectrum Disorder (ASD). One theory, that of weak central coherence, concerns an inability to combine stimulus details into a coherent whole. Here we test this theory in the case of sound patterns, using a new definition of the details (local structure) and the coherent whole (global structure). Thirteen individuals with a diagnosis of autism or Asperger's syndrome and 15 control participants were administered auditory tests, where they were required to match local pitch direction changes between two auditory sequences. When the other local features of the sequence pairs were altered (the actual pitches and relative time points of pitch direction change), the control participants obtained lower scores compared with when these details were left unchanged. This can be attributed to interference from the global structure, defined as the combination of the local auditory details. In contrast, the participants with ASD did not obtain lower scores in the presence of such mismatches. This was attributed to the absence of interference from an auditory coherent whole. The results are consistent with the presence of abnormal interactions between local and global auditory perception in ASD.

Spatiotemporal coding of inputs for a system of globally coupled phase oscillators

Science.gov (United States)

Wordsworth, John; Ashwin, Peter

2008-12-01

We investigate the spatiotemporal coding of low amplitude inputs to a simple system of globally coupled phase oscillators with coupling function g(ϕ)=-sin(ϕ+α)+rsin(2ϕ+β) that has robust heteroclinic cycles (slow switching between cluster states). The inputs correspond to detuning of the oscillators. It was recently noted that globally coupled phase oscillators can encode their frequencies in the form of spatiotemporal codes of a sequence of cluster states [P. Ashwin, G. Orosz, J. Wordsworth, and S. Townley, SIAM J. Appl. Dyn. Syst. 6, 728 (2007)]. Concentrating on the case of N=5 oscillators we show in detail how the spatiotemporal coding can be used to resolve all of the information that relates the individual inputs to each other, providing that a long enough time series is considered. We investigate robustness to the addition of noise and find a remarkable stability, especially of the temporal coding, to the addition of noise even for noise of a comparable magnitude to the inputs.
MIPS: a database for genomes and protein sequences.

Science.gov (United States)

Mewes, H W; Frishman, D; Güldener, U; Mannhaupt, G; Mayer, K; Mokrejs, M; Morgenstern, B; Münsterkötter, M; Rudd, S; Weil, B

2002-01-01

The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) continues to provide genome-related information in a systematic way. MIPS supports both national and European sequencing and functional analysis projects, develops and maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences, and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the databases for the comprehensive set of genomes (PEDANT genomes), the database of annotated human EST clusters (HIB), the database of complete cDNAs from the DHGP (German Human Genome Project), as well as the project specific databases for the GABI (Genome Analysis in Plants) and HNB (Helmholtz-Netzwerk Bioinformatik) networks. The Arabidospsis thaliana database (MATDB), the database of mitochondrial proteins (MITOP) and our contribution to the PIR International Protein Sequence Database have been described elsewhere [Schoof et al. (2002) Nucleic Acids Res., 30, 91-93; Scharfe et al. (2000) Nucleic Acids Res., 28, 155-158; Barker et al. (2001) Nucleic Acids Res., 29, 29-32]. All databases described, the protein analysis tools provided and the detailed descriptions of our projects can be accessed through the MIPS World Wide Web server (http://mips.gsf.de).
Isolation, sequence identification and tissue expression profile of a ...

African Journals Online (AJOL)

The complete expressed sequence tag (CDS) sequence of Banna mini-pig inbred line (BMI) ribokinase gene (RBKS) was amplified using the reverse transcription-polymerase chain reaction (RT-PCR) based on the conserved sequence information of the cattle or other mammals and known highly homologous swine ESTs.
Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples

Directory of Open Access Journals (Sweden)

Maley Carlo C

2008-10-01

Full Text Available Abstract Background Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. Results We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, Saccharomyces cerevisiae, and Escherichia coli (K-12 genomes. Virtually all possible (> 98% 12 bp oligomers appear in vertebrate genomes while 98% to D. melanogaster (12–17 bp, C. elegans (11–17 bp, A. thaliana (11–17 bp, S. cerevisiae (10–16 bp and E. coli (9–15 bp. Frequencies of unique oligomers in the genomes follow similar patterns. We identified a set of 2.6 M 15-mers that are more than 1 nucleotide different from all 15-mers in the human genome and so could be used as probes to detect microbes in human samples. In a human sample, these probes would detect 100% of the 433 currently fully sequenced prokaryotes and 75% of the 3065 fully sequenced viruses. The human genome is significantly more compact in sequence space than a random genome. We identified the most frequent 5- to 20-mers in the human genome, which may prove useful as PCR primers. We also identified a bacterium, Anaeromyxobacter dehalogenans, which has an exceptionally low diversity of oligomers given the size of its genome and its GC content. The entropy of coding regions in the human genome is significantly higher than non-coding regions and chromosomes. However chromosomes 1, 2, 9, 12 and 14 have a relatively high proportion of coding DNA without high entropy, and chromosome 20 is the opposite with a low frequency of coding regions but relatively high entropy. Conclusion Measures of the frequency of oligomers are useful for designing PCR assays and for identifying chromosomes and organisms with hidden structure that had not been previously recognized. This information may be used to detect
Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples

Science.gov (United States)

Liu, Zhandong; Venkatesh, Santosh S; Maley, Carlo C

2008-01-01

Background Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. Results We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, Saccharomyces cerevisiae, and Escherichia coli (K-12) genomes. Virtually all possible (> 98%) 12 bp oligomers appear in vertebrate genomes while 98% to < 2% of possible oligomers in D. melanogaster (12–17 bp), C. elegans (11–17 bp), A. thaliana (11–17 bp), S. cerevisiae (10–16 bp) and E. coli (9–15 bp). Frequencies of unique oligomers in the genomes follow similar patterns. We identified a set of 2.6 M 15-mers that are more than 1 nucleotide different from all 15-mers in the human genome and so could be used as probes to detect microbes in human samples. In a human sample, these probes would detect 100% of the 433 currently fully sequenced prokaryotes and 75% of the 3065 fully sequenced viruses. The human genome is significantly more compact in sequence space than a random genome. We identified the most frequent 5- to 20-mers in the human genome, which may prove useful as PCR primers. We also identified a bacterium, Anaeromyxobacter dehalogenans, which has an exceptionally low diversity of oligomers given the size of its genome and its GC content. The entropy of coding regions in the human genome is significantly higher than non-coding regions and chromosomes. However chromosomes 1, 2, 9, 12 and 14 have a relatively high proportion of coding DNA without high entropy, and chromosome 20 is the opposite with a low frequency of coding regions but relatively high entropy. Conclusion Measures of the frequency of oligomers are useful for designing PCR assays and for identifying chromosomes and organisms with hidden structure that had not been previously recognized. This information may be used to
Situation models and memory: the effects of temporal and causal information on recall sequence.

Science.gov (United States)

Brownstein, Aaron L; Read, Stephen J

2007-10-01

Participants watched an episode of the television show Cheers on video and then reported free recall. Recall sequence followed the sequence of events in the story; if one concept was observed immediately after another, it was recalled immediately after it. We also made a causal network of the show's story and found that recall sequence followed causal links; effects were recalled immediately after their causes. Recall sequence was more likely to follow causal links than temporal sequence, and most likely to follow causal links that were temporally sequential. Results were similar at 10-minute and 1-week delayed recall. This is the most direct and detailed evidence reported on sequential effects in recall. The causal network also predicted probability of recall; concepts with more links and concepts on the main causal chain were most likely to be recalled. This extends the causal network model to more complex materials than previous research.
MIPS: a database for protein sequences and complete genomes.

Science.gov (United States)

Mewes, H W; Hani, J; Pfeiffer, F; Frishman, D

1998-01-01

The MIPS group [Munich Information Center for Protein Sequences of the German National Center for Environment and Health (GSF)] at the Max-Planck-Institute for Biochemistry, Martinsried near Munich, Germany, is involved in a number of data collection activities, including a comprehensive database of the yeast genome, a database reflecting the progress in sequencing the Arabidopsis thaliana genome, the systematic analysis of other small genomes and the collection of protein sequence data within the framework of the PIR-International Protein Sequence Database (described elsewhere in this volume). Through its WWW server (http://www.mips.biochem.mpg.de ) MIPS provides access to a variety of generic databases, including a database of protein families as well as automatically generated data by the systematic application of sequence analysis algorithms. The yeast genome sequence and its related information was also compiled on CD-ROM to provide dynamic interactive access to the 16 chromosomes of the first eukaryotic genome unraveled. PMID:9399795
Storing and managing information artifacts collected by information analysts using a computing device

Science.gov (United States)

Pike, William A; Riensche, Roderick M; Best, Daniel M; Roberts, Ian E; Whyatt, Marie V; Hart, Michelle L; Carr, Norman J; Thomas, James J

2012-09-18

Systems and computer-implemented processes for storage and management of information artifacts collected by information analysts using a computing device. The processes and systems can capture a sequence of interactive operation elements that are performed by the information analyst, who is collecting an information artifact from at least one of the plurality of software applications. The information artifact can then be stored together with the interactive operation elements as a snippet on a memory device, which is operably connected to the processor. The snippet comprises a view from an analysis application, data contained in the view, and the sequence of interactive operation elements stored as a provenance representation comprising operation element class, timestamp, and data object attributes for each interactive operation element in the sequence.
Information technology ethics

DEFF Research Database (Denmark)

Hongladarom, Soraj; Ess, Charles

This book was the first publication to take a genuinely global approach to the diverse ethical issues evoked by Information and Communication Technologies and their possible resolutions. Readers will gain a greater appreciation for the problems and possibilities of genuinely global information...... ethics, which are urgently needed as information and communication technologies continue their exponential growth...
The Flickering Global City

Directory of Open Access Journals (Sweden)

Eric Slater

2015-08-01

Full Text Available This article explores new dimensions of the global city in light of the correlation between hegemonic transition and the prominence of financial centers. It counterposes Braudels historical sequence of dominant cities to extant approaches in the literature, shifting the emphasis from a convergence of form and function to variations in history and structure. The marked increase of finance in the composition of London, New York and Tokyo has paralleled each citys occupation of a distinct niche in world financial markets: London is the principal center of currency exchange, New York is the primary equities market, and Tokyo is the leader in international banking. This division expresses the progression of world-economies since the nineteenth century and unfolds in the context of the present hegemonic transition. By combining world-historical and city-centered approaches, the article seeks to reframe the global city and overcome the limits inherent in the paradigm of globalization.
Stochastic process of pragmatic information for 2D spiral wave turbulence in globally and locally coupled Alief-Panfilov oscillators

Science.gov (United States)

Kuwahara, Jun; Miyata, Hajime; Konno, Hidetoshi

2017-09-01

Recently, complex dynamics of globally coupled oscillators have been attracting many researcher's attentions. In spite of their numerous studies, their features of nonlinear oscillator systems with global and local couplings in two-dimension (2D) are not understood fully. The paper focuses on 2D states of coherent, clustered and chaotic oscillation especially under the effect of negative global coupling (NGC) in 2D Alief-Panfilov model. It is found that the tuning NGC can cause various new coupling-parameter dependency on the features of oscillations. Then quantitative characterization of various states of oscillations (so called spiral wave turbulence) is examined by using the pragmatic information (PI) which have been utilized in analyzing multimode laser, solar activity and neuronal systems. It is demonstrated that the dynamics of the PI for various oscillations can be characterized successfully by the Hyper-Gamma stochastic process.
The vulnerability of being ill informed: the Trans-Pacific Partnership Agreement and Global Public Health.

Science.gov (United States)

Greenberg, Henry; Shiau, Stephanie

2014-09-01

The Trans Pacific Partnership Agreement (TPPA) is a regional trade agreement currently being negotiated by 11 Pacific Rim countries, excluding China. While the negotiations are being conducted under a veil of secrecy, substantive leaks over the past 4 years have revealed a broad view of the proposed contents. As it stands the TPPA poses serious risks to global public health, particularly chronic, non-communicable diseases. At greatest risk are national tobacco regulations, regulations governing the emergence of generic drugs and controls over food imports by transnational corporations. Aside from a small group of public health professionals from Australia, the academic public health community has missed these threats to the global community, although many other health-related entities, international lawyers and health-conscious politicians have voiced serious concerns. As of mid-2014 there has been no comment in the leading public health journals. This large lacuna in interest or recognition reflects the larger problem that the public health education community has all but ignored global non-communicable diseases. Without such a focus, the risks are unseen and the threats not perceived. This cautionary tale of the TPPA reflects the vulnerability of being ill informed of contemporary realities. © The Author 2014. Published by Oxford University Press on behalf of Faculty of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
DIALIGN: multiple DNA and protein sequence alignment at BiBiServ.

OpenAIRE

Morgenstern, Burkhard

2004-01-01

DIALIGN is a widely used software tool for multiple DNA and protein sequence alignment. The program combines local and global alignment features and can therefore be applied to sequence data that cannot be correctly aligned by more traditional approaches. DIALIGN is available online through Bielefeld Bioinformatics Server (BiBiServ). The downloadable version of the program offers several new program features. To compare the output of different alignment programs, we developed the program AltA...
Climate Services Information System Activities in Support of The Global Framework for Climate Services Implementation

Science.gov (United States)

Timofeyeva-Livezey, M. M.; Horsfall, F. M. C.; Pulwarty, R. S.; Klein-Tank, A.; Kolli, R. K.; Hechler, P.; Dilley, M.; Ceron, J. P.; Goodess, C.

2017-12-01

The WMO Commission on Climatology (CCl) supports the implementation of the Global Framework for Climate Services (GFCS) with a particular focus on the Climate Services Information System (CSIS), which is the core operational component of GFCS at the global, regional, and national level. CSIS is designed for producing, packaging and operationally delivering authoritative climate information data and products through appropriate operational systems, practices, data exchange, technical standards, authentication, communication, and product delivery. Its functions include climate analysis and monitoring, assessment and attribution, prediction (monthly, seasonal, decadal), and projection (centennial scale) as well as tailoring the associated products tUEAo suit user requirements. A central, enabling piece of implementation of CSIS is a Climate Services Toolkit (CST). In its development phase, CST exists as a prototype (www.wmo.int/cst) as a compilation of tools for generating tailored data and products for decision-making, with a special focus on national requirements in developing countries. WMO provides a server to house the CST prototype as well as support operations and maintenance. WMO members provide technical expertise and other in-kind support, including leadership of the CSIS development team. Several recent WMO events have helped with the deployment of CST within the eight countries that have been recognized by GFCS as illustrative for developing their climate services at national levels. Currently these countries are developing climate services projects focusing service development and delivery for selected economic sectors, such as for health, agriculture, energy, water resources, and hydrometeorological disaster risk reduction. These countries are working together with their respective WMO Regional Climate Centers (RCCs), which provide technical assistance with implementation of climate services projects at the country level and facilitate development of
Global Food Security Index Studies and Satellite Information

Science.gov (United States)

Medina, T. A.; Ganti-Agrawal, S.; Joshi, D.; Lakhankar, T.

2017-12-01

Food yield is equal to the total crop harvest per unit cultivated area. During the elapsed time of germination and frequent harvesting, both human and climate related effects determine a country's' contribution towards global food security. Each country across the globe's annual income per capita was collected to then determine nine countries for further studies. For a location to be chosen, its income per capita needed to be considered poor, uprising or wealthy. Both physical land cover and regional climate helped categorize potential parameters thought to be studied. Once selected, Normalized Difference Vegetation Index (NDVI) data was collected for Ethiopia, Liberia, Indonesia, United States, Norway, Russia, Kuwait and Saudi Arabia over the recent 16 years for approximately every 16 days starting from early in the year 2000. Software languages such as Geographic Information System (GIS), MatLab and Excel were used to determine how population size, income and deforestation directly determines agricultural yields. Because of high maintenance requirements for large harvests when forest areas are cleared, they often have a reduction in soil quality, requiring fertilizer use to produce sufficient crop yields. Total area and vegetation index of each country is to be studied, to determine crop and deforestation percentages. To determine how deforestation impacts future income and crop yield predictions of each country studied. By using NDVI results a parameter is to be potentially found that will help define an index, to create an equation that will determine a country's annual income and ability to provide for their families and themselves.
Automatic Exchange of Information as the new global standard: the end of (offshore tax evasion) history?

OpenAIRE

Meinzer, Markus

2017-01-01

Automatic exchange of information (AEoI) for tax purposes has become the global standard for international tax cooperation in 2013. As a tool for containing offshore tax evasion, it has encountered opposition in the past and continues to be fraught with challenges. This paper recapitulates the rationale for AEoI, including estimates on the magnitudes of assets held offshore, with a specific focus on Turkish assets held in Germany (chapter 1). Subsequently, chapter 2 summarises the recent hist...
INFORMATION TECHNOLOGY CAPACITIES AND THE BORN GLOBAL COMPANIES DOI: 10.5585/riae.v8i1.1629

Directory of Open Access Journals (Sweden)

María Soledad Etchebarne López

2009-08-01

Full Text Available International business transactions have increased in recent years, triggered by the opening up of markets and the development of information technologies and communications, especially in the cases of small and medium enterprises. In this context, a new type of company is emerging: the so-called 'born global' companies. These organizations have an international profile since their birth, and have not gone through the traditional evolutionary pattern of internationalization. This article describes the theories that explain this new phenomenon, the main features of these new companies and their relationship with information technology (IT, which allows them to generate competitive advantages when entering the international market.
Facilitating genome navigation : survey sequencing and dense radiation-hybrid gene mapping

NARCIS (Netherlands)

Hitte, C; Madeoy, J; Kirkness, EF; Priat, C; Lorentzen, TD; Senger, F; Thomas, D; Derrien, T; Ramirez, C; Scott, C; Evanno, G; Pullar, B; Cadieu, E; Oza, [No Value; Lourgant, K; Jaffe, DB; Tacher, S; Dreano, S; Berkova, N; Andre, C; Deloukas, P; Fraser, C; Lindblad-Toh, K; Ostrander, EA; Galibert, F

Accurate and comprehensive sequence coverage for large genomes has been restricted to only a few species of specific interest. Lower sequence coverage (survey sequencing) of related species can yield a wealth of information about gene content and putative regulatory elements. But survey sequences
The Global Genome Biodiversity Network (GGBN) Data Standard specification

Science.gov (United States)

Droege, G.; Barker, K.; Seberg, O.; Coddington, J.; Benson, E.; Berendsohn, W. G.; Bunk, B.; Butler, C.; Cawsey, E. M.; Deck, J.; Döring, M.; Flemons, P.; Gemeinholzer, B.; Güntsch, A.; Hollowell, T.; Kelbert, P.; Kostadinov, I.; Kottmann, R.; Lawlor, R. T.; Lyal, C.; Mackenzie-Dodds, J.; Meyer, C.; Mulcahy, D.; Nussbeck, S. Y.; O'Tuama, É.; Orrell, T.; Petersen, G.; Robertson, T.; Söhngen, C.; Whitacre, J.; Wieczorek, J.; Yilmaz, P.; Zetzsche, H.; Zhang, Y.; Zhou, X.

2016-01-01

Genomic samples of non-model organisms are becoming increasingly important in a broad range of studies from developmental biology, biodiversity analyses, to conservation. Genomic sample definition, description, quality, voucher information and metadata all need to be digitized and disseminated across scientific communities. This information needs to be concise and consistent in today’s ever-increasing bioinformatic era, for complementary data aggregators to easily map databases to one another. In order to facilitate exchange of information on genomic samples and their derived data, the Global Genome Biodiversity Network (GGBN) Data Standard is intended to provide a platform based on a documented agreement to promote the efficient sharing and usage of genomic sample material and associated specimen information in a consistent way. The new data standard presented here build upon existing standards commonly used within the community extending them with the capability to exchange data on tissue, environmental and DNA sample as well as sequences. The GGBN Data Standard will reveal and democratize the hidden contents of biodiversity biobanks, for the convenience of everyone in the wider biobanking community. Technical tools exist for data providers to easily map their databases to the standard. Database URL: http://terms.tdwg.org/wiki/GGBN_Data_Standard PMID:27694206
Insights into the phylogeny of Northern Hemisphere Armillaria: Neighbor-net and Bayesian analyses of translation elongation factor 1-α gene sequences.

Science.gov (United States)

Klopfenstein, Ned B; Stewart, Jane E; Ota, Yuko; Hanna, John W; Richardson, Bryce A; Ross-Davis, Amy L; Elías-Román, Rubén D; Korhonen, Kari; Keča, Nenad; Iturritxa, Eugenia; Alvarado-Rosales, Dionicio; Solheim, Halvor; Brazee, Nicholas J; Łakomy, Piotr; Cleary, Michelle R; Hasegawa, Eri; Kikuchi, Taisei; Garza-Ocañas, Fortunato; Tsopelas, Panaghiotis; Rigling, Daniel; Prospero, Simone; Tsykun, Tetyana; Bérubé, Jean A; Stefani, Franck O P; Jafarpour, Saeideh; Antonín, Vladimír; Tomšovský, Michal; McDonald, Geral I; Woodward, Stephen; Kim, Mee-Sook

2017-01-01

Armillaria possesses several intriguing characteristics that have inspired wide interest in understanding phylogenetic relationships within and among species of this genus. Nuclear ribosomal DNA sequence-based analyses of Armillaria provide only limited information for phylogenetic studies among widely divergent taxa. More recent studies have shown that translation elongation factor 1-α (tef1) sequences are highly informative for phylogenetic analysis of Armillaria species within diverse global regions. This study used Neighbor-net and coalescence-based Bayesian analyses to examine phylogenetic relationships of newly determined and existing tef1 sequences derived from diverse Armillaria species from across the Northern Hemisphere, with Southern Hemisphere Armillaria species included for reference. Based on the Bayesian analysis of tef1 sequences, Armillaria species from the Northern Hemisphere are generally contained within the following four superclades, which are named according to the specific epithet of the most frequently cited species within the superclade: (i) Socialis/Tabescens (exannulate) superclade including Eurasian A. ectypa, North American A. socialis (A. tabescens), and Eurasian A. socialis (A. tabescens) clades; (ii) Mellea superclade including undescribed annulate North American Armillaria sp. (Mexico) and four separate clades of A. mellea (Europe and Iran, eastern Asia, and two groups from North America); (iii) Gallica superclade including Armillaria Nag E (Japan), multiple clades of A. gallica (Asia and Europe), A. calvescens (eastern North America), A. cepistipes (North America), A. altimontana (western USA), A. nabsnona (North America and Japan), and at least two A. gallica clades (North America); and (iv) Solidipes/Ostoyae superclade including two A. solidipes/ostoyae clades (North America), A. gemina (eastern USA), A. solidipes/ostoyae (Eurasia), A. cepistipes (Europe and Japan), A. sinapina (North America and Japan), and A. borealis

Genome sequence of Stachybotrys chartarum Strain 51-11

Science.gov (United States)

Stachybotrys chartarum strain 51-11 genome was sequenced by shotgun sequencing utilizing Illumina Hiseq 2000 and PacBio long read technology. Since Stachybotrys chartarum has been implicated in health impacts within water-damaged buildings, any information extracted from the geno...
Sequencing and comparing whole mitochondrial genomes ofanimals

Energy Technology Data Exchange (ETDEWEB)

Boore, Jeffrey L.; Macey, J. Robert; Medina, Monica

2005-04-22

Comparing complete animal mitochondrial genome sequences is becoming increasingly common for phylogenetic reconstruction and as a model for genome evolution. Not only are they much more informative than shorter sequences of individual genes for inferring evolutionary relatedness, but these data also provide sets of genome-level characters, such as the relative arrangements of genes, that can be especially powerful. We describe here the protocols commonly used for physically isolating mtDNA, for amplifying these by PCR or RCA, for cloning,sequencing, assembly, validation, and gene annotation, and for comparing both sequences and gene arrangements. On several topics, we offer general observations based on our experiences to date with determining and comparing complete mtDNA sequences.
Next-generation sequencing-based user-friendly platforms for drug-resistant tuberculosis diagnosis: A promise for the near future

Directory of Open Access Journals (Sweden)

David L Dolinger

2016-01-01

Full Text Available Since 2002, there has been a gradual worldwide 1.3% annual decrease in the incidence of tuberculosis (TB. This is an encouraging statistic; however, it will not achieve the World Health Organization's goal of eliminating TB by 2050, and it is being compounded by the persistent global incidence of drug-resistant tuberculosis (DR-TB acquired by transmission and by treatment pressure. One key to effectively control tuberculosis and the spread of multiresistant strains is accurate information pertaining to drug resistance and susceptibility. Next-generation sequencing (NGS has the potential to effectively change global health and the management of TB. Industry has focused primarily on using NGS for oncology diagnostics and human genomics, but the area in which NGS can rapidly impact health care is in the area of infectious disease diagnostics in low- and middle-income countries. To date, there has been a failure as a community to capitalize on the potential of NGS, especially at the reference laboratory level where it can provide actionable information pertaining to treatment options for patients. The rapid evolution of knowledge about the genetic foundations of tuberculosis drug resistance makes sequencing a versatile technology platform for providing rapid, accurate, and actionable results for treating this disease. No “plug-and-play” and “end-to-end” NGS solutions exist that provide clinically relevant sequence data from the Mycobacterium tuberculosis complex genome from primary clinical samples (e.g., sputum in high-burden country reference laboratories, which is where they are most needed. However, such a system-based solution is underdeveloped by Foundation for Innovative Diagnostics (FIND, in collaboration with partners from academia, nongovernmental organizations, and industry. The solution is modular and is designed and developed to perform targeted amplicon sequencing directly from a patient's primary sputum sample. This solution
Optimal rotation sequences for active perception

Science.gov (United States)

Nakath, David; Rachuy, Carsten; Clemens, Joachim; Schill, Kerstin

2016-05-01

One major objective of autonomous systems navigating in dynamic environments is gathering information needed for self localization, decision making, and path planning. To account for this, such systems are usually equipped with multiple types of sensors. As these sensors often have a limited field of view and a fixed orientation, the task of active perception breaks down to the problem of calculating alignment sequences which maximize the information gain regarding expected measurements. Action sequences that rotate the system according to the calculated optimal patterns then have to be generated. In this paper we present an approach for calculating these sequences for an autonomous system equipped with multiple sensors. We use a particle filter for multi- sensor fusion and state estimation. The planning task is modeled as a Markov decision process (MDP), where the system decides in each step, what actions to perform next. The optimal control policy, which provides the best action depending on the current estimated state, maximizes the expected cumulative reward. The latter is computed from the expected information gain of all sensors over time using value iteration. The algorithm is applied to a manifold representation of the joint space of rotation and time. We show the performance of the approach in a spacecraft navigation scenario where the information gain is changing over time, caused by the dynamic environment and the continuous movement of the spacecraft
PINGU: PredIction of eNzyme catalytic residues usinG seqUence information.

Directory of Open Access Journals (Sweden)

Priyadarshini P Pai

Full Text Available Identification of catalytic residues can help unveil interesting attributes of enzyme function for various therapeutic and industrial applications. Based on their biochemical roles, the number of catalytic residues and sequence lengths of enzymes vary. This article describes a prediction approach (PINGU for such a scenario. It uses models trained using physicochemical properties and evolutionary information of 650 non-redundant enzymes (2136 catalytic residues in a support vector machines architecture. Independent testing on 200 non-redundant enzymes (683 catalytic residues in predefined prediction settings, i.e., with non-catalytic per catalytic residue ranging from 1 to 30, suggested that the prediction approach was highly sensitive and specific, i.e., 80% or above, over the incremental challenges. To learn more about the discriminatory power of PINGU in real scenarios, where the prediction challenge is variable and susceptible to high false positives, the best model from independent testing was used on 60 diverse enzymes. Results suggested that PINGU was able to identify most catalytic residues and non-catalytic residues properly with 80% or above accuracy, sensitivity and specificity. The effect of false positives on precision was addressed in this study by application of predicted ligand-binding residue information as a post-processing filter. An overall improvement of 20% in F-measure and 0.138 in Correlation Coefficient with 16% enhanced precision could be achieved. On account of its encouraging performance, PINGU is hoped to have eventual applications in boosting enzyme engineering and novel drug discovery.
AIM satellite-based research bridges the unique scientific aspects of the mission to informal education programs globally

Science.gov (United States)

Robinson, D.; Maggi, B.

2003-04-01

The Education and Public Outreach (EPO) component of the satellite-based research mission "Aeronomy of Ice In the Mesosphere" (AIM) will bridge the unique scientific aspects of the mission to informal education organizations. The informal education materials developed by the EPO will utilize AIM data and educate the public about the environmental implications associated with the data. This will assist with creating a scientifically literate workforce and in developing a citizenry capable of making educated decisions related to environmental policies and laws. The objective of the AIM mission is to understand the mechanisms that cause Polar Mesospheric Clouds (PMCs) to form, how their presence affects the atmosphere, and how change in the atmosphere affects them. PMCs are sometimes known as Noctilucent Clouds (NLCs) because of their visibility during the night from appropriate locations. The phenomenon of PMCs is an observable indicator of global change, a concern to all citizens. Recent sightings of these clouds over populated regions have compelled AIM educators to expand informal education opportunities to communities worldwide. Collaborations with informal organizations include: Museums/Science Centers; NASA Sun-Earth Connection Forum; Alaska Native Ways of Knowing Project; Amateur Noctilucent Cloud Observers Organization; National Parks Education Programs; After School Science Clubs; Public Broadcasting Associations; and National Public Radio. The Native Ways of Knowing Project is an excellent example of informal collaboration with the AIM EPO. This Alaska based project will assist native peoples of the state with photographing NLCs for the EPO website. It will also aid the EPO with developing materials for informal organizations that incorporate traditional native knowledge and science, related to the sky. Another AIM collaboration that will offer citizens lasting informal education opportunities is the one established with the United States National Parks
Core Genome Multilocus Sequence Typing for Identification of Globally Distributed Clonal Groups and Differentiation of Outbreak Strains of Listeria monocytogenes.

Science.gov (United States)

Chen, Yi; Gonzalez-Escalona, Narjol; Hammack, Thomas S; Allard, Marc W; Strain, Errol A; Brown, Eric W

2016-10-15

Many listeriosis outbreaks are caused by a few globally distributed clonal groups, designated clonal complexes or epidemic clones, of Listeria monocytogenes, several of which have been defined by classic multilocus sequence typing (MLST) schemes targeting 6 to 8 housekeeping or virulence genes. We have developed and evaluated core genome MLST (cgMLST) schemes and applied them to isolates from multiple clonal groups, including those associated with 39 listeriosis outbreaks. The cgMLST clusters were congruent with MLST-defined clonal groups, which had various degrees of diversity at the whole-genome level. Notably, cgMLST could distinguish among outbreak strains and epidemiologically unrelated strains of the same clonal group, which could not be achieved using classic MLST schemes. The precise selection of cgMLST gene targets may not be critical for the general identification of clonal groups and outbreak strains. cgMLST analyses further identified outbreak strains, including those associated with recent outbreaks linked to contaminated French-style cheese, Hispanic-style cheese, stone fruit, caramel apple, ice cream, and packaged leafy green salad, as belonging to major clonal groups. We further developed lineage-specific cgMLST schemes, which can include accessory genes when core genomes do not possess sufficient diversity, and this provided additional resolution over species-specific cgMLST. Analyses of isolates from different common-source listeriosis outbreaks revealed various degrees of diversity, indicating that the numbers of allelic differences should always be combined with cgMLST clustering and epidemiological evidence to define a listeriosis outbreak. Classic multilocus sequence typing (MLST) schemes targeting internal fragments of 6 to 8 genes that define clonal complexes or epidemic clones have been widely employed to study L. monocytogenes biodiversity and its relation to pathogenicity potential and epidemiology. We demonstrated that core genome MLST
TurboFold: Iterative probabilistic estimation of secondary structures for multiple RNA sequences

Directory of Open Access Journals (Sweden)

Sharma Gaurav

2011-04-01

Full Text Available Abstract Background The prediction of secondary structure, i.e. the set of canonical base pairs between nucleotides, is a first step in developing an understanding of the function of an RNA sequence. The most accurate computational methods predict conserved structures for a set of homologous RNA sequences. These methods usually suffer from high computational complexity. In this paper, TurboFold, a novel and efficient method for secondary structure prediction for multiple RNA sequences, is presented. Results TurboFold takes, as input, a set of homologous RNA sequences and outputs estimates of the base pairing probabilities for each sequence. The base pairing probabilities for a sequence are estimated by combining intrinsic information, derived from the sequence itself via the nearest neighbor thermodynamic model, with extrinsic information, derived from the other sequences in the input set. For a given sequence, the extrinsic information is computed by using pairwise-sequence-alignment-based probabilities for co-incidence with each of the other sequences, along with estimated base pairing probabilities, from the previous iteration, for the other sequences. The extrinsic information is introduced as free energy modifications for base pairing in a partition function computation based on the nearest neighbor thermodynamic model. This process yields updated estimates of base pairing probability. The updated base pairing probabilities in turn are used to recompute extrinsic information, resulting in the overall iterative estimation procedure that defines TurboFold. TurboFold is benchmarked on a number of ncRNA datasets and compared against alternative secondary structure prediction methods. The iterative procedure in TurboFold is shown to improve estimates of base pairing probability with each iteration, though only small gains are obtained beyond three iterations. Secondary structures composed of base pairs with estimated probabilities higher than a
Acceptability of, and Information Needs Regarding, Next-Generation Sequencing in People Tested for Hereditary Cancer: A Qualitative Study.

Science.gov (United States)

Meiser, Bettina; Storey, Ben; Quinn, Veronica; Rahman, Belinda; Andrews, Lesley

2016-04-01

Next generation sequencing (NGS) for patients at risk of hereditary cancer syndromes can also identify non-cancer related mutations, as well as variants of unknown significance. This study aimed to determine what benefits and shortcomings patients perceive in relation to NGS, as well as their interest and information preferences in regards to such testing. Eligible patients had previously received inconclusive results from clinical mutation testing for cancer susceptibility. Semi-structured telephone interviews were subjected to qualitative analysis guided by the approach developed by Miles and Huberman. The majority of the 19 participants reported they would be interested in panel/genomic testing. Advantages identified included that it would enable better preparation and allow implementation of individualized preventative strategies, with few disadvantages mentioned. Almost all participants said they would want all results, not just those related to their previous diagnosis. Participants felt that a face-to-face discussion supplemented by an information booklet would be the best way to convey information and achieve informed consent. All participants wanted their information stored and reviewed in accordance with new developments. Although the findings indicate strong interest among these individuals, it seems that the consent process, and the interpretation and communication of results will be areas that will require revision to meet the needs of patients.
Globalization – Chances or Risks

Directory of Open Access Journals (Sweden)

MĂDĂLINA ANTOANETA RĂDOI

2015-05-01

Full Text Available There are for and against arguments as regards the process of globalization. But what is globalization: a concept, a reality or a state as such? We can consider that globalization reflects the natural continuity of a process that appeared a long time ago and that has evolved ever since or a new phenomenon that was generated by the speed with which new technology and information flow. Milton Friedman, a fervent supporter of globalization, gives an answer to the question “what is globalization”; according to him, “globalization is not a simple tendency or phantasy but rather an international system. It is the new system that has replaced the Cold War system and that, like the former one, has its own laws and logic, being able to directly or indirectly influence today’s politics, the environment, geopolitics and the economy of every country in the world.” (Friedman, 2000. Globalization represents: the unlimited ascend of technology, the free flow of information, the annihilation of territorial limits, the uniformity of economy, the free flow of capital, the mobility of the person, as well as a political form of organization that aims at a future global government.
Perception Enhancement using Visual Attributes in Sequence Motif Visualization

OpenAIRE

Oon, Yin; Lee, Nung; Kok, Wei

2016-01-01

Sequence logo is a well-accepted scientific method to visualize the conservation characteristics of biological sequence motifs. Previous studies found that using sequence logo graphical representation for scientific evidence reports or arguments could seriously cause biases and misinterpretation by users. This study investigates on the visual attributes performance of a sequence logo in helping users to perceive and interpret the information based on preattentive theories and Gestalt principl...
Voice over Internet Protocol (VoIP) Technology as a Global Learning Tool: Information Systems Success and Control Belief Perspectives

Science.gov (United States)

Chen, Charlie C.; Vannoy, Sandra

2013-01-01

Voice over Internet Protocol- (VoIP) enabled online learning service providers struggling with high attrition rates and low customer loyalty issues despite VoIP's high degree of system fit for online global learning applications. Effective solutions to this prevalent problem rely on the understanding of system quality, information quality, and…
Sustainable Mobility: Using a Global Energy Model to Inform Vehicle Technology Choices in a Decarbonized Economy

Directory of Open Access Journals (Sweden)

Timothy Wallington

2013-04-01

Full Text Available The reduction of CO2 emissions associated with vehicle use is an important element of a global transition to sustainable mobility and is a major long-term challenge for society. Vehicle and fuel technologies are part of a global energy system, and assessing the impact of the availability of clean energy technologies and advanced vehicle technologies on sustainable mobility is a complex task. The global energy transition (GET model accounts for interactions between the different energy sectors, and we illustrate its use to inform vehicle technology choices in a decarbonizing economy. The aim of this study is to assess how uncertainties in future vehicle technology cost, as well as how developments in other energy sectors, affect cost-effective fuel and vehicle technology choices. Given the uncertainties in future costs and efficiencies for light-duty vehicle and fuel technologies, there is no clear fuel/vehicle technology winner that can be discerned at the present time. We conclude that a portfolio approach with research and development of multiple fuel and vehicle technology pathways is the best way forward to achieve the desired result of affordable and sustainable personal mobility. The practical ramifications of this analysis are illustrated in the portfolio approach to providing sustainable mobility adopted by the Ford Motor Company.
A Unified Theoretical Framework for Cognitive Sequencing.

Science.gov (United States)

Savalia, Tejas; Shukla, Anuj; Bapi, Raju S

2016-01-01

The capacity to sequence information is central to human performance. Sequencing ability forms the foundation stone for higher order cognition related to language and goal-directed planning. Information related to the order of items, their timing, chunking and hierarchical organization are important aspects in sequencing. Past research on sequencing has emphasized two distinct and independent dichotomies: implicit vs. explicit and goal-directed vs. habits. We propose a theoretical framework unifying these two streams. Our proposal relies on brain's ability to implicitly extract statistical regularities from the stream of stimuli and with attentional engagement organizing sequences explicitly and hierarchically. Similarly, sequences that need to be assembled purposively to accomplish a goal require engagement of attentional processes. With repetition, these goal-directed plans become habits with concomitant disengagement of attention. Thus, attention and awareness play a crucial role in the implicit-to-explicit transition as well as in how goal-directed plans become automatic habits. Cortico-subcortical loops basal ganglia-frontal cortex and hippocampus-frontal cortex loops mediate the transition process. We show how the computational principles of model-free and model-based learning paradigms, along with a pivotal role for attention and awareness, offer a unifying framework for these two dichotomies. Based on this framework, we make testable predictions related to the potential influence of response-to-stimulus interval (RSI) on developing awareness in implicit learning tasks.
A Unified Theoretical Framework for Cognitive Sequencing

Directory of Open Access Journals (Sweden)

Tejas Savalia

2016-11-01

Full Text Available The capacity to sequence information is central to human performance. Sequencing ability forms the foundation stone for higher order cognition related to language and goal-directed planning. Information related to the order of items, their timing, chunking and hierarchical organization are important aspects in sequencing. Past research on sequencing has emphasized two distinct and independent dichotomies: implicit versus explicit and goal-directed versus habits. We propose a theoretical framework unifying these two streams. Our proposal relies on brain's ability to implicitly extract statistical regularities from the stream of stimuli and with attentional engagement organizing sequences explicitly and hierarchically. Similarly, sequences that need to be assembled purposively to accomplish a goal require engagement of attentional processes. With repetition, these goal-directed plans become habits with concomitant disengagement of attention. Thus attention and awareness play a crucial role in the implicit-to-explicit transition as well as in how goal-directed plans become automatic habits. Cortico-subcortical loops ─ basal ganglia-frontal cortex and hippocampus-frontal cortex loops ─ mediate the transition process. We show how the computational principles of model-free and model-based learning paradigms, along with a pivotal role for attention and awareness, offer a unifying framework for these two dichotomies. Based on this framework, we make testable predictions related to the potential influence of response-to-stimulus interval (RSI on developing awareness in implicit learning tasks.
A dated molecular phylogeny of manta and devil rays (Mobulidae) based on mitogenome and nuclear sequences

NARCIS (Netherlands)

Poortvliet, Marloes; Olsen, Jeanine; Croll, Donald A.; Bernardi, Giacomo; Newton, Kelly; Kollias, Spyros; O'Sullivan, John; Fernando, Daniel; Stevens, Guy; Galván Magaña, Felipe; Seret, Bernard; Wintner, Sabine; Hoarau, Galice

Manta and devil rays are an iconic group of globally distributed pelagic filter feeders, yet their evolutionary history remains enigmatic. We employed next generation sequencing of mitogenomes for nine of the 11 recognized species and two outgroups; as well as additional Sanger sequencing of two
Statistical approaches to use a model organism for regulatory sequences annotation of newly sequenced species.

Directory of Open Access Journals (Sweden)

Pietro Liò

Full Text Available A major goal of bioinformatics is the characterization of transcription factors and the transcriptional programs they regulate. Given the speed of genome sequencing, we would like to quickly annotate regulatory sequences in newly-sequenced genomes. In such cases, it would be helpful to predict sequence motifs by using experimental data from closely related model organism. Here we present a general algorithm that allow to identify transcription factor binding sites in one newly sequenced species by performing Bayesian regression on the annotated species. First we set the rationale of our method by applying it within the same species, then we extend it to use data available in closely related species. Finally, we generalise the method to handle the case when a certain number of experiments, from several species close to the species on which to make inference, are available. In order to show the performance of the method, we analyse three functionally related networks in the Ascomycota. Two gene network case studies are related to the G2/M phase of the Ascomycota cell cycle; the third is related to morphogenesis. We also compared the method with MatrixReduce and discuss other types of validation and tests. The first network is well known and provides a biological validation test of the method. The two cell cycle case studies, where the gene network size is conserved, demonstrate an effective utility in annotating new species sequences using all the available replicas from model species. The third case, where the gene network size varies among species, shows that the combination of information is less powerful but is still informative. Our methodology is quite general and could be extended to integrate other high-throughput data from model organisms.
RNA sequencing: current and prospective uses in metabolic research.

Science.gov (United States)

Vikman, Petter; Fadista, Joao; Oskolkov, Nikolay

2014-10-01

Previous global RNA analysis was restricted to known transcripts in species with a defined transcriptome. Next generation sequencing has transformed transcriptomics by making it possible to analyse expressed genes with an exon level resolution from any tissue in any species without any a priori knowledge of which genes that are being expressed, splice patterns or their nucleotide sequence. In addition, RNA sequencing is a more sensitive technique compared with microarrays with a larger dynamic range, and it also allows for investigation of imprinting and allele-specific expression. This can be done for a cost that is able to compete with that of a microarray, making RNA sequencing a technique available to most researchers. Therefore RNA sequencing has recently become the state of the art with regards to large-scale RNA investigations and has to a large extent replaced microarrays. The only drawback is the large data amounts produced, which together with the complexity of the data can make a researcher spend far more time on analysis than performing the actual experiment. © 2014 Society for Endocrinology.
Detection of Emerging Vaccine-Related Polioviruses by Deep Sequencing.

Science.gov (United States)

Sahoo, Malaya K; Holubar, Marisa; Huang, ChunHong; Mohamed-Hadley, Alisha; Liu, Yuanyuan; Waggoner, Jesse J; Troy, Stephanie B; Garcia-Garcia, Lourdes; Ferreyra-Reyes, Leticia; Maldonado, Yvonne; Pinsky, Benjamin A

2017-07-01

Oral poliovirus vaccine can mutate to regain neurovirulence. To date, evaluation of these mutations has been performed primarily on culture-enriched isolates by using conventional Sanger sequencing. We therefore developed a culture-independent, deep-sequencing method targeting the 5' untranslated region (UTR) and P1 genomic region to characterize vaccine-related poliovirus variants. Error analysis of the deep-sequencing method demonstrated reliable detection of poliovirus mutations at levels of vaccinated, asymptomatic children and their close contacts collected during a prospective cohort study in Veracruz, Mexico, revealed no vaccine-derived polioviruses. This was expected given that the longest duration between sequenced sample collection and the end of the most recent national immunization week was 66 days. However, we identified many low-level variants (Sabin serotypes, as well as vaccine-related viruses with multiple canonical mutations associated with phenotypic reversion present at high levels (>90%). These results suggest that monitoring emerging vaccine-related poliovirus variants by deep sequencing may aid in the poliovirus endgame and efforts to ensure global polio eradication. Copyright © 2017 Sahoo et al.
Sequence stratigraphy as a scientific enterprise: the evolution and persistence of conflicting paradigms

Science.gov (United States)

Miall, Andrew D.; Miall, Charlene E.

2001-08-01

In the 1970s, seismic stratigraphy represented a new paradigm in geological thought. The development of new techniques for analyzing seismic-reflection data constituted a "crisis," as conceptualized by T.S. Kuhn, and stimulated a revolution in stratigraphy. We analyze here a specific subset of the new ideas, that pertaining to the concept of global-eustasy and the global cycle chart published by Vail et al. [Vail, P.R., Mitchum, R.M., Jr., Todd, R.G., Widmier, J.M., Thompson, S., III, Sangree, J.B., Bubb, J.N., Hatlelid, W.G., 1977. Seismic stratigraphy and global changes of sea-level. In: Payton, C.E. (Ed.), Seismic Stratigraphy—Applications to Hydrocarbon Exploration, Am. Assoc. Pet. Geol. Mem. 26, pp. 49-212.] The global-eustasy model posed two challenges to the "normal science" of stratigraphy then underway: (1) that sequence stratigraphy, as exemplified by the global cycle chart, constitutes a superior standard of geologic time to that assembled from conventional chronostratigraphic evidence, and (2) that stratigraphic processes are dominated by the effects of eustasy, to the exclusion of other allogenic mechanisms, including tectonism. While many stratigraphers now doubt the universal validity of the model of global-eustasy, what we term the global-eustasy paradigm, a group of sequence researchers led by Vail still adheres to it, and the two conceptual approaches have evolved into two conflicting paradigms. Those who assert that there are multiple processes generating stratigraphic sequences (possibly including eustatic processes) are adherents of what we term the complexity paradigm. Followers of this paradigm argue that tests of the global cycle chart amount to little more than circular reasoning. A new body of work documenting the European sequence record was published in 1998 by de Graciansky et al. These workers largely follow the global-eustasy paradigm. Citation and textual analysis of this work indicates that they have not responded to any of the

Biophysical and structural considerations for protein sequence evolution

Directory of Open Access Journals (Sweden)

Grahnen Johan A

2011-12-01

Full Text Available Abstract Background Protein sequence evolution is constrained by the biophysics of folding and function, causing interdependence between interacting sites in the sequence. However, current site-independent models of sequence evolutions do not take this into account. Recent attempts to integrate the influence of structure and biophysics into phylogenetic models via statistical/informational approaches have not resulted in expected improvements in model performance. This suggests that further innovations are needed for progress in this field. Results Here we develop a coarse-grained physics-based model of protein folding and binding function, and compare it to a popular informational model. We find that both models violate the assumption of the native sequence being close to a thermodynamic optimum, causing directional selection away from the native state. Sampling and simulation show that the physics-based model is more specific for fold-defining interactions that vary less among residue type. The informational model diffuses further in sequence space with fewer barriers and tends to provide less support for an invariant sites model, although amino acid substitutions are generally conservative. Both approaches produce sequences with natural features like dN/dS Conclusions Simple coarse-grained models of protein folding can describe some natural features of evolving proteins but are currently not accurate enough to use in evolutionary inference. This is partly due to improper packing of the hydrophobic core. We suggest possible improvements on the representation of structure, folding energy, and binding function, as regards both native and non-native conformations, and describe a large number of possible applications for such a model.
Placental fetal stem segmentation in a sequence of histology images

Science.gov (United States)

Athavale, Prashant; Vese, Luminita A.

2012-02-01

Recent research in perinatal pathology argues that analyzing properties of the placenta may reveal important information on how certain diseases progress. One important property is the structure of the placental fetal stems. Analysis of the fetal stems in a placenta could be useful in the study and diagnosis of some diseases like autism. To study the fetal stem structure effectively, we need to automatically and accurately track fetal stems through a sequence of digitized hematoxylin and eosin (H&E) stained histology slides. There are many problems in successfully achieving this goal. A few of the problems are: large size of images, misalignment of the consecutive H&E slides, unpredictable inaccuracies of manual tracing, very complicated texture patterns of various tissue types without clear characteristics, just to name a few. In this paper we propose a novel algorithm to achieve automatic tracing of the fetal stem in a sequence of H&E images, based on an inaccurate manual segmentation of a fetal stem in one of the images. This algorithm combines global affine registration, local non-affine registration and a novel 'dynamic' version of the active contours model without edges. We first use global affine image registration of all the images based on displacement, scaling and rotation. This gives us approximate location of the corresponding fetal stem in the image that needs to be traced. We then use the affine registration algorithm "locally" near this location. At this point, we use a fast non-affine registration based on L2-similarity measure and diffusion regularization to get a better location of the fetal stem. Finally, we have to take into account inaccuracies in the initial tracing. This is achieved through a novel dynamic version of the active contours model without edges where the coefficients of the fitting terms are computed iteratively to ensure that we obtain a unique stem in the segmentation. The segmentation thus obtained can then be used as an
An information-theoretic approach to the modeling and analysis of whole-genome bisulfite sequencing data.

Science.gov (United States)

Jenkinson, Garrett; Abante, Jordi; Feinberg, Andrew P; Goutsias, John

2018-03-07

DNA methylation is a stable form of epigenetic memory used by cells to control gene expression. Whole genome bisulfite sequencing (WGBS) has emerged as a gold-standard experimental technique for studying DNA methylation by producing high resolution genome-wide methylation profiles. Statistical modeling and analysis is employed to computationally extract and quantify information from these profiles in an effort to identify regions of the genome that demonstrate crucial or aberrant epigenetic behavior. However, the performance of most currently available methods for methylation analysis is hampered by their inability to directly account for statistical dependencies between neighboring methylation sites, thus ignoring significant information available in WGBS reads. We present a powerful information-theoretic approach for genome-wide modeling and analysis of WGBS data based on the 1D Ising model of statistical physics. This approach takes into account correlations in methylation by utilizing a joint probability model that encapsulates all information available in WGBS methylation reads and produces accurate results even when applied on single WGBS samples with low coverage. Using the Shannon entropy, our approach provides a rigorous quantification of methylation stochasticity in individual WGBS samples genome-wide. Furthermore, it utilizes the Jensen-Shannon distance to evaluate differences in methylation distributions between a test and a reference sample. Differential performance assessment using simulated and real human lung normal/cancer data demonstrate a clear superiority of our approach over DSS, a recently proposed method for WGBS data analysis. Critically, these results demonstrate that marginal methods become statistically invalid when correlations are present in the data. This contribution demonstrates clear benefits and the necessity of modeling joint probability distributions of methylation using the 1D Ising model of statistical physics and of
Genomic sequencing of Pleistocene cave bears

Energy Technology Data Exchange (ETDEWEB)

Noonan, James P.; Hofreiter, Michael; Smith, Doug; Priest, JamesR.; Rohland, Nadin; Rabeder, Gernot; Krause, Johannes; Detter, J. Chris; Paabo, Svante; Rubin, Edward M.

2005-04-01

Despite the information content of genomic DNA, ancient DNA studies to date have largely been limited to amplification of mitochondrial DNA due to technical hurdles such as contamination and degradation of ancient DNAs. In this study, we describe two metagenomic libraries constructed using unamplified DNA extracted from the bones of two 40,000-year-old extinct cave bears. Analysis of {approx}1 Mb of sequence from each library showed that, despite significant microbial contamination, 5.8 percent and 1.1 percent of clones in the libraries contain cave bear inserts, yielding 26,861 bp of cave bear genome sequence. Alignment of this sequence to the dog genome, the closest sequenced genome to cave bear in terms of evolutionary distance, revealed roughly the expected ratio of cave bear exons, repeats and conserved noncoding sequences. Only 0.04 percent of all clones sequenced were derived from contamination with modern human DNA. Comparison of cave bear with orthologous sequences from several modern bear species revealed the evolutionary relationship of these lineages. Using the metagenomic approach described here, we have recovered substantial quantities of mammalian genomic sequence more than twice as old as any previously reported, establishing the feasibility of ancient DNA genomic sequencing programs.
A Novel Computational Method for Detecting DNA Methylation Sites with DNA Sequence Information and Physicochemical Properties.

Science.gov (United States)

Pan, Gaofeng; Jiang, Limin; Tang, Jijun; Guo, Fei

2018-02-08

DNA methylation is an important biochemical process, and it has a close connection with many types of cancer. Research about DNA methylation can help us to understand the regulation mechanism and epigenetic reprogramming. Therefore, it becomes very important to recognize the methylation sites in the DNA sequence. In the past several decades, many computational methods-especially machine learning methods-have been developed since the high-throughout sequencing technology became widely used in research and industry. In order to accurately identify whether or not a nucleotide residue is methylated under the specific DNA sequence context, we propose a novel method that overcomes the shortcomings of previous methods for predicting methylation sites. We use k -gram, multivariate mutual information, discrete wavelet transform, and pseudo amino acid composition to extract features, and train a sparse Bayesian learning model to do DNA methylation prediction. Five criteria-area under the receiver operating characteristic curve (AUC), Matthew's correlation coefficient (MCC), accuracy (ACC), sensitivity (SN), and specificity-are used to evaluate the prediction results of our method. On the benchmark dataset, we could reach 0.8632 on AUC, 0.8017 on ACC, 0.5558 on MCC, and 0.7268 on SN. Additionally, the best results on two scBS-seq profiled mouse embryonic stem cells datasets were 0.8896 and 0.9511 by AUC, respectively. When compared with other outstanding methods, our method surpassed them on the accuracy of prediction. The improvement of AUC by our method compared to other methods was at least 0.0399 . For the convenience of other researchers, our code has been uploaded to a file hosting service, and can be downloaded from: https://figshare.com/s/0697b692d802861282d3.
A Novel Computational Method for Detecting DNA Methylation Sites with DNA Sequence Information and Physicochemical Properties

Directory of Open Access Journals (Sweden)

Gaofeng Pan

2018-02-01

Full Text Available DNA methylation is an important biochemical process, and it has a close connection with many types of cancer. Research about DNA methylation can help us to understand the regulation mechanism and epigenetic reprogramming. Therefore, it becomes very important to recognize the methylation sites in the DNA sequence. In the past several decades, many computational methods—especially machine learning methods—have been developed since the high-throughout sequencing technology became widely used in research and industry. In order to accurately identify whether or not a nucleotide residue is methylated under the specific DNA sequence context, we propose a novel method that overcomes the shortcomings of previous methods for predicting methylation sites. We use k-gram, multivariate mutual information, discrete wavelet transform, and pseudo amino acid composition to extract features, and train a sparse Bayesian learning model to do DNA methylation prediction. Five criteria—area under the receiver operating characteristic curve (AUC, Matthew’s correlation coefficient (MCC, accuracy (ACC, sensitivity (SN, and specificity—are used to evaluate the prediction results of our method. On the benchmark dataset, we could reach 0.8632 on AUC, 0.8017 on ACC, 0.5558 on MCC, and 0.7268 on SN. Additionally, the best results on two scBS-seq profiled mouse embryonic stem cells datasets were 0.8896 and 0.9511 by AUC, respectively. When compared with other outstanding methods, our method surpassed them on the accuracy of prediction. The improvement of AUC by our method compared to other methods was at least 0.0399 . For the convenience of other researchers, our code has been uploaded to a file hosting service, and can be downloaded from: https://figshare.com/s/0697b692d802861282d3.
Nanopore Sequencing as a Rapidly Deployable Ebola Outbreak Tool.

Science.gov (United States)

Hoenen, Thomas; Groseth, Allison; Rosenke, Kyle; Fischer, Robert J; Hoenen, Andreas; Judson, Seth D; Martellaro, Cynthia; Falzarano, Darryl; Marzi, Andrea; Squires, R Burke; Wollenberg, Kurt R; de Wit, Emmie; Prescott, Joseph; Safronetz, David; van Doremalen, Neeltje; Bushmaker, Trenton; Feldmann, Friederike; McNally, Kristin; Bolay, Fatorma K; Fields, Barry; Sealy, Tara; Rayfield, Mark; Nichol, Stuart T; Zoon, Kathryn C; Massaquoi, Moses; Munster, Vincent J; Feldmann, Heinz

2016-02-01

Rapid sequencing of RNA/DNA from pathogen samples obtained during disease outbreaks provides critical scientific and public health information. However, challenges exist for exporting samples to laboratories or establishing conventional sequencers in remote outbreak regions. We successfully used a novel, pocket-sized nanopore sequencer at a field diagnostic laboratory in Liberia during the current Ebola virus outbreak.
Comprehensive effective and efficient global public health surveillance

Directory of Open Access Journals (Sweden)

McNabb Scott JN

2010-12-01

Full Text Available Abstract At a crossroads, global public health surveillance exists in a fragmented state. Slow to detect, register, confirm, and analyze cases of public health significance, provide feedback, and communicate timely and useful information to stakeholders, global surveillance is neither maximally effective nor optimally efficient. Stakeholders lack a globa surveillance consensus policy and strategy; officials face inadequate training and scarce resources. Three movements now set the stage for transformation of surveillance: 1 adoption by Member States of the World Health Organization (WHO of the revised International Health Regulations (IHR[2005]; 2 maturation of information sciences and the penetration of information technologies to distal parts of the globe; and 3 consensus that the security and public health communities have overlapping interests and a mutual benefit in supporting public health functions. For these to enhance surveillance competencies, eight prerequisites should be in place: politics, policies, priorities, perspectives, procedures, practices, preparation, and payers. To achieve comprehensive, global surveillance, disparities in technical, logistic, governance, and financial capacities must be addressed. Challenges to closing these gaps include the lack of trust and transparency; perceived benefit at various levels; global governance to address data power and control; and specified financial support from globa partners. We propose an end-state perspective for comprehensive, effective and efficient global, multiple-hazard public health surveillance and describe a way forward to achieve it. This end-state is universal, global access to interoperable public health information when it’s needed, where it’s needed. This vision mitigates the tension between two fundamental human rights: first, the right to privacy, confidentiality, and security of personal health information combined with the right of sovereign, national entities
Comprehensive effective and efficient global public health surveillance.

Science.gov (United States)

McNabb, Scott J N

2010-12-03

At a crossroads, global public health surveillance exists in a fragmented state. Slow to detect, register, confirm, and analyze cases of public health significance, provide feedback, and communicate timely and useful information to stakeholders, global surveillance is neither maximally effective nor optimally efficient. Stakeholders lack a globa surveillance consensus policy and strategy; officials face inadequate training and scarce resources.Three movements now set the stage for transformation of surveillance: 1) adoption by Member States of the World Health Organization (WHO) of the revised International Health Regulations (IHR[2005]); 2) maturation of information sciences and the penetration of information technologies to distal parts of the globe; and 3) consensus that the security and public health communities have overlapping interests and a mutual benefit in supporting public health functions. For these to enhance surveillance competencies, eight prerequisites should be in place: politics, policies, priorities, perspectives, procedures, practices, preparation, and payers.To achieve comprehensive, global surveillance, disparities in technical, logistic, governance, and financial capacities must be addressed. Challenges to closing these gaps include the lack of trust and transparency; perceived benefit at various levels; global governance to address data power and control; and specified financial support from globa partners.We propose an end-state perspective for comprehensive, effective and efficient global, multiple-hazard public health surveillance and describe a way forward to achieve it. This end-state is universal, global access to interoperable public health information when it's needed, where it's needed. This vision mitigates the tension between two fundamental human rights: first, the right to privacy, confidentiality, and security of personal health information combined with the right of sovereign, national entities to the ownership and stewardship
The critical role of acute flaccid paralysis surveillance in the Global Polio Eradication Initiative.

Science.gov (United States)

Tangermann, Rudolf H; Lamoureux, Christine; Tallis, Graham; Goel, Ajay

2017-05-01

Acute flaccid paralysis (AFP) surveillance is a key strategy used by the Global Polio Eradication Initiative (GPEI) to measure progress towards reaching the global eradication goal. Supported by a global polio laboratory network, AFP surveillance is conducted in 179 of 194 WHO member states. Active surveillance visits to priority health facilities are used to assure all children polio laboratories. The quality of AFP surveillance is regularly monitored with standardized surveillance quality indicators. In highest risk countries and areas, the sensitivity of AFP surveillance is enhanced by environmental surveillance (testing of sewage samples). Genetic sequencing of detected poliovirus isolates yields programmatically important information on polio transmission pathways. AFP surveillance is one of the most valuable assets of the GPEI, with the potential to serve as a platform to build integrated disease surveillance systems. Continued support to maintain AFP surveillance systems will be essential, to reliably monitor the completion of global polio eradication, and to assure that a key resource for building surveillance capacity is transitioned post-eradication to support other health priorities. © The Author 2017. Published by Oxford University Press on behalf of Royal Society of Tropical Medicine and Hygiene. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Complete chloroplast genome sequence of a major allogamous forage species, perennial ryegrass (Lolium perenne L.).

Science.gov (United States)

Diekmann, Kerstin; Hodkinson, Trevor R; Wolfe, Kenneth H; van den Bekerom, Rob; Dix, Philip J; Barth, Susanne

2009-06-01

Lolium perenne L. (perennial ryegrass) is globally one of the most important forage and grassland crops. We sequenced the chloroplast (cp) genome of Lolium perenne cultivar Cashel. The L. perenne cp genome is 135 282 bp with a typical quadripartite structure. It contains genes for 76 unique proteins, 30 tRNAs and four rRNAs. As in other grasses, the genes accD, ycf1 and ycf2 are absent. The genome is of average size within its subfamily Pooideae and of medium size within the Poaceae. Genome size differences are mainly due to length variations in non-coding regions. However, considerable length differences of 1-27 codons in comparison of L. perenne to other Poaceae and 1-68 codons among all Poaceae were also detected. Within the cp genome of this outcrossing cultivar, 10 insertion/deletion polymorphisms and 40 single nucleotide polymorphisms were detected. Two of the polymorphisms involve tiny inversions within hairpin structures. By comparing the genome sequence with RT-PCR products of transcripts for 33 genes, 31 mRNA editing sites were identified, five of them unique to Lolium. The cp genome sequence of L. perenne is available under Accession number AM777385 at the European Molecular Biology Laboratory, National Center for Biotechnology Information and DNA DataBank of Japan.
A qualitative content analysis of global health engagements in Peacekeeping and Stability Operations Institute's stability operations lessons learned and information management system.

Science.gov (United States)

Nang, Roberto N; Monahan, Felicia; Diehl, Glendon B; French, Daniel

2015-04-01

Many institutions collect reports in databases to make important lessons-learned available to their members. The Uniformed Services University of the Health Sciences collaborated with the Peacekeeping and Stability Operations Institute to conduct a descriptive and qualitative analysis of global health engagements (GHEs) contained in the Stability Operations Lessons Learned and Information Management System (SOLLIMS). This study used a summative qualitative content analysis approach involving six steps: (1) a comprehensive search; (2) two-stage reading and screening process to identify first-hand, health-related records; (3) qualitative and quantitative data analysis using MAXQDA, a software program; (4) a word cloud to illustrate word frequencies and interrelationships; (5) coding of individual themes and validation of the coding scheme; and (6) identification of relationships in the data and overarching lessons-learned. The individual codes with the most number of text segments coded included: planning, personnel, interorganizational coordination, communication/information sharing, and resources/supplies. When compared to the Department of Defense's (DoD's) evolving GHE principles and capabilities, the SOLLIMS coding scheme appeared to align well with the list of GHE capabilities developed by the Department of Defense Global Health Working Group. The results of this study will inform practitioners of global health and encourage additional qualitative analysis of other lessons-learned databases. Reprint & Copyright © 2015 Association of Military Surgeons of the U.S.
Application of wavelet analysis in determining the periodicity of global warming

Science.gov (United States)

Feng, Xiao

2018-04-01

In the last two decades of the last century, the global average temperature has risen by 0.48 ° C over 100 years ago. Since then, global warming has become a hot topic. Global warming will have complex and potential impacts on humans and the Earth. However, the negative impacts far outweigh the positive impacts. The most obvious external manifestation of global warming is temperature. Therefore, this study uses wavelet analysis study the characteristics of temperature time series, solve the periodicity of the sequence, find out the trend of temperature change and predict the extent of global warming in the future, so as to take the necessary precautionary measures.
A Window Into Clinical Next-Generation Sequencing-Based Oncology Testing Practices.

Science.gov (United States)

Nagarajan, Rakesh; Bartley, Angela N; Bridge, Julia A; Jennings, Lawrence J; Kamel-Reid, Suzanne; Kim, Annette; Lazar, Alexander J; Lindeman, Neal I; Moncur, Joel; Rai, Alex J; Routbort, Mark J; Vasalos, Patricia; Merker, Jason D

2017-12-01

- Detection of acquired variants in cancer is a paradigm of precision medicine, yet little has been reported about clinical laboratory practices across a broad range of laboratories. - To use College of American Pathologists proficiency testing survey results to report on the results from surveys on next-generation sequencing-based oncology testing practices. - College of American Pathologists proficiency testing survey results from more than 250 laboratories currently performing molecular oncology testing were used to determine laboratory trends in next-generation sequencing-based oncology testing. - These presented data provide key information about the number of laboratories that currently offer or are planning to offer next-generation sequencing-based oncology testing. Furthermore, we present data from 60 laboratories performing next-generation sequencing-based oncology testing regarding specimen requirements and assay characteristics. The findings indicate that most laboratories are performing tumor-only targeted sequencing to detect single-nucleotide variants and small insertions and deletions, using desktop sequencers and predesigned commercial kits. Despite these trends, a diversity of approaches to testing exists. - This information should be useful to further inform a variety of topics, including national discussions involving clinical laboratory quality systems, regulation and oversight of next-generation sequencing-based oncology testing, and precision oncology efforts in a data-driven manner.
Global Warming: Implications for Library and Information Professionals

African Journals Online (AJOL)

The major finding is that global warming is an issue that cannot be resolved overnight or with any one policy. It is an intergenerational problem which needs to be addressed by ensuring that how people live on this planet takes climate change seriously into account. The major recommendation is that library buildings should ...
Drug pricing and reimbursement information management: processes and decision making in the global economy.

Science.gov (United States)

Tsourougiannis, Dimitrios

2017-01-01

Background : Cost-containment initiatives are re-shaping the pharmaceutical business environment and affecting market access as well as pricing and reimbursement decisions. Effective price management procedures are too complex to accomplish manually. Prior to February 2013, price management within Astellas Pharma Europe Ltd was done manually using an Excel database. The system was labour intensive, slow to update, and prone to error. An innovative web-based pricing information management system was developed to address the shortcomings of the previous system. Development : A secure web-based system for submitting, reviewing and approving pricing requests was designed to: track all pricing applications and approval status; update approved pricing information automatically; provide fixed and customizable reports of pricing information; collect pricing and reimbursement rules from each country; validate pricing and reimbursement rules monthly. Several sequential phases of development emphasized planning, time schedules, target dates, budgets and implementation of the entire system. A test system was used to pilot the electronic (e)-pricing system with three affiliates (four users) in February 2013. Outcomes : The web-based system was introduced in March 2013, currently has about 227 active users globally and comprises more than 1000 presentations of 150 products. The overall benefits of switching from a manual to an e-pricing system were immediate and highly visible in terms of efficiency, transparency, reliability and compliance. Conclusions : The e-pricing system has improved the efficiency, reliability, compliance, transparency and ease of access to multinational drug pricing and approval information.
An HMM posterior decoder for sequence feature prediction that includes homology information

DEFF Research Database (Denmark)

Käll, Lukas; Krogh, Anders Stærmose; Sonnhammer, Erik L. L.

2005-01-01

Motivation: When predicting sequence features like transmembrane topology, signal peptides, coil-coil structures, protein secondary structure or genes, extra support can be gained from homologs. Results: We present here a general hidden Markov model (HMM) decoding algorithm that combines probabil......Motivation: When predicting sequence features like transmembrane topology, signal peptides, coil-coil structures, protein secondary structure or genes, extra support can be gained from homologs. Results: We present here a general hidden Markov model (HMM) decoding algorithm that combines......://phobius.cgb.ki.se/poly.html . An implementation of the algorithm is available on request from the authors....
Understanding Information Technology Investment Decision-Making in the Context of Hotel Global Distribution Systems: a Multiple-Case Study

OpenAIRE

Connolly, Daniel J.

1999-01-01

UNDERSTANDING INFORMATION TECHNOLOGY INVESTMENT DECISION-MAKING IN THE CONTEXT OF HOTEL GLOBAL DISTRIBUTION SYSTEMS: A MULTIPLE-CASE STUDY by Daniel J. Connolly Dr. Michael D. Olsen, Chair Department of Hospitality and Tourism Management ABSTRACT This study investigates what three large, multinational hospitality companies do in practice when evaluating and making IT investment decisions. This study was launched in an attempt to 1) learn more about ...
Global DNA methylation analysis using methyl-sensitive amplification polymorphism (MSAP).

Science.gov (United States)

Yaish, Mahmoud W; Peng, Mingsheng; Rothstein, Steven J

2014-01-01

DNA methylation is a crucial epigenetic process which helps control gene transcription activity in eukaryotes. Information regarding the methylation status of a regulatory sequence of a particular gene provides important knowledge of this transcriptional control. DNA methylation can be detected using several methods, including sodium bisulfite sequencing and restriction digestion using methylation-sensitive endonucleases. Methyl-Sensitive Amplification Polymorphism (MSAP) is a technique used to study the global DNA methylation status of an organism and hence to distinguish between two individuals based on the DNA methylation status determined by the differential digestion pattern. Therefore, this technique is a useful method for DNA methylation mapping and positional cloning of differentially methylated genes. In this technique, genomic DNA is first digested with a methylation-sensitive restriction enzyme such as HpaII, and then the DNA fragments are ligated to adaptors in order to facilitate their amplification. Digestion using a methylation-insensitive isoschizomer of HpaII, MspI is used in a parallel digestion reaction as a loading control in the experiment. Subsequently, these fragments are selectively amplified by fluorescently labeled primers. PCR products from different individuals are compared, and once an interesting polymorphic locus is recognized, the desired DNA fragment can be isolated from a denaturing polyacrylamide gel, sequenced and identified based on DNA sequence similarity to other sequences available in the database. We will use analysis of met1, ddm1, and atmbd9 mutants and wild-type plants treated with a cytidine analogue, 5-azaC, or zebularine to demonstrate how to assess the genetic modulation of DNA methylation in Arabidopsis. It should be noted that despite the fact that MSAP is a reliable technique used to fish for polymorphic methylated loci, its power is limited to the restriction recognition sites of the enzymes used in the genomic
On the long-lasting sequences of coral reef terraces from SE Sulawesi (Indonesia): Distribution, formation, and global significance

Science.gov (United States)

Pedoja, Kevin; Husson, Laurent; Bezos, Antoine; Pastier, Anne-Morwenn; Imran, Andy Muhammad; Arias-Ruiz, Camilo; Sarr, Anta-Clarisse; Elliot, Mary; Pons-Branchu, Edwige; Nexer, Maëlle; Regard, Vincent; Hafidz, Abdul; Robert, Xavier; Benoit, Laurent; Delcaillau, Bernard; Authemayou, Christine; Dumoulin, Caroline; Choblet, Gaël

2018-05-01

Many islands of the eastern Indonesian Archipelago exhibit Late Cenozoic sequences of coral reef terraces. In SE Sulawesi, on the Tukang Besi and Buton archipelagos, we identified 23 islands bearing such sequences. Remote sensing imagery and field mapping combined to U/Th and 14C dating enable to establish a chronologic framework of the reef terrace sequences from Wangi-Wangi, Buton as well as on the neighbouring, smaller islands of Ular, Siumpu and Kadatua. We identified the terraces from the last interglacial maximum (MIS 5e) at elevations lower than 20 m except on W Kadatua where it is raised at 34 ± 5 m. Such elevations yield low to moderate Upper Pleistocene uplift rates (<0.3 mm yr-1). On SE Buton Island, a sequence culminates at 650 m and includes at least 40 undated strandlines. Next to this exceptional sequence, on the Sampolawa Peninsula, 18 strandlines culminate at 430 m. Dated samples at the base of this sequence (<40 m) yield mean Middle Pleistocene uplift rates of 0.14 ± 0.09 mm yr-1. Extrapolation of these uplift rates compared to the geological setting suggests that the sequences of the Sampolawa Peninsula provide a record of sea-level high-stands for the last 3.8 ± 0.6 Ma. The sequences on SE Buton Island therefore constitute the best preserved long-lasting geomorphic record of Plio-Quaternary sea-level stands worldwide.

Application of next generation sequencing in clinical microbiology and infection prevention

NARCIS (Netherlands)

Deurenberg, Ruud H.; Bathoorn, Erik; Chlebowicz, Monika A.; Couto, Natacha; Ferdous, Mithila; Garcia-Cobos, Silvia; Kooistra-Smid, Anna M. D.; Raangs, Erwin C.; Rosema, Sigrid; Veloo, Alida C. M.; Zhou, Kai; Friedrich, Alexander W.; Rossen, John W. A.

2017-01-01

Current molecular diagnostics of human pathogens provide limited information that is often not sufficient for outbreak and transmission investigation. Next generation sequencing (NGS) determines the DNA sequence of a complete bacterial genome in a single sequence run, and from these data,
Globalization – Chances or Risks

OpenAIRE

MĂDĂLINA ANTOANETA RĂDOI; ALEXANDRU OLTEANU

2015-01-01

There are for and against arguments as regards the process of globalization. But what is globalization: a concept, a reality or a state as such? We can consider that globalization reflects the natural continuity of a process that appeared a long time ago and that has evolved ever since or a new phenomenon that was generated by the speed with which new technology and information flow. Milton Friedman, a fervent supporter of globalization, gives an answer to the question “what is globalization”...
Sequencing Genetics Information: Integrating Data into Information Literacy for Undergraduate Biology Students

Science.gov (United States)

MacMillan, Don

2010-01-01

This case study describes an information literacy lab for an undergraduate biology course that leads students through a range of resources to discover aspects of genetic information. The lab provides over 560 students per semester with the opportunity for hands-on exploration of resources in steps that simulate the pathways of higher-level…
Polymorphism Sequence - JSNP | LSDB Archive [Life Science Database Archive metadata

Lifescience Database Archive (English)

Full Text Available List Contact us JSNP Polymorphism Sequence Data detail Data name Polymorphism Sequence DOI 10.18908/lsdba.nb...dc00114-001 Description of data contents Information on polymorphisms (SNPs and insertions/deletions) and th...se Name database name JSNP_SNP: single nucleotide polymorphism JSNP_InsDel_IND: insertion/deletion JSNP_InsD...ved allele observed 3' Flanking Sequence 3' flanking sequence Offset in Flanking Sequence position of the polymorphism...uence Accession No. accession No. of the sequence for polymorphism screening Offset in Record position of the polymorphism
Overlapping genomic sequences: a treasure trove of single-nucleotide polymorphisms.

Science.gov (United States)

Taillon-Miller, P; Gu, Z; Li, Q; Hillier, L; Kwok, P Y

1998-07-01

An efficient strategy to develop a dense set of single-nucleotide polymorphism (SNP) markers is to take advantage of the human genome sequencing effort currently under way. Our approach is based on the fact that bacterial artificial chromosomes (BACs) and P1-based artificial chromosomes (PACs) used in long-range sequencing projects come from diploid libraries. If the overlapping clones sequenced are from different lineages, one is comparing the sequences from 2 homologous chromosomes in the overlapping region. We have analyzed in detail every SNP identified while sequencing three sets of overlapping clones found on chromosome 5p15.2, 7q21-7q22, and 13q12-13q13. In the 200.6 kb of DNA sequence analyzed in these overlaps, 153 SNPs were identified. Computer analysis for repetitive elements and suitability for STS development yielded 44 STSs containing 68 SNPs for further study. All 68 SNPs were confirmed to be present in at least one of the three (Caucasian, African-American, Hispanic) populations studied. Furthermore, 42 of the SNPs tested (62%) were informative in at least one population, 32 (47%) were informative in two or more populations, and 23 (34%) were informative in all three populations. These results clearly indicate that developing SNP markers from overlapping genomic sequence is highly efficient and cost effective, requiring only the two simple steps of developing STSs around the known SNPs and characterizing them in the appropriate populations.
Effort in Multitasking: Local and Global Assessment of Effort.

Science.gov (United States)

Kiesel, Andrea; Dignath, David

2017-01-01

When performing multiple tasks in succession, self-organization of task order might be superior compared to external-controlled task schedules, because self-organization allows optimizing processing modes and thus reduces switch costs, and it increases commitment to task goals. However, self-organization is an additional executive control process that is not required if task order is externally specified and as such it is considered as time-consuming and effortful. To compare self-organized and externally controlled task scheduling, we suggest assessing global subjective and objectives measures of effort in addition to local performance measures. In our new experimental approach, we combined characteristics of dual tasking settings and task switching settings and compared local and global measures of effort in a condition with free choice of task sequence and a condition with cued task sequence. In a multi-tasking environment, participants chose the task order while the task requirement of the not-yet-performed task remained the same. This task preview allowed participants to work on the previously non-chosen items in parallel and resulted in faster responses and fewer errors in task switch trials than in task repetition trials. The free-choice group profited more from this task preview than the cued group when considering local performance measures. Nevertheless, the free-choice group invested more effort than the cued group when considering global measures. Thus, self-organization in task scheduling seems to be effortful even in conditions in which it is beneficiary for task processing. In a second experiment, we reduced the possibility of task preview for the not-yet-performed tasks in order to hinder efficient self-organization. Here neither local nor global measures revealed substantial differences between the free-choice and a cued task sequence condition. Based on the results of both experiments, we suggest that global assessment of effort in addition to
The roles of information technology in global chain supply: a multiple case study of multinational companies of China

Science.gov (United States)

He, Mao; Duan, Wanchun

2007-12-01

Nowadays many Chinese companies have being becoming more and more international. Therefore, these Chinese companies have to face global supply chains rather than the former domestic ones. The use of information technology (IT) is considered a prerequisite for the effective control of today's complex global supply chains. Based on empirical data from 10 multinational companies of China, this paper presents a classification of the ways in which companies use IT in SCM, and examines the drivers for these different utilization types. According to the findings of this research, the purposes of using of IT in SCM can be divided into 1) transaction processing, 2) supply chain planning and collaboration, and 3) order tracking and delivery coordination. The findings further suggest that the drivers between these three uses of IT in SCM differ.
Information Assurance Security in the Information Environment

CERN Document Server

Blyth, Andrew

2006-01-01

Intended for IT managers and assets protection professionals, this work aims to bridge the gap between information security, information systems security and information warfare. It covers topics such as the role of the corporate security officer; Corporate cybercrime; Electronic commerce and the global marketplace; Cryptography; and, more.
Genome sequencing - the ultimate answer to global real time genotyping and surveillance?

DEFF Research Database (Denmark)

Hendriksen, Rene S.

2013-01-01

organised to discuss the possibility of using WGS as diagnostic tool on a global scale. These meetings were attended by scientists and policy makers from around the world. The general conclusion of these meetings was that the technology exists and that the spread in the application should be linked...
31 CFR 594.305 - Information or informational materials.

Science.gov (United States)

2010-07-01

... (Continued) OFFICE OF FOREIGN ASSETS CONTROL, DEPARTMENT OF THE TREASURY GLOBAL TERRORISM SANCTIONS REGULATIONS General Definitions § 594.305 Information or informational materials. (a) For purposes of this...
Globalization of healthcare.

Science.gov (United States)

2012-05-01

Globalization-the increasing transnational circulation of money, goods, people, ideas, and information worldwide-is generally recognized as one of the most powerful forces shaping our current and future history. How is it affecting healthcare, and in that context, what is the purpose and significance of Global Advances in Health and Medicine (GAHM), publisher of this journal? Our goal is not homogenization but rather to provide an opportunity for integration, convergence, and collaboration across cultures. By respecting and conserving the richness and diversity of each new medicine, we embrace globalization. Globalization is of course not new; it began in the Renaissance and particularly with the 15th- and 16th-century voyages of exploration by Columbus, Magellan, and others. Since the beginning of time, there have been interactions and exchanges among different peoples and cultures. However, the current magnitude of globalization is unprecedented and yet still expanding rapidly.
Comprehensive global amino acid sequence analysis of PB1F2 protein of influenza A H5N1 viruses and the influenza A virus subtypes responsible for the 20th‐century pandemics

Science.gov (United States)

Pasricha, Gunisha; Mishra, Akhilesh C.; Chakrabarti, Alok K.

2012-01-01

Please cite this paper as: Pasricha et al. (2012) Comprehensive global amino acid sequence analysis of PB1F2 protein of influenza A H5N1 viruses and the Influenza A virus subtypes responsible for the 20th‐century pandemics. Influenza and Other Respiratory Viruses 7(4), 497–505. Background PB1F2 is the 11th protein of influenza A virus translated from +1 alternate reading frame of PB1 gene. Since the discovery, varying sizes and functions of the PB1F2 protein of influenza A viruses have been reported. Selection of PB1 gene segment in the pandemics, variable size and pleiotropic effect of PB1F2 intrigued us to analyze amino acid sequences of this protein in various influenza A viruses. Methods Amino acid sequences for PB1F2 protein of influenza A H5N1, H1N1, H2N2, and H3N2 subtypes were obtained from Influenza Research Database. Multiple sequence alignments of the PB1F2 protein sequences of the aforementioned subtypes were used to determine the size, variable and conserved domains and to perform mutational analysis. Results Analysis showed that 96·4% of the H5N1 influenza viruses harbored full‐length PB1F2 protein. Except for the 2009 pandemic H1N1 virus, all the subtypes of the 20th‐century pandemic influenza viruses contained full‐length PB1F2 protein. Through the years, PB1F2 protein of the H1N1 and H3N2 viruses has undergone much variation. PB1F2 protein sequences of H5N1 viruses showed both human‐ and avian host‐specific conserved domains. Global database of PB1F2 protein revealed that N66S mutation was present only in 3·8% of the H5N1 strains. We found a novel mutation, N84S in the PB1F2 protein of 9·35% of the highly pathogenic avian influenza H5N1 influenza viruses. Conclusions Varying sizes and mutations of the PB1F2 protein in different influenza A virus subtypes with pandemic potential were obtained. There was genetic divergence of the protein in various hosts which highlighted the host‐specific evolution of the virus
The role of new information technology meeting the global need and gap of education in pediatric surgery.

Science.gov (United States)

Ure, Benno; Zoeller, Christoph; Lacher, Martin

2015-06-01

Traditionally, pediatric surgical education consisted of exposure to patients, textbooks, lectures, team-based education, congresses, and workshops. Over the last decades, however, new information technology (IT) and the internet revolutionized the sharing of information and communication. IT has become relevant in particular for the younger generation of pediatric surgeons. Today, gaps in children's health and the quality of pediatric surgical education persist between countries and regions. Advances in health care are not shared equitably. The use of IT for resource libraries, teleconferences, virtual symposiums, and telementoring has great potential in closing this gap and meeting the global needs for pediatric surgical education. This article focuses on the potential role of IT in this respect. Copyright © 2015 Elsevier Inc. All rights reserved.
Targeted assembly of short sequence reads.

Directory of Open Access Journals (Sweden)

René L Warren

Full Text Available As next-generation sequence (NGS production continues to increase, analysis is becoming a significant bottleneck. However, in situations where information is required only for specific sequence variants, it is not necessary to assemble or align whole genome data sets in their entirety. Rather, NGS data sets can be mined for the presence of sequence variants of interest by localized assembly, which is a faster, easier, and more accurate approach. We present TASR, a streamlined assembler that interrogates very large NGS data sets for the presence of specific variants by only considering reads within the sequence space of input target sequences provided by the user. The NGS data set is searched for reads with an exact match to all possible short words within the target sequence, and these reads are then assembled stringently to generate a consensus of the target and flanking sequence. Typically, variants of a particular locus are provided as different target sequences, and the presence of the variant in the data set being interrogated is revealed by a successful assembly outcome. However, TASR can also be used to find unknown sequences that flank a given target. We demonstrate that TASR has utility in finding or confirming genomic mutations, polymorphisms, fusions and integration events. Targeted assembly is a powerful method for interrogating large data sets for the presence of sequence variants of interest. TASR is a fast, flexible and easy to use tool for targeted assembly.
Organizing, exploring, and analyzing antibody sequence data: the case for relational-database managers.

Science.gov (United States)

Owens, John

2009-01-01

Technological advances in the acquisition of DNA and protein sequence information and the resulting onrush of data can quickly overwhelm the scientist unprepared for the volume of information that must be evaluated and carefully dissected to discover its significance. Few laboratories have the luxury of dedicated personnel to organize, analyze, or consistently record a mix of arriving sequence data. A methodology based on a modern relational-database manager is presented that is both a natural storage vessel for antibody sequence information and a conduit for organizing and exploring sequence data and accompanying annotation text. The expertise necessary to implement such a plan is equal to that required by electronic word processors or spreadsheet applications. Antibody sequence projects maintained as independent databases are selectively unified by the relational-database manager into larger database families that contribute to local analyses, reports, interactive HTML pages, or exported to facilities dedicated to sophisticated sequence analysis techniques. Database files are transposable among current versions of Microsoft, Macintosh, and UNIX operating systems.
An Ambystoma mexicanum EST sequencing project: analysis of 17,352 expressed sequence tags from embryonic and regenerating blastema cDNA libraries

Science.gov (United States)

Habermann, Bianca; Bebin, Anne-Gaelle; Herklotz, Stephan; Volkmer, Michael; Eckelt, Kay; Pehlke, Kerstin; Epperlein, Hans Henning; Schackert, Hans Konrad; Wiebe, Glenis; Tanaka, Elly M

2004-01-01

Background The ambystomatid salamander, Ambystoma mexicanum (axolotl), is an important model organism in evolutionary and regeneration research but relatively little sequence information has so far been available. This is a major limitation for molecular studies on caudate development, regeneration and evolution. To address this lack of sequence information we have generated an expressed sequence tag (EST) database for A. mexicanum. Results Two cDNA libraries, one made from stage 18-22 embryos and the other from day-6 regenerating tail blastemas, generated 17,352 sequences. From the sequenced ESTs, 6,377 contigs were assembled that probably represent 25% of the expressed genes in this organism. Sequence comparison revealed significant homology to entries in the NCBI non-redundant database. Further examination of this gene set revealed the presence of genes involved in important cell and developmental processes, including cell proliferation, cell differentiation and cell-cell communication. On the basis of these data, we have performed phylogenetic analysis of key cell-cycle regulators. Interestingly, while cell-cycle proteins such as the cyclin B family display expected evolutionary relationships, the cyclin-dependent kinase inhibitor 1 gene family shows an unusual evolutionary behavior among the amphibians. Conclusions Our analysis reveals the importance of a comprehensive sequence set from a representative of the Caudata and illustrates that the EST sequence database is a rich source of molecular, developmental and regeneration studies. To aid in data mining, the ESTs have been organized into an easily searchable database that is freely available online. PMID:15345051
The RNA world, automatic sequences and oncogenetics

Energy Technology Data Exchange (ETDEWEB)

Tahir Shah, K

1993-04-01

We construct a model of the RNA world in terms of naturally evolving nucleotide sequences assuming only Crick-Watson base pairing and self-cleaving/splicing capability. These sequences have the following properties. (1) They are recognizable by an automation (or automata). That is, to each k-sequence, there exist a k-automation which accepts, recognizes or generates the k-sequence. These are known as automatic sequences. Fibonacci and Morse-Thue sequences are the most natural outcome of pre-biotic chemical conditions. (2) Infinite (resp. large) sequences are self-similar (resp. nearly self-similar) under certain rewrite rules and consequently give rise to fractal (resp.fractal-like) structures. Computationally, such sequences can also be generated by their corresponding deterministic parallel re-write system, known as a DOL system. The self-similar sequences are fixed points of their respective rewrite rules. Some of these automatic sequences have the capability that they can read or ``accept`` other sequences while others can detect errors and trigger error-correcting mechanisms. They can be enlarged and have block and/or palindrome structure. Linear recurring sequences such as Fibonacci sequence are simply Feed-back Shift Registers, a well know model of information processing machines. We show that a mutation of any rewrite rule can cause a combinatorial explosion of error and relates this to oncogenetical behavior. On the other hand, a mutation of sequences that are not rewrite rules, leads to normal evolutionary change. Known experimental results support our hypothesis. (author). Refs.
The RNA world, automatic sequences and oncogenetics

International Nuclear Information System (INIS)

Tahir Shah, K.

1993-04-01

We construct a model of the RNA world in terms of naturally evolving nucleotide sequences assuming only Crick-Watson base pairing and self-cleaving/splicing capability. These sequences have the following properties. 1) They are recognizable by an automation (or automata). That is, to each k-sequence, there exist a k-automation which accepts, recognizes or generates the k-sequence. These are known as automatic sequences. Fibonacci and Morse-Thue sequences are the most natural outcome of pre-biotic chemical conditions. 2) Infinite (resp. large) sequences are self-similar (resp. nearly self-similar) under certain rewrite rules and consequently give rise to fractal (resp.fractal-like) structures. Computationally, such sequences can also be generated by their corresponding deterministic parallel re-write system, known as a DOL system. The self-similar sequences are fixed points of their respective rewrite rules. Some of these automatic sequences have the capability that they can read or 'accept' other sequences while others can detect errors and trigger error-correcting mechanisms. They can be enlarged and have block and/or palindrome structure. Linear recurring sequences such as Fibonacci sequence are simply Feed-back Shift Registers, a well know model of information processing machines. We show that a mutation of any rewrite rule can cause a combinatorial explosion of error and relates this to oncogenetical behavior. On the other hand, a mutation of sequences that are not rewrite rules, leads to normal evolutionary change. Known experimental results support our hypothesis. (author). Refs
A global warming forum: Scientific, economic, and legal overview

International Nuclear Information System (INIS)

Geyer, R.A.

1993-01-01

A Global Warming Forum covers in detail five general subject areas aimed at providing first, the scientific background and technical information available on global warming and second, a study and evaluation of the role of economic, legal, and political considerations in global warming. The five general topic areas discussed are the following: (1) The role of geophysical and geoengineering methods to solve problems related to global climatic change; (2) the role of oceanographic and geochemical methods to provide evidence for global climatic change; (3) the global assessment of greenhouse gas production including the need for additional information; (4) natural resource management needed to provide long-term global energy and agricultural uses; (5) legal, policy, and educational considerations required to properly evaluate global warming proposals
Evolutionary history of Phakopsora pachyrhizi (the Asian soybean rust in Brazil based on nucleotide sequences of the internal transcribed spacer region of the nuclear ribosomal DNA

Directory of Open Access Journals (Sweden)

Maíra C. M. Freire

2008-01-01

Full Text Available Phakopsora pachyrhizi has dispersed globally and brought severe economic losses to soybean growers. The fungus has been established in Brazil since 2002 and is found nationwide. To gather information on the temporal and spatial patterns of genetic variation in P. pachyrhizi , we sequenced the nuclear internal transcribed spacer regions (ITS1 and ITS2. Total genomic DNA was extracted using either lyophilized urediniospores or lesions removed from infected leaves sampled from 26 soybean fields in Brazil and one field in South Africa. Cloning prior to sequencing was necessary because direct sequencing of PCR amplicons gave partially unreadable electrophoretograms with peak displacements suggestive of multiple sequences with length polymorphism. Sequences were determined from four clones per field. ITS sequences from African or Asian isolates available from the GenBank were included in the analyses. Independent sequence alignments of the ITS1 and ITS2 datasets identified 27 and 19 ribotypes, respectively. Molecular phylogeographic analyses revealed that ribotypes of widespread distribution in Brazil displayed characteristics of ancestrality and were shared with Africa and Asia, while ribotypes of rare occurrence in Brazil were indigenous. The results suggest P. pachyrhizi found in Brazil as originating from multiple, independent long-distance dispersal events.

Complete Genome Sequences of Four Isolates of Plutella xylostella Granulovirus

OpenAIRE

Spence, Robert J.; Noune, Christopher; Hauxwell, Caroline

2016-01-01

Granuloviruses are widespread pathogens of Plutella xylostella L. (diamondback moth) and potential biopesticides for control of this global insect pest. We report the complete genomes of four Plutella xylostella granulovirus isolates from China, Malaysia, and Taiwan exhibiting pairs of noncoding, homologous repeat regions with significant sequence variation but equivalent length.
Formatt: Correcting protein multiple structural alignments by incorporating sequence alignment

Directory of Open Access Journals (Sweden)

Daniels Noah M

2012-10-01

Full Text Available Abstract Background The quality of multiple protein structure alignments are usually computed and assessed based on geometric functions of the coordinates of the backbone atoms from the protein chains. These purely geometric methods do not utilize directly protein sequence similarity, and in fact, determining the proper way to incorporate sequence similarity measures into the construction and assessment of protein multiple structure alignments has proved surprisingly difficult. Results We present Formatt, a multiple structure alignment based on the Matt purely geometric multiple structure alignment program, that also takes into account sequence similarity when constructing alignments. We show that Formatt outperforms Matt and other popular structure alignment programs on the popular HOMSTRAD benchmark. For the SABMark twilight zone benchmark set that captures more remote homology, Formatt and Matt outperform other programs; depending on choice of embedded sequence aligner, Formatt produces either better sequence and structural alignments with a smaller core size than Matt, or similarly sized alignments with better sequence similarity, for a small cost in average RMSD. Conclusions Considering sequence information as well as purely geometric information seems to improve quality of multiple structure alignments, though defining what constitutes the best alignment when sequence and structural measures would suggest different alignments remains a difficult open question.
Globalization on Trial: The Human Condition and the Information ...

International Development Research Centre (IDRC) Digital Library (Canada)

What is the human condition at the dawning of the global age? ... scholars,and students in the social sciences and, particularly, the humanities; donors, ... affecting the nature of human civilization, and with the interaction between Islamic and ...
Sequencing and Characterization of the Invasive Sycamore Lace Bug Corythucha ciliata (Hemiptera: Tingidae) Transcriptome

Science.gov (United States)

Qu, Cheng; Fu, Ningning; Xu, Yihua

2016-01-01

The sycamore lace bug, Corythucha ciliata (Hemiptera: Tingidae), is an invasive forestry pest rapidly expanding in many countries. This pest poses a considerable threat to the urban forestry ecosystem, especially to Platanus spp. However, its molecular biology and biochemistry are poorly understood. This study reports the first C. ciliata transcriptome, encompassing three different life stages (Nymphs, adults female (AF) and adults male (AM)). In total, 26.53 GB of clean data and 60,879 unigenes were obtained from three RNA-seq libraries. These unigenes were annotated and classified by Nr (NCBI non-redundant protein sequences), Nt (NCBI non-redundant nucleotide sequences), Pfam (Protein family), KOG/COG (Clusters of Orthologous Groups of proteins), Swiss-Prot (A manually annotated and reviewed protein sequence database), and KO (KEGG Ortholog database). After all pairwise comparisons between these three different samples, a large number of differentially expressed genes were revealed. The dramatic differences in global gene expression profiles were found between distinct life stages (nymphs and AF, nymphs and AM) and sex difference (AF and AM), with some of the significantly differentially expressed genes (DEGs) being related to metamorphosis, digestion, immune and sex difference. The different express of unigenes were validated through quantitative Real-Time PCR (qRT-PCR) for 16 randomly selected unigenes. In addition, 17,462 potential simple sequence repeat molecular markers were identified in these transcriptome resources. These comprehensive C. ciliata transcriptomic information can be utilized to promote the development of environmentally friendly methodologies to disrupt the processes of metamorphosis, digestion, immune and sex differences. PMID:27494615
Analysis of the Main Access Municipal Project Free and Free Internet in Public Squares: Digital Inclusion in the Present Corporate Information Globalized

Directory of Open Access Journals (Sweden)

Anderson Nogueira Oliveira

2015-12-01

Full Text Available The present study has as its theme the role of municipalities in the current global information society. So it has the general objective analysis on the free access to the internet in public places as a means of digital inclusion, with such spaces known as digital o hotspots squares. In this case we will present concepts, definitions and brief historical development of the objects of study of this research, namely, globalization, the information society and digital inclusion. We emphasize that this research will analyze recent data on internet access in Brazil, and will check the key municipal projects freely and free internet access in public squares. For this research we use the hypothetical-deductive method by the methodology of analysis of books, scientific papers and official data by renamed institutions to present a scientifically valid conclusion.
Global Collaborative STEM Education

Science.gov (United States)

Meabh Kelly, Susan; Smith, Walter

2016-04-01

Global Collaborative STEM Education, as the name suggests, simultaneously supports two sets of knowledge and skills. The first set is STEM -- science, technology, engineering and math. The other set of content knowledge and skills is that of global collaboration. Successful global partnerships require awareness of one's own culture, the biases embedded within that culture, as well as developing awareness of the collaborators' culture. Workforce skills fostered include open-mindedness, perseverance when faced with obstacles, and resourceful use of technological "bridges" to facilitate and sustain communication. In respect for the 2016 GIFT Workshop focus, Global Collaborative STEM Education projects dedicated to astronomy research will be presented. The projects represent different benchmarks within the Global Collaborative STEM Education continuum, culminating in an astronomy research experience that fully reflects how the global STEM workforce collaborates. To facilitate wider engagement in Global Collaborative STEM Education, project summaries, classroom resources and contact information for established international collaborative astronomy research projects will be disseminated.
KNOWLEDGE AND INFORMATION – NEW FACTORS OF PRODUCTION IN THE CONTEXT OF GLOBALIZATION

Directory of Open Access Journals (Sweden)

Mirela Alina COCALIA (CRĂCIUN

2015-02-01

Full Text Available The present article has as a starting point the phenomenon of globalization, so debated worldwide today. Along this work, we have tried offer a departure point, motivated of what the phenomenon of globalization means in economical context. Thus, we debate problems of major interest like: acceptances of the word “globalization”, multiple influences exerted by globalization over the proper nations and the way in which the specific economical and geographic area is marked by the phenomenon of globalization in the same time.
Concept, Components and Promotion of Global Citizenship

Directory of Open Access Journals (Sweden)

Mojtaba Hemmati

2017-04-01

Full Text Available The term "citizenship" refers to an identity between a person and a city, state or nation. When combined with the term "global", it typically defines a person who places their identity with a "global community" above their identity as a citizen of a particular nation or place. The idea is that one’s identity transcends geography or political borders and that responsibilities or rights are or can be derived from membership in a broader class: "humanity". The message of Global citizenship is that the core social, political, economic and environmental realities of the world today should be addressed at all levels - by individuals, civil society organizations, communities and nation states - through a global lens. The lack of a global democratic government that is accountable and responsible against citizens in the face of global challenges, demonstrate the ineffectiveness and lack of effectiveness of the world existing structures. Therefore, to supplement the existing structures, global citizenship is performative and citizen-oriented. Citizens through information and communication networks participate in solving global issues, including environmental problems, human rights, peace and global poverty. This type of citizenship is promoted thorough information technology, environmental, multicultural and human rights education.
Global ethics and principlism.

Science.gov (United States)

Gordon, John-Stewart

2011-09-01

This article examines the special relation between common morality and particular moralities in the four-principles approach and its use for global ethics. It is argued that the special dialectical relation between common morality and particular moralities is the key to bridging the gap between ethical universalism and relativism. The four-principles approach is a good model for a global bioethics by virtue of its ability to mediate successfully between universal demands and cultural diversity. The principle of autonomy (i.e., the idea of individual informed consent), however, does need to be revised so as to make it compatible with alternatives such as family- or community-informed consent. The upshot is that the contribution of the four-principles approach to global ethics lies in the so-called dialectical process and its power to deal with cross-cultural issues against the background of universal demands by joining them together.
Realise : reconstruction of reality from image sequences

NARCIS (Netherlands)

Leymarie, F.; de la Fortelle, A.; Koenderink, Jan J.; Kappers, A. M L; Stavridi, M.; van Ginneken, B.; Muller, S.; Krake, S.; Faugeras, O.; Robert, L.; Gauclin, C.; Laveau, S.; Zeller, C.; Anon,

1996-01-01

REALISE has for principal goals to extract from sequences of images, acquired with a moving camera, information necessary for determining the 3D (CAD-like) structure of a real-life scene together with information about the radiometric signatures of surfaces bounding the extracted 3D objects (e.g.
A priori Considerations When Conducting High-Throughput Amplicon-Based Sequence Analysis

Directory of Open Access Journals (Sweden)

Aditi Sengupta

2016-03-01

Full Text Available Amplicon-based sequencing strategies that include 16S rRNA and functional genes, alongside “meta-omics” analyses of communities of microorganisms, have allowed researchers to pose questions and find answers to “who” is present in the environment and “what” they are doing. Next-generation sequencing approaches that aid microbial ecology studies of agricultural systems are fast gaining popularity among agronomy, crop, soil, and environmental science researchers. Given the rapid development of these high-throughput sequencing techniques, researchers with no prior experience will desire information about the best practices that can be used before actually starting high-throughput amplicon-based sequence analyses. We have outlined items that need to be carefully considered in experimental design, sampling, basic bioinformatics, sequencing of mock communities and negative controls, acquisition of metadata, and in standardization of reaction conditions as per experimental requirements. Not all considerations mentioned here may pertain to a particular study. The overall goal is to inform researchers about considerations that must be taken into account when conducting high-throughput microbial DNA sequencing and sequences analysis.
TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing.

Science.gov (United States)

Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David

2018-04-11

Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions
Infections are a global issue: Infection addresses global issues

NARCIS (Netherlands)

Grobusch, M. P.; Calleri, G.; Bogner, J. R.

2012-01-01

Infections are of unifying global concern, despite regional differences in disease epidemiology, clinical appearance and the instruments to tackle them. The primary aim of Infection is "to be a forum for the presentation and discussion of clinically relevant information on infectious diseasesaEuro
The Global Emergency Observation and Warning System

Science.gov (United States)

Bukley, Angelia P.; Mulqueen, John A.

1994-01-01

Based on an extensive characterization of natural hazards, and an evaluation of their impacts on humanity, a set of functional technical requirements for a global warning and relief system was developed. Since no technological breakthroughs are required to implement a global system capable of performing the functions required to provide sufficient information for prevention, preparedness, warning, and relief from natural disaster effects, a system is proposed which would combine the elements of remote sensing, data processing, information distribution, and communications support on a global scale for disaster mitigation.
The PAZAR database of gene regulatory information coupled to the ORCA toolkit for the study of regulatory sequences

Science.gov (United States)

Portales-Casamar, Elodie; Arenillas, David; Lim, Jonathan; Swanson, Magdalena I.; Jiang, Steven; McCallum, Anthony; Kirov, Stefan; Wasserman, Wyeth W.

2009-01-01

The PAZAR database unites independently created and maintained data collections of transcription factor and regulatory sequence annotation. The flexible PAZAR schema permits the representation of diverse information derived from experiments ranging from biochemical protein–DNA binding to cellular reporter gene assays. Data collections can be made available to the public, or restricted to specific system users. The data ‘boutiques’ within the shopping-mall-inspired system facilitate the analysis of genomics data and the creation of predictive models of gene regulation. Since its initial release, PAZAR has grown in terms of data, features and through the addition of an associated package of software tools called the ORCA toolkit (ORCAtk). ORCAtk allows users to rapidly develop analyses based on the information stored in the PAZAR system. PAZAR is available at http://www.pazar.info. ORCAtk can be accessed through convenient buttons located in the PAZAR pages or via our website at http://www.cisreg.ca/ORCAtk. PMID:18971253
Using sequence similarity networks for visualization of relationships across diverse protein superfamilies.

Directory of Open Access Journals (Sweden)

Holly J Atkinson

Full Text Available The dramatic increase in heterogeneous types of biological data--in particular, the abundance of new protein sequences--requires fast and user-friendly methods for organizing this information in a way that enables functional inference. The most widely used strategy to link sequence or structure to function, homology-based function prediction, relies on the fundamental assumption that sequence or structural similarity implies functional similarity. New tools that extend this approach are still urgently needed to associate sequence data with biological information in ways that accommodate the real complexity of the problem, while being accessible to experimental as well as computational biologists. To address this, we have examined the application of sequence similarity networks for visualizing functional trends across protein superfamilies from the context of sequence similarity. Using three large groups of homologous proteins of varying types of structural and functional diversity--GPCRs and kinases from humans, and the crotonase superfamily of enzymes--we show that overlaying networks with orthogonal information is a powerful approach for observing functional themes and revealing outliers. In comparison to other primary methods, networks provide both a good representation of group-wise sequence similarity relationships and a strong visual and quantitative correlation with phylogenetic trees, while enabling analysis and visualization of much larger sets of sequences than trees or multiple sequence alignments can easily accommodate. We also define important limitations and caveats in the application of these networks. As a broadly accessible and effective tool for the exploration of protein superfamilies, sequence similarity networks show great potential for generating testable hypotheses about protein structure-function relationships.
Using sequence similarity networks for visualization of relationships across diverse protein superfamilies.

Science.gov (United States)

Atkinson, Holly J; Morris, John H; Ferrin, Thomas E; Babbitt, Patricia C

2009-01-01

The dramatic increase in heterogeneous types of biological data--in particular, the abundance of new protein sequences--requires fast and user-friendly methods for organizing this information in a way that enables functional inference. The most widely used strategy to link sequence or structure to function, homology-based function prediction, relies on the fundamental assumption that sequence or structural similarity implies functional similarity. New tools that extend this approach are still urgently needed to associate sequence data with biological information in ways that accommodate the real complexity of the problem, while being accessible to experimental as well as computational biologists. To address this, we have examined the application of sequence similarity networks for visualizing functional trends across protein superfamilies from the context of sequence similarity. Using three large groups of homologous proteins of varying types of structural and functional diversity--GPCRs and kinases from humans, and the crotonase superfamily of enzymes--we show that overlaying networks with orthogonal information is a powerful approach for observing functional themes and revealing outliers. In comparison to other primary methods, networks provide both a good representation of group-wise sequence similarity relationships and a strong visual and quantitative correlation with phylogenetic trees, while enabling analysis and visualization of much larger sets of sequences than trees or multiple sequence alignments can easily accommodate. We also define important limitations and caveats in the application of these networks. As a broadly accessible and effective tool for the exploration of protein superfamilies, sequence similarity networks show great potential for generating testable hypotheses about protein structure-function relationships.
Globalization and its methodological discontents: Contextualizing globalization through the study of HIV/AIDS

Science.gov (United States)

2011-01-01

There remains considerable discontent between globalization scholars about how to conceptualize its meaning and in regards to epistemological and methodological questions concerning how we can come to understand how these processes ultimately operate, intersect and transform our lives. This article argues that to better understand what globalization is and how it affects issues such as global health, we must take a differentiating approach, which focuses on how the multiple processes of globalization are encountered and informed by different social groups and with how these encounters are experienced within particular contexts. The article examines the heuristic properties of qualitative field research as a means to help better understand how the intersections of globalization are manifested within particular locations. To do so, the article focuses on three recent case studies conducted on globalization and HIV/AIDS and explores how these cases can help us to understand the contextual permutations involved within the processes of globalization. PMID:21861895
Globalization and its methodological discontents: Contextualizing globalization through the study of HIV/AIDS

Directory of Open Access Journals (Sweden)

Labonté Ronald

2011-08-01

Full Text Available Abstract There remains considerable discontent between globalization scholars about how to conceptualize its meaning and in regards to epistemological and methodological questions concerning how we can come to understand how these processes ultimately operate, intersect and transform our lives. This article argues that to better understand what globalization is and how it affects issues such as global health, we must take a differentiating approach, which focuses on how the multiple processes of globalization are encountered and informed by different social groups and with how these encounters are experienced within particular contexts. The article examines the heuristic properties of qualitative field research as a means to help better understand how the intersections of globalization are manifested within particular locations. To do so, the article focuses on three recent case studies conducted on globalization and HIV/AIDS and explores how these cases can help us to understand the contextual permutations involved within the processes of globalization.
DNA Polymerases Drive DNA Sequencing-by-Synthesis Technologies: Both Past and Present

Directory of Open Access Journals (Sweden)

Cheng-Yao eChen

2014-06-01

Full Text Available Next-generation sequencing (NGS technologies have revolutionized modern biological and biomedical research. The engines responsible for this innovation are DNA polymerases; they catalyze the biochemical reaction for deriving template sequence information. In fact, DNA polymerase has been a cornerstone of DNA sequencing from the very beginning. E. coli DNA polymerase I proteolytic (Klenow fragment was originally utilized in Sanger's dideoxy chain terminating DNA sequencing chemistry. From these humble beginnings followed an explosion of organism-specific, genome sequence information accessible via public database. Family A/B DNA polymerases from mesophilic/thermophilic bacteria/archaea were modified and tested in today's standard capillary electrophoresis (CE and NGS sequencing platforms. These enzymes were selected for their efficient incorporation of bulky dye-terminator and reversible dye-terminator nucleotides respectively. Third generation, real-time single molecule sequencing platform requires slightly different enzyme properties. Enterobacterial phage ⱷ29 DNA polymerase copies long stretches of DNA and possesses a unique capability to efficiently incorporate terminal phosphate-labeled nucleoside polyphosphates. Furthermore, ⱷ29 enzyme has also been utilized in emerging DNA sequencing technologies including nanopore-, and protein-transistor-based sequencing. DNA polymerase is, and will continue to be, a crucial component of sequencing technologies.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.