progradational regressive highstand: Topics by WorldWideScience.org

Sample records for progradational regressive highstand

Megaquakes, prograde surface waves and urban evolution

Science.gov (United States)

Lomnitz, C.; Castaños, H.

2013-05-01

Cities grow according to evolutionary principles. They move away from soft-ground conditions and avoid vulnerable types of structures. A megaquake generates prograde surface waves that produce unexpected damage in modern buildings. The examples (Figs. 1 and 2) were taken from the 1985 Mexico City and the 2010 Concepción, Chile megaquakes. About 400 structures built under supervision according to modern building codes were destroyed in the Mexican earthquake. All were sited on soft ground. A Rayleigh wave will cause surface particles to move as ellipses in a vertical plane. Building codes assume that this motion will be retrograde as on a homogeneous elastic halfspace, but soft soils are intermediate materials between a solid and a liquid. When Poisson's ratio tends to ν→0.5 the particle motion turns prograde as it would on a homogeneous fluid halfspace. Building codes assume that the tilt of the ground is not in phase with the acceleration but we show that structures on soft ground tilt into the direction of the horizontal ground acceleration. The combined effect of gravity and acceleration may destabilize a structure when it is in resonance with its eigenfrequency. Castaños, H. and C. Lomnitz, 2013. Charles Darwin and the 1835 Chile earthquake. Seismol. Res. Lett., 84, 19-23. Lomnitz, C., 1990. Mexico 1985: the case for gravity waves. Geophys. J. Int., 102, 569-572. Malischewsky, P.G. et al., 2008. The domain of existence of prograde Rayleigh-wave particle motion. Wave Motion 45, 556-564.; Figure 1 1985 Mexico megaquake--overturned 15-story apartment building in Mexico City ; Figure 2 2010 Chile megaquake Overturned 15-story R-C apartment building in Concepción
A Holocene progradation record from Okains Bay, Banks Peninsula, Canterbury, New Zealand

International Nuclear Information System (INIS)

Stephenson, W.; Shulmeister, J.

1999-01-01

Fifty-eight distinct ridges are preserved on the Holocene progradation plain in Okains Bay, Banks Peninsula, Canterbury. Of these, 48 represent beach berm and foredune complexes and the remaining 10 are transverse dune ridges. Periods of rapid coastal progradation are marked by multiple beach berm preservation, whereas intervening periods of lower sediment accumulation result in a stable coastline and transverse dune formation. Infilling of the bay began following sea-level stabilisation in the mid Holocene. The fill is dominantly fine sand, which is derived from sediment carried around Banks Peninsula in the Southland Current and washed into Okains Bay by wave action. Variations in the progradation rate are therefore proxy indicators of coastal erosion in the Canterbury Bight. We demonstrate that there is little progradational fill preserved between c. 6500 and 2000 yr BP. This implies significant changes in sediment delivery to the Southland Current within the last 2000 yr, which we attribute to increased coastal erosion in South Canterbury. We speculate that this increasing erosion resulted from increased wave energy regimes, which in turn may relate to increasing Southern Hemisphere seasonality following the precessional cycle. (author). 32 refs., 4 figs., 3 tabs
Delta progradation in Greenland driven by increasing glacial mass loss

DEFF Research Database (Denmark)

Bendixen, Mette; Iversen, Lars Lonsmann; Bjork, Anders Anker

2017-01-01

imagery. We find that delta progradation was driven by high freshwater runoff from the Greenland Ice Sheet coinciding with periods of open water. Progradation was controlled by the local initial environmental conditions (that is, accumulated air temperatures above 0 degrees C per year, freshwater runoff...... of erosion and accretion along the large deltas of the main rivers in the Arctic5-7. Our results improve the understanding of Arctic coastal evolution in a changing climate, and reveal the impacts on coastal areas of increasing ice mass loss and the associated freshwater runoff and lengthening of open-water...
Integrating millennial and interdecadal shoreline changes: Morpho-sedimentary investigation of two prograded barriers in southeastern Australia

Science.gov (United States)

Oliver, T. S. N.; Tamura, T.; Hudson, J. P.; Woodroffe, C. D.

2017-07-01

Prograded barriers are distinctive coastal landforms preserving the position of past shorelines as low relief, shore-parallel ridges composed of beach sediments and commonly adorned with variable amounts of dune sand. Prograded barriers have been valued as coastal archives which contain palaeoenvironmental information, however integrating the millennial timescale geological history of barriers with observed inter-decadal modern beach processes has proved difficult. Technologies such as airborne LiDAR, ground penetrating radar (GPR) and optically stimulated luminescence dating (OSL) were utilised at Boydtown and Wonboyn, in southeastern Australia, and combined with previously reported radiocarbon dates and offshore seismic and sedimentological data to reconstruct the morpho-sedimentary history of prograded barrier systems. These technologies enabled reconstruction of geological timescale processes integrated with an inter-decadal model of ridge formation explaining the GPR-imaged subsurface character of the barriers. Both the Boydtown and Wonboyn barriers began prograding 7500-8000 years ago when sea level attained at or near present height along this coastline and continued prograding until the present-day with an initially slower rate of shoreline advancement. Sources of sediment for progradation appear to be the inner shelf and shoreface with a large shelf sand body likely contributing to progradation at Wonboyn. The Towamba River seems to have delivered sediment to Twofold Bay during flood events after transitioning to a mature estuarine system sometime after 4000 cal. yr BP. Some of this material appears to have been reworked onto the Boydtown barrier, increasing the rate of progradation in the seaward 50% of the barrier deposited over the past 1500 years. The GPR imaged beachfaces are shown to have similar geometry to beach profiles following recent storm events and a model of ridge formation involving cut and fill of the beachface, and dune building in the
The magnitude of a mid-Holocene sea-level highstand in the Strait of Makassar

NARCIS (Netherlands)

Mann, T.; Rovere, A.; Schöne, T.; Klicpera, A.; Stocchi, P.; Lukman, M.; Westphal, H.

2016-01-01

Knowledge on the timing andmagnitude of past sea-level changes is essential to understandmodern and futuresea-level variability.Holocene sea-level data fromliterature on thewest coast of Sulawesi, central Indonesia, suggestthat this region experienced two relative sea-level highstands over the last
Delta progradation and Neoglaciation, Laguna Parón, Cordillera Blanca, Peru

Science.gov (United States)

Seltzer, Geoffrey O.; Rodbell, Donald T.

2005-10-01

The history of Holocene glaciation serves as an important record of glacier mass balance and, therefore, of climatic change. The moraine record of Holocene glaciation in the tropical Andes, however, is fragmentary and poorly dated. In contrast, increases in the rate of accumulation of inorganic sediment in glacier-fed lakes have been linked to periods of Neoglaciation in many mountain regions. The interpretation of such a record of Neoglaciation from sediment cores in glacier-fed lakes in the tropical Andes can provide the continuity and chronologic control that is lacking in the existing moraine record. Unusual exposures of glacial lacustrine sediment in the Cordillera Blanca, Peru, provide a rare opportunity to assess the link between climatic change, glaciation, and lacustrine sedimentation.Intentional lowering of water levels in Laguna Parón (9°S, 77°44 W, 4200 m a.s.l.) in 1985 resulted in the incision and exposure of at least 20 m of deltaic deposits at the eastern end of the lake. Three deltaic units can be identified: horizontal topset beds, steeply dipping and deformed foreset beds, and horizontally laminated fine-grained sediment. Six radiocarbon ages ranging from 1800 +/- 210 to 465 +/- 95 14C yr BP on wood indicate that the average rate of delta progradation in the late Holocene has been approximately 290 m per 1000 yr. The lake formed during deglaciation at least 10 000 yr ago and if such a rate of progradation of the delta had prevailed over the entire Holocene, then the delta would be at least three times as extensive as it is today. Thus the rate of delta progradation has varied significantly over the Holocene. We suggest that the rate of delta progradation was at least three times greater when glaciers were in advanced positions. These positions are clearly delimited by Neoglacial moraines, which are within 1-2 km of the exposures studied and within 1 km of modern ice limits. The most recent increase in the rate of delta progradation is
Formation of Tidally Induced Bars in Galactic Flybys: Prograde versus Retrograde Encounters

Science.gov (United States)

Łokas, Ewa L.

2018-04-01

Bars in disk galaxies can be formed by interactions with other systems, including those of comparable mass. It has long been established that the effect of such interactions on galaxy morphology depends strongly on the orbital configuration, in particular the orientation of the intrinsic spin of the galactic disk with respect to its orbital angular momentum. Prograde encounters modify the morphology strongly, including the formation of tidally induced bars, while retrograde flybys should have little effect on morphology. Recent works on the subject reached conflicting conclusions, one using the impulse approximation and claiming no dependence on this angle in the properties of tidal bars. To resolve the controversy, we performed self-consistent N-body simulations of hyperbolic encounters between two identical Milky Way-like galaxies assuming different velocities and impact parameters, with one of the galaxies on a prograde and the other on a retrograde orbit. The galaxies were initially composed of an exponential stellar disk and an NFW dark halo, and they were stable against bar formation in isolation for 3 Gyr. We find that strong tidally induced bars form only in galaxies on prograde orbits. For smaller impact parameters and lower relative velocities, the bars are stronger and have lower pattern speeds. Stronger bars undergo extended periods of buckling instability that thicken their vertical structure. The encounters also lead to the formation of two-armed spirals with strength inversely proportional to the strength of the bars. We conclude that proper modeling of prograde and retrograde encounters cannot rely on the simplest impulse approximation.
Sequence stratigraphy in a mixed carbonate-silicilastic depositional system (Middle Miocene; Styrian Basin, Austria)

Science.gov (United States)

Friebe, J. Georg

1993-07-01

The mixed carbonate-siliciclastic Weißenegg (Allo-) Formation records three depositional sequences corresponding approximately to the TB 2.3, TB 2.4 and TB 2.5 global cycles. Sea-level fluctuations were of the order of at least 30 m. Siliciclastic lowstand systems tracts comprise lignite deposits, reworked basement and tidal siltstones (above a tectonically enhanced sequence boundary) as well as coastal sand bars. Coastal sands of the transgressive systems tract contain distinct layers of well cemented nodules. They are interpreted as the first stage in hardground formation and record superimposed minor sea-level fluctuations. Coral patch reefs and rhodolith platforms developed during transgressive phases and were subsequently drowned and/or suffocated by siliciclastics during early highstand. Shallowing upwards siliciclastic parasequences, each terminated by a bank of rhodolith limestone, form the (late) highstand systems tract. The limestone beds record superimposed fourth-order transgressive pulses. Occasionally a carbonate highstand wedge developed. Lowstand carbonate shedding occurred where the top of a platform which suffered incipient drowning during highstand was near sealevel again during the following lowstand. Late highstand delta progradation is common.
MIDDLE TRIASSIC PLATFORM AND BASIN EVOLUTION OF THE SOUTHERN BAKONY MOUNTAINS (TRANSDANUBIAN RANGE, HUNGARY

Directory of Open Access Journals (Sweden)

TAMÁS BUDAI

2006-11-01

Full Text Available Middle Triassic history of the Southern Bakony Mts. is outlined on the base of horizontal and vertical facies changes of the formations. During the Pelsonian (Balatonicus Chron the evolution of the basins and platforms was determined basically by synsedimentary tectonics. The Felsõörs basin of the Balaton Highland opened due to the block-faulting of the Bithynian carbonate ramp (Megyehegy Dolomite. Above the drowning blocks „halfgraben” basins were formed (Felsõörs Formation, while isolated platforms developed on the uplifted ones in the middle part of the Balaton Highland and on the Veszprém plateau (Tagyon Formation. Due to the relative sea-level fall in the early Illyrian, the platforms became subaerially exposed and karstified. As a consequence of the late Illyrian tectonic subsidence (manifested by neptunian dykes the central platform of the Balaton Highland has been drowned (Camunum Subchron. On the contrary, the Anisian platform of the Veszprém plateau was totally flooded only during the latest Illyrian (Reitzi Subchron due to eustatic sea-level rise. It was followed by a short highstand period (Secedensis Chron, characterised by the first progradation of the Budaörs platform on the Veszprém plateau and highstand shedding in the basins and on the submarine high (Vászoly Limestone in the centre of the Balaton Highland basin. Due to the following rapid sea-level rise, carbonate sedimentation continued in eupelagic basin from the Fassanian (Buchenstein Formation. At the beginning of the late Longobardian highstand period (Regoledanus Chron the Budaörs platform intensively prograded from the Veszprém plateau to the southwest, causing highstand shedding in the Balaton Highland basin (Füred Limestone.
Silurian deltaic progradation, Tassili n'Ajjer plateau, south-eastern Algeria: Sedimentology, ichnology and sequence stratigraphy

Science.gov (United States)

Djouder, Hocine; Lüning, Sebastian; Da Silva, Anne-Christine; Abdallah, Hussein; Boulvain, Frédéric

2018-06-01

The economic potential for unconventional shale oil and gas production in the Silurian of the Berkine - Ghadames and Illizi basins (BGI) in south-eastern Algeria has been recently confirmed through exploration drilling. The aim of the present paper attempts a better understanding of the Intra-Tassilian depression within the entire Silurian of the Tassili n'Ajjer plateau. The continuous deposits of the Silurian are exposed at the southern margin of the prolific BGI basins, in the Tassili n'Ajjer plateau, offering the chance to understand the sedimentology, ichnology, and to present a detailed sequence stratigraphy framework for the region. The 410 m-thick clastic Silurian sedimentary strata are subdivided into three formations in the context of sequence stratigraphy, namely: (i) the Oued Imihrou Fm. (Llandoverian) overlain by (ii) the Atafaïtafa Fm. (late Llandoverian to Wenlockian), and (iii) the Oued Tifernine Fm. (late Wenlockian to Pridolian). These can be also distinguished across the entire investigated area and laterally traceable over kilometers. Clear cyclic stacking patterns are identified within the four studied sections showing progressively a general trend of thickening- and coarsening-upward, over a complete 2nd-order megasequence (SIL-1 MS). This transgressive-regressive succession suggests deltaic progradation, shallowing and basin infilling as evidenced by numerous diagnostic sedimentary features and trace fossils, largely from eastern-to western-Tassili plateau. Indeed, the wealth of outcrop data in the Silurian siliciclastic succession enables us to distinct thirteen facies (facies A-M), ranging from shallow-to marginal-marine facies, and in turn, grouped into six facies associations (FA1-FA6). The lowermost part of the succession, which is the most prolific sources of hydrocarbons in North Africa, consists of thick organic-rich graptolite-yielding black 'hot' shales and 'lean' shales with sparse bioturbation with small Thalassinoides belonging
NEXT GENERATION OF TELESCOPES OR DYNAMICS REQUIRED TO DETERMINE IF EXO-MOONS HAVE PROGRADE OR RETROGRADE ORBITS

International Nuclear Information System (INIS)

Lewis, Karen M.; Fujii, Yuka

2014-01-01

We survey the methods proposed in the literature for detecting moons of extrasolar planets in terms of their ability to distinguish between prograde and retrograde moon orbits, an important tracer of the moon formation channel. We find that most moon detection methods, in particular, sensitive methods for detecting moons of transiting planets, cannot observationally distinguishing prograde and retrograde moon orbits. The prograde and retrograde cases can only be distinguished where the dynamical evolution of the orbit due to, e.g., three body effects is detectable, where one of the two cases is dynamically unstable, or where new observational facilities, which can implement a technique capable of differentiating the two cases, come online. In particular, directly imaged planets are promising targets because repeated spectral and photometric measurements, which are required to determine moon orbit direction, could also be conducted with the primary interest of characterizing the planet itself
Timing and magnitude of the Caribbean mid-Holocene highstand

Science.gov (United States)

Ashe, E.; Khan, N.; Horton, B.; Brocard, G. Y.; Dutton, A.; Engelhart, S. E.; Kopp, R. E.; Hill, D. F.; Peltier, W. R.; Scatena, F. N.

2015-12-01

We present a database of published and new relative sea-level (RSL) data for the past 13 ka, which constrains the Holocene sea-level histories of the Caribbean coast of Central and South America (Florida Keys, USA to Guyana) and the Bahamas and Greater and Lesser Antilles islands. Our evaluation of mangrove peat and Acropora palmata sea-level indicators from geological investigations provides 503 sea-level index points and 242 limiting dates. We subdivide the database into 21 regions based on the availability of data, tectonic setting, and distance from the former Laurentide ice sheet. Most index points (75%) and limiting dates (90%) are <8 ka, although there is an unusual temporal distribution with the greatest amount of the data (~28%) occurring between 6-8 ka. We reassess and screen radiocarbon and U/Th ages of mangrove peat and coral data. We use the stratigraphic position (overburden thickness) of index points account for sediment compaction, and use the paleotidal model of Hill et al. (2011) to account for Holocene changes in paleotidal range. A noisy-input Gaussian process regression model calculates that the rates of RSL change were highest during the early Holocene (3-8 mm/yr) and have decreased over time (< 2 mm/yr), which is related to the reduction of ice equivalent meltwater input and collapse of the proglacial forebulge during the Holocene. The sea-level reconstructions demonstrate that RSL did not exceed the present height (0 m) during the Holocene in the majority of locations, with the exception of a small highstand (<2 m) on the northern coast of South America along the Orinoco Delta and Suriname/Guyana located furthest away from the former Laurentide Ice Sheet. The different sea-level histories are an ongoing isostatic response to deglaciation of the Laurentide Ice Sheet and suggest subsidence resulting from collapse of the proglacial forebulge reaches further south than previously considered.
Morphodynamics of prograding beaches: A synthesis of seasonal- to century-scale observations of the Columbia River littoral cell

Science.gov (United States)

Ruggiero, Peter; Kaminsky, George; Gelfenbaum, Guy R.; Cohn, Nicholas

2016-01-01

Findings from nearly two decades of research focused on the Columbia River littoral cell (CRLC), a set of rapidly prograding coastal barriers and strand-plains in the U.S. Pacific Northwest, are synthesized to investigate the morphodynamics associated with prograding beaches. Due to a large sediment supply from the Columbia River, the CRLC is the only extensive stretch of shoreline on the U.S. west coast to have advanced significantly seaward during the late Holocene. Since the last Cascadia Subduction Zone (CSZ) earthquake in 1700, with associated co-seismic subsidence and tsunami, much of the CRLC has prograded hundreds of meters. However, the rates of progradation, and the processes most responsible for sediment accumulation, vary depending on time scale and the morphological unit in question. Remarkably, the 20th and early 21st century shoreline change rates were more than double the late prehistoric rates that include recovery from the last major CSZ event, most likely due to an increase in sediment supply resulting from inlet jetty construction. In some locations detailed beach morphology monitoring reveals that at interannual- to decadal-scale the upper shoreface aggraded about 2 cm/yr, subtidal sandbars migrated offshore and decayed while intertidal bars migrated onshore and welded to the shoreline, the shoreline prograded about 4 m/yr, and 1 to 2 new foredune ridges were generated. A detailed meso-scale sediment budget analysis in one location within the littoral cell shows that approximately 100 m3/m/yr accumulated between − 12 m (seaward limit of data) and + 9 m (crest of landward-most foredune). Gradients in alongshore sediment transport, net onshore-directed cross-shore sediment transport within the surf zone, and cross-shore feeding from a shoreface out of equilibrium with forcing conditions are each partially responsible for the significant rates of sediment supplied to the beaches and dunes of the CRLC during the observational period. Direct
EVIDENCE FOR LADINIAN (MIDDLE TRIASSIC PLATFORM PROGRADATION IN THE GYULAKESZI AREA, TAPOLCA BASIN, WESTERN HUNGARY: MICROFACIES ANALYSIS AND BIOSTRATIGRAPHY

Directory of Open Access Journals (Sweden)

ZSOLT RÓBERT NAGY

2014-07-01

Full Text Available A shallowing-upward carbonate sequence was studied from the outcrop at Gyulakeszi, Tapolca Basin (western Hungary, and it is interpreted as a Middle Triassic (Curionii or younger platform progradation. Two lithostratigraphic units are distinguished. Microfacies analysis and micropaleontological investigation conducted on the red nodular, cherty limestone (Vászoly and Buchenstein formations suggest that the lower unit was deposited during the Reitzi and the Secedensis ammonoid zones. The overlying white platform limestone (upper unit is typical of a prograding platform and includes gravity-driven deposits at the base followed by periplatform facies deposited in shallow marine warm waters around the fair-weather wave base. The section at Gyulakeszi was unaffected by fabric-destructive dolomitization, which is uncharacteristic of similar platform facies in the Balaton Highland. Isopachous and radiaxial fibrous calcite cement found in the grainstone and boundstone facies are indicative of early lithification and diagenesis in the marine phreatic zone. “Evinospongiae”-type cement is described for the first time from the Balaton Highland and it is similar to the outer platform cements published previously from the Alps (Italy and Austria. The progradation could have advanced over the pelagic limestones as early as the Curionii zone, which is an undocumented event in the Veszprém Plateau. Similar event, however, is well known from the Western Dolomites, where aggradation was followed by intense progradation during the Gredleri and Archelaus ammonoid zones. The length of this progradation event at Gyulakeszi, however, is ambiguous since proven Ladinian (Longobardian rocks are not exposed in the study area and were not penetrated by boreholes in the Tapolca Basin.
Un littoral sableux en progradation : le lido entre Leucate et Port-la-Nouvelle (Aude, Golfe du Lion, France

Directory of Open Access Journals (Sweden)

Jean-Pierre Larue

2009-11-01

Full Text Available L'étude multichronique de photographies aériennes révèle que le lido entre Leucate et Port-la-Nouvelle (Aude a progradé d'environ 15 % en largeur, entre 1952 et 2008. L'analyse sédimentologique permet de montrer que cette progradation exceptionnelle en période d'élévation du niveau marin est due à la présence de barres pré-littorales volumineuses et bien alimentées par la dérive littorale et le transport éolien effectué par les vents de terre. Cependant, du fait de la montée actuelle du niveau marin (2,5 à 3 mm/an et malgré la poursuite de l'accrétion, le lido subit des inondations de plus en plus fréquentes entre le cordon actuel et l'ancien cordon romain.A kinematic study of vertical aerial photos taken between 1952 and 2008 reveals that the Leucate-Port-la-Nouvelle lido (Aude has prograded of about 15 % in width. A sedimentological analysis allows us to explain this accretion caused by drift and wind which supply abundant nearshore bars. In spite of this progradation, frequent floodings, favoured by sea level rise (2.5 to 3 mm-1.year, occur between the present coastal bar and the Roman barrier.
The Jurassic of Denmark and Greenland: An offshore transgressive–regressive mudstone-dominated succession from the Sinemurian of Skåne, Sweden

Directory of Open Access Journals (Sweden)

Surlyk, Finn

2003-10-01

Full Text Available A Sinemurian mudstone-dominated succession was exposed until recently in the Gantofta quarry in Skåne, southern Sweden. The deposits are placed in the Döshult and Pankarp Members of the Sinemurian–Aalenian Rya Formation. Similar facies of the same age are widespread in the Danish Basin where they constitute the F-Ib unit (F-I member of the Fjerritslev Formation. The Gantofta succession thus represents the easternmost extension of the environment characteristic of the Fjerritslev Formation and is essentially the only locality where it has been possible tostudy the facies of this formation in outcrop. Sedimentation seems to have taken place under relatively quiet tectonic conditions except for the possible fault-control of the basin margin. Thelower part of the Gantofta section is of Early and early Late Sinemurian age. It represents the upper part of the Döshult Member and consists of muddy, lower shoreface sandstones, abruptlyoverlain by dark, bioturbated, fossiliferous mudstones with thin storm siltstones and sandstones. They are overlain by the Upper Sinemurian Pankarp Member which comprises red-brown, restricted marine calcareous mudstones with an upwards increasing number of storm siltstones and sandstones reflecting general shallowing and shoreline progradation.The succession spans the greater part of two simple sequences with a distal sequence boundary located at the boundary between the Döshult Member and the Pankarp Member. The exposed part of the lower sequence includes a thick transgressive systems tract and a very thin highstand systems tract. The upper sequence is represented by an undifferentiated transgressive and highstand systems tract. An Early Sinemurian sea-level rise, a late Early Sinemurian highstand, an early Late Sinemurian fall and a Late Sinemurian minor rise and a major fall are recognised. Nearby boreholes show evidence for an end-Sinemurian – Early Pliensbachian major rise. This evolution corresponds well with
Retrograde versus Prograde Models of Accreting Black Holes

Directory of Open Access Journals (Sweden)

David Garofalo

2013-01-01

Full Text Available There is a general consensus that magnetic fields, accretion disks, and rotating black holes are instrumental in the generation of the most powerful sources of energy in the known universe. Nonetheless, because magnetized accretion onto rotating black holes involves both the complications of nonlinear magnetohydrodynamics that currently cannot fully be treated numerically, and uncertainties about the origin of magnetic fields that at present are part of the input, the space of possible solutions remains less constrained. Consequently, the literature still bears witness to the proliferation of rather different black hole engine models. But the accumulated wealth of observational data is now sufficient to meaningfully distinguish between them. It is in this light that this critical paper compares the recent retrograde framework with standard “spin paradigm” prograde models.
Controls on facies and sequence stratigraphy of an upper Miocene carbonate ramp and platform, Melilla basin, NE Morocco

Science.gov (United States)

Cunningham, K.J.; Collins, Luke S.

2002-01-01

Upwelling of cool seawater, paleoceanographic circulation, paleoclimate, local tectonics and relative sea-level change controlled the lithofacies and sequence stratigraphy of a carbonate ramp and overlying platform that are part of a temporally well constrained carbonate complex in the Melilla basin, northeastern Morocco. At Melilla, from oldest to youngest, a third-order depositional sequence within the carbonate complex contains (1) a retrogradational, transgressive, warm temperate-type rhodalgal ramp; (2) an early highstand, progradational, bioclastic platform composed mainly of a temperate-type, bivalve-rich molechfor facies; and (3) late highstand, progradational to downstepping, subtropical/tropical-type chlorozoan fringing Porites reefs. The change from rhodalgal ramp to molechfor platform occurred at 7.0??0.14 Ma near the Tortonian/Messinian boundary. During a late stage in the development of the bioclastic platform a transition from temperate-type molechfor facies to subtropical/tropical-type chlorozoan facies occurred and is bracketed by chron 3An.2n (??? 6.3-6.6 Ma). Comparison to a well-dated carbonate complex in southeastern Spain at Cabo de Gata suggests that upwelling of cool seawater influenced production of temperate-type limestone within the ramp and platform at Melilla during postulated late Tortonian-early Messinian subtropical/tropical paleoclimatic conditions in the western Paleo-Mediterranean region. The upwelling of cool seawater across the bioclastic platform at Melilla could be related to the beginning of 'siphoning' of deep, cold Atlantic waters into the Paleo-Mediterranean Sea at 7.17 Ma. The facies change within the bioclastic platform from molechfor to chlorozoan facies may be coincident with a reduction of the siphoning of Atlantic waters and the end of upwelling at Melilla during chron 3An.2n. The ramp contains one retrogradational parasequence and the bioclastic platform three progradational parasequences. Minor erosional surfaces
Highstand shelf fans: The role of buoyancy reversal in the deposition of a new type of shelf sand body

Science.gov (United States)

Steel, Elisabeth; Simms, Alexander R.; Warrick, Jonathan; Yokoyama, Yusuke

2016-01-01

Although sea-level highstands are typically associated with sediment-starved continental shelves, high sea level does not hinder major river floods. Turbidity currents generated by plunging of sediment-laden rivers at the fluvial-marine interface, known as hyperpycnal flows, allow for cross-shelf transport of suspended sand beyond the coastline. Hyperpycnal flows in southern California have deposited six subaqueous fans on the shelf of the northern Santa Barbara Channel in the Holocene. Using eight cores and nine grab samples, we describe the deposits, age, and stratigraphic architecture of two fans in the Santa Barbara Channel. Fan lobes have up to 3 m of relief and are composed of multiple hyperpycnite beds ∼5 cm to 40 cm thick. Deposit architecture and geometry suggest the hyperpycnal flows became positively buoyant and lifted off the seabed, resulting in well-sorted, structureless, elongate sand lobes. Contrary to conventional sequence stratigraphic models, the presence of these features on the continental shelf suggests that active-margin shelves may locally develop high-quality reservoir sand bodies during sea-level highstands, and that such shelves need not be solely the site of sediment bypass. These deposits may provide a Quaternary analogue to many well-sorted sand bodies in the rock record that are interpreted as turbidites but lack typical Bouma-type features.
Early to Middle Holocene sea level fluctuation, coastal progradation and the Neolithic occupation in the Yaojiang Valley of southern Hangzhou Bay, Eastern China

Science.gov (United States)

Liu, Yan; Sun, Qianli; Fan, Daidu; Dai, Bin; Ma, Fuwei; Xu, Lichen; Chen, Jing; Chen, Zhongyuan

2018-06-01

The Yaojiang Valley (YJV) of southern Hangzhou Bay was the birthplace of the well-known Hemudu Culture (HC), one of the representatives of Neolithic civilization in eastern China. To explore the magnitude of natural environmental effects on the HC trajectory, the palaeo-embayment setting of the YJV was studied in detail for the first time in terms of 3D Holocene strata supported by a series of new radiocarbon-dated cores. The results indicated that the local relative sea level rose rapidly during the Early Holocene in the YJV, reached its maximum flooding surface ca. 7900 cal yr BP, and then remained stable ca. 7900-7600 cal yr BP. Thereupon, an estuary stretching inland was first formed by marine transgression, and then, it was transformed to an alluvial-coastal plain by regressive progradation. The alluvial plain was initiated in the foothills and then spread towards the valley centre after sea level stabilization ca. 7600 cal yr BP. Accompanying these natural environmental changes, the earliest arrivals of foragers in the valley occurred no later than ca. 7000 cal yr BP. They engaged in rice farming and fostered the HC for approximately two millennia from ca. 7000-5000 cal yr BP as more lands developed from coastal progradation. The rise and development of the HC are closely associated with the sea level-induced landscape changes in the YJV in the Early-Middle Holocene, but the enigmatic exodus of the HC people after ca. 5000 cal yr BP is still contentious and possibly linked with the rapid waterlogging and deterioration of this setting in such a low-lying coastal plain as well as with associated social reasons.

Prograding coastal facies associations in the Vryheid formation (Permian) at Effingham quarries near Durban, South Africa

Science.gov (United States)

Tavener-Smith, R.

1982-05-01

This paper describes and interprets a flat-lying, sandstone—siltstone sequence 70 m thick in three disused quarries. The beds comprise the lowest part of the Vryheid Formation (middle Ecca) in the Durban vicinity. The sequence is conveniently divisible into two parts: the Lower Division constitutes a prograding beach barrier association, while the upper one represents a back barrier lagoonal complex. Fourteen sedimentary facies are described and interpreted to represent a range of depositional environments including open water shelf silts, sandy shoreface and littoral deposits, organic-rich muds and peats of lagoonal origin, a tidal inlet, washover fans and a fluvial channel sand. Among the conclusions reached are that the local middle Ecca coastline extended in a northwest to southeast direction and that progradation was towards the southwest; that the coastline was microtidal and that stormy conditions were common with prevalent palaeowinds from the northwest. The absence of invertebrate body fossils in these strata is attributed to penecontemporaneous solution of shelly remains. This is the first time that a coastal sequence has been identified on the southeast margin of the Main Karoo Basin of South Africa
Ground Penetrating Radar (GPR) facies delineated shallow sedimentary records along a recently prograding coastal barrier adjoining the Bay of Bengal: Paradeep, Odisha, India

Science.gov (United States)

Layek, M. K.; Sengupta, P.; Mukherjee, A.

2017-12-01

Sea-level fluctuations, triggered by progradation of beach or marine regression, can be of various time-scales. The fluctuating history of a shoreline along a coastal barrier can be identified from the sedimentary features of accretion or erosion. The necessity of the understanding of the complex barrier dynamics and subsurface along the Paradeep coast (in the state of Odisha, India), adjacent to the Bay of Bengal, has been growing since the number of the harbor industrial projects and the inhabitants of this major port city of India increases. In this study area, high resolution ground penetrating radar (GPR) survey and its interpretation by GPR facies analysis, which considers the pattern/set of reflected electromagnetic signals, has proved to be a useful method for shallow-subsurface (up to 8 m) imaging. In order to perform this task, a GPR system with 200 MHZ antenna was employed to survey along (17 profiles) and across (21 profiles) the microtidal coastal barrier of Paradeep. The shapes and sizes of the accretional and erosional features like beach-ridge deposits, washover deposits, channel-and-fill, and scour-and-fill are delineated on the radargram after processing by Radan7® software. The internal geometry of the beach ridge is mapped accurately after the radar facies analysis which suggests the longshore drift of sediments from the nearby river mouths of Mahanadi, Devi and their tributaries. This GPR facies analysis revealed the existence of two types of palaeo-tidal channels of the study area - (a) larger channels which are perpendicular to the shoreline having channel width of about 400 m with maximum depth of 4.5 m from the surface and (b) smaller channels (width up to 60 m) which flow parallel to the shoreline. In case of Paradeep coastal barrier, seaward-dipping beach progradational facies is positioned within oblique erosional surfaces (13°-36°) below the horizontal erosional surface or facies boundary. This lead to delineate the cycles of erosion
Climatic-eustatic control of Holocene nearshore parasequence development, southeastern Texas coast

Science.gov (United States)

Morton, Robert A.; Kindinger, Jack G.; Flocks, James G.; Stewart, Laura B.

1999-01-01

Sediment cores, seismic profiles, radiocarbon dates, and faunal assemblages were used to interpret the depositional setting and geological evolution of the southeastern Texas coast during the last glacio-eustatic cycle. Discrete lithofacies and biofacies zones in the ebb-dominated Sabine Lake estuary and adjacent chenier plain record alternating periods of rapid marine flooding and gradual shoaling related to linked climatic/eustatic fluctuations. Monospecific zones of the mollusks Rangia cuneata and Crassostrea virginica, respectively, indicate high fresh water outflow followed by invasion of marine water, whereas intervening organic-rich zones record bayhead delta deposition. High-frequency parasequence stacking patterns within the valley fill and across the adjacent interfluve reflect an initial rapid rise in sea level about 9 ka that flooded abandoned alluvial terraces and caused onlap of Holocene marsh in the incised valley. The rapid rise was followed by slowly rising and oscillating sea level that filled the deepest portions of the incised valleys with fluvially dominated estuarine deposits, and then a maximum highstand (+1 m msl) about 5 ka that flooded the former subaerial coastal plain between the incised valleys and constructed the highest beach ridges. Between 3.5 and 1.5 ka, sea level oscillated and gradually fell, causing a forced regression and rapid progradation of both the chenier plain and accretionary barrier islands. The only significant sands in the valley fill are (1) falling-stage and lowstand-fluvial sediments between the basal sequence boundary and transgressive surface unconformity, and (2) highstand beach-ridge sediments of the chenier plain.
Factors controlling late Cenozoic continental margin growth from the Ebro Delta to the western Mediterranean deep sea

Science.gov (United States)

Nelson, C.H.; Maldonado, A.

1990-01-01

The Ebro continental margin sedimentation system originated with a Messinian fluvial system. This system eroded both a major subaerial canyon cutting the margin southeastward from the present Ebro Delta and an axial valley that drained northeastward down Valencia Trough. Post-Messinian submergence of this topography and the Pliocene regime of high sea levels resulted in a marine hemipelagic drape over the margin. Late Pliocene to Pleistocene glacial climatic cycles, drainagebasin deforestation, and sea-level lowstands combined to increase sediment supply, cause the margin to prograde, and create a regime of lowstand sediment-gravity flows in the deeper margin. The depositional patterns of regressive, transgressive and highstand sea-level regimes suggest that location of the sediment source near the present Ebro Delta throughout the late Cenozoic, southward current advection of sediment, and greater subsidence in the southern margin combined to cause generally asymmetric progradation of the margin to the southeast. Thicker, less stable deposits filling the Messinian subaerial canyon underwent multiple retrograde failures, eroded wide gullied canyons and formed unchanneled base-of-slope sediment aprons in the central margin area; other margin areas to the north and south developed a series of channel-levee complexes. On the basin floor, the formation of Valencia Valley over the Messinian subaerial valley and earlier faults led to draining of about 20% of the Ebro Pleistocene sediment from channel-levee complexes through the valley to prograde Valencia Fan as much as 500 km northeast of the margin. Thus, the Ebro margin has two growth directions, mainly southeastward during higher sea levels, and eastward to northeastward during lower sea levels. The northeastward draining of turbidity currents has produced unusually thin and widely dispersed turbidite systems compared to those on ponded basin floors. During the past few centuries, man's impact has exceeded natural
Asymmetric Effects of Subaerial and Subaqueous Basement Slopes on Self-Similar Morphology of Prograding Deltas

Science.gov (United States)

Lai, Steven Yueh Jen; Hsiao, Yung-Tai; Wu, Fu-Chun

2017-12-01

Deltas form over basements of various slope configurations. While the morphodynamics of prograding deltas over single-slope basements have been studied previously, our understanding of delta progradation over segmented basements is still limited. Here we use experimental and analytical approaches to investigate the deltaic morphologies developing over two-slope basements with unequal subaerial and subaqueous slopes. For each case considered, the scaled profiles of the evolving delta collapse to a single profile for constant water and sediment influxes, allowing us to use the analytical self-similar profiles to investigate the individual effects of subaerial/subaqueous slopes. Individually varying the subaerial/subaqueous slopes exerts asymmetric effects on the morphologies. Increasing the subaerial slope advances the entire delta; increasing the subaqueous slope advances the upstream boundary of the topset yet causes the downstream boundary to retreat. The delta front exhibits a first-retreat-then-advance migrating trend with increasing subaqueous slope. A decrease in subaerial topset length is always accompanied by an increase in subaqueous volume fraction, no matter which segment is steepened. Applications are presented for estimating shoreline retreat caused by steepening of basement slopes, and estimating subaqueous volume and delta front using the observed topset length. The results may have implications for real-world delta systems subjected to upstream tectonic uplift and/or downstream subsidence. Both scenarios would exhibit reduced topset lengths, which are indicative of the accompanied increases in subaqueous volume and signal tectonic uplift and/or subsidence that are at play. We highlight herein the importance of geometric controls on partitioning of sediment between subaerial and subaqueous delta components.
Widespread Lake Highstands in the Southernmost Andean Altiplano during Heinrich Event 1: Implications for the South American Summer Monsoon

Science.gov (United States)

Chen, C. Y.; McGee, D.; Quade, J.

2014-12-01

Speleothem-based oxygen isotope records provide strong evidence of anti-phased behavior of the northern and southern hemisphere summer monsoons during Heinrich events, but we lack rigorous constraints on the amount of wetting or drying occurring in monsoon regions. Studies centered on shoreline deposits of closed-basin lakes are well suited for establishing such quantitative controls on water balance changes by providing unequivocal evidence of lake volume variations. Here we present new dating constraints on the highstands of several high-altitude (3800-4350 m) paleolakes in the southern Andean Altiplano, an outlying arid region of the Atacama Desert stretching across the Chilean-Bolivian-Argentinian border east of the Andes (20-25°S). These lakes once occupied the closed basins where only phreatic playas, dry salars, and shallow ponds exist today. Initial U-Th dating of massive shoreline tufas reveals that these deposits are dateable to within ±150 to 300 yrs due to high U concentrations and low initial Th content (as indicated by high 230Th/232Th). Our U-Th and 14C dates show that lake highstands predominantly occur between 18.5 and 14.5 kyrs BP, coinciding with Heinrich Event 1 (HE1) and the expansion of other nearby lakes, such as Lake Titicaca. Because of their (1) location at the modern-day southwestern edge of the summer monsoon, (2) intact shoreline preservation, and (3) precise age control, these lakes may uniquely enable us to reconstruct the evolution of water balance (P-E) changes associated with HE1. Hydrologic modeling constrained by temperature estimates provided by local glacial records is used to provide bounds for past precipitation changes. We also examine North Atlantic cooling as the mechanism for these changes by comparing a compilation of S. American lake level records with various hosing experiments and transient climate simulations at HE1. Our results lend us confidence in expanding our U-Th work to other shoreline tufas in the
Sedimentary architecture and chronostratigraphy of a late Quaternary incised-valley fill: A case study of the late Middle and Late Pleistocene Rhine system in the Netherlands

Science.gov (United States)

Peeters, J.; Busschers, F. S.; Stouthamer, E.; Bosch, J. H. A.; Van den Berg, M. W.; Wallinga, J.; Versendaal, A. J.; Bunnik, F. P. M.; Middelkoop, H.

2016-01-01

This paper describes the sedimentary architecture, chronostratigraphy and palaeogeography of the late Middle and Late Pleistocene (Marine Isotope Stage/MIS 6-2) incised Rhine-valley fill in the central Netherlands based on six geological transects, luminescence dating, biostratigraphical data and a 3D geological model. The incised-valley fill consists of a ca. 50 m thick and 10-20 km wide sand-dominated succession and includes a well-developed sequence dating from the Last Interglacial: known as the Eemian in northwest Europe. The lower part of the valley fill contains coarse-grained fluvio-glacial and fluvial Rhine sediments that were deposited under Late Saalian (MIS 6) cold-climatic periglacial conditions and during the transition into the warm Eemian interglacial (MIS 5e-d). This unit is overlain by fine-grained fresh-water flood-basin deposits, which are transgressed by a fine-grained estuarine unit that formed during marine high-stand. This ca. 10 m thick sequence reflects gradual drowning of the Eemian interglacial fluvial Rhine system and transformation into an estuary due to relative sea-level rise. The chronological data suggests a delay in timing of regional Eemian interglacial transgression and sea-level high-stand of several thousand years, when compared to eustatic sea-level. As a result of this glacio-isostatic controlled delay, formation of the interglacial lower deltaic system took only place for a relative short period of time: progradation was therefore limited. During the cooler Weichselian Early Glacial period (MIS 5d-a) deposition of deltaic sediments continued and extensive westward progradation of the Rhine system occurred. Major parts of the Eemian and Weichselian Early Glacial deposits were eroded and buried as a result of sea-level lowering and climate cooling during the early Middle Weichselian (MIS 4-3). Near complete sedimentary preservation occurred along the margins of the incised valley allowing the detailed reconstruction presented
Storm-related sedimentation influenced by coastal configuration in the stratigraphic record of a tectonically active shelf (Upper Pleistocene Le Castella terrace, Italy)

Science.gov (United States)

Nalin, Ronald; Massari, Francesco

2018-03-01

Analysis of patterns of coastal circulation and sediment dispersal is an essential step for the study of controlling factors influencing the long-term dynamics of coastal systems. Modern settings offer the possibility to monitor relevant parameters over relatively short time spans. However, geological examples complement this perspective by providing a time-averaged record where longer trends and stratigraphically significant processes can be evaluated. This study investigates the shallow marine deposits of Le Castella terrace (Upper Pleistocene, southern Italy) to document how patterns of circulation influenced by coastline configuration can affect the preserved millennial-scale depositional record of a progradational shoreline system. The regressive portion of the Le Castella terrace deposits, developed during a relative sea-level highstand and falling stage, consists of a progradational wedge mainly composed of redistributed skeletal particles of a coeval shallow water carbonate factory. Preservation of the morphology of the paleocoastline and abundant current-related sedimentary structures allow reconstruction of the predominant sediment dispersal dynamics responsible for the formation of this sedimentary wedge. Facies and paleocurrent analysis indicate offshore and alongshore sediment transport modes, consistent with coastal circulation driven by storms normally incident to the shoreline and a sharp change in coastline orientation. This coastal inflection influenced circulation patterns causing flow separation and eddy formation in the lee of the curved coastline. Syndepositional tectonic deformation also affected the architecture of the preserved deposits, controlling the nucleation and development of a clinostratified body and determining localized lateral stratigraphic variability. This study illustrates how transient but recurrent circulation patterns associated with changes in coastal orientation and related to high-energy storm events can leave a
Late Pleistocene sequence architecture on the geostrophic current-dominated southwest margin of the Ulleung Basin, East Sea

Science.gov (United States)

Choi, Dong-Lim; Shin, Dong-Hyeok; Kum, Byung-Cheol; Jang, Seok; Cho, Jin-Hyung; Jou, Hyeong-Tae; Jang, Nam-Do

2018-06-01

High-resolution multichannel seismic data were collected to identify depositional sequences on the southwestern shelf of the Ulleung Basin, where a unidirectional ocean current is dominant at water depths exceeding 130 m. Four aggradational stratigraphic sequences with a 100,000-year cycle were recognized since marine isotope stage (MIS) 10. These sequences consist only of lowstand systems tracts (LSTs) and falling-stage systems tracts (FSSTs). Prograding wedge-shaped deposits are present in the LSTs near the shelf break. Oblique progradational clinoforms of forced regressive deposits are present in the FSSTs on the outer continental shelf. Each FSST has non-uniform forced regressional stratal geometries, reflecting that the origins of sediments in each depositional sequence changed when sea level was falling. Slump deposits are characteristically developed in the upper layer of the FSSTs, and this was used as evidence to distinguish the sequence boundaries. The subsidence rates around the shelf break reached as much as 0.6 mm/year since MIS 10, which contributed to the well-preserved depositional sequence. During the Quaternary sea-level change, the water depth in the Korea Strait declined and the intensity of the Tsushima Current flowing near the bottom of the inner continental shelf increased. This resulted in greater erosion of sediments that were delivered to the outer continental shelf, which was the main cause of sediment deposition on the deep, low-angled outer shelf. Therefore, a depositional sequence formation model that consists of only FSSTs and LSTs, excluding highstand systems tracts (HSTs) and transgressive systems tracts (TSTs), best explains the depositional sequence beneath this shelf margin dominated by a geostrophic current.
Sequence stratigraphic interpretation of parts of Anambra Basin, Nigeria using geophysical well logs and biostratigraphic data

Science.gov (United States)

Anakwuba, E. K.; Ajaegwu, N. E.; Ejeke, C. F.; Onyekwelu, C. U.; Chinwuko, A. I.

2018-03-01

The Anambra basin constitutes the southeastern lower portion of the Benue Trough, which is a large structural depression that is divided into lower, middle and upper parts; and is one of the least studied inland sedimentary basins in Nigeria. Sequence stratigraphic interpretation had been carried out in parts of the Anambra Basin using data from three wells (Alo-1 Igbariam-1 and Ajire-1). Geophysical well logs and biostratigraphic data were integrated in order to identify key bounding surfaces, subdivide the sediment packages, correlate sand continuity and interpret the environment of deposition in the fields. Biostratigraphic interpretation, using foraminifera and plankton population and diversity, reveals five maximum flooding surfaces (MFS) in the fields. Five sequence boundaries (SB) were also identified using the well log analysis. Four 3rd order genetic sequences bounded by maximum flooding surfaces (MFS-1 to MFS-6) were identified in the areas; four complete sequences and one incomplete sequence were identified in both Alo-1 and Igbariam-1 wells while Ajire-1 has an no complete sequence. The identified system tracts delineated comprises Lowstand Systems Tracts (progradational to aggradational to retrogradational packages), Transgressive Systems Tracts (retrogradational packages) and Highstand Systems Tracts (aggradational to progradational packages) in each well. The sand continuity across the fields reveal sands S1 to S5 where S1 is present in Ajire-1 well and Igbariam-1 well but not in Alo-1 well. The sands S4 to S5 run across the three fields at different depths. The formations penetrated by the wells starting from the base are; Nkporo Formation (Campanian), Mamu Formation (Late Campanian to Early Maastrichtian), Ajali Sandstone (Maastrichtian), Nsukka Formation (Late Maastrichtian to Early Palaeocene), Imo Formation (Palaeocene) and Nanka Sand (Eocene). The environments of deposition revealed are from coastal to bathyal. The sands of lowstand system
Records of Coastal Change within a Progradational, Wave-Dominated Barrier Island: Morphostratigraphic Framework of the Southern Recurved Spit of Assateague Island, VA

Science.gov (United States)

Shawler, J. L.; Seminack, C.; DeMarco, K. R.; Hein, C. J.; Petruny, L. M.

2017-12-01

Although generally retrogradational in nature, barrier islands commonly contain progradational segments which may preserve records of past coastal dynamics and environmental changes which affected their formation. In particular, recurved-spit ridges may record former shoreline positions on the surface, while in their stratigraphic architecture contain evidence of the processes influencing spit growth. This study uses topographic mapping and nearly 40 km of ground-penetrating radar (GPR) transects to investigate the pre-historic (ca. 1000-1850 C.E.) and historic elongation of Assateague Island, VA (USA) and affiliated progradation of Chincoteague Island. These data uncovered three previously unknown former tidal inlets which have no discernible surface signatures. GPR data further reveal southerly migration (up to 95 m) and closure of these tidal inlets. In addition, GPR data indicates the apparent overprinting of multiple inlets, suggesting later reoccupation of former channels. Seaward-dipping clinoforms (5-15°) indicate that, following inlet closure, the island widened and elongated through beach-ridge growth, proceeded by the development of aeolian foredune ridges. In particular, two large (5 m elevation, 150 m wide) ridges, bounded by smaller (1-3 m elevation, 20-50 m wide) ridge sets, comprise the relict recurved-spit of Assateague Island. This contrasts with the adjacent beach-ridge plain of Chincoteague Island, where surface morphology is characterized by more spatially uniform ridges (1-2 m high, 50-100 m wide). Thus, despite sharing similar internal structure as imaged in GPR, the formational processes associated with these two systems differ: the large, widely-spaced ridges of Assateague are likely indicative of punctuated progradation possibly associated with sediment pulses or complex inlet dynamics, whereas Chincoteague Island may have been built in a semi-protected environment through sediment delivered by inlet bypassing and local longshore
Anatomy of extremely thin marine sequences landward of a passive-margin hinge zone: Neogene Calvert Cliffs succession, Maryland, U.S.A.

Energy Technology Data Exchange (ETDEWEB)

Kidwell, S.M. [Univ. of Chicago, IL (United States). Dept. of Geophysical Sciences

1997-03-01

Detailed examination of Neogene strata in cliffs 25--35 m high along the western shore of Chesapeake Bay, Maryland, reveals the complexity of the surviving record of siliciclastic sequences {approximately}150 km inland of the structural hinge zone of the Atlantic passive margin. Previous study of the lower to middle Miocene Calvert (Plum Point Member) and Choptank Formations documented a series of third-order sequences 7--10 m thick in which lowstand deposits are entirely lacking, transgressive tracts comprise a mosaic of condensed bioclastic facies, and regressive (highstand) tracts are present but partially truncated by the next sequence boundary; smaller-scale (fourth-order) cyclic units could not be resolved. Together, these sequences constitute the transgressive and early highstand tracts of a larger (second-order Miocene) composite sequence. The present paper documents stratigraphic relations higher in the Calvert Cliffs succession, including the upper Miocene St. Marys Formation, which represents late highstand marine deposits of the Miocene second-order sequence, and younger Neogene fluvial and tidal-inlet deposits representing incised-valley deposits of the succeeding second-order cycle. The St. Marys Formation consists of a series of tabular units 2--5 m thick, each with an exclusively transgressive array of facies and bounded by stranding surfaces of abrupt shallowing. These units, which are opposite to the flooding-surface-bounded regressive facies arrays of model parasequences, are best characterized as shaved sequences in which only the transgressive tract survives, and are stacked into larger transgressive, highstand, and forced-regression sets.
Holocene evolution of the western Orinoco Delta, Venezuela

Science.gov (United States)

Aslan, A.; White, W.A.; Warne, A.G.; Guevara, E.H.

2003-01-01

The pristine nature of the Orinoco Delta of eastern Venezuela provides unique opportunities to study the geologic processes and environments of a major tropical delta. Remote-sensing images, shallow cores, and radiocarbon-dating of organic remains form the basis for describing deltaic environments and interpreting the Holocene history of the delta. The Orinoco Delta can be subdivided into two major sectors. The southeast sector is dominated by the Rio Grande-the principal distributary-and complex networks of anastomosing fluvial and tidal channels. The abundance of siliciclastic deposits suggests that fluvial processes such as over-bank flooding strongly influence this part of the delta. In contrast, the northwest sector is represented by few major distributaries, and overbank sedimentation is less widespread relative to the southeast sector. Peat is abundant and occurs in herbaceous and forested swamps that are individually up to 200 km2 in area. Northwest-directed littoral currents transport large volumes of suspended sediment and produce prominent mudcapes along the northwest coast. Mapping of surface sediments, vegetation, and major landforms identified four principal geomorphic systems within the western delta plain: (1) distributary channels, (2) interdistributary flood basins, (3) fluvial-marine transitional environments, and (4) marine-influenced coastal environments. Coring and radiocarbon dating of deltaic deposits show that the northern delta shoreline has prograded 20-30 km during the late Holocene sea-level highstand. Progradation has been accomplished by a combination of distributary avulsion and mudcape progradation. This style of deltaic progradation differs markedly from other deltas such as the Mississippi where distributary avulsion leads to coastal land loss, rather than shoreline progradation. The key difference is that the Orinoco Delta coastal zone receives prodigious amounts of sediment from northwest-moving littoral currents that transport
Shoreline deposits and diagenesis resulting from two Late Pleistocene highstands near +5 and +6 metres, Durban, South Africa

Science.gov (United States)

Cooper, J.A.G.; Flores, R.M.

1991-01-01

In exposures of Pleistocene rocks on the east coast of South Africa, eight sedimentary facies were distinguished on the basis of petrology, grain size, internal structures and field relationships. These are interpreted as deposits of surf zone, breaker zone, swash zone, backbeach, boulder beach and dune environments. Three phases of deposition and diagenesis are recognized. As a result of the stabilising effect of pre-existing coastal facies, the deposits from successive sea level stands are stacked vertically in a narrow coast-normal strip. Early cementation prevented erosion of the deposits during subsequent transgressions. Deposition of subsequent facies took place on an existing coastal dune (Facies 1). A terrace was cut into this dune at a sea level 4.5 to 5 m above present. At this sea level, clastic shoreline sediments were deposited which make up the main sedimentary sequence exposed (Facies 2-7). The steep swash zone, coarse grain size, and comparison with modern conditions in the study area indicate clastic deposition on a high-energy, wave-dominated, microtidal coastline. Vertical stacking of progressively shallower water facies indicates progradation associated with slightly regressive conditions, prior to stranding of the succession above sea level. During a subsequent transgression to 5.5 or 6 m above present sea level, a second terrace was cut across the existing facies, which by then were partly lithified. A boulder beach (Facies 8) deposited on this terrace is indicative of high wave energy and a rocky coastline, formed by existing cemented coastal facies. Comparison with dated deposits from other parts of the South African coast suggest a Late Pleistocene age for Facies 2-8. Deposition was terminated by subsequent regression and continuing low sea levels during the remainder of the Pleistocene. Cementation of the facies took place almost entirely by carbonate precipitation. The presence of isopachous fibrous cements suggests early cementation of
Slope and basinal deposits adjacent to isolated carbonate platforms in the Indian Ocean: Sedimentology, geomorphology, and a new 1.2 Ma record of highstand shedding

Science.gov (United States)

Counts, J. W.; Jorry, S.; Jouet, G.

2017-12-01

Newly analyzed bathymetric, seismic, and core data from carbonate-topped seamounts in the Mozambique Channel reveals a variety of depositional processes and products operating on platform slopes and adjacent basins. Mass transport complexes (including turbidites and debrites), leveed channel systems with basin-floor fans, and contourites are imaged in high resolution in both seafloor maps and cross-section, and show both differences and similarities compared with platform slopes in the Bahamas and elsewhere. In some, though not all, platforms, increased sedimentation can be observed on the leeward margins, and slope rugosity may be asymmetric with respect to prevailing wind direction. Deposition is also controlled by glacial-interglacial cycles; cores taken from the lower slopes (3000+ m water depth) of carbonate platforms reveal a causative relationship between sea level and aragonite export to the deep ocean. δ18O isotopes from planktonic and benthic foraminifera of two 27-meter cores, reveal a high-resolution, continuous depositional record of carbonate sediment dating back to 1.2 Ma. Sea level rise, as determined by correlation with the LR04 benthic stack, is coincident with increased aragonite flux from platform tops. Gravity flow deposits are also affected by platform flooding—the frequency of turbidite/debrite deposits on pinnacle slopes increases during highstand, although such deposits are also present during glacial episodes. The results reported here are the first record of highstand shedding in the southern Indian Ocean, and provide the longest Quaternary sediment record to date in the region, including the Mid-Brunhes transition (MIS 11) that serves as an analog for the current climate conditions. In addition, this is the first study to describe sedimentation on the slopes of these platforms, providing an important point of comparison that has the potential to influence source-to-sink carbonate facies models.
Early-to-middle Holocene sea-level fluctuations, coastal progradation and the Neolithic occupations in Yaojiang valley of southern Hangzhou bay, eastern China

Science.gov (United States)

Liu, Y.; Sun, Q.; Fan, D.; Chen, Z.

2017-12-01

The formation of Holocene coast in eastern China provided material base for the development of Neolithic civilizations. The coastal Yaojiang valley of south Hangzhou bay was one of the examples where the well-known Neolithic Hemudu Culture (HC) of Eastern China initiated. Here, we studied the early-to-middle Holocene environment changes in relation to sea-level fluctuations on the basis of a serial of sediment cores based on a set of new Accelerator Mass Spectrometry radiocarbon (AMS 14C) chronology. The result indicated that relative sea-level rose rapidly in the Yaojiang valley at the early Holocene, reaching its maximum at ca. 8000-7800 cal yr BP and then decelerated at ca. 7800-7500 cal yr BP. The alluvial plain in Yaojiang valley began to form at the foothills first and then grew towards the valley center accompanying with the sea-level stabilization after ca. 7500 cal yr BP. This progressive progradation of alluvial plain would attract the early arrivals of foragers to dwell at the foothills to engaging in rice farming after ca.7000 cal yr BP and starting the epic Hemudu Culture. The HC people then move down to the valley center as more land became available thanks to sediment aggregation and progradation. The rise and development of HC were closely associated with the sea-level induced landscape changes in Yaojiang valley at the early-middle Holocene, and the unstable hydraulic condition in the valley after 5000 cal yr BP could be accountable for the cultural termination.
Progade PT path, prograde fluid flow, metasomatism and hydrous melting in the Osor high-grade HT-LP complex (Catalan Coastal Ranges-CCR, NE Iberia).

Science.gov (United States)

Reche, Joan; Martínez, Francisco; Leoz, Gisela

2015-04-01

Fast thermal pulses related to HT-LP metamorphism may imply dehydration reaction overstepping, higher than normal fluid production rates, quick local increases in Pfluid and common situations of Pfluid >> Plitostatic and surpassing locally the tensile stresses. This ambient would be favorable to transient hydrofracturing and fluid flow even if the ongoing HT-LP event develops on dominantly ductile crustal levels. In inner zones where temperatures are high enough, hydrous melting and melt migration would be favored as well. Such movement of fluids and melts would tend to be sustained if non-hydrostatic stresses are active during heating, and would be favored in high strain domains such high-T shear zones or along foliation planes. In such scenario, local metasomatic processes and mass-transfer phenomena are expected to occur along these high strain zones and so distributed along tectonic anisotropies. A variety of features found in high T Garnet - biotite-sillimanite±cordierite±plagioclase±K-feldspar±quartz metapelitic gneisses from the Osor Complex (Guilleries massif, CCR), testify from this kind of processes operating in the lower crustal section, at the amphibolite to granulite transition zone during a prograde Variscan HT-LP thermal pulse. Such features include: syn-D2 quartz veining, leucogranitoid (leucotonalite, trondhjemitic) lenses sub parallel to S2 dominant foliation, fibrolite-rich foliation planes and prograde sub-idiomorphic garnet developing preferentially near fluid migration channels (quartz veins) or near melt lenses.
Seismic stratigraphy and late Quaternary shelf history, south-central Monterey Bay, California

Science.gov (United States)

Chin, J.L.; Clifton, H.E.; Mullins, H.T.

1988-01-01

The south-central Monterey Bay shelf is a high-energy, wave-dominated, tectonically active coastal region on the central California continental margin. A prominent feature of this shelf is a sediment lobe off the mouth of the Salinas River that has surface expression. High-resolution seismic-reflection profiles reveal that an angular unconformity (Quaternary?) underlies the entire shelf and separates undeformed strata above it from deformed strata below it. The Salinas River lobe is a convex bulge on the shelf covering an area of approximately 72 km2 in water depths from 10 to 90 m. It reaches a maximum thickness of 35 m about 2.5 km seaward of the river mouth and thins in all directions away from this point. Adjacent shelf areas are characterized by only a thin (2 to 5 m thick) and uniform veneer of sediment. Acoustic stratigraphy of the lobe is complex and is characterized by at least three unconformity-bounded depositional sequences. Acoustically, these sequences are relatively well bedded. Acoustic foresets occur within the intermediate sequence and dip seaward at 0.7?? to 2.0??. Comparison with sedimentary sequences in uplifted onshore Pleistocene marine-terrace deposits of the Monterey Bay area, which were presumably formed in a similar setting under similar processes, suggests that a general interpretation can be formulated for seismic stratigraphic patterns. Depositional sequences are interpreted to represent shallowing-upwards progradational sequences of marine to nonmarine coastal deposits formed during interglacial highstands and/or during early stages of falling sea level. Acoustic foresets within the intermediate sequence are evidence of seaward progradation. Acoustic unconformities that separate depositional sequences are interpreted as having formed largely by shoreface planation and may be the only record of the intervening transgressions. The internal stratigraphy of the Salinas River lobe thus suggests that at least several late Quaternary
Sequence stratigraphy from ``spot'' outcrops—example from a carbonate-dominated setting: Devonian-Carboniferous transition, Dinant synclinorium (Belgium)

Science.gov (United States)

Van Steenwinkel, M.

1990-12-01

The application of sequence stratigraphy appears to be perfectly valid in isolated "spot" outcrops, which usually are badly exposed and located in areas of strong tectonic overprint. The case study presented here is in a carbonate-dominated setting at the northern edge of the Cornwall-Rhenish basin, during latest Devonian and earliest Carboniferous times. The approach taken involved the analysis of facies and vertical facies evolutions, in combination with detailed biostratigraphy. The subsequent deduction of deepening and shallowing events, and the understanding of the sedimentary response to these events in different palaeogeographical positions, are the basis for lateral correlation of facies sequences. A similar trend in the changes of deepenings and shallowings throughout the area, regardless of lateral facies differentiation, indicates a relationship between bathymetric changes and relative sea-level changes, that is independent of local differences in the sedimentation or subsidence rates. The studied stratigraphic interval, with its apparently "abrupt" changes in facies or lithology, is the sedimentary response to six, gradually changing trends in the rate of relative sea-level fall and rise. End-Devonian, offshore highstand deposits (1) are capped by an unconformity surface or by a surface of submarine erosion and nondeposition, representing a sequence boundary (2). The overlying lowstand unit consists of a basal (3) and an upper (4) part. The basal part contains an oolitic and crinoidal sand shoals, associated with a transgressive lag deposit in offshore position; the upper, main part is a prograding wedge of shallow-marine crinoidal sands, intercalated with increasingly deeper-water shaly facies (higher-order fluctuation). This lowstand unit was deposited during the initial, slow phase of relative sea-level rise. It levelled the topographic relief before the final drowning during the subsequent rapid (near the rise inflection point) relative sea
Cyclone trends constrain monsoon variability during late Oligocene sea level highstands (Kachchh Basin, NW India)

Science.gov (United States)

Reuter, M.; Piller, W. E.; Harzhauser, M.; Kroh, A.

2013-09-01

Climate change has an unknown impact on tropical cyclones and the Asian monsoon. Herein we present a sequence of fossil shell beds from the shallow-marine Maniyara Fort Formation (Kachcch Basin) as a recorder of tropical cyclone activity along the NW Indian coast during the late Oligocene warming period (~ 27-24 Ma). Proxy data providing information about the atmospheric circulation dynamics over the Indian subcontinent at this time are important since it corresponds to a major climate reorganization in Asia that ends up with the establishment of the modern Asian monsoon system at the Oligocene-Miocene boundary. The vast shell concentrations are comprised of a mixture of parautochthonous and allochthonous assemblages indicating storm-generated sediment transport from deeper to shallow water during third-order sea level highstands. Three distinct skeletal assemblages were distinguished, each recording a relative storm wave base. (1) A shallow storm wave base is shown by nearshore molluscs, reef corals and Clypeaster echinoids; (2) an intermediate storm wave base depth is indicated by lepidocyclinid foraminifers, Eupatagus echinoids and corallinacean algae; and (3) a deep storm wave base is represented by an Amussiopecten bivalve-Schizaster echinoid assemblage. These wave base depth estimates were used for the reconstruction of long-term tropical storm intensity during the late Oligocene. The development and intensification of cyclones over the recent Arabian Sea is primarily limited by the atmospheric monsoon circulation and strength of the associated vertical wind shear. Therefore, since the topographic boundary conditions for the Indian monsoon already existed in the late Oligocene, the reconstructed long-term cyclone trends were interpreted to reflect monsoon variability during the initiation of the Asian monsoon system. Our results imply an active monsoon over the Eastern Tethys at ~ 26 Ma followed by a period of monsoon weakening during the peak of the late

Testing cosmic dose rate models for ESR: Dating corals and molluscs on San Salvador, Bahamas

International Nuclear Information System (INIS)

Deely, A.E.; Blackwell, B.A.B.; Mylroie, J.E.; Carew, J.L.; Blickstein, J.I.B.; Skinner, A.R.

2011-01-01

Sealevel curves are best developed on tectonically stable coastlines, like San Salvador, where eolianites preserve transgressive and regressive phases associated with Quaternary high seastands, while reef facies mark the highstands. At 11 locations around San Salvador, terrestrial molluscs (Cerion) from the eolianites, lagoonal bivalves (Codakia), and corals from the highstand deposits were dated by ESR. Volumetrically averaged sedimentary dose rates were calculated from sedimentary geochemistry and time-averaged cosmic dose rates from each sample's current and past geologic contexts. Rice Bay Formation corals dated at 3.9 ± 0.3 to 7.1 ± 0.4 ka (OIS 1). Minimum ages for the Cockburn Town Member's regressive phase ranged from 49 ± 6 to 75 ± 8 ka, correlating with OIS 3-4. Codakia dates showed that an OIS 5a sealevel approached modern levels at 91-78 ka. In situ corals from the Cockburn Town Reef averaged from 127 ± 6 to 138 ± 10 ka, correlating well with OIS 5e. Ages from the Reef's rubble zones hint that some coral reefs grew as early as OIS 7, but were likely reworked during OIS 5. San Salvador preserves deposits from three mid to late Quaternary highstands above, and as many as three that closely approach, modern sealevel.
Testing cosmic dose rate models for ESR: Dating corals and molluscs on San Salvador, Bahamas

Energy Technology Data Exchange (ETDEWEB)

Deely, A.E. [RFK Science Research Institute, Glenwood Landing, NY, 11547-0866 (United States); Blackwell, B.A.B., E-mail: bonnie.a.b.blackwell@williams.edu [RFK Science Research Institute, Glenwood Landing, NY, 11547-0866 (United States); Dept. of Chemistry, Williams College, Williamstown MA, 01267-2692 (United States); Mylroie, J.E. [Dept. of Geosciences, Mississippi State University, MS, 39762-5448 (United States); Carew, J.L. [Dept. of Geology and Environmental Geosciences, College of Charleston, Charleston, SC 29424 (United States); Blickstein, J.I.B. [RFK Science Research Institute, Glenwood Landing, NY, 11547-0866 (United States); Skinner, A.R. [RFK Science Research Institute, Glenwood Landing, NY, 11547-0866 (United States); Dept. of Chemistry, Williams College, Williamstown MA, 01267-2692 (United States)

2011-09-15

Sealevel curves are best developed on tectonically stable coastlines, like San Salvador, where eolianites preserve transgressive and regressive phases associated with Quaternary high seastands, while reef facies mark the highstands. At 11 locations around San Salvador, terrestrial molluscs (Cerion) from the eolianites, lagoonal bivalves (Codakia), and corals from the highstand deposits were dated by ESR. Volumetrically averaged sedimentary dose rates were calculated from sedimentary geochemistry and time-averaged cosmic dose rates from each sample's current and past geologic contexts. Rice Bay Formation corals dated at 3.9 {+-} 0.3 to 7.1 {+-} 0.4 ka (OIS 1). Minimum ages for the Cockburn Town Member's regressive phase ranged from 49 {+-} 6 to 75 {+-} 8 ka, correlating with OIS 3-4. Codakia dates showed that an OIS 5a sealevel approached modern levels at 91-78 ka. In situ corals from the Cockburn Town Reef averaged from 127 {+-} 6 to 138 {+-} 10 ka, correlating well with OIS 5e. Ages from the Reef's rubble zones hint that some coral reefs grew as early as OIS 7, but were likely reworked during OIS 5. San Salvador preserves deposits from three mid to late Quaternary highstands above, and as many as three that closely approach, modern sealevel.
Milankovitch cyclicity in modern continental margins: stratigraphic cycles in terrigenous shelf settings; El registro de la ciclicidad de Milankovitch en margenes continentales actuales: ciclos estratigraficos en plataformas terrigenas

Energy Technology Data Exchange (ETDEWEB)

Lobo, F. J.; Ridente, D.

2013-06-01

We present a synthesis of the sedimentary responses to Late Quaternary Milankovitch-type sea-level cycles (100 and 20 kyr periodicities) as a basis for our investigations into the patterns and concepts of composite sequences in shallow-shelf settings. We describe the record of both 100 and 20 kyr cycles as documented worldwide and discuss the pattern of composite cyclicity mainly on the basis of previously published data from the Adriatic Sea and Gulf of Cadiz margins. Cycles of 100 kyr are those most frequently documented in Quaternary margins; they occur in the form of unconformity-bounded depositional sequences dominated by fairly uniform pro gradational-regressive units and more variable, though less well developed, transgressive deposits. Sequence boundaries correspond to prominent polygenic (regressive-transgressive) erosional surfaces that bear witness to considerable transgressive reworking of the original sub-aerial unconformity. Although the progradational units making up the greater part of these sequences have usually been interpreted as a record of a falling sea-level stage, recent evidence is pointing towards a more complex stratigraphic picture, including a distinction between relative highstand and lowstand deposits. The 20-kyr stratigraphic motifs show greater variation compared to that displayed by the more common 100-kyr sequences, particularly in the basic structure of systems tracts and the nature of bounding surfaces. The two case studies described here, the Adriatic Sea and Gulf of Cadiz margins, highlight the fact that, concomitantly with an increase in frequencies of cycles and sequences, sediment supply and the dynamics of their dispersal significantly affected the stratigraphic response to the main controlling factor, which was sea-level, thus determining the variety of expression in the 20 kyr cycles. (Author)
Interaction of tectonic and depositional processes that control the evolution of the Iberian Gulf of Cadiz margin

Science.gov (United States)

Maldonado, A.; Nelson, C.H.

1999-01-01

This study provides an integrated view of the growth patterns and factors that controlled the evolution of the Gulf of Cadiz continental margin based on studies of the tectonic, sedimentologic and oceanographic history of the area. Seven sedimentary regimes are identified, but there are more extensive descriptions of the late Cenozoic regimes because of the larger data base. The regimes of the Mesozoic passive margin include carbonate platforms, which become mixed calcareous-terrigenous deposits during the Late Cretaceous-early Tertiary. The Oligocene and Early Miocene terrigenous regimes developed, in contrast, over the active and transcurrent margins near the African-Iberian plate boundary. The top of the Gulf of Cadiz olistostrome, emplaced in the Late Miocene, is used as a key horizon to define the 'post-orogenic' depositional regimes. The Late Miocene progradational margin regime is characterized by a large terrigenous sediment supply to the margin and coincides with the closing of the Miocene Atlantic-Mediterranean gateways. The terrigenous drift depositional regime of the Early Pliocene resulted from the occurrence of high eustatic sea level and the characteristics of the Mediterranean outflow currents that developed after the opening of the Strait of Gibraltar. The Late Pliocene and Quaternary regimes are dominated by sequences of deposits related to cycles of high and low sea levels. Deposition of shelf-margin deltas and slope wedges correlate with regressive and low sea level regimes caused by eustasy and subsidence. During the highstand regimes of the Holocene, inner shelf prograding deltas and deep-water sediment drifts were developed under the influence of the Atlantic inflow and Mediterranean outflow currents, respectively. A modern human cultural regime began 2000 years ago with the Roman occupation of Iberia; human cultural effects on sedimentary regimes may have equalled natural factors such as climate change. Interplay of tectonic and
Seismic analysis of clinoform depositional sequences and shelf-margin trajectories in Lower Cretaceous (Albian) strata, Alaska North Slope

Science.gov (United States)

Houseknecht, D.W.; Bird, K.J.; Schenk, C.J.

2009-01-01

Lower Cretaceous strata beneath the Alaska North Slope include clinoform depositional sequences that filled the western Colville foreland basin and overstepped the Beaufort rift shoulder. Analysis of Albian clinoform sequences with two-dimensional (2D) seismic data resulted in the recognition of seismic facies inferred to represent lowstand, transgressive and highstand systems tracts. These are stacked to produce shelf-margin trajectories that appear in low-resolution seismic data to alternate between aggradational and progradational. Higher-resolution seismic data reveal shelf-margin trajectories that are more complex, particularly in net-aggradational areas, where three patterns commonly are observed: (1) a negative (downward) step across the sequence boundary followed by mostly aggradation in the lowstand systems tract (LST), (2) a positive (upward) step across the sequence boundary followed by mostly progradation in the LST and (3) an upward backstep across a mass-failure d??collement. These different shelf-margin trajectories are interpreted as (1) fall of relative sea level below the shelf edge, (2) fall of relative sea level to above the shelf edge and (3) mass-failure removal of shelf-margin sediment. Lowstand shelf margins mapped using these criteria are oriented north-south in the foreland basin, indicating longitudinal filling from west to east. The shelf margins turn westward in the north, where the clinoform depositional system overstepped the rift shoulder, and turn eastward in the south, suggesting progradation of depositional systems from the ancestral Brooks Range into the foredeep. Lowstand shelf-margin orientations are consistently perpendicular to clinoform-foreset-dip directions. Although the Albian clinoform sequences of the Alaska North Slope are generally similar in stratal geometry to clinoform sequences elsewhere, they are significantly thicker. Clinoform-sequence thickness ranges from 600-1000 m in the north to 1700-2000 m in the south
Facies architecture and high resolution sequence stratigraphy of an aeolian, fluvial and shallow marine system in the Pennsylvanian Piauí Formation, Parnaíba Basin, Brazil

Science.gov (United States)

Vieira, Lucas Valadares; Scherer, Claiton Marlon dos Santos

2017-07-01

The Pennsylvanian Piauí Formation records the deposition of aeolian, fluvial and shallow marine systems accumulated in the cratonic sag Parnaíba basin. Characterization of the facies associations and sequence stratigraphic framework was done by detailed description and logging of outcrops. Six facies associations were recognized: aeolian dunes and interdunes, aeolian sandsheets, fluvial channels, tidally-influenced fluvial channels, shoreface and shoreface-shelf transition. Through correlation of stratigraphic surfaces, the facies associations were organized in system tracts, which formed eight high frequency depositional sequences, bounded by subaerial unconformities. These sequences are composed of a lowstand system tract (LST), that is aeolian-dominated or fluvial-dominated, a transgressive system tract (TST) that is formed by tidally-influenced fluvial channels and/or shoreface and shoreface-shelf transition deposits with retrogradational stacking, and a highstand system tract (HST), which is formed by shoreface-shelf transition and shoreface deposits with progradational stacking. Two low frequency cycles were determined by observing the stacking of the high frequency cycles. The Lower Sequence is characterized by aeolian deposits of the LST and an aggradational base followed by a progressive transgression, defining a general TST. The Upper Sequence is characterized by fluvial deposits and interfluve pedogenesis concurring with the aeolian deposits of the LST and records a subtle regression followed by transgression. The main control on sedimentation in the Piauí Formation was glacioeustasy, which was responsible for the changes in relative sea level. Even though, climate changes were associated with glacioeustatic phases and influenced the aeolian and fluvial deposition.
Early Permian transgressive-regressive cycles: Sequence stratigraphic reappraisal of the coal-bearing Barakar Formation, Raniganj Basin, India

Science.gov (United States)

Bhattacharya, Biplab; Bhattacharjee, Joyeeta; Bandyopadhyay, Sandip; Banerjee, Sudipto; Adhikari, Kalyan

2018-03-01

The present research is an attempt to assess the Barakar Formation of the Raniganj Gondwana Basin, India, in the frame of fluvio-marine (estuarine) depositional systems using sequence stratigraphic elements. Analysis of predominant facies associations signify deposition in three sub-environments: (i) a river-dominated bay-head delta zone in the inner estuary, with transition from braided fluvial channels (FA-B1) to tide-affected meandering fluvial channels and flood plains (FA-B2) in the basal part of the succession; (ii) a mixed energy central basin zone, which consists of transitional fluvio-tidal channels (FA-B2), tidal flats, associated with tidal channels and bars (FA-B3) in the middle-upper part of the succession; and (iii) a wave-dominated outer estuary (coastal) zone (FA-B4 with FA-B3) in the upper part of the succession. Stacked progradational (P1, P2)-retrogradational (R1, R2) successions attest to one major base level fluctuation, leading to distinct transgressive-regressive (T-R) cycles with development of initial falling stage systems tract (FSST), followed by lowstand systems tract (LST) and successive transgressive systems tracts (TST-1 and TST-2). Shift in the depositional regime from regressive to transgressive estuarine system in the early Permian Barakar Formation is attributed to change in accommodation space caused by mutual interactions of (i) base level fluctuations in response to climatic amelioration and (ii) basinal tectonisms (exhumation/sagging) related to post-glacial isostatic adjustments in the riftogenic Gondwana basins.
Differentiating regressed melanoma from regressed lichenoid keratosis.

Science.gov (United States)

Chan, Aegean H; Shulman, Kenneth J; Lee, Bonnie A

2017-04-01

Distinguishing regressed lichen planus-like keratosis (LPLK) from regressed melanoma can be difficult on histopathologic examination, potentially resulting in mismanagement of patients. We aimed to identify histopathologic features by which regressed melanoma can be differentiated from regressed LPLK. Twenty actively inflamed LPLK, 12 LPLK with regression and 15 melanomas with regression were compared and evaluated by hematoxylin and eosin staining as well as Melan-A, microphthalmia transcription factor (MiTF) and cytokeratin (AE1/AE3) immunostaining. (1) A total of 40% of regressed melanomas showed complete or near complete loss of melanocytes within the epidermis with Melan-A and MiTF immunostaining, while 8% of regressed LPLK exhibited this finding. (2) Necrotic keratinocytes were seen in the epidermis in 33% regressed melanomas as opposed to all of the regressed LPLK. (3) A dense infiltrate of melanophages in the papillary dermis was seen in 40% of regressed melanomas, a feature not seen in regressed LPLK. In summary, our findings suggest that a complete or near complete loss of melanocytes within the epidermis strongly favors a regressed melanoma over a regressed LPLK. In addition, necrotic epidermal keratinocytes and the presence of a dense band-like distribution of dermal melanophages can be helpful in differentiating these lesions. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Mud depocenters on continental shelves—appearance, initiation times, and growth dynamics

Science.gov (United States)

Hanebuth, Till J. J.; Lantzsch, Hendrik; Nizou, Jean

2015-12-01

Mud accumulates on continental shelves under a variety of environmental conditions and results in a diverse formation of mud depocenters (MDCs). Their three-dimensional architectures have been in the focus of several recent studies. Due to some terminological confusion concerning MDCs, the present study sets out to define eight individual MDC types in terms of surface sediment distribution and internal geometry. Under conditions of substantial sediment supply, prodeltas (distal zones off river deltas; triangular sheets), subaqueous deltas (disconnected from deltas by strong normal-to-shore currents; wedge-like clinoforms), and mud patches (scattered distribution) and mud blankets (widespread covers) are formed. Forced by hydrodynamic conditions, mud belts in the strict sense (detached from source; elongated bodies), and shallow-water contourite drifts (detached from source; growing normal to prevailing current direction; triangular clinoforms) develop. Controlled by local morphology, mud entrapments (in depressions, behind morphological steps) and mud wedges (triangular clinoforms growing in flow direction) are deposited. Shelf mud deposition took place (1) during early outer-shelf drowning (~14 ka), (2) after inner-shelf inundation to maximum flooding (9.5-6.5 ka), and (3) in sub-recent times (near the fluvial source, (2) uni-directional, extending along advective current transport paths, and (3) progradational, forming clinoforms that grow either parallel or normal to the bottom current direction. Classical mud belts may be initiated around defined nuclei, the remote sites of which are determined by seafloor morphology rather than the location of the source. From a stratigraphic perspective, mud depocenters coincide with sea-level highstand-related, shelf-wide condensed sections. They often show a conformable succession from transgressive to highstand systems tract stages.
Retro-regression--another important multivariate regression improvement.

Science.gov (United States)

Randić, M

2001-01-01

We review the serious problem associated with instabilities of the coefficients of regression equations, referred to as the MRA (multivariate regression analysis) "nightmare of the first kind". This is manifested when in a stepwise regression a descriptor is included or excluded from a regression. The consequence is an unpredictable change of the coefficients of the descriptors that remain in the regression equation. We follow with consideration of an even more serious problem, referred to as the MRA "nightmare of the second kind", arising when optimal descriptors are selected from a large pool of descriptors. This process typically causes at different steps of the stepwise regression a replacement of several previously used descriptors by new ones. We describe a procedure that resolves these difficulties. The approach is illustrated on boiling points of nonanes which are considered (1) by using an ordered connectivity basis; (2) by using an ordering resulting from application of greedy algorithm; and (3) by using an ordering derived from an exhaustive search for optimal descriptors. A novel variant of multiple regression analysis, called retro-regression (RR), is outlined showing how it resolves the ambiguities associated with both "nightmares" of the first and the second kind of MRA.
Modified Regression Correlation Coefficient for Poisson Regression Model

Science.gov (United States)

Kaengthong, Nattacha; Domthong, Uthumporn

2017-09-01

This study gives attention to indicators in predictive power of the Generalized Linear Model (GLM) which are widely used; however, often having some restrictions. We are interested in regression correlation coefficient for a Poisson regression model. This is a measure of predictive power, and defined by the relationship between the dependent variable (Y) and the expected value of the dependent variable given the independent variables [E(Y|X)] for the Poisson regression model. The dependent variable is distributed as Poisson. The purpose of this research was modifying regression correlation coefficient for Poisson regression model. We also compare the proposed modified regression correlation coefficient with the traditional regression correlation coefficient in the case of two or more independent variables, and having multicollinearity in independent variables. The result shows that the proposed regression correlation coefficient is better than the traditional regression correlation coefficient based on Bias and the Root Mean Square Error (RMSE).
The Changing Face of Plio-Pleistocene Reef Margins: Results of the Dominican Republic Drilling Project (DRDP)

Science.gov (United States)

Klaus, J.; McNeill, D. F.; Díaz, V.; Swart, P. K.; Pourmand, A.; Grasmueck, M.; Eberli, G. P.

2013-12-01

Fringing reef margins of the Caribbean display a characteristic zonation in which Acropora palmata dominates shallow high-energy reef crests and Acropora cervicornis calmer fore-reef slopes and backreef lagoons. The dominance of acroporids across this zonation has been attributed to growth rates 5-100 times faster than other corals. However, the dominance and high accretion potential of acroporid reefs has a relatively recent geologic origin. Caribbean reefs changed profoundly in taxonomic composition, diversity, and dominance structure during late Pliocene and Pleistocene climatic change. These changes coincide with protracted climatic deterioration and cooling between 2.0 to 0.8 Ma, and the onset of high amplitude sea-level fluctuations ~400 ka. The Dominican Republic Drilling Project (DRDP) was initiated to determine how climate change and global high-amplitude sea level changes influenced depositional patterns in Pliocene to Recent reef systems of the Caribbean. A transect of 7 core borings (~700 m total depth) were collected along a transect of the southern coast of the DR in conjunction with over 20 km of ground penetrating radar (GPR) lines. New age constraints based on U/Th geochronometry and radiogenic Sr isotopes, combined with depositional lithofacies, faunal indicators, stable isotope profiles and GPR data have allowed us to correlate between wells and define the internal anatomy and stratal geometry of the individual reef sigmoids and sigmoid sets. The stacking of these sigmoid-shaped reefs produce lateral progradation of approximately 15 km with geometries that generally follow the highstand systems tract model of Pomar and Ward (1994). Based on existing age models eccentricity (high amplitude 100 kyr) sigmoids display increased aggradation and progradation potential compared to reef cycles driven by obliquity (41 kyr).
Sequence stratigraphy in the middle Ordovician shale successions, mid-east Korea: Stratigraphic variations and preservation potential of organic matter within a sequence stratigraphic framework

Science.gov (United States)

Byun, Uk Hwan; Lee, Hyun Suk; Kwon, Yi Kyun

2018-02-01

The Jigunsan Formation is the middle Ordovician shale-dominated transgressive succession in the Taebaeksan Basin, located in the eastern margin of the North China platform. The total organic carbon (TOC) content and some geochemical properties of the succession exhibit a stratigraphically distinct distribution pattern. The pattern was closely associated with the redox conditions related to decomposition, bulk sedimentation rate (dilution), and productivity. To explain the distinct distribution pattern, this study attempted to construct a high-resolution sequence stratigraphic framework for the Jigunsan Formation. The shale-dominated Jigunsan Formation comprises a lower layer of dark gray shale, deposited during transgression, and an upper layer of greenish gray siltstone, deposited during highstand and falling stage systems tracts. The concept of a back-stepped carbonate platform is adopted to distinguish early and late transgressive systems tracts (early and late TST) in this study, whereas the highstand systems tracts and falling stage systems tracts can be divided by changes in stacking patterns from aggradation to progradation. The late TST would be initiated on a rapidly back-stepping surface of sediments and, just above the surface, exhibits a high peak in TOC content, followed by a gradually upward decrease. This trend of TOC distribution in the late TST continues to the maximum flooding surface (MFS). The perplexing TOC distribution pattern within the late TST most likely resulted from both a gradual reduction in productivity during the late TST and a gradual increase in dilution effect near the MFS interval. The reduced production of organic matter primarily incurred decreasing TOC content toward the MFS when the productivity was mainly governed by benthic biota because planktonic organisms were not widespread in the Ordovician. Results of this study will help improve the understanding of the source rock distribution in mixed carbonate
Dual Regression

OpenAIRE

Spady, Richard; Stouli, Sami

2012-01-01

We propose dual regression as an alternative to the quantile regression process for the global estimation of conditional distribution functions under minimal assumptions. Dual regression provides all the interpretational power of the quantile regression process while avoiding the need for repairing the intersecting conditional quantile surfaces that quantile regression often produces in practice. Our approach introduces a mathematical programming characterization of conditional distribution f...
Regression: A Bibliography.

Science.gov (United States)

Pedrini, D. T.; Pedrini, Bonnie C.

Regression, another mechanism studied by Sigmund Freud, has had much research, e.g., hypnotic regression, frustration regression, schizophrenic regression, and infra-human-animal regression (often directly related to fixation). Many investigators worked with hypnotic age regression, which has a long history, going back to Russian reflexologists.…
Prograde and retrograde growth of monazite in migmatites: An example from the Nagercoil Block, southern India

Directory of Open Access Journals (Sweden)

Tim E. Johnson

2015-05-01

Full Text Available Data from a migmatised metapelite raft enclosed within charnockite provide quantitative constraints on the pressure–temperature–time (P–T–t evolution of the Nagercoil Block at the southernmost tip of peninsular India. An inferred peak metamorphic assemblage of garnet, K-feldspar, sillimanite, plagioclase, magnetite, ilmenite, spinel and melt is consistent with peak metamorphic pressures of 6–8 kbar and temperatures in excess of 900 °C. Subsequent growth of cordierite and biotite record high-temperature retrograde decompression to around 5 kbar and 800 °C. SHRIMP U–Pb dating of magmatic zircon cores suggests that the sedimentary protoliths were in part derived from felsic igneous rocks with Palaeoproterozoic crystallisation ages. New growth of metamorphic zircon on the rims of detrital grains constrains the onset of melt crystallisation, and the minimum age of the metamorphic peak, to around 560 Ma. The data suggest two stages of monazite growth. The first generation of REE-enriched monazite grew during partial melting along the prograde path at around 570 Ma via the incongruent breakdown of apatite. Relatively REE-depleted rims, which have a pronounced negative europium anomaly, grew during melt crystallisation along the retrograde path at around 535 Ma. Our data show the rocks remained at suprasolidus temperatures for at least 35 million years and probably much longer, supporting a long-lived high-grade metamorphic history. The metamorphic conditions, timing and duration of the implied clockwise P–T–t path are similar to that previously established for other regions in peninsular India during the Ediacaran to Cambrian assembly of that part of the Gondwanan supercontinent.
Quantitative allochem compositional analysis of Lochkovian-Pragian boundary sections in the Prague Basin (Czech Republic)

Science.gov (United States)

Weinerová, Hedvika; Hron, Karel; Bábek, Ondřej; Šimíček, Daniel; Hladil, Jindřich

2017-06-01

Quantitative allochem compositional trends across the Lochkovian-Pragian boundary Event were examined at three sections recording the proximal to more distal carbonate ramp environment of the Prague Basin. Multivariate statistical methods (principal component analysis, correspondence analysis, cluster analysis) of point-counted thin section data were used to reconstruct facies stacking patterns and sea-level history. Both the closed-nature allochem percentages and their centred log-ratio (clr) coordinates were used. Both these approaches allow for distinguishing of lowstand, transgressive and highstand system tracts within the Praha Formation, which show gradual transition from crinoid-dominated facies deposited above the storm wave base to dacryoconarid-dominated facies of deep-water environment below the storm wave base. Quantitative compositional data also indicate progradative-retrogradative trends in the macrolithologically monotonous shallow-water succession and enable its stratigraphic correlation with successions from deeper-water environments. Generally, the stratigraphic trends of the clr data are more sensitive to subtle changes in allochem composition in comparison to the results based on raw data. A heterozoan-dominated allochem association in shallow-water environments of the Praha Formation supports the carbonate ramp environment assumed by previous authors.
Advanced statistics: linear regression, part I: simple linear regression.

Science.gov (United States)

Marill, Keith A

2004-01-01

Simple linear regression is a mathematical technique used to model the relationship between a single independent predictor variable and a single dependent outcome variable. In this, the first of a two-part series exploring concepts in linear regression analysis, the four fundamental assumptions and the mechanics of simple linear regression are reviewed. The most common technique used to derive the regression line, the method of least squares, is described. The reader will be acquainted with other important concepts in simple linear regression, including: variable transformations, dummy variables, relationship to inference testing, and leverage. Simplified clinical examples with small datasets and graphic models are used to illustrate the points. This will provide a foundation for the second article in this series: a discussion of multiple linear regression, in which there are multiple predictor variables.
Sedimentary architecture of the Holocene mud deposit off the southern Shandong Peninsula in the Yellow Sea

Science.gov (United States)

Qiu, Jiandong; Liu, Jian; Xu, Hong; Zhou, Liangyong

2018-01-01

Newly acquired high-resolution seismic profiles reveal a nearshore and an offshore mud depocenter offthe southern Shandong Peninsula in the Yellow Sea. The nearshore depocenter is distributed in bands along the south coast of Shandong Peninsula. The offshore depocenter is part of the distal subaqueous deltaic lobe, which deposited around the southeastern tip of the Shandong Peninsula. Between the two depocenters is a linear depression. The mud deposits directly overlie the postglacial transgressive surface and can be divided into lower and upper units by the Holocene maximum flooding surface. The nearshore and offshore units display different seismic structures. The lower unit of the nearshore deposit exhibits basal onlap, whereas the upper unit is characterized by progradation. The lower and upper units of the offshore deposit display distinct acoustic features. The lower unit has low-angle aggradation with internal reflectors generally dipping seaward and truncated by the Holocene maximum flooding surface, whereas the upper unit is characterized by aggradation and progradation landward rather than seaward. Results of geochemistry analysis of QDZ03 sediments and mineral analysis of WHZK01 sediments suggest that the nearshore deposit and the lower unit of the offshore deposit are derived from the proximal coastal sediments of the Shandong Peninsula and the Huanghe (Yellow) River sediments. The upper unit of the offshore deposit is mainly Huanghe River-derived. The lower unit of the mud deposit represents a post-glacial transgressive system tract according to dates of core QDZ03, and the upper unit represents a highstand system tract from middle Holocene to the present. These results will be of great significance to further understanding of the transportation of the Huanghe River sediments into the Yellow Sea and the spatial distribution of the subaqueous delta.
Ordinary least square regression, orthogonal regression, geometric mean regression and their applications in aerosol science

International Nuclear Information System (INIS)

Leng Ling; Zhang Tianyi; Kleinman, Lawrence; Zhu Wei

2007-01-01

Regression analysis, especially the ordinary least squares method which assumes that errors are confined to the dependent variable, has seen a fair share of its applications in aerosol science. The ordinary least squares approach, however, could be problematic due to the fact that atmospheric data often does not lend itself to calling one variable independent and the other dependent. Errors often exist for both measurements. In this work, we examine two regression approaches available to accommodate this situation. They are orthogonal regression and geometric mean regression. Comparisons are made theoretically as well as numerically through an aerosol study examining whether the ratio of organic aerosol to CO would change with age

Petroleum system elements within the Late Cretaceous and Early Paleogene sediments of Nigeria's inland basins: An integrated sequence stratigraphic approach

Science.gov (United States)

Dim, Chidozie Izuchukwu Princeton; Onuoha, K. Mosto; Okeugo, Chukwudike Gabriel; Ozumba, Bertram Maduka

2017-06-01

Sequence stratigraphic studies have been carried out using subsurface well and 2D seismic data in the Late Cretaceous and Early Paleogene sediments of Anambra and proximal onshore section of Niger Delta Basin in the Southeastern Nigeria. The aim was to establish the stratigraphic framework for better understanding of the reservoir, source and seal rock presence and distribution in the basin. Thirteen stratigraphic bounding surfaces (consisting of six maximum flooding surfaces - MFSs and seven sequence boundaries - SBs) were recognized and calibrated using a newly modified chronostratigraphic chart. Stratigraphic surfaces were matched with corresponding foraminiferal and palynological biozones, aiding correlation across wells in this study. Well log sequence stratigraphic correlation reveals that stratal packages within the basin are segmented into six depositional sequences occurring from Late Cretaceous to Early Paleogene age. Generated gross depositional environment maps at various MFSs show that sediment packages deposited within shelfal to deep marine settings, reflect continuous rise and fall of sea levels within a regressive cycle. Each of these sequences consist of three system tracts (lowstand system tract - LST, transgressive system tract - TST and highstand system tract - HST) that are associated with mainly progradational and retrogradational sediment stacking patterns. Well correlation reveals that the sand and shale units of the LSTs, HSTs and TSTs, that constitute the reservoir and source/seal packages respectively are laterally continuous and thicken basinwards, due to structural influences. Result from interpretation of seismic section reveals the presence of hanging wall, footwall, horst block and collapsed crest structures. These structural features generally aid migration and offer entrapment mechanism for hydrocarbon accumulation. The combination of these reservoirs, sources, seals and trap elements form a good petroleum system that is viable
Polynomial regression analysis and significance test of the regression function

International Nuclear Information System (INIS)

Gao Zhengming; Zhao Juan; He Shengping

2012-01-01

In order to analyze the decay heating power of a certain radioactive isotope per kilogram with polynomial regression method, the paper firstly demonstrated the broad usage of polynomial function and deduced its parameters with ordinary least squares estimate. Then significance test method of polynomial regression function is derived considering the similarity between the polynomial regression model and the multivariable linear regression model. Finally, polynomial regression analysis and significance test of the polynomial function are done to the decay heating power of the iso tope per kilogram in accord with the authors' real work. (authors)
Reduced Rank Regression

DEFF Research Database (Denmark)

Johansen, Søren

2008-01-01

The reduced rank regression model is a multivariate regression model with a coefficient matrix with reduced rank. The reduced rank regression algorithm is an estimation procedure, which estimates the reduced rank regression model. It is related to canonical correlations and involves calculating...
Quantile Regression Methods

DEFF Research Database (Denmark)

Fitzenberger, Bernd; Wilke, Ralf Andreas

2015-01-01

if the mean regression model does not. We provide a short informal introduction into the principle of quantile regression which includes an illustrative application from empirical labor market research. This is followed by briefly sketching the underlying statistical model for linear quantile regression based......Quantile regression is emerging as a popular statistical approach, which complements the estimation of conditional mean models. While the latter only focuses on one aspect of the conditional distribution of the dependent variable, the mean, quantile regression provides more detailed insights...... by modeling conditional quantiles. Quantile regression can therefore detect whether the partial effect of a regressor on the conditional quantiles is the same for all quantiles or differs across quantiles. Quantile regression can provide evidence for a statistical relationship between two variables even...
The Upper Devonian deposits in the northern part of Leon (Cantabrian Mountains, Northwestern Spain)

NARCIS (Netherlands)

Loevezijn, van G.B.S.; Raven, J.G.M.

1983-01-01

During the Late Devonian, deposition in the Cantabrian Mountains was largely controlled by movements along faults. By way of intermitting subsidence of the area south of the Sabero-Gordón line and the connected progradation of the coast during the Frasnian and early Famennian, three regressive
A Monte Carlo simulation study comparing linear regression, beta regression, variable-dispersion beta regression and fractional logit regression at recovering average difference measures in a two sample design.

Science.gov (United States)

Meaney, Christopher; Moineddin, Rahim

2014-01-24

In biomedical research, response variables are often encountered which have bounded support on the open unit interval--(0,1). Traditionally, researchers have attempted to estimate covariate effects on these types of response data using linear regression. Alternative modelling strategies may include: beta regression, variable-dispersion beta regression, and fractional logit regression models. This study employs a Monte Carlo simulation design to compare the statistical properties of the linear regression model to that of the more novel beta regression, variable-dispersion beta regression, and fractional logit regression models. In the Monte Carlo experiment we assume a simple two sample design. We assume observations are realizations of independent draws from their respective probability models. The randomly simulated draws from the various probability models are chosen to emulate average proportion/percentage/rate differences of pre-specified magnitudes. Following simulation of the experimental data we estimate average proportion/percentage/rate differences. We compare the estimators in terms of bias, variance, type-1 error and power. Estimates of Monte Carlo error associated with these quantities are provided. If response data are beta distributed with constant dispersion parameters across the two samples, then all models are unbiased and have reasonable type-1 error rates and power profiles. If the response data in the two samples have different dispersion parameters, then the simple beta regression model is biased. When the sample size is small (N0 = N1 = 25) linear regression has superior type-1 error rates compared to the other models. Small sample type-1 error rates can be improved in beta regression models using bias correction/reduction methods. In the power experiments, variable-dispersion beta regression and fractional logit regression models have slightly elevated power compared to linear regression models. Similar results were observed if the
Regression Phalanxes

OpenAIRE

Zhang, Hongyang; Welch, William J.; Zamar, Ruben H.

2017-01-01

Tomal et al. (2015) introduced the notion of "phalanxes" in the context of rare-class detection in two-class classification problems. A phalanx is a subset of features that work well for classification tasks. In this paper, we propose a different class of phalanxes for application in regression settings. We define a "Regression Phalanx" - a subset of features that work well together for prediction. We propose a novel algorithm which automatically chooses Regression Phalanxes from high-dimensi...
Advanced statistics: linear regression, part II: multiple linear regression.

Science.gov (United States)

Marill, Keith A

2004-01-01

The applications of simple linear regression in medical research are limited, because in most situations, there are multiple relevant predictor variables. Univariate statistical techniques such as simple linear regression use a single predictor variable, and they often may be mathematically correct but clinically misleading. Multiple linear regression is a mathematical technique used to model the relationship between multiple independent predictor variables and a single dependent outcome variable. It is used in medical research to model observational data, as well as in diagnostic and therapeutic studies in which the outcome is dependent on more than one factor. Although the technique generally is limited to data that can be expressed with a linear function, it benefits from a well-developed mathematical framework that yields unique solutions and exact confidence intervals for regression coefficients. Building on Part I of this series, this article acquaints the reader with some of the important concepts in multiple regression analysis. These include multicollinearity, interaction effects, and an expansion of the discussion of inference testing, leverage, and variable transformations to multivariate models. Examples from the first article in this series are expanded on using a primarily graphic, rather than mathematical, approach. The importance of the relationships among the predictor variables and the dependence of the multivariate model coefficients on the choice of these variables are stressed. Finally, concepts in regression model building are discussed.
Boosted beta regression.

Directory of Open Access Journals (Sweden)

Matthias Schmid

Full Text Available Regression analysis with a bounded outcome is a common problem in applied statistics. Typical examples include regression models for percentage outcomes and the analysis of ratings that are measured on a bounded scale. In this paper, we consider beta regression, which is a generalization of logit models to situations where the response is continuous on the interval (0,1. Consequently, beta regression is a convenient tool for analyzing percentage responses. The classical approach to fit a beta regression model is to use maximum likelihood estimation with subsequent AIC-based variable selection. As an alternative to this established - yet unstable - approach, we propose a new estimation technique called boosted beta regression. With boosted beta regression estimation and variable selection can be carried out simultaneously in a highly efficient way. Additionally, both the mean and the variance of a percentage response can be modeled using flexible nonlinear covariate effects. As a consequence, the new method accounts for common problems such as overdispersion and non-binomial variance structures.
Regression to Causality : Regression-style presentation influences causal attribution

DEFF Research Database (Denmark)

Bordacconi, Mats Joe; Larsen, Martin Vinæs

2014-01-01

of equivalent results presented as either regression models or as a test of two sample means. Our experiment shows that the subjects who were presented with results as estimates from a regression model were more inclined to interpret these results causally. Our experiment implies that scholars using regression...... models – one of the primary vehicles for analyzing statistical results in political science – encourage causal interpretation. Specifically, we demonstrate that presenting observational results in a regression model, rather than as a simple comparison of means, makes causal interpretation of the results...... more likely. Our experiment drew on a sample of 235 university students from three different social science degree programs (political science, sociology and economics), all of whom had received substantial training in statistics. The subjects were asked to compare and evaluate the validity...
Sequence stratigraphy of the Upper Cambrian (Furongian; Jiangshanian and Sunwaptan) Tunnel City Group, Upper Mississippi Valley: Transgressing assumptions of cratonic flooding

Science.gov (United States)

Eoff, Jennifer D.

2014-01-01

New data from detailed measured sections permit comprehensive analysis of the sequence framework of the Furongian (Upper Cambrian; Jiangshanian and Sunwaptan stages) Tunnel City Group (Lone Rock Formation and Mazomanie Formation) of Wisconsin and Minnesota. The sequence-stratigraphic architecture of the lower part of the Sunwaptan Stage at the base of the Tunnel City Group, at the contact between the Wonewoc Formation and Lone Rock Formation, records the first part of complex polyphase flooding (Sauk III) of the Laurentian craton, at a scale smaller than most events recorded by global sea-level curves. Flat-pebble conglomerate and glauconite document transgressive ravinement and development of a condensed section when creation of accommodation exceeded its consumption by sedimentation. Thinly-bedded, fossiliferous sandstone represents the most distal setting during earliest highstand. Subsequent deposition of sandstone characterized by hummocky or trough cross-stratification records progradational pulses of shallower, storm- and wave-dominated environments across the craton before final flooding of Sauk III commenced with carbonate deposition during the middle part of the Sunwaptan Stage. Comparison of early Sunwaptan flooding of the inner Laurentian craton to published interpretations from other parts of North America suggests that Sauk III was not a single, long-term accommodation event as previously proposed.
Seismic evidence for the preservation of several stacked Pleistocene coastal barrier/lagoon systems on the Gulf of Valencia continental shelf (western Mediterranean)

Science.gov (United States)

Albarracín, Silvia; Alcántara-Carrió, Javier; Barranco, Andrés; Sánchez García, María José; Fontán Bouzas, Ángela; Rey Salgado, Jorge

2013-04-01

The focus of this study is the analysis of coastal sand barriers and associated coastal lagoons on the inner continental shelf of the Gulf of Valencia (western Mediterranean), based on two W-E seismic profiles recorded seaward of the Albufera de Valencia coastal lagoon. Seismic facies identified include a number of coastal sand barriers with landward lagoons draped by contemporary continental shelf deposits. The barrier systems have been grouped into two sedimentary systems tracts, the older one corresponding to a prograding/aggrading highstand systems tract involving at least four paleo-coastal sand barrier/lagoon systems, followed landward by a transgressive systems tract comprising three such systems. All the systems have been allocated a Tyrrhenian age, the formation of individual barrier systems having been associated with successive sea-level stillstands, and their present-day position being explained by the very high regional subsidence rate. In summary, this study demonstrates that the Quaternary stratigraphic record of the Gulf of Valencia inner continental shelf is composed of littoral sand facies, in particular coastal sand barrier and lagoon deposits. These findings are in agreement with corresponding observations on other continental shelves of the western Mediterranean, showing that the formation of coastal sand barriers was a characteristic feature of this region during the Quaternary.
Regression analysis with categorized regression calibrated exposure: some interesting findings

Directory of Open Access Journals (Sweden)

Hjartåker Anette

2006-07-01

Full Text Available Abstract Background Regression calibration as a method for handling measurement error is becoming increasingly well-known and used in epidemiologic research. However, the standard version of the method is not appropriate for exposure analyzed on a categorical (e.g. quintile scale, an approach commonly used in epidemiologic studies. A tempting solution could then be to use the predicted continuous exposure obtained through the regression calibration method and treat it as an approximation to the true exposure, that is, include the categorized calibrated exposure in the main regression analysis. Methods We use semi-analytical calculations and simulations to evaluate the performance of the proposed approach compared to the naive approach of not correcting for measurement error, in situations where analyses are performed on quintile scale and when incorporating the original scale into the categorical variables, respectively. We also present analyses of real data, containing measures of folate intake and depression, from the Norwegian Women and Cancer study (NOWAC. Results In cases where extra information is available through replicated measurements and not validation data, regression calibration does not maintain important qualities of the true exposure distribution, thus estimates of variance and percentiles can be severely biased. We show that the outlined approach maintains much, in some cases all, of the misclassification found in the observed exposure. For that reason, regression analysis with the corrected variable included on a categorical scale is still biased. In some cases the corrected estimates are analytically equal to those obtained by the naive approach. Regression calibration is however vastly superior to the naive method when applying the medians of each category in the analysis. Conclusion Regression calibration in its most well-known form is not appropriate for measurement error correction when the exposure is analyzed on a
Knickpoint formation, rapid propagation, and landscape response following coastal cliff retreat at last-interglacial sea-level highstand: Kaua'i, Hawai'i

Science.gov (United States)

Lamb, Michael; Mackey, Ben; Scheingross, Joel; Farley, Ken

2013-04-01

The propagation of knickpoints through a landscape is recognized as a highly efficient mechanism of channel incision, and exerts a first-order control in communicating changes in base level throughout a landscape. However, few settings allow reconstruction of the long-term rate of knickpoint retreat. Here, we use cosmogenic 3He exposure dating of olivine within basalt to document the retreat rate of a waterfall in Ka'ula'ula Valley, a small catchment on the Na Pali coast of Kaua'i, Hawai'i. We constrained the exposure age of 18 features (in-channel boulders, stable boulders on terraces, and in-channel bedrock) along the length of the channel that allow us to discriminate between models of knickpoint propagation. Cosmogenic exposure ages are oldest near the coast (120 ka) and systematically decrease with upstream distance towards the waterfall (waterfall has migrated 4 km up valley over the past 120 ka at an average rate of 33 mm/yr. Steady-state vertical erosion appears to dominate upstream of the waterfall, where the channel has incised ~100 m into the original surface of the shield volcano. Our results indicate the lateral rate of knickpoint retreat exceeds rates of vertical channel incision by three orders of magnitude, and that knickpoints may be the primary driver of relief generation in Hawaiian catchments. Submarine landslides have been proposed as the cause of knickpoints in Kaua'i streams; however, the bathymetry off the northwest Kaua'i coast lacks evidence for large submarine flank collapse. Alternatively, we propose substantial cliff erosion during the last interglacial sea-level highstand generated a waterfall at the coast, which has subsequently propagated inland. Superimposing Kaua'i's subsidence history and Pleistocene sea level fluctuations indicate that the only time waves could have eroded cliffs at Ka'ula'ula Valley's entrance over the past 1.5 Ma was during the last interglacial, ~130-120 ka. Knickpoint generation during sea level high stands
Time-adaptive quantile regression

DEFF Research Database (Denmark)

Møller, Jan Kloppenborg; Nielsen, Henrik Aalborg; Madsen, Henrik

2008-01-01

and an updating procedure are combined into a new algorithm for time-adaptive quantile regression, which generates new solutions on the basis of the old solution, leading to savings in computation time. The suggested algorithm is tested against a static quantile regression model on a data set with wind power......An algorithm for time-adaptive quantile regression is presented. The algorithm is based on the simplex algorithm, and the linear optimization formulation of the quantile regression problem is given. The observations have been split to allow a direct use of the simplex algorithm. The simplex method...... production, where the models combine splines and quantile regression. The comparison indicates superior performance for the time-adaptive quantile regression in all the performance parameters considered....
Sequence-Stratigraphic Analysis of the Regional Observation Monitoring Program (ROMP) 29A Test Corehole and Its Relation to Carbonate Porosity and Regional Transmissivity in the Floridan Aquifer System, Highlands County, Florida

Science.gov (United States)

Ward, W. C.; Cunningham, K.J.; Renken, R.A.; Wacker, M.A.; Carlson, J.I.

2003-01-01

An analysis was made to describe and interpret the lithology of a part of the Upper Floridan aquifer penetrated by the Regional Observation Monitoring Program (ROMP) 29A test corehole in Highlands County, Florida. This information was integrated into a one-dimensional hydrostratigraphic model that delineates candidate flow zones and confining units in the context of sequence stratigraphy. Results from this test corehole will serve as a starting point to build a robust three-dimensional sequence-stratigraphic framework of the Floridan aquifer system. The ROMP 29A test corehole penetrated the Avon Park Formation, Ocala Limestone, Suwannee Limestone, and Hawthorn Group of middle Eocene to Pliocene age. The part of the Avon Park Formation penetrated in the ROMP 29A test corehole contains two composite depositional sequences. A transgressive systems tract and a highstand systems tract were interpreted for the upper composite sequence; however, only a highstand systems tract was interpreted for the lower composite sequence of the deeper Avon Park stratigraphic section. The composite depositional sequences are composed of at least five high-frequency depositional sequences. These sequences contain high-frequency cycle sets that are an amalgamation of vertically stacked high-frequency cycles. Three types of high-frequency cycles have been identified in the Avon Park Formation: peritidal, shallow subtidal, and deeper subtidal high-frequency cycles. The vertical distribution of carbonate-rock diffuse flow zones within the Avon Park Formation is heterogeneous. Porous vuggy intervals are less than 10 feet, and most are much thinner. The volumetric arrangement of the diffuse flow zones shows that most occur in the highstand systems tract of the lower composite sequence of the Avon Park Formation as compared to the upper composite sequence, which contains both a backstepping transgressive systems tract and a prograding highstand systems tract. Although the porous and permeable
Regression analysis by example

CERN Document Server

Chatterjee, Samprit

2012-01-01

Praise for the Fourth Edition: ""This book is . . . an excellent source of examples for regression analysis. It has been and still is readily readable and understandable."" -Journal of the American Statistical Association Regression analysis is a conceptually simple method for investigating relationships among variables. Carrying out a successful application of regression analysis, however, requires a balance of theoretical results, empirical rules, and subjective judgment. Regression Analysis by Example, Fifth Edition has been expanded
Applied logistic regression

CERN Document Server

Hosmer, David W; Sturdivant, Rodney X

2013-01-01

A new edition of the definitive guide to logistic regression modeling for health science and other applications This thoroughly expanded Third Edition provides an easily accessible introduction to the logistic regression (LR) model and highlights the power of this model by examining the relationship between a dichotomous outcome and a set of covariables. Applied Logistic Regression, Third Edition emphasizes applications in the health sciences and handpicks topics that best suit the use of modern statistical software. The book provides readers with state-of-
Normalization Ridge Regression in Practice I: Comparisons Between Ordinary Least Squares, Ridge Regression and Normalization Ridge Regression.

Science.gov (United States)

Bulcock, J. W.

The problem of model estimation when the data are collinear was examined. Though the ridge regression (RR) outperforms ordinary least squares (OLS) regression in the presence of acute multicollinearity, it is not a problem free technique for reducing the variance of the estimates. It is a stochastic procedure when it should be nonstochastic and it…
Vector regression introduced

Directory of Open Access Journals (Sweden)

Mok Tik

2014-06-01

Full Text Available This study formulates regression of vector data that will enable statistical analysis of various geodetic phenomena such as, polar motion, ocean currents, typhoon/hurricane tracking, crustal deformations, and precursory earthquake signals. The observed vector variable of an event (dependent vector variable is expressed as a function of a number of hypothesized phenomena realized also as vector variables (independent vector variables and/or scalar variables that are likely to impact the dependent vector variable. The proposed representation has the unique property of solving the coefficients of independent vector variables (explanatory variables also as vectors, hence it supersedes multivariate multiple regression models, in which the unknown coefficients are scalar quantities. For the solution, complex numbers are used to rep- resent vector information, and the method of least squares is deployed to estimate the vector model parameters after transforming the complex vector regression model into a real vector regression model through isomorphism. Various operational statistics for testing the predictive significance of the estimated vector parameter coefficients are also derived. A simple numerical example demonstrates the use of the proposed vector regression analysis in modeling typhoon paths.

Applied linear regression

CERN Document Server

Weisberg, Sanford

2013-01-01

Praise for the Third Edition ""...this is an excellent book which could easily be used as a course text...""-International Statistical Institute The Fourth Edition of Applied Linear Regression provides a thorough update of the basic theory and methodology of linear regression modeling. Demonstrating the practical applications of linear regression analysis techniques, the Fourth Edition uses interesting, real-world exercises and examples. Stressing central concepts such as model building, understanding parameters, assessing fit and reliability, and drawing conclusions, the new edition illus
Understanding poisson regression.

Science.gov (United States)

Hayat, Matthew J; Higgins, Melinda

2014-04-01

Nurse investigators often collect study data in the form of counts. Traditional methods of data analysis have historically approached analysis of count data either as if the count data were continuous and normally distributed or with dichotomization of the counts into the categories of occurred or did not occur. These outdated methods for analyzing count data have been replaced with more appropriate statistical methods that make use of the Poisson probability distribution, which is useful for analyzing count data. The purpose of this article is to provide an overview of the Poisson distribution and its use in Poisson regression. Assumption violations for the standard Poisson regression model are addressed with alternative approaches, including addition of an overdispersion parameter or negative binomial regression. An illustrative example is presented with an application from the ENSPIRE study, and regression modeling of comorbidity data is included for illustrative purposes. Copyright 2014, SLACK Incorporated.
Alternative Methods of Regression

CERN Document Server

Birkes, David

2011-01-01

Of related interest. Nonlinear Regression Analysis and its Applications Douglas M. Bates and Donald G. Watts ".an extraordinary presentation of concepts and methods concerning the use and analysis of nonlinear regression models.highly recommend[ed].for anyone needing to use and/or understand issues concerning the analysis of nonlinear regression models." --Technometrics This book provides a balance between theory and practice supported by extensive displays of instructive geometrical constructs. Numerous in-depth case studies illustrate the use of nonlinear regression analysis--with all data s
Introduction to regression graphics

CERN Document Server

Cook, R Dennis

2009-01-01

Covers the use of dynamic and interactive computer graphics in linear regression analysis, focusing on analytical graphics. Features new techniques like plot rotation. The authors have composed their own regression code, using Xlisp-Stat language called R-code, which is a nearly complete system for linear regression analysis and can be utilized as the main computer program in a linear regression course. The accompanying disks, for both Macintosh and Windows computers, contain the R-code and Xlisp-Stat. An Instructor's Manual presenting detailed solutions to all the problems in the book is ava
Prediction of unwanted pregnancies using logistic regression, probit regression and discriminant analysis.

Science.gov (United States)

Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon

2015-01-01

Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended.
Geochronology and paleoenvironment of pluvial Harper Lake, Mojave Desert, California, USA

Science.gov (United States)

Garcia, Anna L.; Knott, Jeffrey R.; Mahan, Shannon; Bright, Jordan

2014-01-01

Accurate reconstruction of the paleo-Mojave River and pluvial lake (Harper, Manix, Cronese, and Mojave) system of southern California is critical to understanding paleoclimate and the North American polar jet stream position over the last 500 ka. Previous studies inferred a polar jet stream south of 35°N at 18 ka and at ~ 40°N at 17–14 ka. Highstand sediments of Harper Lake, the upstream-most pluvial lake along the Mojave River, have yielded uncalibrated radiocarbon ages ranging from 24,000 to > 30,000 14C yr BP. Based on geologic mapping, radiocarbon and optically stimulated luminescence dating, we infer a ~ 45–40 ka age for the Harper Lake highstand sediments. Combining the Harper Lake highstand with other Great Basin pluvial lake/spring and marine climate records, we infer that the North American polar jet stream was south of 35°N about 45–40 ka, but shifted to 40°N by ~ 35 ka. Ostracodes (Limnocythere ceriotuberosa) from Harper Lake highstand sediments are consistent with an alkaline lake environment that received seasonal inflow from the Mojave River, thus confirming the lake was fed by the Mojave River. The ~ 45–40 ka highstand at Harper Lake coincides with a shallowing interval at downstream Lake Manix.
New allocyclic dimensions in a prograding carbonate bank: Evidence for eustatic, tectonic, and paleoceanographic control (late Neogene, Bahamas)

Science.gov (United States)

Lidz, B.H.; McNeill, D.F.

1997-01-01

The deep-sea record, examined recently for the first time in a shallow-depocenter setting, has unveiled remarkable evidence for new sedimentary components and allocyclic complexity in a large, well-studied carbonate bank, the western Great Bahama Bank. The evidence is a composite foraminiferal signature - Paleocene to early Miocene (allogenic or reworked) and late Miocene to late Pliocene (host) planktic taxa, and redeposited middle Miocene shallow benthic faunas. Ages of the oldest and youngest planktic groups range from ??? 66 to ??? 2 Ma. The reworked and redeposited taxa are a proxy for significant sediment components that otherwise have no lithofacies or seismic resolution. The composite signature, reinforced by a distinctive distribution of the reworked and redeposited faunas, documents a much more complex late Neogene depositional system than previously known. The system is more than progradational. The source sequences that supplied the constituent bank-margin grains formed at different water depths and over hundreds of kilometers and tens of millions of years apart. New evidence from the literature and from data obtained during Ocean Drilling Program (OOP) Leg 166 in the Santaren Channel (Bahamas) support early interpretations based on the composite fossil record and provide valuable new dimensions to regional allocyclicity. The middle Miocene taxa were confined to the lower part of the section by the latest Miocene-earliest Pliocene(?) lowstand of sea level. An orderly occurrence of the allogenic taxa is unique to the global reworked geologic record and appears to have been controlled by a combination of Paleogene-early Neogene tectonics at the source, eustatic changes, and late Neogene current activity at the source and across the bank. The allogenic taxa expand the spatial and temporal range of information in the northern Bahamas by nearly an order of magnitude. In essence, some of the major processes active in the region during ??? 64 m.y. of the
Regression and regression analysis time series prediction modeling on climate data of quetta, pakistan

International Nuclear Information System (INIS)

Jafri, Y.Z.; Kamal, L.

2007-01-01

Various statistical techniques was used on five-year data from 1998-2002 of average humidity, rainfall, maximum and minimum temperatures, respectively. The relationships to regression analysis time series (RATS) were developed for determining the overall trend of these climate parameters on the basis of which forecast models can be corrected and modified. We computed the coefficient of determination as a measure of goodness of fit, to our polynomial regression analysis time series (PRATS). The correlation to multiple linear regression (MLR) and multiple linear regression analysis time series (MLRATS) were also developed for deciphering the interdependence of weather parameters. Spearman's rand correlation and Goldfeld-Quandt test were used to check the uniformity or non-uniformity of variances in our fit to polynomial regression (PR). The Breusch-Pagan test was applied to MLR and MLRATS, respectively which yielded homoscedasticity. We also employed Bartlett's test for homogeneity of variances on a five-year data of rainfall and humidity, respectively which showed that the variances in rainfall data were not homogenous while in case of humidity, were homogenous. Our results on regression and regression analysis time series show the best fit to prediction modeling on climatic data of Quetta, Pakistan. (author)
Linear regression in astronomy. I

Science.gov (United States)

Isobe, Takashi; Feigelson, Eric D.; Akritas, Michael G.; Babu, Gutti Jogesh

1990-01-01

Five methods for obtaining linear regression fits to bivariate data with unknown or insignificant measurement errors are discussed: ordinary least-squares (OLS) regression of Y on X, OLS regression of X on Y, the bisector of the two OLS lines, orthogonal regression, and 'reduced major-axis' regression. These methods have been used by various researchers in observational astronomy, most importantly in cosmic distance scale applications. Formulas for calculating the slope and intercept coefficients and their uncertainties are given for all the methods, including a new general form of the OLS variance estimates. The accuracy of the formulas was confirmed using numerical simulations. The applicability of the procedures is discussed with respect to their mathematical properties, the nature of the astronomical data under consideration, and the scientific purpose of the regression. It is found that, for problems needing symmetrical treatment of the variables, the OLS bisector performs significantly better than orthogonal or reduced major-axis regression.
Logic regression and its extensions.

Science.gov (United States)

Schwender, Holger; Ruczinski, Ingo

2010-01-01

Logic regression is an adaptive classification and regression procedure, initially developed to reveal interacting single nucleotide polymorphisms (SNPs) in genetic association studies. In general, this approach can be used in any setting with binary predictors, when the interaction of these covariates is of primary interest. Logic regression searches for Boolean (logic) combinations of binary variables that best explain the variability in the outcome variable, and thus, reveals variables and interactions that are associated with the response and/or have predictive capabilities. The logic expressions are embedded in a generalized linear regression framework, and thus, logic regression can handle a variety of outcome types, such as binary responses in case-control studies, numeric responses, and time-to-event data. In this chapter, we provide an introduction to the logic regression methodology, list some applications in public health and medicine, and summarize some of the direct extensions and modifications of logic regression that have been proposed in the literature. Copyright © 2010 Elsevier Inc. All rights reserved.
Tumor regression patterns in retinoblastoma

International Nuclear Information System (INIS)

Zafar, S.N.; Siddique, S.N.; Zaheer, N.

2016-01-01

To observe the types of tumor regression after treatment, and identify the common pattern of regression in our patients. Study Design: Descriptive study. Place and Duration of Study: Department of Pediatric Ophthalmology and Strabismus, Al-Shifa Trust Eye Hospital, Rawalpindi, Pakistan, from October 2011 to October 2014. Methodology: Children with unilateral and bilateral retinoblastoma were included in the study. Patients were referred to Pakistan Institute of Medical Sciences, Islamabad, for chemotherapy. After every cycle of chemotherapy, dilated funds examination under anesthesia was performed to record response of the treatment. Regression patterns were recorded on RetCam II. Results: Seventy-four tumors were included in the study. Out of 74 tumors, 3 were ICRB group A tumors, 43 were ICRB group B tumors, 14 tumors belonged to ICRB group C, and remaining 14 were ICRB group D tumors. Type IV regression was seen in 39.1% (n=29) tumors, type II in 29.7% (n=22), type III in 25.6% (n=19), and type I in 5.4% (n=4). All group A tumors (100%) showed type IV regression. Seventeen (39.5%) group B tumors showed type IV regression. In group C, 5 tumors (35.7%) showed type II regression and 5 tumors (35.7%) showed type IV regression. In group D, 6 tumors (42.9%) regressed to type II non-calcified remnants. Conclusion: The response and success of the focal and systemic treatment, as judged by the appearance of different patterns of tumor regression, varies with the ICRB grouping of the tumor. (author)
Late Pleistocene sea-level changes recorded in tidal and fluvial deposits from Itaubal Formation, onshore portion of the Foz do Amazonas Basin, Brazil

Directory of Open Access Journals (Sweden)

Isaac Salém Alves Azevedo Bezerra

Full Text Available ABSTRACTThe Pleistocene deposits exposed in the Amapá Coastal Plain (onshore portion of the Foz do Amazonas Basin, northeastern South America were previously interpreted as Miocene in age. In this work, they were named as "Itaubal Formation" and were included in the quaternary coastal history of Amazonia. The study, through facies and stratigraphic analyses in combination with optically stimulated luminescence (single and multiple aliquot regeneration, allowed interpreting this unit as Late Pleistocene tidal and fluvial deposits. The Itaubal Formation, which unconformably overlies strongly weathered basement rocks of the Guianas Shield, was subdivided into two progradational units, separated by an unconformity related to sea-level fall, here named as Lower and Upper Units. The Lower Unit yielded ages between 120,600 (± 12,000 and 70,850 (± 6,700 years BP and consists of subtidal flat, tide-influenced meandering stream and floodplain deposits, during highstand conditions. The Upper Unit spans between 69,150 (± 7,200 and 58,150 (± 6,800 years BP and is characterized by braided fluvial deposits incised in the Lower Unit, related to base-level fall; lowstand conditions remained until 23,500 (± 3,000 years BP. The studied region was likely exposed during the Last Glacial Maximum and then during Holocene, covered by tidal deposits influenced by the Amazon River.
Combining Alphas via Bounded Regression

Directory of Open Access Journals (Sweden)

Zura Kakushadze

2015-11-01

Full Text Available We give an explicit algorithm and source code for combining alpha streams via bounded regression. In practical applications, typically, there is insufficient history to compute a sample covariance matrix (SCM for a large number of alphas. To compute alpha allocation weights, one then resorts to (weighted regression over SCM principal components. Regression often produces alpha weights with insufficient diversification and/or skewed distribution against, e.g., turnover. This can be rectified by imposing bounds on alpha weights within the regression procedure. Bounded regression can also be applied to stock and other asset portfolio construction. We discuss illustrative examples.
riskRegression

DEFF Research Database (Denmark)

Ozenne, Brice; Sørensen, Anne Lyngholm; Scheike, Thomas

2017-01-01

In the presence of competing risks a prediction of the time-dynamic absolute risk of an event can be based on cause-specific Cox regression models for the event and the competing risks (Benichou and Gail, 1990). We present computationally fast and memory optimized C++ functions with an R interface...... for predicting the covariate specific absolute risks, their confidence intervals, and their confidence bands based on right censored time to event data. We provide explicit formulas for our implementation of the estimator of the (stratified) baseline hazard function in the presence of tied event times. As a by...... functionals. The software presented here is implemented in the riskRegression package....
Regression in autistic spectrum disorders.

Science.gov (United States)

Stefanatos, Gerry A

2008-12-01

A significant proportion of children diagnosed with Autistic Spectrum Disorder experience a developmental regression characterized by a loss of previously-acquired skills. This may involve a loss of speech or social responsitivity, but often entails both. This paper critically reviews the phenomena of regression in autistic spectrum disorders, highlighting the characteristics of regression, age of onset, temporal course, and long-term outcome. Important considerations for diagnosis are discussed and multiple etiological factors currently hypothesized to underlie the phenomenon are reviewed. It is argued that regressive autistic spectrum disorders can be conceptualized on a spectrum with other regressive disorders that may share common pathophysiological features. The implications of this viewpoint are discussed.
Understanding logistic regression analysis

OpenAIRE

Sperandei, Sandro

2014-01-01

Logistic regression is used to obtain odds ratio in the presence of more than one explanatory variable. The procedure is quite similar to multiple linear regression, with the exception that the response variable is binomial. The result is the impact of each variable on the odds ratio of the observed event of interest. The main advantage is to avoid confounding effects by analyzing the association of all variables together. In this article, we explain the logistic regression procedure using ex...
Refining the model of barrier island formation along a paraglacial coast in the Gulf of Maine

Science.gov (United States)

Hein, Christopher J.; FitzGerald, Duncan M.; Carruthers, Emily A.; Stone, Byron D.; Barnhardt, Walter A.; Gontz, Allen M.

2012-01-01

Details of the internal architecture and local geochronology of Plum Island, the longest barrier in the Gulf of Maine, have refined our understanding of barrier island formation in paraglacial settings. Ground-penetrating radar and shallow-seismic profiles coupled with sediment cores and radiocarbon dates provide an 8000-year evolutionary history of this barrier system in response to changes in sediment sources and supply rates as well as variability in the rate of sea-level change. The barrier sequence overlies tills of Wisconsinan and Illinoian glaciations as well as late Pleistocene glaciomarine clay deposited during the post-glacial sea-level highstand at approximately 17 ka. Holocene sediment began accumulating at the site of Plum Island at 7–8 ka, in the form of coarse fluvial channel-lag deposits related to the 50-m wide erosional channel of the Parker River that carved into underlying glaciomarine deposits during a lower stand of sea level. Plum Island had first developed in its modern location by ca. 3.6 ka through onshore migration and vertical accretion of reworked regressive and lowstand deposits. The prevalence of southerly, seaward-dipping layers indicates that greater than 60% of the barrier lithosome developed in its modern location through southerly spit progradation, consistent with a dominantly longshore transport system driven by northeast storms. Thinner sequences of northerly, landward-dipping clinoforms represent the northern recurve of the prograding spit. A 5–6-m-thick inlet-fill sequence was identified overlying the lower stand fluvial deposit; its stratigraphy captures events of channel migration, ebb-delta breaching, onshore bar migration, channel shoaling and inlet infilling associated with the migration and eventual closure of the inlet. This inlet had a maximum cross-sectional area of 2800 m2 and was active around 3.5–3.6 ka. Discovery of this inlet suggests that the tidal prism was once larger than at present. Bay infilling
Linear regression in astronomy. II

Science.gov (United States)

Feigelson, Eric D.; Babu, Gutti J.

1992-01-01

A wide variety of least-squares linear regression procedures used in observational astronomy, particularly investigations of the cosmic distance scale, are presented and discussed. The classes of linear models considered are (1) unweighted regression lines, with bootstrap and jackknife resampling; (2) regression solutions when measurement error, in one or both variables, dominates the scatter; (3) methods to apply a calibration line to new data; (4) truncated regression models, which apply to flux-limited data sets; and (5) censored regression models, which apply when nondetections are present. For the calibration problem we develop two new procedures: a formula for the intercept offset between two parallel data sets, which propagates slope errors from one regression to the other; and a generalization of the Working-Hotelling confidence bands to nonstandard least-squares lines. They can provide improved error analysis for Faber-Jackson, Tully-Fisher, and similar cosmic distance scale relations.
A Matlab program for stepwise regression

Directory of Open Access Journals (Sweden)

Yanhong Qi

2016-03-01

Full Text Available The stepwise linear regression is a multi-variable regression for identifying statistically significant variables in the linear regression equation. In present study, we presented the Matlab program of stepwise regression.
Quantile regression theory and applications

CERN Document Server

Davino, Cristina; Vistocco, Domenico

2013-01-01

A guide to the implementation and interpretation of Quantile Regression models This book explores the theory and numerous applications of quantile regression, offering empirical data analysis as well as the software tools to implement the methods. The main focus of this book is to provide the reader with a comprehensivedescription of the main issues concerning quantile regression; these include basic modeling, geometrical interpretation, estimation and inference for quantile regression, as well as issues on validity of the model, diagnostic tools. Each methodological aspect is explored and

Fungible weights in logistic regression.

Science.gov (United States)

Jones, Jeff A; Waller, Niels G

2016-06-01

In this article we develop methods for assessing parameter sensitivity in logistic regression models. To set the stage for this work, we first review Waller's (2008) equations for computing fungible weights in linear regression. Next, we describe 2 methods for computing fungible weights in logistic regression. To demonstrate the utility of these methods, we compute fungible logistic regression weights using data from the Centers for Disease Control and Prevention's (2010) Youth Risk Behavior Surveillance Survey, and we illustrate how these alternate weights can be used to evaluate parameter sensitivity. To make our work accessible to the research community, we provide R code (R Core Team, 2015) that will generate both kinds of fungible logistic regression weights. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Principal component regression analysis with SPSS.

Science.gov (United States)

Liu, R X; Kuang, J; Gong, Q; Hou, X L

2003-06-01

The paper introduces all indices of multicollinearity diagnoses, the basic principle of principal component regression and determination of 'best' equation method. The paper uses an example to describe how to do principal component regression analysis with SPSS 10.0: including all calculating processes of the principal component regression and all operations of linear regression, factor analysis, descriptives, compute variable and bivariate correlations procedures in SPSS 10.0. The principal component regression analysis can be used to overcome disturbance of the multicollinearity. The simplified, speeded up and accurate statistical effect is reached through the principal component regression analysis with SPSS.
Logistic regression models

CERN Document Server

Hilbe, Joseph M

2009-01-01

This book really does cover everything you ever wanted to know about logistic regression … with updates available on the author's website. Hilbe, a former national athletics champion, philosopher, and expert in astronomy, is a master at explaining statistical concepts and methods. Readers familiar with his other expository work will know what to expect-great clarity.The book provides considerable detail about all facets of logistic regression. No step of an argument is omitted so that the book will meet the needs of the reader who likes to see everything spelt out, while a person familiar with some of the topics has the option to skip "obvious" sections. The material has been thoroughly road-tested through classroom and web-based teaching. … The focus is on helping the reader to learn and understand logistic regression. The audience is not just students meeting the topic for the first time, but also experienced users. I believe the book really does meet the author's goal … .-Annette J. Dobson, Biometric...
Logistic regression applied to natural hazards: rare event logistic regression with replications

Science.gov (United States)

Guns, M.; Vanacker, V.

2012-06-01

Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logistic regression with replications, combines the strength of probabilistic and statistical methods, and allows overcoming some of the limitations of previous developments through robust variable selection. This technique was here developed for the analyses of landslide controlling factors, but the concept is widely applicable for statistical analyses of natural hazards.
Understanding logistic regression analysis.

Science.gov (United States)

Sperandei, Sandro

2014-01-01

Logistic regression is used to obtain odds ratio in the presence of more than one explanatory variable. The procedure is quite similar to multiple linear regression, with the exception that the response variable is binomial. The result is the impact of each variable on the odds ratio of the observed event of interest. The main advantage is to avoid confounding effects by analyzing the association of all variables together. In this article, we explain the logistic regression procedure using examples to make it as simple as possible. After definition of the technique, the basic interpretation of the results is highlighted and then some special issues are discussed.
First report of garnet corundum rocks from southern India: Implications for prograde high-pressure (eclogite-facies?) metamorphism

Science.gov (United States)

Shimpo, Makoto; Tsunogae, Toshiaki; Santosh, M.

2006-02-01

We report here for the first time the occurrence of garnet and corundum in Mg-Al-rich rocks at Sevitturangampatti (Namakkal district) in the Palghat-Cauvery Shear Zone System (PCSS), southern India. The rocks contain several rare mineral assemblages such as garnet-corundum-sillimanite-cordierite-sapphirine-spinel-Mg-rich staurolite, garnet-corundum-sodic gedrite-cordierite-sillimanite/kyanite, garnet-Mg-rich staurolite-sillimanite/kyanite, sodic gedrite-Mg-rich staurolite-corundum-sapphirine, biotite-corundum-sapphirine and sodic gedrite-sapphirine-spinel-cordierite. Both garnet and corundum in these rocks occur as coarse-grained (1 mm to 10 cm) porphyroblasts in the matrix of sillimanite, cordierite and gedrite. Kyanite is common as inclusions in garnet, but matrix aluminosilicates are mainly sillimanite. The presence of rare garnet + corundum, which has so far been reported from kimberlite xenoliths, aluminous eclogites and ultrahigh-pressure metamorphic rocks as well as in high-pressure experiments, suggests that the assemblage is an indicator of an unusually high-pressure event, which has not been recorded in previous studies from southern India. Phase analysis of quartz-absent MAS system also suggests high-pressure stability of the assemblage. The inference of high pressure metamorphism is also supported by the presence of Mg-rich [Mg/(Fe + Mg) = 0.51] staurolite, which has been reported from high-pressure rocks, included from cores of coarse-grained garnet and gedrite. Porphyroblastic occurrence of garnet + corundum as well as staurolite and kyanite inclusions suggests that the area underwent prograde high-pressure metamorphism, probably in the eclogite field. The rocks subsequently underwent continuous heating at 940 to 990 °C, suggesting ultrahigh-temperature (UHT) metamorphism along a clockwise trajectory. Sapphirine + cordierite and spinel + cordierite symplectites between garnet and sillimanite suggest near isothermal decompression after the peak event
Minimax Regression Quantiles

DEFF Research Database (Denmark)

Bache, Stefan Holst

A new and alternative quantile regression estimator is developed and it is shown that the estimator is root n-consistent and asymptotically normal. The estimator is based on a minimax ‘deviance function’ and has asymptotically equivalent properties to the usual quantile regression estimator. It is......, however, a different and therefore new estimator. It allows for both linear- and nonlinear model specifications. A simple algorithm for computing the estimates is proposed. It seems to work quite well in practice but whether it has theoretical justification is still an open question....
Regression with Sparse Approximations of Data

DEFF Research Database (Denmark)

Noorzad, Pardis; Sturm, Bob L.

2012-01-01

We propose sparse approximation weighted regression (SPARROW), a method for local estimation of the regression function that uses sparse approximation with a dictionary of measurements. SPARROW estimates the regression function at a point with a linear combination of a few regressands selected...... by a sparse approximation of the point in terms of the regressors. We show SPARROW can be considered a variant of \\(k\\)-nearest neighbors regression (\\(k\\)-NNR), and more generally, local polynomial kernel regression. Unlike \\(k\\)-NNR, however, SPARROW can adapt the number of regressors to use based...
Logistic regression applied to natural hazards: rare event logistic regression with replications

Directory of Open Access Journals (Sweden)

M. Guns

2012-06-01

Full Text Available Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logistic regression with replications, combines the strength of probabilistic and statistical methods, and allows overcoming some of the limitations of previous developments through robust variable selection. This technique was here developed for the analyses of landslide controlling factors, but the concept is widely applicable for statistical analyses of natural hazards.
A simple approach to power and sample size calculations in logistic regression and Cox regression models.

Science.gov (United States)

Vaeth, Michael; Skovlund, Eva

2004-06-15

For a given regression problem it is possible to identify a suitably defined equivalent two-sample problem such that the power or sample size obtained for the two-sample problem also applies to the regression problem. For a standard linear regression model the equivalent two-sample problem is easily identified, but for generalized linear models and for Cox regression models the situation is more complicated. An approximately equivalent two-sample problem may, however, also be identified here. In particular, we show that for logistic regression and Cox regression models the equivalent two-sample problem is obtained by selecting two equally sized samples for which the parameters differ by a value equal to the slope times twice the standard deviation of the independent variable and further requiring that the overall expected number of events is unchanged. In a simulation study we examine the validity of this approach to power calculations in logistic regression and Cox regression models. Several different covariate distributions are considered for selected values of the overall response probability and a range of alternatives. For the Cox regression model we consider both constant and non-constant hazard rates. The results show that in general the approach is remarkably accurate even in relatively small samples. Some discrepancies are, however, found in small samples with few events and a highly skewed covariate distribution. Comparison with results based on alternative methods for logistic regression models with a single continuous covariate indicates that the proposed method is at least as good as its competitors. The method is easy to implement and therefore provides a simple way to extend the range of problems that can be covered by the usual formulas for power and sample size determination. Copyright 2004 John Wiley & Sons, Ltd.
Post-processing through linear regression

Science.gov (United States)

van Schaeybroeck, B.; Vannitsem, S.

2011-03-01

Various post-processing techniques are compared for both deterministic and ensemble forecasts, all based on linear regression between forecast data and observations. In order to evaluate the quality of the regression methods, three criteria are proposed, related to the effective correction of forecast error, the optimal variability of the corrected forecast and multicollinearity. The regression schemes under consideration include the ordinary least-square (OLS) method, a new time-dependent Tikhonov regularization (TDTR) method, the total least-square method, a new geometric-mean regression (GM), a recently introduced error-in-variables (EVMOS) method and, finally, a "best member" OLS method. The advantages and drawbacks of each method are clarified. These techniques are applied in the context of the 63 Lorenz system, whose model version is affected by both initial condition and model errors. For short forecast lead times, the number and choice of predictors plays an important role. Contrarily to the other techniques, GM degrades when the number of predictors increases. At intermediate lead times, linear regression is unable to provide corrections to the forecast and can sometimes degrade the performance (GM and the best member OLS with noise). At long lead times the regression schemes (EVMOS, TDTR) which yield the correct variability and the largest correlation between ensemble error and spread, should be preferred.
Regression modeling methods, theory, and computation with SAS

CERN Document Server

Panik, Michael

2009-01-01

Regression Modeling: Methods, Theory, and Computation with SAS provides an introduction to a diverse assortment of regression techniques using SAS to solve a wide variety of regression problems. The author fully documents the SAS programs and thoroughly explains the output produced by the programs.The text presents the popular ordinary least squares (OLS) approach before introducing many alternative regression methods. It covers nonparametric regression, logistic regression (including Poisson regression), Bayesian regression, robust regression, fuzzy regression, random coefficients regression,
Better Autologistic Regression

Directory of Open Access Journals (Sweden)

Mark A. Wolters

2017-11-01

Full Text Available Autologistic regression is an important probability model for dichotomous random variables observed along with covariate information. It has been used in various fields for analyzing binary data possessing spatial or network structure. The model can be viewed as an extension of the autologistic model (also known as the Ising model, quadratic exponential binary distribution, or Boltzmann machine to include covariates. It can also be viewed as an extension of logistic regression to handle responses that are not independent. Not all authors use exactly the same form of the autologistic regression model. Variations of the model differ in two respects. First, the variable coding—the two numbers used to represent the two possible states of the variables—might differ. Common coding choices are (zero, one and (minus one, plus one. Second, the model might appear in either of two algebraic forms: a standard form, or a recently proposed centered form. Little attention has been paid to the effect of these differences, and the literature shows ambiguity about their importance. It is shown here that changes to either coding or centering in fact produce distinct, non-nested probability models. Theoretical results, numerical studies, and analysis of an ecological data set all show that the differences among the models can be large and practically significant. Understanding the nature of the differences and making appropriate modeling choices can lead to significantly improved autologistic regression analyses. The results strongly suggest that the standard model with plus/minus coding, which we call the symmetric autologistic model, is the most natural choice among the autologistic variants.
Semiparametric regression during 2003–2007

KAUST Repository

Ruppert, David; Wand, M.P.; Carroll, Raymond J.

2009-01-01

Semiparametric regression is a fusion between parametric regression and nonparametric regression that integrates low-rank penalized splines, mixed model and hierarchical Bayesian methodology – thus allowing more streamlined handling of longitudinal and spatial correlation. We review progress in the field over the five-year period between 2003 and 2007. We find semiparametric regression to be a vibrant field with substantial involvement and activity, continual enhancement and widespread application.
Unbalanced Regressions and the Predictive Equation

DEFF Research Database (Denmark)

Osterrieder, Daniela; Ventosa-Santaulària, Daniel; Vera-Valdés, J. Eduardo

Predictive return regressions with persistent regressors are typically plagued by (asymptotically) biased/inconsistent estimates of the slope, non-standard or potentially even spurious statistical inference, and regression unbalancedness. We alleviate the problem of unbalancedness in the theoreti......Predictive return regressions with persistent regressors are typically plagued by (asymptotically) biased/inconsistent estimates of the slope, non-standard or potentially even spurious statistical inference, and regression unbalancedness. We alleviate the problem of unbalancedness...
Comparison of multinomial logistic regression and logistic regression: which is more efficient in allocating land use?

Science.gov (United States)

Lin, Yingzhi; Deng, Xiangzheng; Li, Xing; Ma, Enjun

2014-12-01

Spatially explicit simulation of land use change is the basis for estimating the effects of land use and cover change on energy fluxes, ecology and the environment. At the pixel level, logistic regression is one of the most common approaches used in spatially explicit land use allocation models to determine the relationship between land use and its causal factors in driving land use change, and thereby to evaluate land use suitability. However, these models have a drawback in that they do not determine/allocate land use based on the direct relationship between land use change and its driving factors. Consequently, a multinomial logistic regression method was introduced to address this flaw, and thereby, judge the suitability of a type of land use in any given pixel in a case study area of the Jiangxi Province, China. A comparison of the two regression methods indicated that the proportion of correctly allocated pixels using multinomial logistic regression was 92.98%, which was 8.47% higher than that obtained using logistic regression. Paired t-test results also showed that pixels were more clearly distinguished by multinomial logistic regression than by logistic regression. In conclusion, multinomial logistic regression is a more efficient and accurate method for the spatial allocation of land use changes. The application of this method in future land use change studies may improve the accuracy of predicting the effects of land use and cover change on energy fluxes, ecology, and environment.
Interpretation of commonly used statistical regression models.

Science.gov (United States)

Kasza, Jessica; Wolfe, Rory

2014-01-01

A review of some regression models commonly used in respiratory health applications is provided in this article. Simple linear regression, multiple linear regression, logistic regression and ordinal logistic regression are considered. The focus of this article is on the interpretation of the regression coefficients of each model, which are illustrated through the application of these models to a respiratory health research study. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.
Linear regression

CERN Document Server

Olive, David J

2017-01-01

This text covers both multiple linear regression and some experimental design models. The text uses the response plot to visualize the model and to detect outliers, does not assume that the error distribution has a known parametric distribution, develops prediction intervals that work when the error distribution is unknown, suggests bootstrap hypothesis tests that may be useful for inference after variable selection, and develops prediction regions and large sample theory for the multivariate linear regression model that has m response variables. A relationship between multivariate prediction regions and confidence regions provides a simple way to bootstrap confidence regions. These confidence regions often provide a practical method for testing hypotheses. There is also a chapter on generalized linear models and generalized additive models. There are many R functions to produce response and residual plots, to simulate prediction intervals and hypothesis tests, to detect outliers, and to choose response trans...
Regression modeling of ground-water flow

Science.gov (United States)

Cooley, R.L.; Naff, R.L.

1985-01-01

Nonlinear multiple regression methods are developed to model and analyze groundwater flow systems. Complete descriptions of regression methodology as applied to groundwater flow models allow scientists and engineers engaged in flow modeling to apply the methods to a wide range of problems. Organization of the text proceeds from an introduction that discusses the general topic of groundwater flow modeling, to a review of basic statistics necessary to properly apply regression techniques, and then to the main topic: exposition and use of linear and nonlinear regression to model groundwater flow. Statistical procedures are given to analyze and use the regression models. A number of exercises and answers are included to exercise the student on nearly all the methods that are presented for modeling and statistical analysis. Three computer programs implement the more complex methods. These three are a general two-dimensional, steady-state regression model for flow in an anisotropic, heterogeneous porous medium, a program to calculate a measure of model nonlinearity with respect to the regression parameters, and a program to analyze model errors in computed dependent variables such as hydraulic head. (USGS)
Post-processing through linear regression

Directory of Open Access Journals (Sweden)

B. Van Schaeybroeck

2011-03-01

Full Text Available Various post-processing techniques are compared for both deterministic and ensemble forecasts, all based on linear regression between forecast data and observations. In order to evaluate the quality of the regression methods, three criteria are proposed, related to the effective correction of forecast error, the optimal variability of the corrected forecast and multicollinearity. The regression schemes under consideration include the ordinary least-square (OLS method, a new time-dependent Tikhonov regularization (TDTR method, the total least-square method, a new geometric-mean regression (GM, a recently introduced error-in-variables (EVMOS method and, finally, a "best member" OLS method. The advantages and drawbacks of each method are clarified.

These techniques are applied in the context of the 63 Lorenz system, whose model version is affected by both initial condition and model errors. For short forecast lead times, the number and choice of predictors plays an important role. Contrarily to the other techniques, GM degrades when the number of predictors increases. At intermediate lead times, linear regression is unable to provide corrections to the forecast and can sometimes degrade the performance (GM and the best member OLS with noise. At long lead times the regression schemes (EVMOS, TDTR which yield the correct variability and the largest correlation between ensemble error and spread, should be preferred.

A comparison of random forest regression and multiple linear regression for prediction in neuroscience.

Science.gov (United States)

Smith, Paul F; Ganesh, Siva; Liu, Ping

2013-10-30

Regression is a common statistical tool for prediction in neuroscience. However, linear regression is by far the most common form of regression used, with regression trees receiving comparatively little attention. In this study, the results of conventional multiple linear regression (MLR) were compared with those of random forest regression (RFR), in the prediction of the concentrations of 9 neurochemicals in the vestibular nucleus complex and cerebellum that are part of the l-arginine biochemical pathway (agmatine, putrescine, spermidine, spermine, l-arginine, l-ornithine, l-citrulline, glutamate and γ-aminobutyric acid (GABA)). The R(2) values for the MLRs were higher than the proportion of variance explained values for the RFRs: 6/9 of them were ≥ 0.70 compared to 4/9 for RFRs. Even the variables that had the lowest R(2) values for the MLRs, e.g. ornithine (0.50) and glutamate (0.61), had much lower proportion of variance explained values for the RFRs (0.27 and 0.49, respectively). The RSE values for the MLRs were lower than those for the RFRs in all but two cases. In general, MLRs seemed to be superior to the RFRs in terms of predictive value and error. In the case of this data set, MLR appeared to be superior to RFR in terms of its explanatory value and error. This result suggests that MLR may have advantages over RFR for prediction in neuroscience with this kind of data set, but that RFR can still have good predictive value in some cases. Copyright © 2013 Elsevier B.V. All rights reserved.
Logistic regression applied to natural hazards: rare event logistic regression with replications

OpenAIRE

Guns, M.; Vanacker, Veerle

2012-01-01

Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logisti...
A Seemingly Unrelated Poisson Regression Model

OpenAIRE

King, Gary

1989-01-01

This article introduces a new estimator for the analysis of two contemporaneously correlated endogenous event count variables. This seemingly unrelated Poisson regression model (SUPREME) estimator combines the efficiencies created by single equation Poisson regression model estimators and insights from "seemingly unrelated" linear regression models.
Recursive Algorithm For Linear Regression

Science.gov (United States)

Varanasi, S. V.

1988-01-01

Order of model determined easily. Linear-regression algorithhm includes recursive equations for coefficients of model of increased order. Algorithm eliminates duplicative calculations, facilitates search for minimum order of linear-regression model fitting set of data satisfactory.
Applied regression analysis a research tool

CERN Document Server

Pantula, Sastry; Dickey, David

1998-01-01

Least squares estimation, when used appropriately, is a powerful research tool. A deeper understanding of the regression concepts is essential for achieving optimal benefits from a least squares analysis. This book builds on the fundamentals of statistical methods and provides appropriate concepts that will allow a scientist to use least squares as an effective research tool. Applied Regression Analysis is aimed at the scientist who wishes to gain a working knowledge of regression analysis. The basic purpose of this book is to develop an understanding of least squares and related statistical methods without becoming excessively mathematical. It is the outgrowth of more than 30 years of consulting experience with scientists and many years of teaching an applied regression course to graduate students. Applied Regression Analysis serves as an excellent text for a service course on regression for non-statisticians and as a reference for researchers. It also provides a bridge between a two-semester introduction to...
Standards for Standardized Logistic Regression Coefficients

Science.gov (United States)

Menard, Scott

2011-01-01

Standardized coefficients in logistic regression analysis have the same utility as standardized coefficients in linear regression analysis. Although there has been no consensus on the best way to construct standardized logistic regression coefficients, there is now sufficient evidence to suggest a single best approach to the construction of a…
[Application of negative binomial regression and modified Poisson regression in the research of risk factors for injury frequency].

Science.gov (United States)

Cao, Qingqing; Wu, Zhenqiang; Sun, Ying; Wang, Tiezhu; Han, Tengwei; Gu, Chaomei; Sun, Yehuan

2011-11-01

To Eexplore the application of negative binomial regression and modified Poisson regression analysis in analyzing the influential factors for injury frequency and the risk factors leading to the increase of injury frequency. 2917 primary and secondary school students were selected from Hefei by cluster random sampling method and surveyed by questionnaire. The data on the count event-based injuries used to fitted modified Poisson regression and negative binomial regression model. The risk factors incurring the increase of unintentional injury frequency for juvenile students was explored, so as to probe the efficiency of these two models in studying the influential factors for injury frequency. The Poisson model existed over-dispersion (P Poisson regression and negative binomial regression model, was fitted better. respectively. Both showed that male gender, younger age, father working outside of the hometown, the level of the guardian being above junior high school and smoking might be the results of higher injury frequencies. On a tendency of clustered frequency data on injury event, both the modified Poisson regression analysis and negative binomial regression analysis can be used. However, based on our data, the modified Poisson regression fitted better and this model could give a more accurate interpretation of relevant factors affecting the frequency of injury.
Logistic regression for dichotomized counts.

Science.gov (United States)

Preisser, John S; Das, Kalyan; Benecha, Habtamu; Stamm, John W

2016-12-01

Sometimes there is interest in a dichotomized outcome indicating whether a count variable is positive or zero. Under this scenario, the application of ordinary logistic regression may result in efficiency loss, which is quantifiable under an assumed model for the counts. In such situations, a shared-parameter hurdle model is investigated for more efficient estimation of regression parameters relating to overall effects of covariates on the dichotomous outcome, while handling count data with many zeroes. One model part provides a logistic regression containing marginal log odds ratio effects of primary interest, while an ancillary model part describes the mean count of a Poisson or negative binomial process in terms of nuisance regression parameters. Asymptotic efficiency of the logistic model parameter estimators of the two-part models is evaluated with respect to ordinary logistic regression. Simulations are used to assess the properties of the models with respect to power and Type I error, the latter investigated under both misspecified and correctly specified models. The methods are applied to data from a randomized clinical trial of three toothpaste formulations to prevent incident dental caries in a large population of Scottish schoolchildren. © The Author(s) 2014.
Bayesian ARTMAP for regression.

Science.gov (United States)

Sasu, L M; Andonie, R

2013-10-01

Bayesian ARTMAP (BA) is a recently introduced neural architecture which uses a combination of Fuzzy ARTMAP competitive learning and Bayesian learning. Training is generally performed online, in a single-epoch. During training, BA creates input data clusters as Gaussian categories, and also infers the conditional probabilities between input patterns and categories, and between categories and classes. During prediction, BA uses Bayesian posterior probability estimation. So far, BA was used only for classification. The goal of this paper is to analyze the efficiency of BA for regression problems. Our contributions are: (i) we generalize the BA algorithm using the clustering functionality of both ART modules, and name it BA for Regression (BAR); (ii) we prove that BAR is a universal approximator with the best approximation property. In other words, BAR approximates arbitrarily well any continuous function (universal approximation) and, for every given continuous function, there is one in the set of BAR approximators situated at minimum distance (best approximation); (iii) we experimentally compare the online trained BAR with several neural models, on the following standard regression benchmarks: CPU Computer Hardware, Boston Housing, Wisconsin Breast Cancer, and Communities and Crime. Our results show that BAR is an appropriate tool for regression tasks, both for theoretical and practical reasons. Copyright © 2013 Elsevier Ltd. All rights reserved.
Mechanisms of neuroblastoma regression

Science.gov (United States)

Brodeur, Garrett M.; Bagatell, Rochelle

2014-01-01

Recent genomic and biological studies of neuroblastoma have shed light on the dramatic heterogeneity in the clinical behaviour of this disease, which spans from spontaneous regression or differentiation in some patients, to relentless disease progression in others, despite intensive multimodality therapy. This evidence also suggests several possible mechanisms to explain the phenomena of spontaneous regression in neuroblastomas, including neurotrophin deprivation, humoral or cellular immunity, loss of telomerase activity and alterations in epigenetic regulation. A better understanding of the mechanisms of spontaneous regression might help to identify optimal therapeutic approaches for patients with these tumours. Currently, the most druggable mechanism is the delayed activation of developmentally programmed cell death regulated by the tropomyosin receptor kinase A pathway. Indeed, targeted therapy aimed at inhibiting neurotrophin receptors might be used in lieu of conventional chemotherapy or radiation in infants with biologically favourable tumours that require treatment. Alternative approaches consist of breaking immune tolerance to tumour antigens or activating neurotrophin receptor pathways to induce neuronal differentiation. These approaches are likely to be most effective against biologically favourable tumours, but they might also provide insights into treatment of biologically unfavourable tumours. We describe the different mechanisms of spontaneous neuroblastoma regression and the consequent therapeutic approaches. PMID:25331179
Using the Ridge Regression Procedures to Estimate the Multiple Linear Regression Coefficients

Science.gov (United States)

Gorgees, HazimMansoor; Mahdi, FatimahAssim

2018-05-01

This article concerns with comparing the performance of different types of ordinary ridge regression estimators that have been already proposed to estimate the regression parameters when the near exact linear relationships among the explanatory variables is presented. For this situations we employ the data obtained from tagi gas filling company during the period (2008-2010). The main result we reached is that the method based on the condition number performs better than other methods since it has smaller mean square error (MSE) than the other stated methods.
Multicollinearity and Regression Analysis

Science.gov (United States)

Daoud, Jamal I.

2017-12-01

In regression analysis it is obvious to have a correlation between the response and predictor(s), but having correlation among predictors is something undesired. The number of predictors included in the regression model depends on many factors among which, historical data, experience, etc. At the end selection of most important predictors is something objective due to the researcher. Multicollinearity is a phenomena when two or more predictors are correlated, if this happens, the standard error of the coefficients will increase [8]. Increased standard errors means that the coefficients for some or all independent variables may be found to be significantly different from In other words, by overinflating the standard errors, multicollinearity makes some variables statistically insignificant when they should be significant. In this paper we focus on the multicollinearity, reasons and consequences on the reliability of the regression model.
Panel Smooth Transition Regression Models

DEFF Research Database (Denmark)

González, Andrés; Terasvirta, Timo; Dijk, Dick van

We introduce the panel smooth transition regression model. This new model is intended for characterizing heterogeneous panels, allowing the regression coefficients to vary both across individuals and over time. Specifically, heterogeneity is allowed for by assuming that these coefficients are bou...
Credit Scoring Problem Based on Regression Analysis

OpenAIRE

Khassawneh, Bashar Suhil Jad Allah

2014-01-01

ABSTRACT: This thesis provides an explanatory introduction to the regression models of data mining and contains basic definitions of key terms in the linear, multiple and logistic regression models. Meanwhile, the aim of this study is to illustrate fitting models for the credit scoring problem using simple linear, multiple linear and logistic regression models and also to analyze the found model functions by statistical tools. Keywords: Data mining, linear regression, logistic regression....
Unbalanced Regressions and the Predictive Equation

DEFF Research Database (Denmark)

Osterrieder, Daniela; Ventosa-Santaulària, Daniel; Vera-Valdés, J. Eduardo

Predictive return regressions with persistent regressors are typically plagued by (asymptotically) biased/inconsistent estimates of the slope, non-standard or potentially even spurious statistical inference, and regression unbalancedness. We alleviate the problem of unbalancedness in the theoreti......Predictive return regressions with persistent regressors are typically plagued by (asymptotically) biased/inconsistent estimates of the slope, non-standard or potentially even spurious statistical inference, and regression unbalancedness. We alleviate the problem of unbalancedness...... in the theoretical predictive equation by suggesting a data generating process, where returns are generated as linear functions of a lagged latent I(0) risk process. The observed predictor is a function of this latent I(0) process, but it is corrupted by a fractionally integrated noise. Such a process may arise due...... to aggregation or unexpected level shifts. In this setup, the practitioner estimates a misspecified, unbalanced, and endogenous predictive regression. We show that the OLS estimate of this regression is inconsistent, but standard inference is possible. To obtain a consistent slope estimate, we then suggest...
[From clinical judgment to linear regression model.

Science.gov (United States)

Palacios-Cruz, Lino; Pérez, Marcela; Rivas-Ruiz, Rodolfo; Talavera, Juan O

2013-01-01

When we think about mathematical models, such as linear regression model, we think that these terms are only used by those engaged in research, a notion that is far from the truth. Legendre described the first mathematical model in 1805, and Galton introduced the formal term in 1886. Linear regression is one of the most commonly used regression models in clinical practice. It is useful to predict or show the relationship between two or more variables as long as the dependent variable is quantitative and has normal distribution. Stated in another way, the regression is used to predict a measure based on the knowledge of at least one other variable. Linear regression has as it's first objective to determine the slope or inclination of the regression line: Y = a + bx, where "a" is the intercept or regression constant and it is equivalent to "Y" value when "X" equals 0 and "b" (also called slope) indicates the increase or decrease that occurs when the variable "x" increases or decreases in one unit. In the regression line, "b" is called regression coefficient. The coefficient of determination (R 2 ) indicates the importance of independent variables in the outcome.
Autistic Regression

Science.gov (United States)

Matson, Johnny L.; Kozlowski, Alison M.

2010-01-01

Autistic regression is one of the many mysteries in the developmental course of autism and pervasive developmental disorders not otherwise specified (PDD-NOS). Various definitions of this phenomenon have been used, further clouding the study of the topic. Despite this problem, some efforts at establishing prevalence have been made. The purpose of…
Ridge regression estimator: combining unbiased and ordinary ridge regression methods of estimation

Directory of Open Access Journals (Sweden)

Sharad Damodar Gore

2009-10-01

Full Text Available Statistical literature has several methods for coping with multicollinearity. This paper introduces a new shrinkage estimator, called modified unbiased ridge (MUR. This estimator is obtained from unbiased ridge regression (URR in the same way that ordinary ridge regression (ORR is obtained from ordinary least squares (OLS. Properties of MUR are derived. Results on its matrix mean squared error (MMSE are obtained. MUR is compared with ORR and URR in terms of MMSE. These results are illustrated with an example based on data generated by Hoerl and Kennard (1975.
Timing of lake-level changes for a deep last-glacial Lake Missoula: optical dating of the Garden Gulch area, Montana, USA

DEFF Research Database (Denmark)

Smith, Larry N.; Sohbati, Reza; Buylaert, Jan-Pieter

2018-01-01

Glaciolacustrine sediments in the Clark Fork River valley at Garden Gulch, near Drummond, Montana, USA record highstand positions of the ice-dammed glacial Lake Missoula and repeated subaerial exposure. During these highstands the lake was at greater than 65% of its recognized maximum capacity......-level fluctuation, occurred over time scales of decades to ∼2 ka. Bioturbated sandy slopewash dated at 10.6 ± 0.9 ka and 11.9 ± 1.2 ka unconformably overlies the upper glaciolacustrine deposits. The uppermost sediments, above the glaciolacustrine section, are younger than the Glacier Peak tephra (13.7-13.4 cal ka B...... the lake's highstand position due to ice-dam failure likely led to scour in the downstream portions of the glacial Lake Missoula basin and megafloods in the Channeled Scabland....
Discriminative Elastic-Net Regularized Linear Regression.

Science.gov (United States)

Zhang, Zheng; Lai, Zhihui; Xu, Yong; Shao, Ling; Wu, Jian; Xie, Guo-Sen

2017-03-01

In this paper, we aim at learning compact and discriminative linear regression models. Linear regression has been widely used in different problems. However, most of the existing linear regression methods exploit the conventional zero-one matrix as the regression targets, which greatly narrows the flexibility of the regression model. Another major limitation of these methods is that the learned projection matrix fails to precisely project the image features to the target space due to their weak discriminative capability. To this end, we present an elastic-net regularized linear regression (ENLR) framework, and develop two robust linear regression models which possess the following special characteristics. First, our methods exploit two particular strategies to enlarge the margins of different classes by relaxing the strict binary targets into a more feasible variable matrix. Second, a robust elastic-net regularization of singular values is introduced to enhance the compactness and effectiveness of the learned projection matrix. Third, the resulting optimization problem of ENLR has a closed-form solution in each iteration, which can be solved efficiently. Finally, rather than directly exploiting the projection matrix for recognition, our methods employ the transformed features as the new discriminate representations to make final image classification. Compared with the traditional linear regression model and some of its variants, our method is much more accurate in image classification. Extensive experiments conducted on publicly available data sets well demonstrate that the proposed framework can outperform the state-of-the-art methods. The MATLAB codes of our methods can be available at http://www.yongxu.org/lunwen.html.

Categorical regression dose-response modeling

Science.gov (United States)

The goal of this training is to provide participants with training on the use of the U.S. EPA’s Categorical Regression soft¬ware (CatReg) and its application to risk assessment. Categorical regression fits mathematical models to toxicity data that have been assigned ord...
Abstract Expression Grammar Symbolic Regression

Science.gov (United States)

Korns, Michael F.

This chapter examines the use of Abstract Expression Grammars to perform the entire Symbolic Regression process without the use of Genetic Programming per se. The techniques explored produce a symbolic regression engine which has absolutely no bloat, which allows total user control of the search space and output formulas, which is faster, and more accurate than the engines produced in our previous papers using Genetic Programming. The genome is an all vector structure with four chromosomes plus additional epigenetic and constraint vectors, allowing total user control of the search space and the final output formulas. A combination of specialized compiler techniques, genetic algorithms, particle swarm, aged layered populations, plus discrete and continuous differential evolution are used to produce an improved symbolic regression sytem. Nine base test cases, from the literature, are used to test the improvement in speed and accuracy. The improved results indicate that these techniques move us a big step closer toward future industrial strength symbolic regression systems.
Comparison of Classical Linear Regression and Orthogonal Regression According to the Sum of Squares Perpendicular Distances

OpenAIRE

KELEŞ, Taliha; ALTUN, Murat

2016-01-01

Regression analysis is a statistical technique for investigating and modeling the relationship between variables. The purpose of this study was the trivial presentation of the equation for orthogonal regression (OR) and the comparison of classical linear regression (CLR) and OR techniques with respect to the sum of squared perpendicular distances. For that purpose, the analyses were shown by an example. It was found that the sum of squared perpendicular distances of OR is smaller. Thus, it wa...
Pathological assessment of liver fibrosis regression

Directory of Open Access Journals (Sweden)

WANG Bingqiong

2017-03-01

Full Text Available Hepatic fibrosis is the common pathological outcome of chronic hepatic diseases. An accurate assessment of fibrosis degree provides an important reference for a definite diagnosis of diseases, treatment decision-making, treatment outcome monitoring, and prognostic evaluation. At present, many clinical studies have proven that regression of hepatic fibrosis and early-stage liver cirrhosis can be achieved by effective treatment, and a correct evaluation of fibrosis regression has become a hot topic in clinical research. Liver biopsy has long been regarded as the gold standard for the assessment of hepatic fibrosis, and thus it plays an important role in the evaluation of fibrosis regression. This article reviews the clinical application of current pathological staging systems in the evaluation of fibrosis regression from the perspectives of semi-quantitative scoring system, quantitative approach, and qualitative approach, in order to propose a better pathological evaluation system for the assessment of fibrosis regression.
Logistic Regression: Concept and Application

Science.gov (United States)

Cokluk, Omay

2010-01-01

The main focus of logistic regression analysis is classification of individuals in different groups. The aim of the present study is to explain basic concepts and processes of binary logistic regression analysis intended to determine the combination of independent variables which best explain the membership in certain groups called dichotomous…
Predictors of course in obsessive-compulsive disorder: logistic regression versus Cox regression for recurrent events.

Science.gov (United States)

Kempe, P T; van Oppen, P; de Haan, E; Twisk, J W R; Sluis, A; Smit, J H; van Dyck, R; van Balkom, A J L M

2007-09-01

Two methods for predicting remissions in obsessive-compulsive disorder (OCD) treatment are evaluated. Y-BOCS measurements of 88 patients with a primary OCD (DSM-III-R) diagnosis were performed over a 16-week treatment period, and during three follow-ups. Remission at any measurement was defined as a Y-BOCS score lower than thirteen combined with a reduction of seven points when compared with baseline. Logistic regression models were compared with a Cox regression for recurrent events model. Logistic regression yielded different models at different evaluation times. The recurrent events model remained stable when fewer measurements were used. Higher baseline levels of neuroticism and more severe OCD symptoms were associated with a lower chance of remission, early age of onset and more depressive symptoms with a higher chance. Choice of outcome time affects logistic regression prediction models. Recurrent events analysis uses all information on remissions and relapses. Short- and long-term predictors for OCD remission show overlap.
Sparse reduced-rank regression with covariance estimation

KAUST Repository

Chen, Lisha

2014-12-08

Improving the predicting performance of the multiple response regression compared with separate linear regressions is a challenging question. On the one hand, it is desirable to seek model parsimony when facing a large number of parameters. On the other hand, for certain applications it is necessary to take into account the general covariance structure for the errors of the regression model. We assume a reduced-rank regression model and work with the likelihood function with general error covariance to achieve both objectives. In addition we propose to select relevant variables for reduced-rank regression by using a sparsity-inducing penalty, and to estimate the error covariance matrix simultaneously by using a similar penalty on the precision matrix. We develop a numerical algorithm to solve the penalized regression problem. In a simulation study and real data analysis, the new method is compared with two recent methods for multivariate regression and exhibits competitive performance in prediction and variable selection.
Sparse reduced-rank regression with covariance estimation

KAUST Repository

Chen, Lisha; Huang, Jianhua Z.

2014-01-01

Improving the predicting performance of the multiple response regression compared with separate linear regressions is a challenging question. On the one hand, it is desirable to seek model parsimony when facing a large number of parameters. On the other hand, for certain applications it is necessary to take into account the general covariance structure for the errors of the regression model. We assume a reduced-rank regression model and work with the likelihood function with general error covariance to achieve both objectives. In addition we propose to select relevant variables for reduced-rank regression by using a sparsity-inducing penalty, and to estimate the error covariance matrix simultaneously by using a similar penalty on the precision matrix. We develop a numerical algorithm to solve the penalized regression problem. In a simulation study and real data analysis, the new method is compared with two recent methods for multivariate regression and exhibits competitive performance in prediction and variable selection.
Regression models of reactor diagnostic signals

International Nuclear Information System (INIS)

Vavrin, J.

1989-01-01

The application is described of an autoregression model as the simplest regression model of diagnostic signals in experimental analysis of diagnostic systems, in in-service monitoring of normal and anomalous conditions and their diagnostics. The method of diagnostics is described using a regression type diagnostic data base and regression spectral diagnostics. The diagnostics is described of neutron noise signals from anomalous modes in the experimental fuel assembly of a reactor. (author)
Regression and Sparse Regression Methods for Viscosity Estimation of Acid Milk From it’s Sls Features

DEFF Research Database (Denmark)

Sharifzadeh, Sara; Skytte, Jacob Lercke; Nielsen, Otto Højager Attermann

2012-01-01

Statistical solutions find wide spread use in food and medicine quality control. We investigate the effect of different regression and sparse regression methods for a viscosity estimation problem using the spectro-temporal features from new Sub-Surface Laser Scattering (SLS) vision system. From...... with sparse LAR, lasso and Elastic Net (EN) sparse regression methods. Due to the inconsistent measurement condition, Locally Weighted Scatter plot Smoothing (Loess) has been employed to alleviate the undesired variation in the estimated viscosity. The experimental results of applying different methods show...
Testing discontinuities in nonparametric regression

KAUST Repository

Dai, Wenlin

2017-01-19

In nonparametric regression, it is often needed to detect whether there are jump discontinuities in the mean function. In this paper, we revisit the difference-based method in [13 H.-G. Müller and U. Stadtmüller, Discontinuous versus smooth regression, Ann. Stat. 27 (1999), pp. 299–337. doi: 10.1214/aos/1018031100
Testing discontinuities in nonparametric regression

KAUST Repository

Dai, Wenlin; Zhou, Yuejin; Tong, Tiejun

2017-01-01

In nonparametric regression, it is often needed to detect whether there are jump discontinuities in the mean function. In this paper, we revisit the difference-based method in [13 H.-G. Müller and U. Stadtmüller, Discontinuous versus smooth regression, Ann. Stat. 27 (1999), pp. 299–337. doi: 10.1214/aos/1018031100
On Solving Lq-Penalized Regressions

Directory of Open Access Journals (Sweden)

Tracy Zhou Wu

2007-01-01

Full Text Available Lq-penalized regression arises in multidimensional statistical modelling where all or part of the regression coefficients are penalized to achieve both accuracy and parsimony of statistical models. There is often substantial computational difficulty except for the quadratic penalty case. The difficulty is partly due to the nonsmoothness of the objective function inherited from the use of the absolute value. We propose a new solution method for the general Lq-penalized regression problem based on space transformation and thus efficient optimization algorithms. The new method has immediate applications in statistics, notably in penalized spline smoothing problems. In particular, the LASSO problem is shown to be polynomial time solvable. Numerical studies show promise of our approach.
Boosted regression trees, multivariate adaptive regression splines and their two-step combinations with multiple linear regression or partial least squares to predict blood-brain barrier passage: a case study.

Science.gov (United States)

Deconinck, E; Zhang, M H; Petitet, F; Dubus, E; Ijjaali, I; Coomans, D; Vander Heyden, Y

2008-02-18

The use of some unconventional non-linear modeling techniques, i.e. classification and regression trees and multivariate adaptive regression splines-based methods, was explored to model the blood-brain barrier (BBB) passage of drugs and drug-like molecules. The data set contains BBB passage values for 299 structural and pharmacological diverse drugs, originating from a structured knowledge-based database. Models were built using boosted regression trees (BRT) and multivariate adaptive regression splines (MARS), as well as their respective combinations with stepwise multiple linear regression (MLR) and partial least squares (PLS) regression in two-step approaches. The best models were obtained using combinations of MARS with either stepwise MLR or PLS. It could be concluded that the use of combinations of a linear with a non-linear modeling technique results in some improved properties compared to the individual linear and non-linear models and that, when the use of such a combination is appropriate, combinations using MARS as non-linear technique should be preferred over those with BRT, due to some serious drawbacks of the BRT approaches.
Testing Heteroscedasticity in Robust Regression

Czech Academy of Sciences Publication Activity Database

Kalina, Jan

2011-01-01

Roč. 1, č. 4 (2011), s. 25-28 ISSN 2045-3345 Grant - others:GA ČR(CZ) GA402/09/0557 Institutional research plan: CEZ:AV0Z10300504 Keywords : robust regression * heteroscedasticity * regression quantiles * diagnostics Subject RIV: BB - Applied Statistics , Operational Research http://www.researchjournals.co.uk/documents/Vol4/06%20Kalina.pdf
Spontaneous regression of a congenital melanocytic nevus

Directory of Open Access Journals (Sweden)

Amiya Kumar Nath

2011-01-01

Full Text Available Congenital melanocytic nevus (CMN may rarely regress which may also be associated with a halo or vitiligo. We describe a 10-year-old girl who presented with CMN on the left leg since birth, which recently started to regress spontaneously with associated depigmentation in the lesion and at a distant site. Dermoscopy performed at different sites of the regressing lesion demonstrated loss of epidermal pigments first followed by loss of dermal pigments. Histopathology and Masson-Fontana stain demonstrated lymphocytic infiltration and loss of pigment production in the regressing area. Immunohistochemistry staining (S100 and HMB-45, however, showed that nevus cells were present in the regressing areas.
Regression Analysis by Example. 5th Edition

Science.gov (United States)

Chatterjee, Samprit; Hadi, Ali S.

2012-01-01

Regression analysis is a conceptually simple method for investigating relationships among variables. Carrying out a successful application of regression analysis, however, requires a balance of theoretical results, empirical rules, and subjective judgment. "Regression Analysis by Example, Fifth Edition" has been expanded and thoroughly…
Gaussian process regression analysis for functional data

CERN Document Server

Shi, Jian Qing

2011-01-01

Gaussian Process Regression Analysis for Functional Data presents nonparametric statistical methods for functional regression analysis, specifically the methods based on a Gaussian process prior in a functional space. The authors focus on problems involving functional response variables and mixed covariates of functional and scalar variables.Covering the basics of Gaussian process regression, the first several chapters discuss functional data analysis, theoretical aspects based on the asymptotic properties of Gaussian process regression models, and new methodological developments for high dime
Is past life regression therapy ethical?

Science.gov (United States)

Andrade, Gabriel

2017-01-01

Past life regression therapy is used by some physicians in cases with some mental diseases. Anxiety disorders, mood disorders, and gender dysphoria have all been treated using life regression therapy by some doctors on the assumption that they reflect problems in past lives. Although it is not supported by psychiatric associations, few medical associations have actually condemned it as unethical. In this article, I argue that past life regression therapy is unethical for two basic reasons. First, it is not evidence-based. Past life regression is based on the reincarnation hypothesis, but this hypothesis is not supported by evidence, and in fact, it faces some insurmountable conceptual problems. If patients are not fully informed about these problems, they cannot provide an informed consent, and hence, the principle of autonomy is violated. Second, past life regression therapy has the great risk of implanting false memories in patients, and thus, causing significant harm. This is a violation of the principle of non-malfeasance, which is surely the most important principle in medical ethics.
Regression Models for Market-Shares

DEFF Research Database (Denmark)

Birch, Kristina; Olsen, Jørgen Kai; Tjur, Tue

2005-01-01

On the background of a data set of weekly sales and prices for three brands of coffee, this paper discusses various regression models and their relation to the multiplicative competitive-interaction model (the MCI model, see Cooper 1988, 1993) for market-shares. Emphasis is put on the interpretat......On the background of a data set of weekly sales and prices for three brands of coffee, this paper discusses various regression models and their relation to the multiplicative competitive-interaction model (the MCI model, see Cooper 1988, 1993) for market-shares. Emphasis is put...... on the interpretation of the parameters in relation to models for the total sales based on discrete choice models.Key words and phrases. MCI model, discrete choice model, market-shares, price elasitcity, regression model....

Detection of epistatic effects with logic regression and a classical linear regression model.

Science.gov (United States)

Malina, Magdalena; Ickstadt, Katja; Schwender, Holger; Posch, Martin; Bogdan, Małgorzata

2014-02-01

To locate multiple interacting quantitative trait loci (QTL) influencing a trait of interest within experimental populations, usually methods as the Cockerham's model are applied. Within this framework, interactions are understood as the part of the joined effect of several genes which cannot be explained as the sum of their additive effects. However, if a change in the phenotype (as disease) is caused by Boolean combinations of genotypes of several QTLs, this Cockerham's approach is often not capable to identify them properly. To detect such interactions more efficiently, we propose a logic regression framework. Even though with the logic regression approach a larger number of models has to be considered (requiring more stringent multiple testing correction) the efficient representation of higher order logic interactions in logic regression models leads to a significant increase of power to detect such interactions as compared to a Cockerham's approach. The increase in power is demonstrated analytically for a simple two-way interaction model and illustrated in more complex settings with simulation study and real data analysis.
Micropaleontologic record of Quaternary paleoenvironments in the Central Albemarle Embayment, North Carolina, U.S.A.

Science.gov (United States)

Culver, Stephen J.; Farrell, Kathleen M.; Mallinson, David J.; Willard, Debra A.; Horton, Benjamin P.; Riggs, Stanley R.; Thieler, E. Robert; Wehmiller, John F.; Parham, Peter; Snyder, Scott W.; Hillier, Caroline

2011-01-01

through time of Quaternary sediments reflects the eastward progradational geometry of the continental shelf.The preservation potential of marginal marine deposits (barrier island, shoreface, backbarrier deposits) is not high, except in topographic lows associated with late Pleistocene paleovalleys and inlets because the current interglacial highstand has not yet reached its highest level. Given the documented increase in rate of relative sea-level rise in this region, shallow marine conditions are likely to return to the central Albemarle Embayment in the near future. ?? 2011 Elsevier B.V.
Poisson Mixture Regression Models for Heart Disease Prediction.

Science.gov (United States)

Mufudza, Chipo; Erol, Hamza

2016-01-01

Early heart disease control can be achieved by high disease prediction and diagnosis efficiency. This paper focuses on the use of model based clustering techniques to predict and diagnose heart disease via Poisson mixture regression models. Analysis and application of Poisson mixture regression models is here addressed under two different classes: standard and concomitant variable mixture regression models. Results show that a two-component concomitant variable Poisson mixture regression model predicts heart disease better than both the standard Poisson mixture regression model and the ordinary general linear Poisson regression model due to its low Bayesian Information Criteria value. Furthermore, a Zero Inflated Poisson Mixture Regression model turned out to be the best model for heart prediction over all models as it both clusters individuals into high or low risk category and predicts rate to heart disease componentwise given clusters available. It is deduced that heart disease prediction can be effectively done by identifying the major risks componentwise using Poisson mixture regression model.
Poisson Mixture Regression Models for Heart Disease Prediction

Science.gov (United States)

Erol, Hamza

2016-01-01

Early heart disease control can be achieved by high disease prediction and diagnosis efficiency. This paper focuses on the use of model based clustering techniques to predict and diagnose heart disease via Poisson mixture regression models. Analysis and application of Poisson mixture regression models is here addressed under two different classes: standard and concomitant variable mixture regression models. Results show that a two-component concomitant variable Poisson mixture regression model predicts heart disease better than both the standard Poisson mixture regression model and the ordinary general linear Poisson regression model due to its low Bayesian Information Criteria value. Furthermore, a Zero Inflated Poisson Mixture Regression model turned out to be the best model for heart prediction over all models as it both clusters individuals into high or low risk category and predicts rate to heart disease componentwise given clusters available. It is deduced that heart disease prediction can be effectively done by identifying the major risks componentwise using Poisson mixture regression model. PMID:27999611
Regression analysis using dependent Polya trees.

Science.gov (United States)

Schörgendorfer, Angela; Branscum, Adam J

2013-11-30

Many commonly used models for linear regression analysis force overly simplistic shape and scale constraints on the residual structure of data. We propose a semiparametric Bayesian model for regression analysis that produces data-driven inference by using a new type of dependent Polya tree prior to model arbitrary residual distributions that are allowed to evolve across increasing levels of an ordinal covariate (e.g., time, in repeated measurement studies). By modeling residual distributions at consecutive covariate levels or time points using separate, but dependent Polya tree priors, distributional information is pooled while allowing for broad pliability to accommodate many types of changing residual distributions. We can use the proposed dependent residual structure in a wide range of regression settings, including fixed-effects and mixed-effects linear and nonlinear models for cross-sectional, prospective, and repeated measurement data. A simulation study illustrates the flexibility of our novel semiparametric regression model to accurately capture evolving residual distributions. In an application to immune development data on immunoglobulin G antibodies in children, our new model outperforms several contemporary semiparametric regression models based on a predictive model selection criterion. Copyright © 2013 John Wiley & Sons, Ltd.
Applied Regression Modeling A Business Approach

CERN Document Server

Pardoe, Iain

2012-01-01

An applied and concise treatment of statistical regression techniques for business students and professionals who have little or no background in calculusRegression analysis is an invaluable statistical methodology in business settings and is vital to model the relationship between a response variable and one or more predictor variables, as well as the prediction of a response value given values of the predictors. In view of the inherent uncertainty of business processes, such as the volatility of consumer spending and the presence of market uncertainty, business professionals use regression a
Regression of environmental noise in LIGO data

International Nuclear Information System (INIS)

Tiwari, V; Klimenko, S; Mitselmakher, G; Necula, V; Drago, M; Prodi, G; Frolov, V; Yakushin, I; Re, V; Salemi, F; Vedovato, G

2015-01-01

We address the problem of noise regression in the output of gravitational-wave (GW) interferometers, using data from the physical environmental monitors (PEM). The objective of the regression analysis is to predict environmental noise in the GW channel from the PEM measurements. One of the most promising regression methods is based on the construction of Wiener–Kolmogorov (WK) filters. Using this method, the seismic noise cancellation from the LIGO GW channel has already been performed. In the presented approach the WK method has been extended, incorporating banks of Wiener filters in the time–frequency domain, multi-channel analysis and regulation schemes, which greatly enhance the versatility of the regression analysis. Also we present the first results on regression of the bi-coherent noise in the LIGO data. (paper)
ORBITAL DEPENDENCE OF GALAXY PROPERTIES IN SATELLITE SYSTEMS OF GALAXIES

International Nuclear Information System (INIS)

Hwang, Ho Seong; Park, Changbom

2010-01-01

We study the dependence of satellite galaxy properties on the distance to the host galaxy and the orbital motion (prograde and retrograde orbits) using the Sloan Digital Sky Survey (SDSS) data. From SDSS Data Release 7, we find 3515 isolated satellite systems of galaxies at z -1 . It is found that the radial distribution of early-type satellites in prograde orbit is strongly concentrated toward the host while that of retrograde ones shows much less concentration. We also find the orbital speed of late-type satellites in prograde orbit increases as the projected distance to the host (R) decreases while the speed decreases for those in retrograde orbit. At R less than 0.1 times the host virial radius (R vir,host ), the orbital speed decreases in both prograde and retrograde orbit cases. Prograde satellites are on average fainter than retrograde satellites for both early and late morphological types. The u - r color becomes redder as R decreases for both prograde and retrograde orbit late-type satellites. The differences between prograde and retrograde orbit satellite galaxies may be attributed to their different origin or the different strength of physical processes that they have experienced through hydrodynamic interactions with their host galaxies.
Forecasting with Dynamic Regression Models

CERN Document Server

Pankratz, Alan

2012-01-01

One of the most widely used tools in statistical forecasting, single equation regression models is examined here. A companion to the author's earlier work, Forecasting with Univariate Box-Jenkins Models: Concepts and Cases, the present text pulls together recent time series ideas and gives special attention to possible intertemporal patterns, distributed lag responses of output to input series and the auto correlation patterns of regression disturbance. It also includes six case studies.
Estimating Loess Plateau Average Annual Precipitation with Multiple Linear Regression Kriging and Geographically Weighted Regression Kriging

Directory of Open Access Journals (Sweden)

Qiutong Jin

2016-06-01

Full Text Available Estimating the spatial distribution of precipitation is an important and challenging task in hydrology, climatology, ecology, and environmental science. In order to generate a highly accurate distribution map of average annual precipitation for the Loess Plateau in China, multiple linear regression Kriging (MLRK and geographically weighted regression Kriging (GWRK methods were employed using precipitation data from the period 1980–2010 from 435 meteorological stations. The predictors in regression Kriging were selected by stepwise regression analysis from many auxiliary environmental factors, such as elevation (DEM, normalized difference vegetation index (NDVI, solar radiation, slope, and aspect. All predictor distribution maps had a 500 m spatial resolution. Validation precipitation data from 130 hydrometeorological stations were used to assess the prediction accuracies of the MLRK and GWRK approaches. Results showed that both prediction maps with a 500 m spatial resolution interpolated by MLRK and GWRK had a high accuracy and captured detailed spatial distribution data; however, MLRK produced a lower prediction error and a higher variance explanation than GWRK, although the differences were small, in contrast to conclusions from similar studies.
High-resolution sedimentological and subsidence analysis of the Late Neogene, Pannonian Basin, Hungary

Science.gov (United States)

Juhasz, E.; Muller, P.; Toth-Makk, A.; Hamor, T.; Farkas-Bulla, J.; Suto-Szentai, M.; Phillips, R.L.; Ricketts, B.

1996-01-01

Detailed sedimentological and paleontological analyses were carried out on more than 13,000 m of core from ten boreholes in the Late Neogene sediments of the Pannonian Basin, Hungary. These data provide the basis for determining the character of high-order depositional cycles and their stacking patterns. In the Late Neogene sediments of the Pannonian Basin there are two third-order sequences: the Late Miocene and the Pliocene ones. The Miocene sequence shows a regressive, upward-coarsening trend. There are four distinguishable sedimentary units in this sequence: the basal transgressive, the lower aggradational, the progradational and the upper aggradational units. The Pliocene sequence is also of aggradational character. The progradation does not coincide in time in the wells within the basin. The character of the relative water-level curves is similar throughout the basin but shows only very faint similarity to the sea-level curve. Therefore, it is unlikely that eustasy played any significant role in the pattern of basin filling. Rather, the dominant controls were the rapidly changing basin subsidence and high sedimentation rates, together with possible climatic factors.
Gibrat’s law and quantile regressions

DEFF Research Database (Denmark)

Distante, Roberta; Petrella, Ivan; Santoro, Emiliano

2017-01-01

The nexus between firm growth, size and age in U.S. manufacturing is examined through the lens of quantile regression models. This methodology allows us to overcome serious shortcomings entailed by linear regression models employed by much of the existing literature, unveiling a number of important...
ON REGRESSION REPRESENTATIONS OF STOCHASTIC-PROCESSES

NARCIS (Netherlands)

RUSCHENDORF, L; DEVALK, [No Value

We construct a.s. nonlinear regression representations of general stochastic processes (X(n))n is-an-element-of N. As a consequence we obtain in particular special regression representations of Markov chains and of certain m-dependent sequences. For m-dependent sequences we obtain a constructive
Introduction to the use of regression models in epidemiology.

Science.gov (United States)

Bender, Ralf

2009-01-01

Regression modeling is one of the most important statistical techniques used in analytical epidemiology. By means of regression models the effect of one or several explanatory variables (e.g., exposures, subject characteristics, risk factors) on a response variable such as mortality or cancer can be investigated. From multiple regression models, adjusted effect estimates can be obtained that take the effect of potential confounders into account. Regression methods can be applied in all epidemiologic study designs so that they represent a universal tool for data analysis in epidemiology. Different kinds of regression models have been developed in dependence on the measurement scale of the response variable and the study design. The most important methods are linear regression for continuous outcomes, logistic regression for binary outcomes, Cox regression for time-to-event data, and Poisson regression for frequencies and rates. This chapter provides a nontechnical introduction to these regression models with illustrating examples from cancer research.
From Rasch scores to regression

DEFF Research Database (Denmark)

Christensen, Karl Bang

2006-01-01

Rasch models provide a framework for measurement and modelling latent variables. Having measured a latent variable in a population a comparison of groups will often be of interest. For this purpose the use of observed raw scores will often be inadequate because these lack interval scale propertie....... This paper compares two approaches to group comparison: linear regression models using estimated person locations as outcome variables and latent regression models based on the distribution of the score....
Producing The New Regressive Left

DEFF Research Database (Denmark)

Crone, Christine

members, this thesis investigates a growing political trend and ideological discourse in the Arab world that I have called The New Regressive Left. On the premise that a media outlet can function as a forum for ideology production, the thesis argues that an analysis of this material can help to trace...... the contexture of The New Regressive Left. If the first part of the thesis lays out the theoretical approach and draws the contextual framework, through an exploration of the surrounding Arab media-and ideoscapes, the second part is an analytical investigation of the discourse that permeates the programmes aired...... becomes clear from the analytical chapters is the emergence of the new cross-ideological alliance of The New Regressive Left. This emerging coalition between Shia Muslims, religious minorities, parts of the Arab Left, secular cultural producers, and the remnants of the political,strategic resistance...
Mixture of Regression Models with Single-Index

OpenAIRE

Xiang, Sijia; Yao, Weixin

2016-01-01

In this article, we propose a class of semiparametric mixture regression models with single-index. We argue that many recently proposed semiparametric/nonparametric mixture regression models can be considered special cases of the proposed model. However, unlike existing semiparametric mixture regression models, the new pro- posed model can easily incorporate multivariate predictors into the nonparametric components. Backfitting estimates and the corresponding algorithms have been proposed for...
Local bilinear multiple-output quantile/depth regression

Czech Academy of Sciences Publication Activity Database

Hallin, M.; Lu, Z.; Paindaveine, D.; Šiman, Miroslav

2015-01-01

Roč. 21, č. 3 (2015), s. 1435-1466 ISSN 1350-7265 R&D Projects: GA MŠk(CZ) 1M06047 Institutional support: RVO:67985556 Keywords : conditional depth * growth chart * halfspace depth * local bilinear regression * multivariate quantile * quantile regression * regression depth Subject RIV: BA - General Mathematics Impact factor: 1.372, year: 2015 http://library.utia.cas.cz/separaty/2015/SI/siman-0446857.pdf
Do clinical and translational science graduate students understand linear regression? Development and early validation of the REGRESS quiz.

Science.gov (United States)

Enders, Felicity

2013-12-01

Although regression is widely used for reading and publishing in the medical literature, no instruments were previously available to assess students' understanding. The goal of this study was to design and assess such an instrument for graduate students in Clinical and Translational Science and Public Health. A 27-item REsearch on Global Regression Expectations in StatisticS (REGRESS) quiz was developed through an iterative process. Consenting students taking a course on linear regression in a Clinical and Translational Science program completed the quiz pre- and postcourse. Student results were compared to practicing statisticians with a master's or doctoral degree in statistics or a closely related field. Fifty-two students responded precourse, 59 postcourse , and 22 practicing statisticians completed the quiz. The mean (SD) score was 9.3 (4.3) for students precourse and 19.0 (3.5) postcourse (P REGRESS quiz was internally reliable (Cronbach's alpha 0.89). The initial validation is quite promising with statistically significant and meaningful differences across time and study populations. Further work is needed to validate the quiz across multiple institutions. © 2013 Wiley Periodicals, Inc.
The MIDAS Touch: Mixed Data Sampling Regression Models

OpenAIRE

Ghysels, Eric; Santa-Clara, Pedro; Valkanov, Rossen

2004-01-01

We introduce Mixed Data Sampling (henceforth MIDAS) regression models. The regressions involve time series data sampled at different frequencies. Technically speaking MIDAS models specify conditional expectations as a distributed lag of regressors recorded at some higher sampling frequencies. We examine the asymptotic properties of MIDAS regression estimation and compare it with traditional distributed lag models. MIDAS regressions have wide applicability in macroeconomics and ï¿½nance.

Suppression Situations in Multiple Linear Regression

Science.gov (United States)

Shieh, Gwowen

2006-01-01

This article proposes alternative expressions for the two most prevailing definitions of suppression without resorting to the standardized regression modeling. The formulation provides a simple basis for the examination of their relationship. For the two-predictor regression, the author demonstrates that the previous results in the literature are…
Significance testing in ridge regression for genetic data

Directory of Open Access Journals (Sweden)

De Iorio Maria

2011-09-01

Full Text Available Abstract Background Technological developments have increased the feasibility of large scale genetic association studies. Densely typed genetic markers are obtained using SNP arrays, next-generation sequencing technologies and imputation. However, SNPs typed using these methods can be highly correlated due to linkage disequilibrium among them, and standard multiple regression techniques fail with these data sets due to their high dimensionality and correlation structure. There has been increasing interest in using penalised regression in the analysis of high dimensional data. Ridge regression is one such penalised regression technique which does not perform variable selection, instead estimating a regression coefficient for each predictor variable. It is therefore desirable to obtain an estimate of the significance of each ridge regression coefficient. Results We develop and evaluate a test of significance for ridge regression coefficients. Using simulation studies, we demonstrate that the performance of the test is comparable to that of a permutation test, with the advantage of a much-reduced computational cost. We introduce the p-value trace, a plot of the negative logarithm of the p-values of ridge regression coefficients with increasing shrinkage parameter, which enables the visualisation of the change in p-value of the regression coefficients with increasing penalisation. We apply the proposed method to a lung cancer case-control data set from EPIC, the European Prospective Investigation into Cancer and Nutrition. Conclusions The proposed test is a useful alternative to a permutation test for the estimation of the significance of ridge regression coefficients, at a much-reduced computational cost. The p-value trace is an informative graphical tool for evaluating the results of a test of significance of ridge regression coefficients as the shrinkage parameter increases, and the proposed test makes its production computationally feasible.
Regression calibration with more surrogates than mismeasured variables

KAUST Repository

Kipnis, Victor

2012-06-29

In a recent paper (Weller EA, Milton DK, Eisen EA, Spiegelman D. Regression calibration for logistic regression with multiple surrogates for one exposure. Journal of Statistical Planning and Inference 2007; 137: 449-461), the authors discussed fitting logistic regression models when a scalar main explanatory variable is measured with error by several surrogates, that is, a situation with more surrogates than variables measured with error. They compared two methods of adjusting for measurement error using a regression calibration approximate model as if it were exact. One is the standard regression calibration approach consisting of substituting an estimated conditional expectation of the true covariate given observed data in the logistic regression. The other is a novel two-stage approach when the logistic regression is fitted to multiple surrogates, and then a linear combination of estimated slopes is formed as the estimate of interest. Applying estimated asymptotic variances for both methods in a single data set with some sensitivity analysis, the authors asserted superiority of their two-stage approach. We investigate this claim in some detail. A troubling aspect of the proposed two-stage method is that, unlike standard regression calibration and a natural form of maximum likelihood, the resulting estimates are not invariant to reparameterization of nuisance parameters in the model. We show, however, that, under the regression calibration approximation, the two-stage method is asymptotically equivalent to a maximum likelihood formulation, and is therefore in theory superior to standard regression calibration. However, our extensive finite-sample simulations in the practically important parameter space where the regression calibration model provides a good approximation failed to uncover such superiority of the two-stage method. We also discuss extensions to different data structures.
Regression calibration with more surrogates than mismeasured variables

KAUST Repository

Kipnis, Victor; Midthune, Douglas; Freedman, Laurence S.; Carroll, Raymond J.

2012-01-01

In a recent paper (Weller EA, Milton DK, Eisen EA, Spiegelman D. Regression calibration for logistic regression with multiple surrogates for one exposure. Journal of Statistical Planning and Inference 2007; 137: 449-461), the authors discussed fitting logistic regression models when a scalar main explanatory variable is measured with error by several surrogates, that is, a situation with more surrogates than variables measured with error. They compared two methods of adjusting for measurement error using a regression calibration approximate model as if it were exact. One is the standard regression calibration approach consisting of substituting an estimated conditional expectation of the true covariate given observed data in the logistic regression. The other is a novel two-stage approach when the logistic regression is fitted to multiple surrogates, and then a linear combination of estimated slopes is formed as the estimate of interest. Applying estimated asymptotic variances for both methods in a single data set with some sensitivity analysis, the authors asserted superiority of their two-stage approach. We investigate this claim in some detail. A troubling aspect of the proposed two-stage method is that, unlike standard regression calibration and a natural form of maximum likelihood, the resulting estimates are not invariant to reparameterization of nuisance parameters in the model. We show, however, that, under the regression calibration approximation, the two-stage method is asymptotically equivalent to a maximum likelihood formulation, and is therefore in theory superior to standard regression calibration. However, our extensive finite-sample simulations in the practically important parameter space where the regression calibration model provides a good approximation failed to uncover such superiority of the two-stage method. We also discuss extensions to different data structures.
Few crystal balls are crystal clear : eyeballing regression

International Nuclear Information System (INIS)

Wittebrood, R.T.

1998-01-01

The theory of regression and statistical analysis as it applies to reservoir analysis was discussed. It was argued that regression lines are not always the final truth. It was suggested that regression lines and eyeballed lines are often equally accurate. The many conditions that must be fulfilled to calculate a proper regression were discussed. Mentioned among these conditions were the distribution of the data, hidden variables, knowledge of how the data was obtained, the need for causal correlation of the variables, and knowledge of the manner in which the regression results are going to be used. 1 tab., 13 figs
Regression methods for medical research

CERN Document Server

Tai, Bee Choo

2013-01-01

Regression Methods for Medical Research provides medical researchers with the skills they need to critically read and interpret research using more advanced statistical methods. The statistical requirements of interpreting and publishing in medical journals, together with rapid changes in science and technology, increasingly demands an understanding of more complex and sophisticated analytic procedures.The text explains the application of statistical models to a wide variety of practical medical investigative studies and clinical trials. Regression methods are used to appropriately answer the
Should metacognition be measured by logistic regression?

Science.gov (United States)

Rausch, Manuel; Zehetleitner, Michael

2017-03-01

Are logistic regression slopes suitable to quantify metacognitive sensitivity, i.e. the efficiency with which subjective reports differentiate between correct and incorrect task responses? We analytically show that logistic regression slopes are independent from rating criteria in one specific model of metacognition, which assumes (i) that rating decisions are based on sensory evidence generated independently of the sensory evidence used for primary task responses and (ii) that the distributions of evidence are logistic. Given a hierarchical model of metacognition, logistic regression slopes depend on rating criteria. According to all considered models, regression slopes depend on the primary task criterion. A reanalysis of previous data revealed that massive numbers of trials are required to distinguish between hierarchical and independent models with tolerable accuracy. It is argued that researchers who wish to use logistic regression as measure of metacognitive sensitivity need to control the primary task criterion and rating criteria. Copyright © 2017 Elsevier Inc. All rights reserved.
BOX-COX REGRESSION METHOD IN TIME SCALING

Directory of Open Access Journals (Sweden)

ATİLLA GÖKTAŞ

2013-06-01

Full Text Available Box-Cox regression method with λj, for j = 1, 2, ..., k, power transformation can be used when dependent variable and error term of the linear regression model do not satisfy the continuity and normality assumptions. The situation obtaining the smallest mean square error when optimum power λj, transformation for j = 1, 2, ..., k, of Y has been discussed. Box-Cox regression method is especially appropriate to adjust existence skewness or heteroscedasticity of error terms for a nonlinear functional relationship between dependent and explanatory variables. In this study, the advantage and disadvantage use of Box-Cox regression method have been discussed in differentiation and differantial analysis of time scale concept.
Gaussian Process Regression Model in Spatial Logistic Regression

Science.gov (United States)

Sofro, A.; Oktaviarina, A.

2018-01-01

Spatial analysis has developed very quickly in the last decade. One of the favorite approaches is based on the neighbourhood of the region. Unfortunately, there are some limitations such as difficulty in prediction. Therefore, we offer Gaussian process regression (GPR) to accommodate the issue. In this paper, we will focus on spatial modeling with GPR for binomial data with logit link function. The performance of the model will be investigated. We will discuss the inference of how to estimate the parameters and hyper-parameters and to predict as well. Furthermore, simulation studies will be explained in the last section.
Regression Analysis and the Sociological Imagination

Science.gov (United States)

De Maio, Fernando

2014-01-01

Regression analysis is an important aspect of most introductory statistics courses in sociology but is often presented in contexts divorced from the central concerns that bring students into the discipline. Consequently, we present five lesson ideas that emerge from a regression analysis of income inequality and mortality in the USA and Canada.
An Additive-Multiplicative Cox-Aalen Regression Model

DEFF Research Database (Denmark)

Scheike, Thomas H.; Zhang, Mei-Jie

2002-01-01

Aalen model; additive risk model; counting processes; Cox regression; survival analysis; time-varying effects......Aalen model; additive risk model; counting processes; Cox regression; survival analysis; time-varying effects...
Model-based Quantile Regression for Discrete Data

KAUST Repository

Padellini, Tullia

2018-04-10

Quantile regression is a class of methods voted to the modelling of conditional quantiles. In a Bayesian framework quantile regression has typically been carried out exploiting the Asymmetric Laplace Distribution as a working likelihood. Despite the fact that this leads to a proper posterior for the regression coefficients, the resulting posterior variance is however affected by an unidentifiable parameter, hence any inferential procedure beside point estimation is unreliable. We propose a model-based approach for quantile regression that considers quantiles of the generating distribution directly, and thus allows for a proper uncertainty quantification. We then create a link between quantile regression and generalised linear models by mapping the quantiles to the parameter of the response variable, and we exploit it to fit the model with R-INLA. We extend it also in the case of discrete responses, where there is no 1-to-1 relationship between quantiles and distribution\\'s parameter, by introducing continuous generalisations of the most common discrete variables (Poisson, Binomial and Negative Binomial) to be exploited in the fitting.
riskRegression

DEFF Research Database (Denmark)

Ozenne, Brice; Sørensen, Anne Lyngholm; Scheike, Thomas

2017-01-01

In the presence of competing risks a prediction of the time-dynamic absolute risk of an event can be based on cause-specific Cox regression models for the event and the competing risks (Benichou and Gail, 1990). We present computationally fast and memory optimized C++ functions with an R interface......-product we obtain fast access to the baseline hazards (compared to survival::basehaz()) and predictions of survival probabilities, their confidence intervals and confidence bands. Confidence intervals and confidence bands are based on point-wise asymptotic expansions of the corresponding statistical...
FACIES PARTITIONING AND SEQUENCE STRATIGRAPHY OF A MIXED SILICICLASTIC-CARBONATE RAMP STACK IN THE GELASIAN OF SICILY (S ITALY: A POTENTIAL MODEL FOR ICEHOUSE, DISTALLY-STEEPENED HETEROZOAN RAMPS

Directory of Open Access Journals (Sweden)

FRANCESCO MASSARI

2012-11-01

Full Text Available The Gelasian succession of the Capodarso area (Enna-Caltanissetta basin, Sicily, Italy consists of an offlapping stack of cycles composed of siliciclastic units passing to carbonate heterozoan, clino-stratified wedges, developed from a growing positive tectonic structure. Identification of a number of facies tracts, based on sedimentary facies, biofacies and taphofacies, provided important information about the differentiation and characterisation of systems tracts and key stratal surfaces of sequence stratigraphy. The bulk of carbonate wedges are interpreted as representing the rapid falling-stage progradation of distally steepened ramps. The inferred highest rate of carbonate production during forced regressions was concomitant with active downramp resedimentation by storm-driven downwelling flows, leading to storing of most carbonate sediment on the ramp slope as clino-beds of the prograding bodies. Comparison of the Capodarso ramps with other icehouse carbonate ramps, with particular regard to the Mediterranean Plio-Pleistocene, provides clues for defining some common features. These are inferred to include: (1 brief, rapid episodes of progradation concomitant with orbitally-forced sea-level changes, resulting in limited ramp width; (2 preferential fostering of growth and downramp resedimentation of heterozoan carbonates during glacial hemicycles marked by enhanced atmospheric and marine circulation; (3 building out from positive features of entirely submerged distally-steepened ramps with storm-wave-graded profile and distinctive clinoforms; (4 ramp stacks generally consisting of mixed clastic-carbonate sequences showing an ordered spectrum of distinct frequencies; (5 rapid, continuous changes in environmental parameters, leading to the short-lived persistence of faunal communities, climax communities generally having insufficient time to form.
Real estate value prediction using multivariate regression models

Science.gov (United States)

Manjula, R.; Jain, Shubham; Srivastava, Sharad; Rajiv Kher, Pranav

2017-11-01

The real estate market is one of the most competitive in terms of pricing and the same tends to vary significantly based on a lot of factors, hence it becomes one of the prime fields to apply the concepts of machine learning to optimize and predict the prices with high accuracy. Therefore in this paper, we present various important features to use while predicting housing prices with good accuracy. We have described regression models, using various features to have lower Residual Sum of Squares error. While using features in a regression model some feature engineering is required for better prediction. Often a set of features (multiple regressions) or polynomial regression (applying a various set of powers in the features) is used for making better model fit. For these models are expected to be susceptible towards over fitting ridge regression is used to reduce it. This paper thus directs to the best application of regression models in addition to other techniques to optimize the result.
Computing multiple-output regression quantile regions

Czech Academy of Sciences Publication Activity Database

Paindaveine, D.; Šiman, Miroslav

2012-01-01

Roč. 56, č. 4 (2012), s. 840-853 ISSN 0167-9473 R&D Projects: GA MŠk(CZ) 1M06047 Institutional research plan: CEZ:AV0Z10750506 Keywords : halfspace depth * multiple-output regression * parametric linear programming * quantile regression Subject RIV: BA - General Mathematics Impact factor: 1.304, year: 2012 http://library.utia.cas.cz/separaty/2012/SI/siman-0376413.pdf
Preface to Berk's "Regression Analysis: A Constructive Critique"

OpenAIRE

de Leeuw, Jan

2003-01-01

It is pleasure to write a preface for the book ”Regression Analysis” of my fellow series editor Dick Berk. And it is a pleasure in particular because the book is about regression analysis, the most popular and the most fundamental technique in applied statistics. And because it is critical of the way regression analysis is used in the sciences, in particular in the social and behavioral sciences. Although the book can be read as an introduction to regression analysis, it can also be read as a...
Five cases of caudal regression with an aberrant abdominal umbilical artery: Further support for a caudal regression-sirenomelia spectrum.

Science.gov (United States)

Duesterhoeft, Sara M; Ernst, Linda M; Siebert, Joseph R; Kapur, Raj P

2007-12-15

Sirenomelia and caudal regression have sparked centuries of interest and recent debate regarding their classification and pathogenetic relationship. Specific anomalies are common to both conditions, but aside from fusion of the lower extremities, an aberrant abdominal umbilical artery ("persistent vitelline artery") has been invoked as the chief anatomic finding that distinguishes sirenomelia from caudal regression. This observation is important from a pathogenetic viewpoint, in that diversion of blood away from the caudal portion of the embryo through the abdominal umbilical artery ("vascular steal") has been proposed as the primary mechanism leading to sirenomelia. In contrast, caudal regression is hypothesized to arise from primary deficiency of caudal mesoderm. We present five cases of caudal regression that exhibit an aberrant abdominal umbilical artery similar to that typically associated with sirenomelia. Review of the literature identified four similar cases. Collectively, the series lends support for a caudal regression-sirenomelia spectrum with a common pathogenetic basis and suggests that abnormal umbilical arterial anatomy may be the consequence, rather than the cause, of deficient caudal mesoderm. (c) 2007 Wiley-Liss, Inc.
Model-based Quantile Regression for Discrete Data

KAUST Repository

Padellini, Tullia; Rue, Haavard

2018-01-01

Quantile regression is a class of methods voted to the modelling of conditional quantiles. In a Bayesian framework quantile regression has typically been carried out exploiting the Asymmetric Laplace Distribution as a working likelihood. Despite
Linear Regression Analysis

CERN Document Server

Seber, George A F

2012-01-01

Concise, mathematically clear, and comprehensive treatment of the subject.* Expanded coverage of diagnostics and methods of model fitting.* Requires no specialized knowledge beyond a good grasp of matrix algebra and some acquaintance with straight-line regression and simple analysis of variance models.* More than 200 problems throughout the book plus outline solutions for the exercises.* This revision has been extensively class-tested.

Moderation analysis using a two-level regression model.

Science.gov (United States)

Yuan, Ke-Hai; Cheng, Ying; Maxwell, Scott

2014-10-01

Moderation analysis is widely used in social and behavioral research. The most commonly used model for moderation analysis is moderated multiple regression (MMR) in which the explanatory variables of the regression model include product terms, and the model is typically estimated by least squares (LS). This paper argues for a two-level regression model in which the regression coefficients of a criterion variable on predictors are further regressed on moderator variables. An algorithm for estimating the parameters of the two-level model by normal-distribution-based maximum likelihood (NML) is developed. Formulas for the standard errors (SEs) of the parameter estimates are provided and studied. Results indicate that, when heteroscedasticity exists, NML with the two-level model gives more efficient and more accurate parameter estimates than the LS analysis of the MMR model. When error variances are homoscedastic, NML with the two-level model leads to essentially the same results as LS with the MMR model. Most importantly, the two-level regression model permits estimating the percentage of variance of each regression coefficient that is due to moderator variables. When applied to data from General Social Surveys 1991, NML with the two-level model identified a significant moderation effect of race on the regression of job prestige on years of education while LS with the MMR model did not. An R package is also developed and documented to facilitate the application of the two-level model.
Independent contrasts and PGLS regression estimators are equivalent.

Science.gov (United States)

Blomberg, Simon P; Lefevre, James G; Wells, Jessie A; Waterhouse, Mary

2012-05-01

We prove that the slope parameter of the ordinary least squares regression of phylogenetically independent contrasts (PICs) conducted through the origin is identical to the slope parameter of the method of generalized least squares (GLSs) regression under a Brownian motion model of evolution. This equivalence has several implications: 1. Understanding the structure of the linear model for GLS regression provides insight into when and why phylogeny is important in comparative studies. 2. The limitations of the PIC regression analysis are the same as the limitations of the GLS model. In particular, phylogenetic covariance applies only to the response variable in the regression and the explanatory variable should be regarded as fixed. Calculation of PICs for explanatory variables should be treated as a mathematical idiosyncrasy of the PIC regression algorithm. 3. Since the GLS estimator is the best linear unbiased estimator (BLUE), the slope parameter estimated using PICs is also BLUE. 4. If the slope is estimated using different branch lengths for the explanatory and response variables in the PIC algorithm, the estimator is no longer the BLUE, so this is not recommended. Finally, we discuss whether or not and how to accommodate phylogenetic covariance in regression analyses, particularly in relation to the problem of phylogenetic uncertainty. This discussion is from both frequentist and Bayesian perspectives.
Methods for identifying SNP interactions: a review on variations of Logic Regression, Random Forest and Bayesian logistic regression.

Science.gov (United States)

Chen, Carla Chia-Ming; Schwender, Holger; Keith, Jonathan; Nunkesser, Robin; Mengersen, Kerrie; Macrossan, Paula

2011-01-01

Due to advancements in computational ability, enhanced technology and a reduction in the price of genotyping, more data are being generated for understanding genetic associations with diseases and disorders. However, with the availability of large data sets comes the inherent challenges of new methods of statistical analysis and modeling. Considering a complex phenotype may be the effect of a combination of multiple loci, various statistical methods have been developed for identifying genetic epistasis effects. Among these methods, logic regression (LR) is an intriguing approach incorporating tree-like structures. Various methods have built on the original LR to improve different aspects of the model. In this study, we review four variations of LR, namely Logic Feature Selection, Monte Carlo Logic Regression, Genetic Programming for Association Studies, and Modified Logic Regression-Gene Expression Programming, and investigate the performance of each method using simulated and real genotype data. We contrast these with another tree-like approach, namely Random Forests, and a Bayesian logistic regression with stochastic search variable selection.
Demonstration of a Fiber Optic Regression Probe

Science.gov (United States)

Korman, Valentin; Polzin, Kurt A.

2010-01-01

The capability to provide localized, real-time monitoring of material regression rates in various applications has the potential to provide a new stream of data for development testing of various components and systems, as well as serving as a monitoring tool in flight applications. These applications include, but are not limited to, the regression of a combusting solid fuel surface, the ablation of the throat in a chemical rocket or the heat shield of an aeroshell, and the monitoring of erosion in long-life plasma thrusters. The rate of regression in the first application is very fast, while the second and third are increasingly slower. A recent fundamental sensor development effort has led to a novel regression, erosion, and ablation sensor technology (REAST). The REAST sensor allows for measurement of real-time surface erosion rates at a discrete surface location. The sensor is optical, using two different, co-located fiber-optics to perform the regression measurement. The disparate optical transmission properties of the two fiber-optics makes it possible to measure the regression rate by monitoring the relative light attenuation through the fibers. As the fibers regress along with the parent material in which they are embedded, the relative light intensities through the two fibers changes, providing a measure of the regression rate. The optical nature of the system makes it relatively easy to use in a variety of harsh, high temperature environments, and it is also unaffected by the presence of electric and magnetic fields. In addition, the sensor could be used to perform optical spectroscopy on the light emitted by a process and collected by fibers, giving localized measurements of various properties. The capability to perform an in-situ measurement of material regression rates is useful in addressing a variety of physical issues in various applications. An in-situ measurement allows for real-time data regarding the erosion rates, providing a quick method for
Caudal regression syndrome : a case report

International Nuclear Information System (INIS)

Lee, Eun Joo; Kim, Hi Hye; Kim, Hyung Sik; Park, So Young; Han, Hye Young; Lee, Kwang Hun

1998-01-01

Caudal regression syndrome is a rare congenital anomaly, which results from a developmental failure of the caudal mesoderm during the fetal period. We present a case of caudal regression syndrome composed of a spectrum of anomalies including sirenomelia, dysplasia of the lower lumbar vertebrae, sacrum, coccyx and pelvic bones,genitourinary and anorectal anomalies, and dysplasia of the lung, as seen during infantography and MR imaging
Caudal regression syndrome : a case report

Energy Technology Data Exchange (ETDEWEB)

Lee, Eun Joo; Kim, Hi Hye; Kim, Hyung Sik; Park, So Young; Han, Hye Young; Lee, Kwang Hun [Chungang Gil Hospital, Incheon (Korea, Republic of)

1998-07-01

Caudal regression syndrome is a rare congenital anomaly, which results from a developmental failure of the caudal mesoderm during the fetal period. We present a case of caudal regression syndrome composed of a spectrum of anomalies including sirenomelia, dysplasia of the lower lumbar vertebrae, sacrum, coccyx and pelvic bones,genitourinary and anorectal anomalies, and dysplasia of the lung, as seen during infantography and MR imaging.
Correlation and simple linear regression.

Science.gov (United States)

Zou, Kelly H; Tuncali, Kemal; Silverman, Stuart G

2003-06-01

In this tutorial article, the concepts of correlation and regression are reviewed and demonstrated. The authors review and compare two correlation coefficients, the Pearson correlation coefficient and the Spearman rho, for measuring linear and nonlinear relationships between two continuous variables. In the case of measuring the linear relationship between a predictor and an outcome variable, simple linear regression analysis is conducted. These statistical concepts are illustrated by using a data set from published literature to assess a computed tomography-guided interventional technique. These statistical methods are important for exploring the relationships between variables and can be applied to many radiologic studies.
bayesQR: A Bayesian Approach to Quantile Regression

Directory of Open Access Journals (Sweden)

Dries F. Benoit

2017-01-01

Full Text Available After its introduction by Koenker and Basset (1978, quantile regression has become an important and popular tool to investigate the conditional response distribution in regression. The R package bayesQR contains a number of routines to estimate quantile regression parameters using a Bayesian approach based on the asymmetric Laplace distribution. The package contains functions for the typical quantile regression with continuous dependent variable, but also supports quantile regression for binary dependent variables. For both types of dependent variables, an approach to variable selection using the adaptive lasso approach is provided. For the binary quantile regression model, the package also contains a routine that calculates the fitted probabilities for each vector of predictors. In addition, functions for summarizing the results, creating traceplots, posterior histograms and drawing quantile plots are included. This paper starts with a brief overview of the theoretical background of the models used in the bayesQR package. The main part of this paper discusses the computational problems that arise in the implementation of the procedure and illustrates the usefulness of the package through selected examples.
Multivariate Linear Regression and CART Regression Analysis of TBM Performance at Abu Hamour Phase-I Tunnel

Science.gov (United States)

Jakubowski, J.; Stypulkowski, J. B.; Bernardeau, F. G.

2017-12-01

The first phase of the Abu Hamour drainage and storm tunnel was completed in early 2017. The 9.5 km long, 3.7 m diameter tunnel was excavated with two Earth Pressure Balance (EPB) Tunnel Boring Machines from Herrenknecht. TBM operation processes were monitored and recorded by Data Acquisition and Evaluation System. The authors coupled collected TBM drive data with available information on rock mass properties, cleansed, completed with secondary variables and aggregated by weeks and shifts. Correlations and descriptive statistics charts were examined. Multivariate Linear Regression and CART regression tree models linking TBM penetration rate (PR), penetration per revolution (PPR) and field penetration index (FPI) with TBM operational and geotechnical characteristics were performed for the conditions of the weak/soft rock of Doha. Both regression methods are interpretable and the data were screened with different computational approaches allowing enriched insight. The primary goal of the analysis was to investigate empirical relations between multiple explanatory and responding variables, to search for best subsets of explanatory variables and to evaluate the strength of linear and non-linear relations. For each of the penetration indices, a predictive model coupling both regression methods was built and validated. The resultant models appeared to be stronger than constituent ones and indicated an opportunity for more accurate and robust TBM performance predictions.
Background stratified Poisson regression analysis of cohort data.

Science.gov (United States)

Richardson, David B; Langholz, Bryan

2012-03-01

Background stratified Poisson regression is an approach that has been used in the analysis of data derived from a variety of epidemiologically important studies of radiation-exposed populations, including uranium miners, nuclear industry workers, and atomic bomb survivors. We describe a novel approach to fit Poisson regression models that adjust for a set of covariates through background stratification while directly estimating the radiation-disease association of primary interest. The approach makes use of an expression for the Poisson likelihood that treats the coefficients for stratum-specific indicator variables as 'nuisance' variables and avoids the need to explicitly estimate the coefficients for these stratum-specific parameters. Log-linear models, as well as other general relative rate models, are accommodated. This approach is illustrated using data from the Life Span Study of Japanese atomic bomb survivors and data from a study of underground uranium miners. The point estimate and confidence interval obtained from this 'conditional' regression approach are identical to the values obtained using unconditional Poisson regression with model terms for each background stratum. Moreover, it is shown that the proposed approach allows estimation of background stratified Poisson regression models of non-standard form, such as models that parameterize latency effects, as well as regression models in which the number of strata is large, thereby overcoming the limitations of previously available statistical software for fitting background stratified Poisson regression models.
Variable importance in latent variable regression models

NARCIS (Netherlands)

Kvalheim, O.M.; Arneberg, R.; Bleie, O.; Rajalahti, T.; Smilde, A.K.; Westerhuis, J.A.

2014-01-01

The quality and practical usefulness of a regression model are a function of both interpretability and prediction performance. This work presents some new graphical tools for improved interpretation of latent variable regression models that can also assist in improved algorithms for variable
Regression: The Apple Does Not Fall Far From the Tree.

Science.gov (United States)

Vetter, Thomas R; Schober, Patrick

2018-05-15

Researchers and clinicians are frequently interested in either: (1) assessing whether there is a relationship or association between 2 or more variables and quantifying this association; or (2) determining whether 1 or more variables can predict another variable. The strength of such an association is mainly described by the correlation. However, regression analysis and regression models can be used not only to identify whether there is a significant relationship or association between variables but also to generate estimations of such a predictive relationship between variables. This basic statistical tutorial discusses the fundamental concepts and techniques related to the most common types of regression analysis and modeling, including simple linear regression, multiple regression, logistic regression, ordinal regression, and Poisson regression, as well as the common yet often underrecognized phenomenon of regression toward the mean. The various types of regression analysis are powerful statistical techniques, which when appropriately applied, can allow for the valid interpretation of complex, multifactorial data. Regression analysis and models can assess whether there is a relationship or association between 2 or more observed variables and estimate the strength of this association, as well as determine whether 1 or more variables can predict another variable. Regression is thus being applied more commonly in anesthesia, perioperative, critical care, and pain research. However, it is crucial to note that regression can identify plausible risk factors; it does not prove causation (a definitive cause and effect relationship). The results of a regression analysis instead identify independent (predictor) variable(s) associated with the dependent (outcome) variable. As with other statistical methods, applying regression requires that certain assumptions be met, which can be tested with specific diagnostics.
Multiple regression and beyond an introduction to multiple regression and structural equation modeling

CERN Document Server

Keith, Timothy Z

2014-01-01

Multiple Regression and Beyond offers a conceptually oriented introduction to multiple regression (MR) analysis and structural equation modeling (SEM), along with analyses that flow naturally from those methods. By focusing on the concepts and purposes of MR and related methods, rather than the derivation and calculation of formulae, this book introduces material to students more clearly, and in a less threatening way. In addition to illuminating content necessary for coursework, the accessibility of this approach means students are more likely to be able to conduct research using MR or SEM--and more likely to use the methods wisely. Covers both MR and SEM, while explaining their relevance to one another Also includes path analysis, confirmatory factor analysis, and latent growth modeling Figures and tables throughout provide examples and illustrate key concepts and techniques For additional resources, please visit: http://tzkeith.com/.
Quasi-experimental evidence on tobacco tax regressivity.

Science.gov (United States)

Koch, Steven F

2018-01-01

Tobacco taxes are known to reduce tobacco consumption and to be regressive, such that tobacco control policy may have the perverse effect of further harming the poor. However, if tobacco consumption falls faster amongst the poor than the rich, tobacco control policy can actually be progressive. We take advantage of persistent and committed tobacco control activities in South Africa to examine the household tobacco expenditure burden. For the analysis, we make use of two South African Income and Expenditure Surveys (2005/06 and 2010/11) that span a series of such tax increases and have been matched across the years, yielding 7806 matched pairs of tobacco consuming households and 4909 matched pairs of cigarette consuming households. By matching households across the surveys, we are able to examine both the regressivity of the household tobacco burden, and any change in that regressivity, and since tobacco taxes have been a consistent component of tobacco prices, our results also relate to the regressivity of tobacco taxes. Like previous research into cigarette and tobacco expenditures, we find that the tobacco burden is regressive; thus, so are tobacco taxes. However, we find that over the five-year period considered, the tobacco burden has decreased, and, most importantly, falls less heavily on the poor. Thus, the tobacco burden and the tobacco tax is less regressive in 2010/11 than in 2005/06. Thus, increased tobacco taxes can, in at least some circumstances, reduce the financial burden that tobacco places on households. Copyright © 2017 Elsevier Ltd. All rights reserved.
Polylinear regression analysis in radiochemistry

International Nuclear Information System (INIS)

Kopyrin, A.A.; Terent'eva, T.N.; Khramov, N.N.

1995-01-01

A number of radiochemical problems have been formulated in the framework of polylinear regression analysis, which permits the use of conventional mathematical methods for their solution. The authors have considered features of the use of polylinear regression analysis for estimating the contributions of various sources to the atmospheric pollution, for studying irradiated nuclear fuel, for estimating concentrations from spectral data, for measuring neutron fields of a nuclear reactor, for estimating crystal lattice parameters from X-ray diffraction patterns, for interpreting data of X-ray fluorescence analysis, for estimating complex formation constants, and for analyzing results of radiometric measurements. The problem of estimating the target parameters can be incorrect at certain properties of the system under study. The authors showed the possibility of regularization by adding a fictitious set of data open-quotes obtainedclose quotes from the orthogonal design. To estimate only a part of the parameters under consideration, the authors used incomplete rank models. In this case, it is necessary to take into account the possibility of confounding estimates. An algorithm for evaluating the degree of confounding is presented which is realized using standard software or regression analysis
Influence diagnostics in meta-regression model.

Science.gov (United States)

Shi, Lei; Zuo, ShanShan; Yu, Dalei; Zhou, Xiaohua

2017-09-01

This paper studies the influence diagnostics in meta-regression model including case deletion diagnostic and local influence analysis. We derive the subset deletion formulae for the estimation of regression coefficient and heterogeneity variance and obtain the corresponding influence measures. The DerSimonian and Laird estimation and maximum likelihood estimation methods in meta-regression are considered, respectively, to derive the results. Internal and external residual and leverage measure are defined. The local influence analysis based on case-weights perturbation scheme, responses perturbation scheme, covariate perturbation scheme, and within-variance perturbation scheme are explored. We introduce a method by simultaneous perturbing responses, covariate, and within-variance to obtain the local influence measure, which has an advantage of capable to compare the influence magnitude of influential studies from different perturbations. An example is used to illustrate the proposed methodology. Copyright © 2017 John Wiley & Sons, Ltd.
Are Deltaic Subaqueous Clinothems One-Highstand Affairs?

Science.gov (United States)

Giosan, L.; Clift, P.; Henstock, T.; Ponton, C.; Limmer, D. R.

2009-12-01

Clinothems are basic building blocks of continental shelves, whether modern or ancient. In many cases large delta-building rivers directly construct subaqueous clinothems on the shelf that are offset offshore from the delta coast. Assuming that the sediment flux to the shelf and the sediment redistributing processes are suitable for subaqueous clinothem development, the evolution of any subaqueous clinothems depends primarily on the availability of accommodation space. As the eustatic sea level varies with the volume of global ice, one primary mechanism of creating accommodation space on shelves is erosion during lowstands. We discuss here possible mechanisms for clinothems to survive erosion during lowstands by examining new data from the Indus delta shelf offshore Pakistan. Theoretical considerations based on estimates of the relative importance of wave energy vs. fluvial sediment delivery suggest that the Indus delta should develop a mid-shelf subaqueous clinothem. Instead, the Indus shelf exhibits a compound clinoform morphology. A shallow delta front clinoform extends along the entire delta coast from the shoreline to the 10-25 m water depth. New seismic data confirm that a mid-shelf clinothem developed between 30 and 90 m water depth extending over 100 km offshore east of the Indus canyon but less than 30 km west of the canyon. The advanced position of the eastern mid-shelf clinothem might reflect either a prolonged sediment delivery from the Indus River in that area compared to the shelf west of the canyon or the presence of a relict pre-Holocene mid-shelf delta.
Variable and subset selection in PLS regression

DEFF Research Database (Denmark)

Høskuldsson, Agnar

2001-01-01

The purpose of this paper is to present some useful methods for introductory analysis of variables and subsets in relation to PLS regression. We present here methods that are efficient in finding the appropriate variables or subset to use in the PLS regression. The general conclusion...... is that variable selection is important for successful analysis of chemometric data. An important aspect of the results presented is that lack of variable selection can spoil the PLS regression, and that cross-validation measures using a test set can show larger variation, when we use different subsets of X, than...
Ridge Regression Signal Processing

Science.gov (United States)

Kuhl, Mark R.

1990-01-01

The introduction of the Global Positioning System (GPS) into the National Airspace System (NAS) necessitates the development of Receiver Autonomous Integrity Monitoring (RAIM) techniques. In order to guarantee a certain level of integrity, a thorough understanding of modern estimation techniques applied to navigational problems is required. The extended Kalman filter (EKF) is derived and analyzed under poor geometry conditions. It was found that the performance of the EKF is difficult to predict, since the EKF is designed for a Gaussian environment. A novel approach is implemented which incorporates ridge regression to explain the behavior of an EKF in the presence of dynamics under poor geometry conditions. The basic principles of ridge regression theory are presented, followed by the derivation of a linearized recursive ridge estimator. Computer simulations are performed to confirm the underlying theory and to provide a comparative analysis of the EKF and the recursive ridge estimator.
Regression filter for signal resolution

International Nuclear Information System (INIS)

Matthes, W.

1975-01-01

The problem considered is that of resolving a measured pulse height spectrum of a material mixture, e.g. gamma ray spectrum, Raman spectrum, into a weighed sum of the spectra of the individual constituents. The model on which the analytical formulation is based is described. The problem reduces to that of a multiple linear regression. A stepwise linear regression procedure was constructed. The efficiency of this method was then tested by transforming the procedure in a computer programme which was used to unfold test spectra obtained by mixing some spectra, from a library of arbitrary chosen spectra, and adding a noise component. (U.K.)

Direction of Effects in Multiple Linear Regression Models.

Science.gov (United States)

Wiedermann, Wolfgang; von Eye, Alexander

2015-01-01

Previous studies analyzed asymmetric properties of the Pearson correlation coefficient using higher than second order moments. These asymmetric properties can be used to determine the direction of dependence in a linear regression setting (i.e., establish which of two variables is more likely to be on the outcome side) within the framework of cross-sectional observational data. Extant approaches are restricted to the bivariate regression case. The present contribution extends the direction of dependence methodology to a multiple linear regression setting by analyzing distributional properties of residuals of competing multiple regression models. It is shown that, under certain conditions, the third central moments of estimated regression residuals can be used to decide upon direction of effects. In addition, three different approaches for statistical inference are discussed: a combined D'Agostino normality test, a skewness difference test, and a bootstrap difference test. Type I error and power of the procedures are assessed using Monte Carlo simulations, and an empirical example is provided for illustrative purposes. In the discussion, issues concerning the quality of psychological data, possible extensions of the proposed methods to the fourth central moment of regression residuals, and potential applications are addressed.
Robust mislabel logistic regression without modeling mislabel probabilities.

Science.gov (United States)

Hung, Hung; Jou, Zhi-Yu; Huang, Su-Yun

2018-03-01

Logistic regression is among the most widely used statistical methods for linear discriminant analysis. In many applications, we only observe possibly mislabeled responses. Fitting a conventional logistic regression can then lead to biased estimation. One common resolution is to fit a mislabel logistic regression model, which takes into consideration of mislabeled responses. Another common method is to adopt a robust M-estimation by down-weighting suspected instances. In this work, we propose a new robust mislabel logistic regression based on γ-divergence. Our proposal possesses two advantageous features: (1) It does not need to model the mislabel probabilities. (2) The minimum γ-divergence estimation leads to a weighted estimating equation without the need to include any bias correction term, that is, it is automatically bias-corrected. These features make the proposed γ-logistic regression more robust in model fitting and more intuitive for model interpretation through a simple weighting scheme. Our method is also easy to implement, and two types of algorithms are included. Simulation studies and the Pima data application are presented to demonstrate the performance of γ-logistic regression. © 2017, The International Biometric Society.
A Simulation Investigation of Principal Component Regression.

Science.gov (United States)

Allen, David E.

Regression analysis is one of the more common analytic tools used by researchers. However, multicollinearity between the predictor variables can cause problems in using the results of regression analyses. Problems associated with multicollinearity include entanglement of relative influences of variables due to reduced precision of estimation,…
Hierarchical regression analysis in structural Equation Modeling

NARCIS (Netherlands)

de Jong, P.F.

1999-01-01

In a hierarchical or fixed-order regression analysis, the independent variables are entered into the regression equation in a prespecified order. Such an analysis is often performed when the extra amount of variance accounted for in a dependent variable by a specific independent variable is the main
Repeated Results Analysis for Middleware Regression Benchmarking

Czech Academy of Sciences Publication Activity Database

Bulej, Lubomír; Kalibera, T.; Tůma, P.

2005-01-01

Roč. 60, - (2005), s. 345-358 ISSN 0166-5316 R&D Projects: GA ČR GA102/03/0672 Institutional research plan: CEZ:AV0Z10300504 Keywords : middleware benchmarking * regression benchmarking * regression testing Subject RIV: JD - Computer Applications, Robotics Impact factor: 0.756, year: 2005
and Multinomial Logistic Regression

African Journals Online (AJOL)

This work presented the results of an experimental comparison of two models: Multinomial Logistic Regression (MLR) and Artificial Neural Network (ANN) for classifying students based on their academic performance. The predictive accuracy for each model was measured by their average Classification Correct Rate (CCR).
Modeling Fire Occurrence at the City Scale: A Comparison between Geographically Weighted Regression and Global Linear Regression.

Science.gov (United States)

Song, Chao; Kwan, Mei-Po; Zhu, Jiping

2017-04-08

An increasing number of fires are occurring with the rapid development of cities, resulting in increased risk for human beings and the environment. This study compares geographically weighted regression-based models, including geographically weighted regression (GWR) and geographically and temporally weighted regression (GTWR), which integrates spatial and temporal effects and global linear regression models (LM) for modeling fire risk at the city scale. The results show that the road density and the spatial distribution of enterprises have the strongest influences on fire risk, which implies that we should focus on areas where roads and enterprises are densely clustered. In addition, locations with a large number of enterprises have fewer fire ignition records, probably because of strict management and prevention measures. A changing number of significant variables across space indicate that heterogeneity mainly exists in the northern and eastern rural and suburban areas of Hefei city, where human-related facilities or road construction are only clustered in the city sub-centers. GTWR can capture small changes in the spatiotemporal heterogeneity of the variables while GWR and LM cannot. An approach that integrates space and time enables us to better understand the dynamic changes in fire risk. Thus governments can use the results to manage fire safety at the city scale.
Background stratified Poisson regression analysis of cohort data

International Nuclear Information System (INIS)

Richardson, David B.; Langholz, Bryan

2012-01-01

Background stratified Poisson regression is an approach that has been used in the analysis of data derived from a variety of epidemiologically important studies of radiation-exposed populations, including uranium miners, nuclear industry workers, and atomic bomb survivors. We describe a novel approach to fit Poisson regression models that adjust for a set of covariates through background stratification while directly estimating the radiation-disease association of primary interest. The approach makes use of an expression for the Poisson likelihood that treats the coefficients for stratum-specific indicator variables as 'nuisance' variables and avoids the need to explicitly estimate the coefficients for these stratum-specific parameters. Log-linear models, as well as other general relative rate models, are accommodated. This approach is illustrated using data from the Life Span Study of Japanese atomic bomb survivors and data from a study of underground uranium miners. The point estimate and confidence interval obtained from this 'conditional' regression approach are identical to the values obtained using unconditional Poisson regression with model terms for each background stratum. Moreover, it is shown that the proposed approach allows estimation of background stratified Poisson regression models of non-standard form, such as models that parameterize latency effects, as well as regression models in which the number of strata is large, thereby overcoming the limitations of previously available statistical software for fitting background stratified Poisson regression models. (orig.)
Regression of uveal malignant melanomas following cobalt-60 plaque. Correlates between acoustic spectrum analysis and tumor regression

International Nuclear Information System (INIS)

Coleman, D.J.; Lizzi, F.L.; Silverman, R.H.; Ellsworth, R.M.; Haik, B.G.; Abramson, D.H.; Smith, M.E.; Rondeau, M.J.

1985-01-01

Parameters derived from computer analysis of digital radio-frequency (rf) ultrasound scan data of untreated uveal malignant melanomas were examined for correlations with tumor regression following cobalt-60 plaque. Parameters included tumor height, normalized power spectrum and acoustic tissue type (ATT). Acoustic tissue type was based upon discriminant analysis of tumor power spectra, with spectra of tumors of known pathology serving as a model. Results showed ATT to be correlated with tumor regression during the first 18 months following treatment. Tumors with ATT associated with spindle cell malignant melanoma showed over twice the percentage reduction in height as those with ATT associated with mixed/epithelioid melanomas. Pre-treatment height was only weakly correlated with regression. Additionally, significant spectral changes were observed following treatment. Ultrasonic spectrum analysis thus provides a noninvasive tool for classification, prediction and monitoring of tumor response to cobalt-60 plaque
Regression away from the mean: Theory and examples.

Science.gov (United States)

Schwarz, Wolf; Reike, Dennis

2018-02-01

Using a standard repeated measures model with arbitrary true score distribution and normal error variables, we present some fundamental closed-form results which explicitly indicate the conditions under which regression effects towards (RTM) and away from the mean are expected. Specifically, we show that for skewed and bimodal distributions many or even most cases will show a regression effect that is in expectation away from the mean, or that is not just towards but actually beyond the mean. We illustrate our results in quantitative detail with typical examples from experimental and biometric applications, which exhibit a clear regression away from the mean ('egression from the mean') signature. We aim not to repeal cautionary advice against potential RTM effects, but to present a balanced view of regression effects, based on a clear identification of the conditions governing the form that regression effects take in repeated measures designs. © 2017 The British Psychological Society.
On directional multiple-output quantile regression

Czech Academy of Sciences Publication Activity Database

Paindaveine, D.; Šiman, Miroslav

2011-01-01

Roč. 102, č. 2 (2011), s. 193-212 ISSN 0047-259X R&D Projects: GA MŠk(CZ) 1M06047 Grant - others:Commision EC(BE) Fonds National de la Recherche Scientifique Institutional research plan: CEZ:AV0Z10750506 Keywords : multivariate quantile * quantile regression * multiple-output regression * halfspace depth * portfolio optimization * value-at risk Subject RIV: BA - General Mathematics Impact factor: 0.879, year: 2011 http://library.utia.cas.cz/separaty/2011/SI/siman-0364128.pdf
Bayesian logistic regression analysis

NARCIS (Netherlands)

Van Erp, H.R.N.; Van Gelder, P.H.A.J.M.

2012-01-01

In this paper we present a Bayesian logistic regression analysis. It is found that if one wishes to derive the posterior distribution of the probability of some event, then, together with the traditional Bayes Theorem and the integrating out of nuissance parameters, the Jacobian transformation is an
Examination of influential observations in penalized spline regression

Science.gov (United States)

Türkan, Semra

2013-10-01

In parametric or nonparametric regression models, the results of regression analysis are affected by some anomalous observations in the data set. Thus, detection of these observations is one of the major steps in regression analysis. These observations are precisely detected by well-known influence measures. Pena's statistic is one of them. In this study, Pena's approach is formulated for penalized spline regression in terms of ordinary residuals and leverages. The real data and artificial data are used to see illustrate the effectiveness of Pena's statistic as to Cook's distance on detecting influential observations. The results of the study clearly reveal that the proposed measure is superior to Cook's Distance to detect these observations in large data set.
Subset selection in regression

CERN Document Server

Miller, Alan

2002-01-01

Originally published in 1990, the first edition of Subset Selection in Regression filled a significant gap in the literature, and its critical and popular success has continued for more than a decade. Thoroughly revised to reflect progress in theory, methods, and computing power, the second edition promises to continue that tradition. The author has thoroughly updated each chapter, incorporated new material on recent developments, and included more examples and references. New in the Second Edition:A separate chapter on Bayesian methodsComplete revision of the chapter on estimationA major example from the field of near infrared spectroscopyMore emphasis on cross-validationGreater focus on bootstrappingStochastic algorithms for finding good subsets from large numbers of predictors when an exhaustive search is not feasible Software available on the Internet for implementing many of the algorithms presentedMore examplesSubset Selection in Regression, Second Edition remains dedicated to the techniques for fitting...
Stepwise versus Hierarchical Regression: Pros and Cons

Science.gov (United States)

Lewis, Mitzi

2007-01-01

Multiple regression is commonly used in social and behavioral data analysis. In multiple regression contexts, researchers are very often interested in determining the "best" predictors in the analysis. This focus may stem from a need to identify those predictors that are supportive of theory. Alternatively, the researcher may simply be interested…
Nonparametric Mixture of Regression Models.

Science.gov (United States)

Huang, Mian; Li, Runze; Wang, Shaoli

2013-07-01

Motivated by an analysis of US house price index data, we propose nonparametric finite mixture of regression models. We study the identifiability issue of the proposed models, and develop an estimation procedure by employing kernel regression. We further systematically study the sampling properties of the proposed estimators, and establish their asymptotic normality. A modified EM algorithm is proposed to carry out the estimation procedure. We show that our algorithm preserves the ascent property of the EM algorithm in an asymptotic sense. Monte Carlo simulations are conducted to examine the finite sample performance of the proposed estimation procedure. An empirical analysis of the US house price index data is illustrated for the proposed methodology.
Comparing Methodologies for Developing an Early Warning System: Classification and Regression Tree Model versus Logistic Regression. REL 2015-077

Science.gov (United States)

Koon, Sharon; Petscher, Yaacov

2015-01-01

The purpose of this report was to explicate the use of logistic regression and classification and regression tree (CART) analysis in the development of early warning systems. It was motivated by state education leaders' interest in maintaining high classification accuracy while simultaneously improving practitioner understanding of the rules by…
On weighted and locally polynomial directional quantile regression

Czech Academy of Sciences Publication Activity Database

Boček, Pavel; Šiman, Miroslav

2017-01-01

Roč. 32, č. 3 (2017), s. 929-946 ISSN 0943-4062 R&D Projects: GA ČR GA14-07234S Institutional support: RVO:67985556 Keywords : Quantile regression * Nonparametric regression * Nonparametric regression Subject RIV: IN - Informatics, Computer Science OBOR OECD: Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8) Impact factor: 0.434, year: 2016 http://library.utia.cas.cz/separaty/2017/SI/bocek-0458380.pdf
Regression Benchmarking: An Approach to Quality Assurance in Performance

OpenAIRE

Bulej, Lubomír

2005-01-01

The paper presents a short summary of our work in the area of regression benchmarking and its application to software development. Specially, we explain the concept of regression benchmarking, the requirements for employing regression testing in a software project, and methods used for analyzing the vast amounts of data resulting from repeated benchmarking. We present the application of regression benchmarking on a real software project and conclude with a glimpse at the challenges for the fu...
Nonlinear Regression with R

CERN Document Server

Ritz, Christian; Parmigiani, Giovanni

2009-01-01

R is a rapidly evolving lingua franca of graphical display and statistical analysis of experiments from the applied sciences. This book provides a coherent treatment of nonlinear regression with R by means of examples from a diversity of applied sciences such as biology, chemistry, engineering, medicine and toxicology.

Bounded Gaussian process regression

DEFF Research Database (Denmark)

Jensen, Bjørn Sand; Nielsen, Jens Brehm; Larsen, Jan

2013-01-01

We extend the Gaussian process (GP) framework for bounded regression by introducing two bounded likelihood functions that model the noise on the dependent variable explicitly. This is fundamentally different from the implicit noise assumption in the previously suggested warped GP framework. We...... with the proposed explicit noise-model extension....
There is No Quantum Regression Theorem

International Nuclear Information System (INIS)

Ford, G.W.; OConnell, R.F.

1996-01-01

The Onsager regression hypothesis states that the regression of fluctuations is governed by macroscopic equations describing the approach to equilibrium. It is here asserted that this hypothesis fails in the quantum case. This is shown first by explicit calculation for the example of quantum Brownian motion of an oscillator and then in general from the fluctuation-dissipation theorem. It is asserted that the correct generalization of the Onsager hypothesis is the fluctuation-dissipation theorem. copyright 1996 The American Physical Society
Two Paradoxes in Linear Regression Analysis

Science.gov (United States)

FENG, Ge; PENG, Jing; TU, Dongke; ZHENG, Julia Z.; FENG, Changyong

2016-01-01

Summary Regression is one of the favorite tools in applied statistics. However, misuse and misinterpretation of results from regression analysis are common in biomedical research. In this paper we use statistical theory and simulation studies to clarify some paradoxes around this popular statistical method. In particular, we show that a widely used model selection procedure employed in many publications in top medical journals is wrong. Formal procedures based on solid statistical theory should be used in model selection. PMID:28638214
Long-lasting Microbial Methane Release at the Aquitaine Shelf Break (Bay of Biscay): Relation with the (Plio)-Pleistocene Sedimentary Progradation of the Continental Margin

Science.gov (United States)

Dupré, S.; Michel, G.; Pierre, C.; Ruffine, L.; Scalabrin, C.; Ehrhold, A.; Loubrieu, B.; Gautier, E.; Baltzer, A.; Imbert, P.; Battani, A.; Deville, E.; Dupont, P.; Thomas, Y.; Théréau, E.

2017-12-01

The recent identification of acoustic and visual gas release in the water column at the Aquitaine Shelf (140 and 220 m water depths) led to the discovery of a 200 km2 fluid system at the seafloor with 3000 bubbling sites associated with microbial methane (Dupré et al 2014; Ruffine et al. 2017). The moderate methane fluxes (measured in situ, on average 200 mLn/min per bubbling site) contribute to the formation of small-scale sub-circular authigenic carbonate mounds (with reliefs < 1 m in height) (Pierre et al. 2017). The emitted gases have neither a genetic link with thermogenic hydrocarbons from the Parentis Basin beneath, nor are issued from gas hydrate dissociation, but originate from microbial CO2 reduction. Based on estimated thickness and growth rate of authigenic carbonates, this system has lasted for at least several tens to possibly hundreds of kyears with a volume of escaping methane reaching 3.1012 Ln per 10 kyr. Seismic evidences for gas-charged layers and fossil authigenic carbonates point to organic matter source levels within the sedimentary deposits of the Late Pleistocene progradation system. The Aquitaine Shelf fluid system highlights the edge of continental shelves as preferential areas for bio-geological processes. The GAZCOGNE project is co-funded by TOTAL and IFREMER as part of the PAMELA (Passive Margin Exploration Laboratories) scientific project. References Dupré S, Berger L, Le Bouffant N, Scalabrin C, Bourillet J-F (2014) Fluid emissions at the Aquitaine Shelf (Bay of Biscay, France): a biogenic origin or the expression of hydrocarbon leakage? Cont. Shelf Res. 88:24-33 Pierre C, Demange J, Blanc-Valleron M-M, Dupré S (2017) Authigenic carbonate mounds from active methane seeps on the southern Aquitaine Shelf (Bay of Biscay, France): Evidence for anaerobic oxidation of biogenic methane and submarine groundwater discharge during formation. Cont. Shelf Res. 133:13-25 Ruffine L, Donval J-P, Croguennec C, Bignon L, Birot D, Battani A, Bayon
Origin and evolution of Gneiss-Charnockite rocks of Dharmapuri District, Tamil Nadu, India

Science.gov (United States)

Rao, D. Rameshwar; Narayana, B. L.

1988-01-01

A low- to high-grade transition area in Dharmapuri district was investigated petrologically and geochemically. The investigation confirmed the presence of a continuous section through a former lower crust, with felsic charnockites predominating the lower part and felsic gneisses the upper part. The structure of original gneisses is preserved in charnockites and the latter show petrographic evidence for prograde metamorphism. The prograde metamorphism is of isochemical nature as revealed by the similarity of compositions of tonalitic gneisses and tonalitic charnockites. However, the depletion of LIL elements particularly Rb, caused variation in K/Rb ratios from low values (345) in the gneisses in upper part to higher values (1775) in the charnockites in the lower crust. This variation in K/Rb ratio in a north to south traverse is related to the progressive break-down of hydrous minerals under decreasing H2O and increasing CO2 fluid conditions. Metasomatism and partial melting has also taken place to a limited extent along shear planes and weak zones. During cooling the H2O circulation affected substantial auto-regression in the transition zone resulting in the formation of second generation biotite.
Meta-Modeling by Symbolic Regression and Pareto Simulated Annealing

NARCIS (Netherlands)

Stinstra, E.; Rennen, G.; Teeuwen, G.J.A.

2006-01-01

The subject of this paper is a new approach to Symbolic Regression.Other publications on Symbolic Regression use Genetic Programming.This paper describes an alternative method based on Pareto Simulated Annealing.Our method is based on linear regression for the estimation of constants.Interval
Using Regression Equations Built from Summary Data in the Psychological Assessment of the Individual Case: Extension to Multiple Regression

Science.gov (United States)

Crawford, John R.; Garthwaite, Paul H.; Denham, Annie K.; Chelune, Gordon J.

2012-01-01

Regression equations have many useful roles in psychological assessment. Moreover, there is a large reservoir of published data that could be used to build regression equations; these equations could then be employed to test a wide variety of hypotheses concerning the functioning of individual cases. This resource is currently underused because…
Mixed-effects regression models in linguistics

CERN Document Server

Heylen, Kris; Geeraerts, Dirk

2018-01-01

When data consist of grouped observations or clusters, and there is a risk that measurements within the same group are not independent, group-specific random effects can be added to a regression model in order to account for such within-group associations. Regression models that contain such group-specific random effects are called mixed-effects regression models, or simply mixed models. Mixed models are a versatile tool that can handle both balanced and unbalanced datasets and that can also be applied when several layers of grouping are present in the data; these layers can either be nested or crossed. In linguistics, as in many other fields, the use of mixed models has gained ground rapidly over the last decade. This methodological evolution enables us to build more sophisticated and arguably more realistic models, but, due to its technical complexity, also introduces new challenges. This volume brings together a number of promising new evolutions in the use of mixed models in linguistics, but also addres...
Principles of Quantile Regression and an Application

Science.gov (United States)

Chen, Fang; Chalhoub-Deville, Micheline

2014-01-01

Newer statistical procedures are typically introduced to help address the limitations of those already in practice or to deal with emerging research needs. Quantile regression (QR) is introduced in this paper as a relatively new methodology, which is intended to overcome some of the limitations of least squares mean regression (LMR). QR is more…
Functional data analysis of generalized regression quantiles

KAUST Repository

Guo, Mengmeng; Zhou, Lan; Huang, Jianhua Z.; Hä rdle, Wolfgang Karl

2013-01-01

Generalized regression quantiles, including the conditional quantiles and expectiles as special cases, are useful alternatives to the conditional means for characterizing a conditional distribution, especially when the interest lies in the tails. We develop a functional data analysis approach to jointly estimate a family of generalized regression quantiles. Our approach assumes that the generalized regression quantiles share some common features that can be summarized by a small number of principal component functions. The principal component functions are modeled as splines and are estimated by minimizing a penalized asymmetric loss measure. An iterative least asymmetrically weighted squares algorithm is developed for computation. While separate estimation of individual generalized regression quantiles usually suffers from large variability due to lack of sufficient data, by borrowing strength across data sets, our joint estimation approach significantly improves the estimation efficiency, which is demonstrated in a simulation study. The proposed method is applied to data from 159 weather stations in China to obtain the generalized quantile curves of the volatility of the temperature at these stations. © 2013 Springer Science+Business Media New York.
Functional data analysis of generalized regression quantiles

KAUST Repository

Guo, Mengmeng

2013-11-05

Generalized regression quantiles, including the conditional quantiles and expectiles as special cases, are useful alternatives to the conditional means for characterizing a conditional distribution, especially when the interest lies in the tails. We develop a functional data analysis approach to jointly estimate a family of generalized regression quantiles. Our approach assumes that the generalized regression quantiles share some common features that can be summarized by a small number of principal component functions. The principal component functions are modeled as splines and are estimated by minimizing a penalized asymmetric loss measure. An iterative least asymmetrically weighted squares algorithm is developed for computation. While separate estimation of individual generalized regression quantiles usually suffers from large variability due to lack of sufficient data, by borrowing strength across data sets, our joint estimation approach significantly improves the estimation efficiency, which is demonstrated in a simulation study. The proposed method is applied to data from 159 weather stations in China to obtain the generalized quantile curves of the volatility of the temperature at these stations. © 2013 Springer Science+Business Media New York.
Simple and multiple linear regression: sample size considerations.

Science.gov (United States)

Hanley, James A

2016-11-01

The suggested "two subjects per variable" (2SPV) rule of thumb in the Austin and Steyerberg article is a chance to bring out some long-established and quite intuitive sample size considerations for both simple and multiple linear regression. This article distinguishes two of the major uses of regression models that imply very different sample size considerations, neither served well by the 2SPV rule. The first is etiological research, which contrasts mean Y levels at differing "exposure" (X) values and thus tends to focus on a single regression coefficient, possibly adjusted for confounders. The second research genre guides clinical practice. It addresses Y levels for individuals with different covariate patterns or "profiles." It focuses on the profile-specific (mean) Y levels themselves, estimating them via linear compounds of regression coefficients and covariates. By drawing on long-established closed-form variance formulae that lie beneath the standard errors in multiple regression, and by rearranging them for heuristic purposes, one arrives at quite intuitive sample size considerations for both research genres. Copyright Â© 2016 Elsevier Inc. All rights reserved.
Removing Malmquist bias from linear regressions

Science.gov (United States)

Verter, Frances

1993-01-01

Malmquist bias is present in all astronomical surveys where sources are observed above an apparent brightness threshold. Those sources which can be detected at progressively larger distances are progressively more limited to the intrinsically luminous portion of the true distribution. This bias does not distort any of the measurements, but distorts the sample composition. We have developed the first treatment to correct for Malmquist bias in linear regressions of astronomical data. A demonstration of the corrected linear regression that is computed in four steps is presented.
The microcomputer scientific software series 2: general linear model--regression.

Science.gov (United States)

Harold M. Rauscher

1983-01-01

The general linear model regression (GLMR) program provides the microcomputer user with a sophisticated regression analysis capability. The output provides a regression ANOVA table, estimators of the regression model coefficients, their confidence intervals, confidence intervals around the predicted Y-values, residuals for plotting, a check for multicollinearity, a...
RAWS II: A MULTIPLE REGRESSION ANALYSIS PROGRAM,

Science.gov (United States)

This memorandum gives instructions for the use and operation of a revised version of RAWS, a multiple regression analysis program. The program...of preprocessed data, the directed retention of variable, listing of the matrix of the normal equations and its inverse, and the bypassing of the regression analysis to provide the input variable statistics only. (Author)
Sedimentary architecture and depositional controls of a Pliocene river-dominated delta in the semi-isolated Dacian Basin, Black Sea

Science.gov (United States)

Jorissen, Elisabeth L.; de Leeuw, Arjan; van Baak, Christiaan G. C.; Mandic, Oleg; Stoica, Marius; Abels, Hemmo A.; Krijgsman, Wout

2018-06-01

Sedimentological facies models for (semi-)isolated basins are less well developed than those for marine environments, but are critical for our understanding of both present-day and ancient deltaic sediment records in restricted depositional environments. This study considers an 835 m thick sedimentary succession of mid-Pliocene age, which accumulated in the Dacian Basin, a former embayment of the Black Sea. Detailed sedimentological and palaeontological analyses reveal a regression from distal prodelta deposits with brackish water faunas to delta-top deposits with freshwater faunas. Sediments contain frequent hyperpycnal plumes and an enrichment in terrestrial organic material, ichnofossils and in situ brackish and freshwater faunas. Deltaic progradation created thin, sharply-based sand bodies formed by multiple terminal distributary channels, covering a wide depositional area. The system experienced frequent delta-lobe switching, resulting in numerous thin parasequences. Parasequences are overlain by erosive reddish oxidized sand beds, enriched in broken, abraded brackish and freshwater shells. These beds were formed after sediment starvation, on top of abandoned delta lobes during each flooding event. A robust magnetostratigraphic time frame allowed for comparison between the observed sedimentary cyclicity and the amplitude and frequency of astronomical forcing cycles. Our results indicate that parasequence frequencies are significantly higher than the number of time equivalent astronomical cycles. This suggests that delta-lobe switching was due to autogenic processes. We consider the observed facies architecture typical for a delta prograding on a low-gradient slope into a shallow, brackish, protected, semi-isolated basin. Furthermore, in the absence of significant wave and tidal influence, sediment progradation in such a protected depositional setting shaped a delta, strongly river-dominated.
Learning a Nonnegative Sparse Graph for Linear Regression.

Science.gov (United States)

Fang, Xiaozhao; Xu, Yong; Li, Xuelong; Lai, Zhihui; Wong, Wai Keung

2015-09-01

Previous graph-based semisupervised learning (G-SSL) methods have the following drawbacks: 1) they usually predefine the graph structure and then use it to perform label prediction, which cannot guarantee an overall optimum and 2) they only focus on the label prediction or the graph structure construction but are not competent in handling new samples. To this end, a novel nonnegative sparse graph (NNSG) learning method was first proposed. Then, both the label prediction and projection learning were integrated into linear regression. Finally, the linear regression and graph structure learning were unified within the same framework to overcome these two drawbacks. Therefore, a novel method, named learning a NNSG for linear regression was presented, in which the linear regression and graph learning were simultaneously performed to guarantee an overall optimum. In the learning process, the label information can be accurately propagated via the graph structure so that the linear regression can learn a discriminative projection to better fit sample labels and accurately classify new samples. An effective algorithm was designed to solve the corresponding optimization problem with fast convergence. Furthermore, NNSG provides a unified perceptiveness for a number of graph-based learning methods and linear regression methods. The experimental results showed that NNSG can obtain very high classification accuracy and greatly outperforms conventional G-SSL methods, especially some conventional graph construction methods.
Quantile Regression With Measurement Error

KAUST Repository

Wei, Ying

2009-08-27

Regression quantiles can be substantially biased when the covariates are measured with error. In this paper we propose a new method that produces consistent linear quantile estimation in the presence of covariate measurement error. The method corrects the measurement error induced bias by constructing joint estimating equations that simultaneously hold for all the quantile levels. An iterative EM-type estimation algorithm to obtain the solutions to such joint estimation equations is provided. The finite sample performance of the proposed method is investigated in a simulation study, and compared to the standard regression calibration approach. Finally, we apply our methodology to part of the National Collaborative Perinatal Project growth data, a longitudinal study with an unusual measurement error structure. © 2009 American Statistical Association.
Mixed Frequency Data Sampling Regression Models: The R Package midasr

Directory of Open Access Journals (Sweden)

Eric Ghysels

2016-08-01

Full Text Available When modeling economic relationships it is increasingly common to encounter data sampled at different frequencies. We introduce the R package midasr which enables estimating regression models with variables sampled at different frequencies within a MIDAS regression framework put forward in work by Ghysels, Santa-Clara, and Valkanov (2002. In this article we define a general autoregressive MIDAS regression model with multiple variables of different frequencies and show how it can be specified using the familiar R formula interface and estimated using various optimization methods chosen by the researcher. We discuss how to check the validity of the estimated model both in terms of numerical convergence and statistical adequacy of a chosen regression specification, how to perform model selection based on a information criterion, how to assess forecasting accuracy of the MIDAS regression model and how to obtain a forecast aggregation of different MIDAS regression models. We illustrate the capabilities of the package with a simulated MIDAS regression model and give two empirical examples of application of MIDAS regression.
Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications

Directory of Open Access Journals (Sweden)

Guoqi Qian

2016-01-01

Full Text Available Regression clustering is a mixture of unsupervised and supervised statistical learning and data mining method which is found in a wide range of applications including artificial intelligence and neuroscience. It performs unsupervised learning when it clusters the data according to their respective unobserved regression hyperplanes. The method also performs supervised learning when it fits regression hyperplanes to the corresponding data clusters. Applying regression clustering in practice requires means of determining the underlying number of clusters in the data, finding the cluster label of each data point, and estimating the regression coefficients of the model. In this paper, we review the estimation and selection issues in regression clustering with regard to the least squares and robust statistical methods. We also provide a model selection based technique to determine the number of regression clusters underlying the data. We further develop a computing procedure for regression clustering estimation and selection. Finally, simulation studies are presented for assessing the procedure, together with analyzing a real data set on RGB cell marking in neuroscience to illustrate and interpret the method.

Short-term load forecasting with increment regression tree

Energy Technology Data Exchange (ETDEWEB)

Yang, Jingfei; Stenzel, Juergen [Darmstadt University of Techonology, Darmstadt 64283 (Germany)

2006-06-15

This paper presents a new regression tree method for short-term load forecasting. Both increment and non-increment tree are built according to the historical data to provide the data space partition and input variable selection. Support vector machine is employed to the samples of regression tree nodes for further fine regression. Results of different tree nodes are integrated through weighted average method to obtain the comprehensive forecasting result. The effectiveness of the proposed method is demonstrated through its application to an actual system. (author)
Analyzing hospitalization data: potential limitations of Poisson regression.

Science.gov (United States)

Weaver, Colin G; Ravani, Pietro; Oliver, Matthew J; Austin, Peter C; Quinn, Robert R

2015-08-01

Poisson regression is commonly used to analyze hospitalization data when outcomes are expressed as counts (e.g. number of days in hospital). However, data often violate the assumptions on which Poisson regression is based. More appropriate extensions of this model, while available, are rarely used. We compared hospitalization data between 206 patients treated with hemodialysis (HD) and 107 treated with peritoneal dialysis (PD) using Poisson regression and compared results from standard Poisson regression with those obtained using three other approaches for modeling count data: negative binomial (NB) regression, zero-inflated Poisson (ZIP) regression and zero-inflated negative binomial (ZINB) regression. We examined the appropriateness of each model and compared the results obtained with each approach. During a mean 1.9 years of follow-up, 183 of 313 patients (58%) were never hospitalized (indicating an excess of 'zeros'). The data also displayed overdispersion (variance greater than mean), violating another assumption of the Poisson model. Using four criteria, we determined that the NB and ZINB models performed best. According to these two models, patients treated with HD experienced similar hospitalization rates as those receiving PD {NB rate ratio (RR): 1.04 [bootstrapped 95% confidence interval (CI): 0.49-2.20]; ZINB summary RR: 1.21 (bootstrapped 95% CI 0.60-2.46)}. Poisson and ZIP models fit the data poorly and had much larger point estimates than the NB and ZINB models [Poisson RR: 1.93 (bootstrapped 95% CI 0.88-4.23); ZIP summary RR: 1.84 (bootstrapped 95% CI 0.88-3.84)]. We found substantially different results when modeling hospitalization data, depending on the approach used. Our results argue strongly for a sound model selection process and improved reporting around statistical methods used for modeling count data. © The Author 2015. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.
Sparse Regression by Projection and Sparse Discriminant Analysis

KAUST Repository

Qi, Xin; Luo, Ruiyan; Carroll, Raymond J.; Zhao, Hongyu

2015-01-01

predictions. We introduce a new framework, regression by projection, and its sparse version to analyze high-dimensional data. The unique nature of this framework is that the directions of the regression coefficients are inferred first, and the lengths
Assessing risk factors for periodontitis using regression

Science.gov (United States)

Lobo Pereira, J. A.; Ferreira, Maria Cristina; Oliveira, Teresa

2013-10-01

Multivariate statistical analysis is indispensable to assess the associations and interactions between different factors and the risk of periodontitis. Among others, regression analysis is a statistical technique widely used in healthcare to investigate and model the relationship between variables. In our work we study the impact of socio-demographic, medical and behavioral factors on periodontal health. Using regression, linear and logistic models, we can assess the relevance, as risk factors for periodontitis disease, of the following independent variables (IVs): Age, Gender, Diabetic Status, Education, Smoking status and Plaque Index. The multiple linear regression analysis model was built to evaluate the influence of IVs on mean Attachment Loss (AL). Thus, the regression coefficients along with respective p-values will be obtained as well as the respective p-values from the significance tests. The classification of a case (individual) adopted in the logistic model was the extent of the destruction of periodontal tissues defined by an Attachment Loss greater than or equal to 4 mm in 25% (AL≥4mm/≥25%) of sites surveyed. The association measures include the Odds Ratios together with the correspondent 95% confidence intervals.
Piecewise linear regression splines with hyperbolic covariates

International Nuclear Information System (INIS)

Cologne, John B.; Sposto, Richard

1992-09-01

Consider the problem of fitting a curve to data that exhibit a multiphase linear response with smooth transitions between phases. We propose substituting hyperbolas as covariates in piecewise linear regression splines to obtain curves that are smoothly joined. The method provides an intuitive and easy way to extend the two-phase linear hyperbolic response model of Griffiths and Miller and Watts and Bacon to accommodate more than two linear segments. The resulting regression spline with hyperbolic covariates may be fit by nonlinear regression methods to estimate the degree of curvature between adjoining linear segments. The added complexity of fitting nonlinear, as opposed to linear, regression models is not great. The extra effort is particularly worthwhile when investigators are unwilling to assume that the slope of the response changes abruptly at the join points. We can also estimate the join points (the values of the abscissas where the linear segments would intersect if extrapolated) if their number and approximate locations may be presumed known. An example using data on changing age at menarche in a cohort of Japanese women illustrates the use of the method for exploratory data analysis. (author)
Support Vector Regression Model Based on Empirical Mode Decomposition and Auto Regression for Electric Load Forecasting

Directory of Open Access Journals (Sweden)

Hong-Juan Li

2013-04-01

Full Text Available Electric load forecasting is an important issue for a power utility, associated with the management of daily operations such as energy transfer scheduling, unit commitment, and load dispatch. Inspired by strong non-linear learning capability of support vector regression (SVR, this paper presents a SVR model hybridized with the empirical mode decomposition (EMD method and auto regression (AR for electric load forecasting. The electric load data of the New South Wales (Australia market are employed for comparing the forecasting performances of different forecasting models. The results confirm the validity of the idea that the proposed model can simultaneously provide forecasting with good accuracy and interpretability.
A gentle introduction to quantile regression for ecologists

Science.gov (United States)

Cade, B.S.; Noon, B.R.

2003-01-01

Quantile regression is a way to estimate the conditional quantiles of a response variable distribution in the linear model that provides a more complete view of possible causal relationships between variables in ecological processes. Typically, all the factors that affect ecological processes are not measured and included in the statistical models used to investigate relationships between variables associated with those processes. As a consequence, there may be a weak or no predictive relationship between the mean of the response variable (y) distribution and the measured predictive factors (X). Yet there may be stronger, useful predictive relationships with other parts of the response variable distribution. This primer relates quantile regression estimates to prediction intervals in parametric error distribution regression models (eg least squares), and discusses the ordering characteristics, interval nature, sampling variation, weighting, and interpretation of the estimates for homogeneous and heterogeneous regression models.
Complex regression Doppler optical coherence tomography

Science.gov (United States)

Elahi, Sahar; Gu, Shi; Thrane, Lars; Rollins, Andrew M.; Jenkins, Michael W.

2018-04-01

We introduce a new method to measure Doppler shifts more accurately and extend the dynamic range of Doppler optical coherence tomography (OCT). The two-point estimate of the conventional Doppler method is replaced with a regression that is applied to high-density B-scans in polar coordinates. We built a high-speed OCT system using a 1.68-MHz Fourier domain mode locked laser to acquire high-density B-scans (16,000 A-lines) at high enough frame rates (˜100 fps) to accurately capture the dynamics of the beating embryonic heart. Flow phantom experiments confirm that the complex regression lowers the minimum detectable velocity from 12.25 mm / s to 374 μm / s, whereas the maximum velocity of 400 mm / s is measured without phase wrapping. Complex regression Doppler OCT also demonstrates higher accuracy and precision compared with the conventional method, particularly when signal-to-noise ratio is low. The extended dynamic range allows monitoring of blood flow over several stages of development in embryos without adjusting the imaging parameters. In addition, applying complex averaging recovers hidden features in structural images.
Image superresolution using support vector regression.

Science.gov (United States)

Ni, Karl S; Nguyen, Truong Q

2007-06-01

A thorough investigation of the application of support vector regression (SVR) to the superresolution problem is conducted through various frameworks. Prior to the study, the SVR problem is enhanced by finding the optimal kernel. This is done by formulating the kernel learning problem in SVR form as a convex optimization problem, specifically a semi-definite programming (SDP) problem. An additional constraint is added to reduce the SDP to a quadratically constrained quadratic programming (QCQP) problem. After this optimization, investigation of the relevancy of SVR to superresolution proceeds with the possibility of using a single and general support vector regression for all image content, and the results are impressive for small training sets. This idea is improved upon by observing structural properties in the discrete cosine transform (DCT) domain to aid in learning the regression. Further improvement involves a combination of classification and SVR-based techniques, extending works in resolution synthesis. This method, termed kernel resolution synthesis, uses specific regressors for isolated image content to describe the domain through a partitioned look of the vector space, thereby yielding good results.
Refractive regression after laser in situ keratomileusis.

Science.gov (United States)

Yan, Mabel K; Chang, John Sm; Chan, Tommy Cy

2018-04-26

Uncorrected refractive errors are a leading cause of visual impairment across the world. In today's society, laser in situ keratomileusis (LASIK) has become the most commonly performed surgical procedure to correct refractive errors. However, regression of the initially achieved refractive correction has been a widely observed phenomenon following LASIK since its inception more than two decades ago. Despite technological advances in laser refractive surgery and various proposed management strategies, post-LASIK regression is still frequently observed and has significant implications for the long-term visual performance and quality of life of patients. This review explores the mechanism of refractive regression after both myopic and hyperopic LASIK, predisposing risk factors and its clinical course. In addition, current preventative strategies and therapies are also reviewed. © 2018 Royal Australian and New Zealand College of Ophthalmologists.
AN APPLICATION OF FUNCTIONAL MULTIVARIATE REGRESSION MODEL TO MULTICLASS CLASSIFICATION

OpenAIRE

Krzyśko, Mirosław; Smaga, Łukasz

2017-01-01

In this paper, the scale response functional multivariate regression model is considered. By using the basis functions representation of functional predictors and regression coefficients, this model is rewritten as a multivariate regression model. This representation of the functional multivariate regression model is used for multiclass classification for multivariate functional data. Computational experiments performed on real labelled data sets demonstrate the effectiveness of the proposed ...
Robust median estimator in logisitc regression

Czech Academy of Sciences Publication Activity Database

Hobza, T.; Pardo, L.; Vajda, Igor

2008-01-01

Roč. 138, č. 12 (2008), s. 3822-3840 ISSN 0378-3758 R&D Projects: GA MŠk 1M0572 Grant - others:Instituto Nacional de Estadistica (ES) MPO FI - IM3/136; GA MŠk(CZ) MTM 2006-06872 Institutional research plan: CEZ:AV0Z10750506 Keywords : Logistic regression * Median * Robustness * Consistency and asymptotic normality * Morgenthaler * Bianco and Yohai * Croux and Hasellbroeck Subject RIV: BB - Applied Statistics, Operational Research Impact factor: 0.679, year: 2008 http://library.utia.cas.cz/separaty/2008/SI/vajda-robust%20median%20estimator%20in%20logistic%20regression.pdf
Short-term electricity prices forecasting based on support vector regression and Auto-regressive integrated moving average modeling

International Nuclear Information System (INIS)

Che Jinxing; Wang Jianzhou

2010-01-01

In this paper, we present the use of different mathematical models to forecast electricity price under deregulated power. A successful prediction tool of electricity price can help both power producers and consumers plan their bidding strategies. Inspired by that the support vector regression (SVR) model, with the ε-insensitive loss function, admits of the residual within the boundary values of ε-tube, we propose a hybrid model that combines both SVR and Auto-regressive integrated moving average (ARIMA) models to take advantage of the unique strength of SVR and ARIMA models in nonlinear and linear modeling, which is called SVRARIMA. A nonlinear analysis of the time-series indicates the convenience of nonlinear modeling, the SVR is applied to capture the nonlinear patterns. ARIMA models have been successfully applied in solving the residuals regression estimation problems. The experimental results demonstrate that the model proposed outperforms the existing neural-network approaches, the traditional ARIMA models and other hybrid models based on the root mean square error and mean absolute percentage error.
A review and comparison of Bayesian and likelihood-based inferences in beta regression and zero-or-one-inflated beta regression.

Science.gov (United States)

Liu, Fang; Eugenio, Evercita C

2018-04-01

Beta regression is an increasingly popular statistical technique in medical research for modeling of outcomes that assume values in (0, 1), such as proportions and patient reported outcomes. When outcomes take values in the intervals [0,1), (0,1], or [0,1], zero-or-one-inflated beta (zoib) regression can be used. We provide a thorough review on beta regression and zoib regression in the modeling, inferential, and computational aspects via the likelihood-based and Bayesian approaches. We demonstrate the statistical and practical importance of correctly modeling the inflation at zero/one rather than ad hoc replacing them with values close to zero/one via simulation studies; the latter approach can lead to biased estimates and invalid inferences. We show via simulation studies that the likelihood-based approach is computationally faster in general than MCMC algorithms used in the Bayesian inferences, but runs the risk of non-convergence, large biases, and sensitivity to starting values in the optimization algorithm especially with clustered/correlated data, data with sparse inflation at zero and one, and data that warrant regularization of the likelihood. The disadvantages of the regular likelihood-based approach make the Bayesian approach an attractive alternative in these cases. Software packages and tools for fitting beta and zoib regressions in both the likelihood-based and Bayesian frameworks are also reviewed.
Prediction of radiation levels in residences: A methodological comparison of CART [Classification and Regression Tree Analysis] and conventional regression

International Nuclear Information System (INIS)

Janssen, I.; Stebbings, J.H.

1990-01-01

In environmental epidemiology, trace and toxic substance concentrations frequently have very highly skewed distributions ranging over one or more orders of magnitude, and prediction by conventional regression is often poor. Classification and Regression Tree Analysis (CART) is an alternative in such contexts. To compare the techniques, two Pennsylvania data sets and three independent variables are used: house radon progeny (RnD) and gamma levels as predicted by construction characteristics in 1330 houses; and ∼200 house radon (Rn) measurements as predicted by topographic parameters. CART may identify structural variables of interest not identified by conventional regression, and vice versa, but in general the regression models are similar. CART has major advantages in dealing with other common characteristics of environmental data sets, such as missing values, continuous variables requiring transformations, and large sets of potential independent variables. CART is most useful in the identification and screening of independent variables, greatly reducing the need for cross-tabulations and nested breakdown analyses. There is no need to discard cases with missing values for the independent variables because surrogate variables are intrinsic to CART. The tree-structured approach is also independent of the scale on which the independent variables are measured, so that transformations are unnecessary. CART identifies important interactions as well as main effects. The major advantages of CART appear to be in exploring data. Once the important variables are identified, conventional regressions seem to lead to results similar but more interpretable by most audiences. 12 refs., 8 figs., 10 tabs
Geologic records of Pleistocene, Holocene and Anthropocene beach profiles?

Science.gov (United States)

Dougherty, Amy; Choi, Jeong-Heon; Dosseto, Anthony

2017-04-01

morphodynamics to interpret paleoenvironmental histories. Data from prograded barriers in North America, New Zealand and Australia are used to illustrate the potential of utilizing GPR, OSL, and LiDAR. Exploiting the fundamental link between paleo-beachfaces and past ocean levels, new sea level curves were constructed by mapping their height over time. Examples from far-field sites capture Eemian and mid-Holocene highstands with a subsequent fall indicating a non-linear nature. The geometry of paleo-beachfaces, intrinsically linked to wave-energy, were analyzed in comparison to present-day beach profile data to extract storm records. The results yielded recurrence intervals with differing coastal impacts, which indicated storm intensity increased as frequency decreased. Volumes of the barrier lithesome were quantified to provide insight on sediment supply and accommodation space over time. Findings show sand supply increased drastically starting in the mid-19th century causing a shift in foredune evolution from previous millennia. Do anomalous foredunes define Anthropocene coastal barriers in the geologic record? Global stratigraphic signatures, distinct from Holocene deposits, are needed to formally establish this 'Human' Epoch. Applying this novel methodology to the more than 300 prograded barriers around the world, including 50+ in Europe, can: 1) augment traditional proxy from ice and sediment cores to help delineate the Anthropocene, 2) determine changes in coastlines since the onset of global warming, and 3) provide insight, and input to forecasting models, needed to mitigate and manage future impacts of climate change.
Multiple linear regression analysis

Science.gov (United States)

Edwards, T. R.

1980-01-01

Program rapidly selects best-suited set of coefficients. User supplies only vectors of independent and dependent data and specifies confidence level required. Program uses stepwise statistical procedure for relating minimal set of variables to set of observations; final regression contains only most statistically significant coefficients. Program is written in FORTRAN IV for batch execution and has been implemented on NOVA 1200.
Spatial correlation in Bayesian logistic regression with misclassification

DEFF Research Database (Denmark)

Bihrmann, Kristine; Toft, Nils; Nielsen, Søren Saxmose

2014-01-01

Standard logistic regression assumes that the outcome is measured perfectly. In practice, this is often not the case, which could lead to biased estimates if not accounted for. This study presents Bayesian logistic regression with adjustment for misclassification of the outcome applied to data...
Quantile Regression With Measurement Error

KAUST Repository

Wei, Ying; Carroll, Raymond J.

2009-01-01

. The finite sample performance of the proposed method is investigated in a simulation study, and compared to the standard regression calibration approach. Finally, we apply our methodology to part of the National Collaborative Perinatal Project growth data, a
Developmental regression in autism: research and conceptual questions

Directory of Open Access Journals (Sweden)

Carolina Lampreia

2013-11-01

Full Text Available The subject of developmental regression in autism has gained importance and a growing number of studies have been conducted in recent years. It is a major issue indicating that there is not a unique form of autism onset. However the phenomenon itself and the concept of regression have been the subject of some debate: there is no consensus on the existence of regression, as there is no consensus on its definition. The aim of this paper is to review the research literature in this area and to introduce some conceptual questions about its existence and its definition.

General Nature of Multicollinearity in Multiple Regression Analysis.

Science.gov (United States)

Liu, Richard

1981-01-01

Discusses multiple regression, a very popular statistical technique in the field of education. One of the basic assumptions in regression analysis requires that independent variables in the equation should not be highly correlated. The problem of multicollinearity and some of the solutions to it are discussed. (Author)
Extensions of Morse-Smale Regression with Application to Actuarial Science

OpenAIRE

Farrelly, Colleen M.

2017-01-01

The problem of subgroups is ubiquitous in scientific research (ex. disease heterogeneity, spatial distributions in ecology...), and piecewise regression is one way to deal with this phenomenon. Morse-Smale regression offers a way to partition the regression function based on level sets of a defined function and that function's basins of attraction. This topologically-based piecewise regression algorithm has shown promise in its initial applications, but the current implementation in the liter...
A Powerful Test for Comparing Multiple Regression Functions.

Science.gov (United States)

Maity, Arnab

2012-09-01

In this article, we address the important problem of comparison of two or more population regression functions. Recently, Pardo-Fernández, Van Keilegom and González-Manteiga (2007) developed test statistics for simple nonparametric regression models: Y(ij) = θ(j)(Z(ij)) + σ(j)(Z(ij))∊(ij), based on empirical distributions of the errors in each population j = 1, … , J. In this paper, we propose a test for equality of the θ(j)(·) based on the concept of generalized likelihood ratio type statistics. We also generalize our test for other nonparametric regression setups, e.g, nonparametric logistic regression, where the loglikelihood for population j is any general smooth function [Formula: see text]. We describe a resampling procedure to obtain the critical values of the test. In addition, we present a simulation study to evaluate the performance of the proposed test and compare our results to those in Pardo-Fernández et al. (2007).
Marginal longitudinal semiparametric regression via penalized splines

KAUST Repository

Al Kadiri, M.

2010-08-01

We study the marginal longitudinal nonparametric regression problem and some of its semiparametric extensions. We point out that, while several elaborate proposals for efficient estimation have been proposed, a relative simple and straightforward one, based on penalized splines, has not. After describing our approach, we then explain how Gibbs sampling and the BUGS software can be used to achieve quick and effective implementation. Illustrations are provided for nonparametric regression and additive models.
Marginal longitudinal semiparametric regression via penalized splines

KAUST Repository

Al Kadiri, M.; Carroll, R.J.; Wand, M.P.

2010-01-01

We study the marginal longitudinal nonparametric regression problem and some of its semiparametric extensions. We point out that, while several elaborate proposals for efficient estimation have been proposed, a relative simple and straightforward one, based on penalized splines, has not. After describing our approach, we then explain how Gibbs sampling and the BUGS software can be used to achieve quick and effective implementation. Illustrations are provided for nonparametric regression and additive models.
Fuzzy multiple linear regression: A computational approach

Science.gov (United States)

Juang, C. H.; Huang, X. H.; Fleming, J. W.

1992-01-01

This paper presents a new computational approach for performing fuzzy regression. In contrast to Bardossy's approach, the new approach, while dealing with fuzzy variables, closely follows the conventional regression technique. In this approach, treatment of fuzzy input is more 'computational' than 'symbolic.' The following sections first outline the formulation of the new approach, then deal with the implementation and computational scheme, and this is followed by examples to illustrate the new procedure.
Alternative regression models to assess increase in childhood BMI

OpenAIRE

Beyerlein, Andreas; Fahrmeir, Ludwig; Mansmann, Ulrich; Toschke, André M

2008-01-01

Abstract Background Body mass index (BMI) data usually have skewed distributions, for which common statistical modeling approaches such as simple linear or logistic regression have limitations. Methods Different regression approaches to predict childhood BMI by goodness-of-fit measures and means of interpretation were compared including generalized linear models (GLMs), quantile regression and Generalized Additive Models for Location, Scale and Shape (GAMLSS). We analyzed data of 4967 childre...
Multivariate Regression Analysis and Slaughter Livestock,

Science.gov (United States)

AGRICULTURE, *ECONOMICS), (*MEAT, PRODUCTION), MULTIVARIATE ANALYSIS, REGRESSION ANALYSIS , ANIMALS, WEIGHT, COSTS, PREDICTIONS, STABILITY, MATHEMATICAL MODELS, STORAGE, BEEF, PORK, FOOD, STATISTICAL DATA, ACCURACY
Predicting Word Reading Ability: A Quantile Regression Study

Science.gov (United States)

McIlraith, Autumn L.

2018-01-01

Predictors of early word reading are well established. However, it is unclear if these predictors hold for readers across a range of word reading abilities. This study used quantile regression to investigate predictive relationships at different points in the distribution of word reading. Quantile regression analyses used preschool and…
The Use of Nonparametric Kernel Regression Methods in Econometric Production Analysis

DEFF Research Database (Denmark)

Czekaj, Tomasz Gerard

and nonparametric estimations of production functions in order to evaluate the optimal firm size. The second paper discusses the use of parametric and nonparametric regression methods to estimate panel data regression models. The third paper analyses production risk, price uncertainty, and farmers' risk preferences...... within a nonparametric panel data regression framework. The fourth paper analyses the technical efficiency of dairy farms with environmental output using nonparametric kernel regression in a semiparametric stochastic frontier analysis. The results provided in this PhD thesis show that nonparametric......This PhD thesis addresses one of the fundamental problems in applied econometric analysis, namely the econometric estimation of regression functions. The conventional approach to regression analysis is the parametric approach, which requires the researcher to specify the form of the regression...
Cenozoic global sea level, sequences, and the New Jersey transect: Results from coastal plain and continental slope drilling

Science.gov (United States)

Miller, K.G.; Mountain, Gregory S.; Browning, J.V.; Kominz, M.; Sugarman, P.J.; Christie-Blick, N.; Katz, M.E.; Wright, J.D.

1998-01-01

The New Jersey Sea Level Transect was designed to evaluate the relationships among global sea level (eustatic) change, unconformity-bounded sequences, and variations in subsidence, sediment supply, and climate on a passive continental margin. By sampling and dating Cenozoic strata from coastal plain and continental slope locations, we show that sequence boundaries correlate (within ??0.5 myr) regionally (onshore-offshore) and interregionally (New Jersey-Alabama-Bahamas), implicating a global cause. Sequence boundaries correlate with ??18O increases for at least the past 42 myr, consistent with an ice volume (glacioeustatic) control, although a causal relationship is not required because of uncertainties in ages and correlations. Evidence for a causal connection is provided by preliminary Miocene data from slope Site 904 that directly link ??18O increases with sequence boundaries. We conclude that variation in the size of ice sheets has been a primary control on the formation of sequence boundaries since ~42 Ma. We speculate that prior to this, the growth and decay of small ice sheets caused small-amplitude sea level changes (changes on mid-ocean ridges. Although our results are consistent with the general number and timing of Paleocene to middle Miocene sequences published by workers at Exxon Production Research Company, our estimates of sea level amplitudes are substantially lower than theirs. Lithofacies patterns within sequences follow repetitive, predictable patterns: (1) coastal plain sequences consist of basal transgressive sands overlain by regressive highstand silts and quartz sands; and (2) although slope lithofacies variations are subdued, reworked sediments constitute lowstand deposits, causing the strongest, most extensive seismic reflections. Despite a primary eustatic control on sequence boundaries, New Jersey sequences were also influenced by changes in tectonics, sediment supply, and climate. During the early to middle Eocene, low siliciclastic and
Logistic regression for risk factor modelling in stuttering research.

Science.gov (United States)

Reed, Phil; Wu, Yaqionq

2013-06-01

To outline the uses of logistic regression and other statistical methods for risk factor analysis in the context of research on stuttering. The principles underlying the application of a logistic regression are illustrated, and the types of questions to which such a technique has been applied in the stuttering field are outlined. The assumptions and limitations of the technique are discussed with respect to existing stuttering research, and with respect to formulating appropriate research strategies to accommodate these considerations. Finally, some alternatives to the approach are briefly discussed. The way the statistical procedures are employed are demonstrated with some hypothetical data. Research into several practical issues concerning stuttering could benefit if risk factor modelling were used. Important examples are early diagnosis, prognosis (whether a child will recover or persist) and assessment of treatment outcome. After reading this article you will: (a) Summarize the situations in which logistic regression can be applied to a range of issues about stuttering; (b) Follow the steps in performing a logistic regression analysis; (c) Describe the assumptions of the logistic regression technique and the precautions that need to be checked when it is employed; (d) Be able to summarize its advantages over other techniques like estimation of group differences and simple regression. Copyright © 2012 Elsevier Inc. All rights reserved.
Interpreting Bivariate Regression Coefficients: Going beyond the Average

Science.gov (United States)

Halcoussis, Dennis; Phillips, G. Michael

2010-01-01

Statistics, econometrics, investment analysis, and data analysis classes often review the calculation of several types of averages, including the arithmetic mean, geometric mean, harmonic mean, and various weighted averages. This note shows how each of these can be computed using a basic regression framework. By recognizing when a regression model…
Application of Negative Binomial Regression for Assessing Public ...

African Journals Online (AJOL)

Because the variance was nearly two times greater than the mean, the negative binomial regression model provided an improved fit to the data and accounted better for overdispersion than the Poisson regression model, which assumed that the mean and variance are the same. The level of education and race were found
Advanced colorectal neoplasia risk stratification by penalized logistic regression.

Science.gov (United States)

Lin, Yunzhi; Yu, Menggang; Wang, Sijian; Chappell, Richard; Imperiale, Thomas F

2016-08-01

Colorectal cancer is the second leading cause of death from cancer in the United States. To facilitate the efficiency of colorectal cancer screening, there is a need to stratify risk for colorectal cancer among the 90% of US residents who are considered "average risk." In this article, we investigate such risk stratification rules for advanced colorectal neoplasia (colorectal cancer and advanced, precancerous polyps). We use a recently completed large cohort study of subjects who underwent a first screening colonoscopy. Logistic regression models have been used in the literature to estimate the risk of advanced colorectal neoplasia based on quantifiable risk factors. However, logistic regression may be prone to overfitting and instability in variable selection. Since most of the risk factors in our study have several categories, it was tempting to collapse these categories into fewer risk groups. We propose a penalized logistic regression method that automatically and simultaneously selects variables, groups categories, and estimates their coefficients by penalizing the [Formula: see text]-norm of both the coefficients and their differences. Hence, it encourages sparsity in the categories, i.e. grouping of the categories, and sparsity in the variables, i.e. variable selection. We apply the penalized logistic regression method to our data. The important variables are selected, with close categories simultaneously grouped, by penalized regression models with and without the interactions terms. The models are validated with 10-fold cross-validation. The receiver operating characteristic curves of the penalized regression models dominate the receiver operating characteristic curve of naive logistic regressions, indicating a superior discriminative performance. © The Author(s) 2013.
Impact of multicollinearity on small sample hydrologic regression models

Science.gov (United States)

Kroll, Charles N.; Song, Peter

2013-06-01

Often hydrologic regression models are developed with ordinary least squares (OLS) procedures. The use of OLS with highly correlated explanatory variables produces multicollinearity, which creates highly sensitive parameter estimators with inflated variances and improper model selection. It is not clear how to best address multicollinearity in hydrologic regression models. Here a Monte Carlo simulation is developed to compare four techniques to address multicollinearity: OLS, OLS with variance inflation factor screening (VIF), principal component regression (PCR), and partial least squares regression (PLS). The performance of these four techniques was observed for varying sample sizes, correlation coefficients between the explanatory variables, and model error variances consistent with hydrologic regional regression models. The negative effects of multicollinearity are magnified at smaller sample sizes, higher correlations between the variables, and larger model error variances (smaller R2). The Monte Carlo simulation indicates that if the true model is known, multicollinearity is present, and the estimation and statistical testing of regression parameters are of interest, then PCR or PLS should be employed. If the model is unknown, or if the interest is solely on model predictions, is it recommended that OLS be employed since using more complicated techniques did not produce any improvement in model performance. A leave-one-out cross-validation case study was also performed using low-streamflow data sets from the eastern United States. Results indicate that OLS with stepwise selection generally produces models across study regions with varying levels of multicollinearity that are as good as biased regression techniques such as PCR and PLS.
Alternative regression models to assess increase in childhood BMI.

Science.gov (United States)

Beyerlein, Andreas; Fahrmeir, Ludwig; Mansmann, Ulrich; Toschke, André M

2008-09-08

Body mass index (BMI) data usually have skewed distributions, for which common statistical modeling approaches such as simple linear or logistic regression have limitations. Different regression approaches to predict childhood BMI by goodness-of-fit measures and means of interpretation were compared including generalized linear models (GLMs), quantile regression and Generalized Additive Models for Location, Scale and Shape (GAMLSS). We analyzed data of 4967 children participating in the school entry health examination in Bavaria, Germany, from 2001 to 2002. TV watching, meal frequency, breastfeeding, smoking in pregnancy, maternal obesity, parental social class and weight gain in the first 2 years of life were considered as risk factors for obesity. GAMLSS showed a much better fit regarding the estimation of risk factors effects on transformed and untransformed BMI data than common GLMs with respect to the generalized Akaike information criterion. In comparison with GAMLSS, quantile regression allowed for additional interpretation of prespecified distribution quantiles, such as quantiles referring to overweight or obesity. The variables TV watching, maternal BMI and weight gain in the first 2 years were directly, and meal frequency was inversely significantly associated with body composition in any model type examined. In contrast, smoking in pregnancy was not directly, and breastfeeding and parental social class were not inversely significantly associated with body composition in GLM models, but in GAMLSS and partly in quantile regression models. Risk factor specific BMI percentile curves could be estimated from GAMLSS and quantile regression models. GAMLSS and quantile regression seem to be more appropriate than common GLMs for risk factor modeling of BMI data.
Spontaneous regression of metastatic Merkel cell carcinoma.

LENUS (Irish Health Repository)

Hassan, S J

2010-01-01

Merkel cell carcinoma is a rare aggressive neuroendocrine carcinoma of the skin predominantly affecting elderly Caucasians. It has a high rate of local recurrence and regional lymph node metastases. It is associated with a poor prognosis. Complete spontaneous regression of Merkel cell carcinoma has been reported but is a poorly understood phenomenon. Here we present a case of complete spontaneous regression of metastatic Merkel cell carcinoma demonstrating a markedly different pattern of events from those previously published.
Forecasting exchange rates: a robust regression approach

OpenAIRE

Preminger, Arie; Franck, Raphael

2005-01-01

The least squares estimation method as well as other ordinary estimation method for regression models can be severely affected by a small number of outliers, thus providing poor out-of-sample forecasts. This paper suggests a robust regression approach, based on the S-estimation method, to construct forecasting models that are less sensitive to data contamination by outliers. A robust linear autoregressive (RAR) and a robust neural network (RNN) models are estimated to study the predictabil...
Nonparametric instrumental regression with non-convex constraints

International Nuclear Information System (INIS)

Grasmair, M; Scherzer, O; Vanhems, A

2013-01-01

This paper considers the nonparametric regression model with an additive error that is dependent on the explanatory variables. As is common in empirical studies in epidemiology and economics, it also supposes that valid instrumental variables are observed. A classical example in microeconomics considers the consumer demand function as a function of the price of goods and the income, both variables often considered as endogenous. In this framework, the economic theory also imposes shape restrictions on the demand function, such as integrability conditions. Motivated by this illustration in microeconomics, we study an estimator of a nonparametric constrained regression function using instrumental variables by means of Tikhonov regularization. We derive rates of convergence for the regularized model both in a deterministic and stochastic setting under the assumption that the true regression function satisfies a projected source condition including, because of the non-convexity of the imposed constraints, an additional smallness condition. (paper)

Nonparametric instrumental regression with non-convex constraints

Science.gov (United States)

Grasmair, M.; Scherzer, O.; Vanhems, A.

2013-03-01

This paper considers the nonparametric regression model with an additive error that is dependent on the explanatory variables. As is common in empirical studies in epidemiology and economics, it also supposes that valid instrumental variables are observed. A classical example in microeconomics considers the consumer demand function as a function of the price of goods and the income, both variables often considered as endogenous. In this framework, the economic theory also imposes shape restrictions on the demand function, such as integrability conditions. Motivated by this illustration in microeconomics, we study an estimator of a nonparametric constrained regression function using instrumental variables by means of Tikhonov regularization. We derive rates of convergence for the regularized model both in a deterministic and stochastic setting under the assumption that the true regression function satisfies a projected source condition including, because of the non-convexity of the imposed constraints, an additional smallness condition.
Principal component regression for crop yield estimation

CERN Document Server

Suryanarayana, T M V

2016-01-01

This book highlights the estimation of crop yield in Central Gujarat, especially with regard to the development of Multiple Regression Models and Principal Component Regression (PCR) models using climatological parameters as independent variables and crop yield as a dependent variable. It subsequently compares the multiple linear regression (MLR) and PCR results, and discusses the significance of PCR for crop yield estimation. In this context, the book also covers Principal Component Analysis (PCA), a statistical procedure used to reduce a number of correlated variables into a smaller number of uncorrelated variables called principal components (PC). This book will be helpful to the students and researchers, starting their works on climate and agriculture, mainly focussing on estimation models. The flow of chapters takes the readers in a smooth path, in understanding climate and weather and impact of climate change, and gradually proceeds towards downscaling techniques and then finally towards development of ...
Use of probabilistic weights to enhance linear regression myoelectric control.

Science.gov (United States)

Smith, Lauren H; Kuiken, Todd A; Hargrove, Levi J

2015-12-01

Clinically available prostheses for transradial amputees do not allow simultaneous myoelectric control of degrees of freedom (DOFs). Linear regression methods can provide simultaneous myoelectric control, but frequently also result in difficulty with isolating individual DOFs when desired. This study evaluated the potential of using probabilistic estimates of categories of gross prosthesis movement, which are commonly used in classification-based myoelectric control, to enhance linear regression myoelectric control. Gaussian models were fit to electromyogram (EMG) feature distributions for three movement classes at each DOF (no movement, or movement in either direction) and used to weight the output of linear regression models by the probability that the user intended the movement. Eight able-bodied and two transradial amputee subjects worked in a virtual Fitts' law task to evaluate differences in controllability between linear regression and probability-weighted regression for an intramuscular EMG-based three-DOF wrist and hand system. Real-time and offline analyses in able-bodied subjects demonstrated that probability weighting improved performance during single-DOF tasks (p linear regression control. Use of probability weights can improve the ability to isolate individual during linear regression myoelectric control, while maintaining the ability to simultaneously control multiple DOFs.
Physiologic noise regression, motion regression, and TOAST dynamic field correction in complex-valued fMRI time series.

Science.gov (United States)

Hahn, Andrew D; Rowe, Daniel B

2012-02-01

As more evidence is presented suggesting that the phase, as well as the magnitude, of functional MRI (fMRI) time series may contain important information and that there are theoretical drawbacks to modeling functional response in the magnitude alone, removing noise in the phase is becoming more important. Previous studies have shown that retrospective correction of noise from physiologic sources can remove significant phase variance and that dynamic main magnetic field correction and regression of estimated motion parameters also remove significant phase fluctuations. In this work, we investigate the performance of physiologic noise regression in a framework along with correction for dynamic main field fluctuations and motion regression. Our findings suggest that including physiologic regressors provides some benefit in terms of reduction in phase noise power, but it is small compared to the benefit of dynamic field corrections and use of estimated motion parameters as nuisance regressors. Additionally, we show that the use of all three techniques reduces phase variance substantially, removes undesirable spatial phase correlations and improves detection of the functional response in magnitude and phase. Copyright © 2011 Elsevier Inc. All rights reserved.
Testing hypotheses for differences between linear regression lines

Science.gov (United States)

Stanley J. Zarnoch

2009-01-01

Five hypotheses are identified for testing differences between simple linear regression lines. The distinctions between these hypotheses are based on a priori assumptions and illustrated with full and reduced models. The contrast approach is presented as an easy and complete method for testing for overall differences between the regressions and for making pairwise...
Targeting: Logistic Regression, Special Cases and Extensions

Directory of Open Access Journals (Sweden)

Helmut Schaeben

2014-12-01

Full Text Available Logistic regression is a classical linear model for logit-transformed conditional probabilities of a binary target variable. It recovers the true conditional probabilities if the joint distribution of predictors and the target is of log-linear form. Weights-of-evidence is an ordinary logistic regression with parameters equal to the differences of the weights of evidence if all predictor variables are discrete and conditionally independent given the target variable. The hypothesis of conditional independence can be tested in terms of log-linear models. If the assumption of conditional independence is violated, the application of weights-of-evidence does not only corrupt the predicted conditional probabilities, but also their rank transform. Logistic regression models, including the interaction terms, can account for the lack of conditional independence, appropriate interaction terms compensate exactly for violations of conditional independence. Multilayer artificial neural nets may be seen as nested regression-like models, with some sigmoidal activation function. Most often, the logistic function is used as the activation function. If the net topology, i.e., its control, is sufficiently versatile to mimic interaction terms, artificial neural nets are able to account for violations of conditional independence and yield very similar results. Weights-of-evidence cannot reasonably include interaction terms; subsequent modifications of the weights, as often suggested, cannot emulate the effect of interaction terms.
Using Dominance Analysis to Determine Predictor Importance in Logistic Regression

Science.gov (United States)

Azen, Razia; Traxel, Nicole

2009-01-01

This article proposes an extension of dominance analysis that allows researchers to determine the relative importance of predictors in logistic regression models. Criteria for choosing logistic regression R[superscript 2] analogues were determined and measures were selected that can be used to perform dominance analysis in logistic regression. A…
A logistic regression estimating function for spatial Gibbs point processes

DEFF Research Database (Denmark)

Baddeley, Adrian; Coeurjolly, Jean-François; Rubak, Ege

We propose a computationally efficient logistic regression estimating function for spatial Gibbs point processes. The sample points for the logistic regression consist of the observed point pattern together with a random pattern of dummy points. The estimating function is closely related to the p......We propose a computationally efficient logistic regression estimating function for spatial Gibbs point processes. The sample points for the logistic regression consist of the observed point pattern together with a random pattern of dummy points. The estimating function is closely related...
Acupuncture and Spontaneous Regression of a Radiculopathic Cervical Herniated Disc

Directory of Open Access Journals (Sweden)

Kim Sung-Ha

2012-06-01

Full Text Available The spontaneous regression of herniated cervical discs is not a well-established phenomenon. However, we encountered a case of a spontaneous regression of a severe radiculopathic herniated cervical disc that was treated with acupuncture, pharmacopuncture, and herb medicine. The symptoms were improved within 12 months of treatment. Magnetic resonance imaging (MRI conducted at that time revealed marked regression of the herniated disc. This case provides an additional example of spontaneous regression of a herniated cervical disc documented by MRI following non-surgical treatment.
Sparse Regression by Projection and Sparse Discriminant Analysis

KAUST Repository

Qi, Xin

2015-04-03

© 2015, © American Statistical Association, Institute of Mathematical Statistics, and Interface Foundation of North America. Recent years have seen active developments of various penalized regression methods, such as LASSO and elastic net, to analyze high-dimensional data. In these approaches, the direction and length of the regression coefficients are determined simultaneously. Due to the introduction of penalties, the length of the estimates can be far from being optimal for accurate predictions. We introduce a new framework, regression by projection, and its sparse version to analyze high-dimensional data. The unique nature of this framework is that the directions of the regression coefficients are inferred first, and the lengths and the tuning parameters are determined by a cross-validation procedure to achieve the largest prediction accuracy. We provide a theoretical result for simultaneous model selection consistency and parameter estimation consistency of our method in high dimension. This new framework is then generalized such that it can be applied to principal components analysis, partial least squares, and canonical correlation analysis. We also adapt this framework for discriminant analysis. Compared with the existing methods, where there is relatively little control of the dependency among the sparse components, our method can control the relationships among the components. We present efficient algorithms and related theory for solving the sparse regression by projection problem. Based on extensive simulations and real data analysis, we demonstrate that our method achieves good predictive performance and variable selection in the regression setting, and the ability to control relationships between the sparse components leads to more accurate classification. In supplementary materials available online, the details of the algorithms and theoretical proofs, and R codes for all simulation studies are provided.
Finding-equal regression method and its application in predication of U resources

International Nuclear Information System (INIS)

Cao Huimo

1995-03-01

The commonly adopted deposit model method in mineral resources predication has two main part: one is model data that show up geological mineralization law for deposit, the other is statistics predication method that accords with characters of the data namely pretty regression method. This kind of regression method may be called finding-equal regression, which is made of the linear regression and distribution finding-equal method. Because distribution finding-equal method is a data pretreatment which accords with advanced mathematical precondition for the linear regression namely equal distribution theory, and this kind of data pretreatment is possible of realization. Therefore finding-equal regression not only can overcome nonlinear limitations, that are commonly occurred in traditional linear regression or other regression and always have no solution, but also can distinguish outliers and eliminate its weak influence, which would usually appeared when Robust regression possesses outlier in independent variables. Thus this newly finding-equal regression stands the best status in all kind of regression methods. Finally, two good examples of U resource quantitative predication are provided
Leadership and regressive group processes: a pilot study.

Science.gov (United States)

Rudden, Marie G; Twemlow, Stuart; Ackerman, Steven

2008-10-01

Various perspectives on leadership within the psychoanalytic, organizational and sociobiological literature are reviewed, with particular attention to research studies in these areas. Hypotheses are offered about what makes an effective leader: her ability to structure tasks well in order to avoid destructive regressions, to make constructive use of the omnipresent regressive energies in group life, and to redirect regressions when they occur. Systematic qualitative observations of three videotaped sessions each from N = 18 medical staff work groups at an urban medical center are discussed, as is the utility of a scale, the Leadership and Group Regressions Scale (LGRS), that attempts to operationalize the hypotheses. Analyzing the tapes qualitatively, it was noteworthy that at times (in N = 6 groups), the nominal leader of the group did not prove to be the actual, working leader. Quantitatively, a significant correlation was seen between leaders' LGRS scores and the group's satisfactory completion of their quantitative goals (p = 0.007) and ability to sustain the goals (p = 0.04), when the score of the person who met criteria for group leadership was used.
On concurvity in nonlinear and nonparametric regression models

Directory of Open Access Journals (Sweden)

Sonia Amodio

2014-12-01

Full Text Available When data are affected by multicollinearity in the linear regression framework, then concurvity will be present in fitting a generalized additive model (GAM. The term concurvity describes nonlinear dependencies among the predictor variables. As collinearity results in inflated variance of the estimated regression coefficients in the linear regression model, the result of the presence of concurvity leads to instability of the estimated coefficients in GAMs. Even if the backfitting algorithm will always converge to a solution, in case of concurvity the final solution of the backfitting procedure in fitting a GAM is influenced by the starting functions. While exact concurvity is highly unlikely, approximate concurvity, the analogue of multicollinearity, is of practical concern as it can lead to upwardly biased estimates of the parameters and to underestimation of their standard errors, increasing the risk of committing type I error. We compare the existing approaches to detect concurvity, pointing out their advantages and drawbacks, using simulated and real data sets. As a result, this paper will provide a general criterion to detect concurvity in nonlinear and non parametric regression models.
Testing for marginal linear effects in quantile regression

KAUST Repository

Wang, Huixia Judy

2017-10-23

The paper develops a new marginal testing procedure to detect significant predictors that are associated with the conditional quantiles of a scalar response. The idea is to fit the marginal quantile regression on each predictor one at a time, and then to base the test on the t-statistics that are associated with the most predictive predictors. A resampling method is devised to calibrate this test statistic, which has non-regular limiting behaviour due to the selection of the most predictive variables. Asymptotic validity of the procedure is established in a general quantile regression setting in which the marginal quantile regression models can be misspecified. Even though a fixed dimension is assumed to derive the asymptotic results, the test proposed is applicable and computationally feasible for large dimensional predictors. The method is more flexible than existing marginal screening test methods based on mean regression and has the added advantage of being robust against outliers in the response. The approach is illustrated by using an application to a human immunodeficiency virus drug resistance data set.
Testing for marginal linear effects in quantile regression

KAUST Repository

Wang, Huixia Judy; McKeague, Ian W.; Qian, Min

2017-01-01

The paper develops a new marginal testing procedure to detect significant predictors that are associated with the conditional quantiles of a scalar response. The idea is to fit the marginal quantile regression on each predictor one at a time, and then to base the test on the t-statistics that are associated with the most predictive predictors. A resampling method is devised to calibrate this test statistic, which has non-regular limiting behaviour due to the selection of the most predictive variables. Asymptotic validity of the procedure is established in a general quantile regression setting in which the marginal quantile regression models can be misspecified. Even though a fixed dimension is assumed to derive the asymptotic results, the test proposed is applicable and computationally feasible for large dimensional predictors. The method is more flexible than existing marginal screening test methods based on mean regression and has the added advantage of being robust against outliers in the response. The approach is illustrated by using an application to a human immunodeficiency virus drug resistance data set.
Stellar atmospheric parameter estimation using Gaussian process regression

Science.gov (United States)

Bu, Yude; Pan, Jingchang

2015-02-01

As is well known, it is necessary to derive stellar parameters from massive amounts of spectral data automatically and efficiently. However, in traditional automatic methods such as artificial neural networks (ANNs) and kernel regression (KR), it is often difficult to optimize the algorithm structure and determine the optimal algorithm parameters. Gaussian process regression (GPR) is a recently developed method that has been proven to be capable of overcoming these difficulties. Here we apply GPR to derive stellar atmospheric parameters from spectra. Through evaluating the performance of GPR on Sloan Digital Sky Survey (SDSS) spectra, Medium resolution Isaac Newton Telescope Library of Empirical Spectra (MILES) spectra, ELODIE spectra and the spectra of member stars of galactic globular clusters, we conclude that GPR can derive stellar parameters accurately and precisely, especially when we use data preprocessed with principal component analysis (PCA). We then compare the performance of GPR with that of several widely used regression methods (ANNs, support-vector regression and KR) and find that with GPR it is easier to optimize structures and parameters and more efficient and accurate to extract atmospheric parameters.
Descriptor Learning via Supervised Manifold Regularization for Multioutput Regression.

Science.gov (United States)

Zhen, Xiantong; Yu, Mengyang; Islam, Ali; Bhaduri, Mousumi; Chan, Ian; Li, Shuo

2017-09-01

Multioutput regression has recently shown great ability to solve challenging problems in both computer vision and medical image analysis. However, due to the huge image variability and ambiguity, it is fundamentally challenging to handle the highly complex input-target relationship of multioutput regression, especially with indiscriminate high-dimensional representations. In this paper, we propose a novel supervised descriptor learning (SDL) algorithm for multioutput regression, which can establish discriminative and compact feature representations to improve the multivariate estimation performance. The SDL is formulated as generalized low-rank approximations of matrices with a supervised manifold regularization. The SDL is able to simultaneously extract discriminative features closely related to multivariate targets and remove irrelevant and redundant information by transforming raw features into a new low-dimensional space aligned to targets. The achieved discriminative while compact descriptor largely reduces the variability and ambiguity for multioutput regression, which enables more accurate and efficient multivariate estimation. We conduct extensive evaluation of the proposed SDL on both synthetic data and real-world multioutput regression tasks for both computer vision and medical image analysis. Experimental results have shown that the proposed SDL can achieve high multivariate estimation accuracy on all tasks and largely outperforms the algorithms in the state of the arts. Our method establishes a novel SDL framework for multioutput regression, which can be widely used to boost the performance in different applications.
A flexible fuzzy regression algorithm for forecasting oil consumption estimation

International Nuclear Information System (INIS)

Azadeh, A.; Khakestani, M.; Saberi, M.

2009-01-01

Oil consumption plays a vital role in socio-economic development of most countries. This study presents a flexible fuzzy regression algorithm for forecasting oil consumption based on standard economic indicators. The standard indicators are annual population, cost of crude oil import, gross domestic production (GDP) and annual oil production in the last period. The proposed algorithm uses analysis of variance (ANOVA) to select either fuzzy regression or conventional regression for future demand estimation. The significance of the proposed algorithm is three fold. First, it is flexible and identifies the best model based on the results of ANOVA and minimum absolute percentage error (MAPE), whereas previous studies consider the best fitted fuzzy regression model based on MAPE or other relative error results. Second, the proposed model may identify conventional regression as the best model for future oil consumption forecasting because of its dynamic structure, whereas previous studies assume that fuzzy regression always provide the best solutions and estimation. Third, it utilizes the most standard independent variables for the regression models. To show the applicability and superiority of the proposed flexible fuzzy regression algorithm the data for oil consumption in Canada, United States, Japan and Australia from 1990 to 2005 are used. The results show that the flexible algorithm provides accurate solution for oil consumption estimation problem. The algorithm may be used by policy makers to accurately foresee the behavior of oil consumption in various regions.
Multicollinearity in Regression Analyses Conducted in Epidemiologic Studies.

Science.gov (United States)

Vatcheva, Kristina P; Lee, MinJae; McCormick, Joseph B; Rahbar, Mohammad H

2016-04-01

The adverse impact of ignoring multicollinearity on findings and data interpretation in regression analysis is very well documented in the statistical literature. The failure to identify and report multicollinearity could result in misleading interpretations of the results. A review of epidemiological literature in PubMed from January 2004 to December 2013, illustrated the need for a greater attention to identifying and minimizing the effect of multicollinearity in analysis of data from epidemiologic studies. We used simulated datasets and real life data from the Cameron County Hispanic Cohort to demonstrate the adverse effects of multicollinearity in the regression analysis and encourage researchers to consider the diagnostic for multicollinearity as one of the steps in regression analysis.
Least-Squares Linear Regression and Schrodinger's Cat: Perspectives on the Analysis of Regression Residuals.

Science.gov (United States)

Hecht, Jeffrey B.

The analysis of regression residuals and detection of outliers are discussed, with emphasis on determining how deviant an individual data point must be to be considered an outlier and the impact that multiple suspected outlier data points have on the process of outlier determination and treatment. Only bivariate (one dependent and one independent)…

Alternative regression models to assess increase in childhood BMI

Directory of Open Access Journals (Sweden)

Mansmann Ulrich

2008-09-01

Full Text Available Abstract Background Body mass index (BMI data usually have skewed distributions, for which common statistical modeling approaches such as simple linear or logistic regression have limitations. Methods Different regression approaches to predict childhood BMI by goodness-of-fit measures and means of interpretation were compared including generalized linear models (GLMs, quantile regression and Generalized Additive Models for Location, Scale and Shape (GAMLSS. We analyzed data of 4967 children participating in the school entry health examination in Bavaria, Germany, from 2001 to 2002. TV watching, meal frequency, breastfeeding, smoking in pregnancy, maternal obesity, parental social class and weight gain in the first 2 years of life were considered as risk factors for obesity. Results GAMLSS showed a much better fit regarding the estimation of risk factors effects on transformed and untransformed BMI data than common GLMs with respect to the generalized Akaike information criterion. In comparison with GAMLSS, quantile regression allowed for additional interpretation of prespecified distribution quantiles, such as quantiles referring to overweight or obesity. The variables TV watching, maternal BMI and weight gain in the first 2 years were directly, and meal frequency was inversely significantly associated with body composition in any model type examined. In contrast, smoking in pregnancy was not directly, and breastfeeding and parental social class were not inversely significantly associated with body composition in GLM models, but in GAMLSS and partly in quantile regression models. Risk factor specific BMI percentile curves could be estimated from GAMLSS and quantile regression models. Conclusion GAMLSS and quantile regression seem to be more appropriate than common GLMs for risk factor modeling of BMI data.
Resting-state functional magnetic resonance imaging: the impact of regression analysis.

Science.gov (United States)

Yeh, Chia-Jung; Tseng, Yu-Sheng; Lin, Yi-Ru; Tsai, Shang-Yueh; Huang, Teng-Yi

2015-01-01

To investigate the impact of regression methods on resting-state functional magnetic resonance imaging (rsfMRI). During rsfMRI preprocessing, regression analysis is considered effective for reducing the interference of physiological noise on the signal time course. However, it is unclear whether the regression method benefits rsfMRI analysis. Twenty volunteers (10 men and 10 women; aged 23.4 ± 1.5 years) participated in the experiments. We used node analysis and functional connectivity mapping to assess the brain default mode network by using five combinations of regression methods. The results show that regressing the global mean plays a major role in the preprocessing steps. When a global regression method is applied, the values of functional connectivity are significantly lower (P ≤ .01) than those calculated without a global regression. This step increases inter-subject variation and produces anticorrelated brain areas. rsfMRI data processed using regression should be interpreted carefully. The significance of the anticorrelated brain areas produced by global signal removal is unclear. Copyright © 2014 by the American Society of Neuroimaging.
Enhancement of Visual Field Predictions with Pointwise Exponential Regression (PER) and Pointwise Linear Regression (PLR).

Science.gov (United States)

Morales, Esteban; de Leon, John Mark S; Abdollahi, Niloufar; Yu, Fei; Nouri-Mahdavi, Kouros; Caprioli, Joseph

2016-03-01

The study was conducted to evaluate threshold smoothing algorithms to enhance prediction of the rates of visual field (VF) worsening in glaucoma. We studied 798 patients with primary open-angle glaucoma and 6 or more years of follow-up who underwent 8 or more VF examinations. Thresholds at each VF location for the first 4 years or first half of the follow-up time (whichever was greater) were smoothed with clusters defined by the nearest neighbor (NN), Garway-Heath, Glaucoma Hemifield Test (GHT), and weighting by the correlation of rates at all other VF locations. Thresholds were regressed with a pointwise exponential regression (PER) model and a pointwise linear regression (PLR) model. Smaller root mean square error (RMSE) values of the differences between the observed and the predicted thresholds at last two follow-ups indicated better model predictions. The mean (SD) follow-up times for the smoothing and prediction phase were 5.3 (1.5) and 10.5 (3.9) years. The mean RMSE values for the PER and PLR models were unsmoothed data, 6.09 and 6.55; NN, 3.40 and 3.42; Garway-Heath, 3.47 and 3.48; GHT, 3.57 and 3.74; and correlation of rates, 3.59 and 3.64. Smoothed VF data predicted better than unsmoothed data. Nearest neighbor provided the best predictions; PER also predicted consistently more accurately than PLR. Smoothing algorithms should be used when forecasting VF results with PER or PLR. The application of smoothing algorithms on VF data can improve forecasting in VF points to assist in treatment decisions.
Regression Models for Repairable Systems

Czech Academy of Sciences Publication Activity Database

Novák, Petr

2015-01-01

Roč. 17, č. 4 (2015), s. 963-972 ISSN 1387-5841 Institutional support: RVO:67985556 Keywords : Reliability analysis * Repair models * Regression Subject RIV: BB - Applied Statistics, Operational Research Impact factor: 0.782, year: 2015 http://library.utia.cas.cz/separaty/2015/SI/novak-0450902.pdf
Survival analysis II: Cox regression

NARCIS (Netherlands)

Stel, Vianda S.; Dekker, Friedo W.; Tripepi, Giovanni; Zoccali, Carmine; Jager, Kitty J.

2011-01-01

In contrast to the Kaplan-Meier method, Cox proportional hazards regression can provide an effect estimate by quantifying the difference in survival between patient groups and can adjust for confounding effects of other variables. The purpose of this article is to explain the basic concepts of the
Spontaneous regression of metastases from malignant melanoma: a case report

DEFF Research Database (Denmark)

Kalialis, Louise V; Drzewiecki, Krzysztof T; Mohammadi, Mahin

2008-01-01

A case of a 61-year-old male with widespread metastatic melanoma is presented 5 years after complete spontaneous cure. Spontaneous regression occurred in cutaneous, pulmonary, hepatic and cerebral metastases. A review of the literature reveals seven cases of regression of cerebral metastases......; this report is the first to document complete spontaneous regression of cerebral metastases from malignant melanoma by means of computed tomography scans. Spontaneous regression is defined as the partial or complete disappearance of a malignant tumour in the absence of all treatment or in the presence...
Robust Regression and its Application in Financial Data Analysis

OpenAIRE

Mansoor Momeni; Mahmoud Dehghan Nayeri; Ali Faal Ghayoumi; Hoda Ghorbani

2010-01-01

This research is aimed to describe the application of robust regression and its advantages over the least square regression method in analyzing financial data. To do this, relationship between earning per share, book value of equity per share and share price as price model and earning per share, annual change of earning per share and return of stock as return model is discussed using both robust and least square regressions, and finally the outcomes are compared. Comparing the results from th...
Linear regression and the normality assumption.

Science.gov (United States)

Schmidt, Amand F; Finan, Chris

2017-12-16

Researchers often perform arbitrary outcome transformations to fulfill the normality assumption of a linear regression model. This commentary explains and illustrates that in large data settings, such transformations are often unnecessary, and worse may bias model estimates. Linear regression assumptions are illustrated using simulated data and an empirical example on the relation between time since type 2 diabetes diagnosis and glycated hemoglobin levels. Simulation results were evaluated on coverage; i.e., the number of times the 95% confidence interval included the true slope coefficient. Although outcome transformations bias point estimates, violations of the normality assumption in linear regression analyses do not. The normality assumption is necessary to unbiasedly estimate standard errors, and hence confidence intervals and P-values. However, in large sample sizes (e.g., where the number of observations per variable is >10) violations of this normality assumption often do not noticeably impact results. Contrary to this, assumptions on, the parametric model, absence of extreme observations, homoscedasticity, and independency of the errors, remain influential even in large sample size settings. Given that modern healthcare research typically includes thousands of subjects focusing on the normality assumption is often unnecessary, does not guarantee valid results, and worse may bias estimates due to the practice of outcome transformations. Copyright © 2017 Elsevier Inc. All rights reserved.
Elliptical multiple-output quantile regression and convex optimization

Czech Academy of Sciences Publication Activity Database

Hallin, M.; Šiman, Miroslav

2016-01-01

Roč. 109, č. 1 (2016), s. 232-237 ISSN 0167-7152 R&D Projects: GA ČR GA14-07234S Institutional support: RVO:67985556 Keywords : quantile regression * elliptical quantile * multivariate quantile * multiple-output regression Subject RIV: BA - General Mathematics Impact factor: 0.540, year: 2016 http://library.utia.cas.cz/separaty/2016/SI/siman-0458243.pdf
Spontaneous regression of retinopathy of prematurity:incidence and predictive factors

Directory of Open Access Journals (Sweden)

Rui-Hong Ju

2013-08-01

Full Text Available AIM:To evaluate the incidence of spontaneous regression of changes in the retina and vitreous in active stage of retinopathy of prematurity(ROP and identify the possible relative factors during the regression.METHODS: This was a retrospective, hospital-based study. The study consisted of 39 premature infants with mild ROP showed spontaneous regression (Group A and 17 with severe ROP who had been treated before naturally involuting (Group B from August 2008 through May 2011. Data on gender, single or multiple pregnancy, gestational age, birth weight, weight gain from birth to the sixth week of life, use of oxygen in mechanical ventilation, total duration of oxygen inhalation, surfactant given or not, need for and times of blood transfusion, 1,5,10-min Apgar score, presence of bacterial or fungal or combined infection, hyaline membrane disease (HMD, patent ductus arteriosus (PDA, duration of stay in the neonatal intensive care unit (NICU and duration of ROP were recorded.RESULTS: The incidence of spontaneous regression of ROP with stage 1 was 86.7%, and with stage 2, stage 3 was 57.1%, 5.9%, respectively. With changes in zone Ⅲ regression was detected 100%, in zoneⅡ 46.2% and in zoneⅠ 0%. The mean duration of ROP in spontaneous regression group was 5.65±3.14 weeks, lower than that of the treated ROP group (7.34±4.33 weeks, but this difference was not statistically significant (P=0.201. GA, 1min Apgar score, 5min Apgar score, duration of NICU stay, postnatal age of initial screening and oxygen therapy longer than 10 days were significant predictive factors for the spontaneous regression of ROP (P＜0.05. Retinal hemorrhage was the only independent predictive factor the spontaneous regression of ROP (OR 0.030, 95%CI 0.001-0.775, P=0.035.CONCLUSION:This study showed most stage 1 and 2 ROP and changes in zone Ⅲ can spontaneously regression in the end. Retinal hemorrhage is weakly inversely associated with the spontaneous regression.
Least Squares Adjustment: Linear and Nonlinear Weighted Regression Analysis

DEFF Research Database (Denmark)

Nielsen, Allan Aasbjerg

2007-01-01

This note primarily describes the mathematics of least squares regression analysis as it is often used in geodesy including land surveying and satellite positioning applications. In these fields regression is often termed adjustment. The note also contains a couple of typical land surveying...... and satellite positioning application examples. In these application areas we are typically interested in the parameters in the model typically 2- or 3-D positions and not in predictive modelling which is often the main concern in other regression analysis applications. Adjustment is often used to obtain...... the clock error) and to obtain estimates of the uncertainty with which the position is determined. Regression analysis is used in many other fields of application both in the natural, the technical and the social sciences. Examples may be curve fitting, calibration, establishing relationships between...
Steep microbial boundstone-dominated plaform margins

NARCIS (Netherlands)

Kenter, J.A.M.; Harris, P.M.; Della Porta, G.P.

2005-01-01

Seaward progradation of several kilometers has been documented mostly for leeward margin low-angle carbonate slope systems with a dominant platform top sediment source. However, steep and high-relief margins fronting deep basins can also prograde and as such are somewhat perplexing. Characteristics
Parameters Estimation of Geographically Weighted Ordinal Logistic Regression (GWOLR) Model

Science.gov (United States)

Zuhdi, Shaifudin; Retno Sari Saputro, Dewi; Widyaningsih, Purnami

2017-06-01

A regression model is the representation of relationship between independent variable and dependent variable. The dependent variable has categories used in the logistic regression model to calculate odds on. The logistic regression model for dependent variable has levels in the logistics regression model is ordinal. GWOLR model is an ordinal logistic regression model influenced the geographical location of the observation site. Parameters estimation in the model needed to determine the value of a population based on sample. The purpose of this research is to parameters estimation of GWOLR model using R software. Parameter estimation uses the data amount of dengue fever patients in Semarang City. Observation units used are 144 villages in Semarang City. The results of research get GWOLR model locally for each village and to know probability of number dengue fever patient categories.
Linear regression crash prediction models : issues and proposed solutions.

Science.gov (United States)

2010-05-01

The paper develops a linear regression model approach that can be applied to : crash data to predict vehicle crashes. The proposed approach involves novice data aggregation : to satisfy linear regression assumptions; namely error structure normality ...
Random regression models for detection of gene by environment interaction

Directory of Open Access Journals (Sweden)

Meuwissen Theo HE

2007-02-01

Full Text Available Abstract Two random regression models, where the effect of a putative QTL was regressed on an environmental gradient, are described. The first model estimates the correlation between intercept and slope of the random regression, while the other model restricts this correlation to 1 or -1, which is expected under a bi-allelic QTL model. The random regression models were compared to a model assuming no gene by environment interactions. The comparison was done with regards to the models ability to detect QTL, to position them accurately and to detect possible QTL by environment interactions. A simulation study based on a granddaughter design was conducted, and QTL were assumed, either by assigning an effect independent of the environment or as a linear function of a simulated environmental gradient. It was concluded that the random regression models were suitable for detection of QTL effects, in the presence and absence of interactions with environmental gradients. Fixing the correlation between intercept and slope of the random regression had a positive effect on power when the QTL effects re-ranked between environments.
Kernel regression with functional response

OpenAIRE

Ferraty, Frédéric; Laksaci, Ali; Tadj, Amel; Vieu, Philippe

2011-01-01

We consider kernel regression estimate when both the response variable and the explanatory one are functional. The rates of uniform almost complete convergence are stated as function of the small ball probability of the predictor and as function of the entropy of the set on which uniformity is obtained.
Establishment of regression dependences. Linear and nonlinear dependences

International Nuclear Information System (INIS)

Onishchenko, A.M.

1994-01-01

The main problems of determination of linear and 19 types of nonlinear regression dependences are completely discussed. It is taken into consideration that total dispersions are the sum of measurement dispersions and parameter variation dispersions themselves. Approaches to all dispersions determination are described. It is shown that the least square fit gives inconsistent estimation for industrial objects and processes. The correction methods by taking into account comparable measurement errors for both variable give an opportunity to obtain consistent estimation for the regression equation parameters. The condition of the correction technique application expediency is given. The technique for determination of nonlinear regression dependences taking into account the dependence form and comparable errors of both variables is described. 6 refs., 1 tab
Depositional evolution of the Melville Bay trough-mouth fan, NW Greenland

Science.gov (United States)

Knutz, Paul; Gregersen, Ulrik

2015-04-01

The continental margin of NW Greenland bordering northern Baffin Bay is characterized by major sediment accumulations, known as Trough-Mouth Fans (TMF). The fan depocentres represent intense sediment dispersal at the terminus of ice streams that during cold climate periods provided major drainage routes of the northern Greenland Ice Sheet into Baffin Bay. The imprint of paleo-icestreams is seen by erosional troughs crossing a >250 km broad shelf region, which caps a series of sedimentary basins containing thick Mesozoic-Tertiary strata packages. This presentation provides an overview of the seismic stratigraphic division, depositional architecture and examples of seismic facies of the Melville Bay TMF using a 5-10 km grid of industry-quality 2D seismic data (TGS). The focus will primarily be on the inception and early stage of glacial fan development. Comparing the present-day topography with the regional geology shows that the paleo-icestreams exploited the Cenozoic infill of former rift basins that are more conducive to erosion than the adjoining ridges and structural highs. The TMF sequence is constructed by a series of progradational seismic units that represent successive steps in location of ice stream terminus and associated depocenters. The slope fronts of the prograding units show abundant signatures of sediment instability and mass-wasting but evidence of along-slope current-driven processes is also recognized presumably linked to interglacial sea level high-stands. The topset of each unit is characterized by planar erosion that merges landward into hummocky positive geometries with low internal reflectivity. These features are generally interpreted as subglacial landforms, e.g. terminal moraines and ice-contact deposits, associated with grounding zone wedges. Unlike the most recent TMF units deposited in front of the present trough, the oldest glacigenic units have built out from a Neogene sediment prism that forms the core of modern shallow-water banks
Radiation regression patterns after cobalt plaque insertion for retinoblastoma

International Nuclear Information System (INIS)

Buys, R.J.; Abramson, D.H.; Ellsworth, R.M.; Haik, B.

1983-01-01

An analysis of 31 eyes of 30 patients who had been treated with cobalt plaques for retinoblastoma disclosed that a type I radiation regression pattern developed in 15 patients; type II, in one patient, and type III, in five patients. Nine patients had a regression pattern characterized by complete destruction of the tumor, the surrounding choroid, and all of the vessels in the area into which the plaque was inserted. This resulting white scar, corresponding to the sclerae only, was classified as a type IV radiation regression pattern. There was no evidence of tumor recurrence in patients with type IV regression patterns, with an average follow-up of 6.5 years, after receiving cobalt plaque therapy. Twenty-nine of these 30 patients had been unsuccessfully treated with at least one other modality (ie, light coagulation, cryotherapy, external beam radiation, or chemotherapy)
Radiation regression patterns after cobalt plaque insertion for retinoblastoma

Energy Technology Data Exchange (ETDEWEB)

Buys, R.J.; Abramson, D.H.; Ellsworth, R.M.; Haik, B.

1983-08-01

An analysis of 31 eyes of 30 patients who had been treated with cobalt plaques for retinoblastoma disclosed that a type I radiation regression pattern developed in 15 patients; type II, in one patient, and type III, in five patients. Nine patients had a regression pattern characterized by complete destruction of the tumor, the surrounding choroid, and all of the vessels in the area into which the plaque was inserted. This resulting white scar, corresponding to the sclerae only, was classified as a type IV radiation regression pattern. There was no evidence of tumor recurrence in patients with type IV regression patterns, with an average follow-up of 6.5 years, after receiving cobalt plaque therapy. Twenty-nine of these 30 patients had been unsuccessfully treated with at least one other modality (ie, light coagulation, cryotherapy, external beam radiation, or chemotherapy).

Thermal Efficiency Degradation Diagnosis Method Using Regression Model

International Nuclear Information System (INIS)

Jee, Chang Hyun; Heo, Gyun Young; Jang, Seok Won; Lee, In Cheol

2011-01-01

This paper proposes an idea for thermal efficiency degradation diagnosis in turbine cycles, which is based on turbine cycle simulation under abnormal conditions and a linear regression model. The correlation between the inputs for representing degradation conditions (normally unmeasured but intrinsic states) and the simulation outputs (normally measured but superficial states) was analyzed with the linear regression model. The regression models can inversely response an associated intrinsic state for a superficial state observed from a power plant. The diagnosis method proposed herein is classified into three processes, 1) simulations for degradation conditions to get measured states (referred as what-if method), 2) development of the linear model correlating intrinsic and superficial states, and 3) determination of an intrinsic state using the superficial states of current plant and the linear regression model (referred as inverse what-if method). The what-if method is to generate the outputs for the inputs including various root causes and/or boundary conditions whereas the inverse what-if method is the process of calculating the inverse matrix with the given superficial states, that is, component degradation modes. The method suggested in this paper was validated using the turbine cycle model for an operating power plant
Spatial stochastic regression modelling of urban land use

International Nuclear Information System (INIS)

Arshad, S H M; Jaafar, J; Abiden, M Z Z; Latif, Z A; Rasam, A R A

2014-01-01

Urbanization is very closely linked to industrialization, commercialization or overall economic growth and development. This results in innumerable benefits of the quantity and quality of the urban environment and lifestyle but on the other hand contributes to unbounded development, urban sprawl, overcrowding and decreasing standard of living. Regulation and observation of urban development activities is crucial. The understanding of urban systems that promotes urban growth are also essential for the purpose of policy making, formulating development strategies as well as development plan preparation. This study aims to compare two different stochastic regression modeling techniques for spatial structure models of urban growth in the same specific study area. Both techniques will utilize the same datasets and their results will be analyzed. The work starts by producing an urban growth model by using stochastic regression modeling techniques namely the Ordinary Least Square (OLS) and Geographically Weighted Regression (GWR). The two techniques are compared to and it is found that, GWR seems to be a more significant stochastic regression model compared to OLS, it gives a smaller AICc (Akaike's Information Corrected Criterion) value and its output is more spatially explainable
Determinants of LSIL Regression in Women from a Colombian Cohort

International Nuclear Information System (INIS)

Molano, Monica; Gonzalez, Mauricio; Gamboa, Oscar; Ortiz, Natasha; Luna, Joaquin; Hernandez, Gustavo; Posso, Hector; Murillo, Raul; Munoz, Nubia

2010-01-01

Objective: To analyze the role of Human Papillomavirus (HPV) and other risk factors in the regression of cervical lesions in women from the Bogota Cohort. Methods: 200 HPV positive women with abnormal cytology were included for regression analysis. The time of lesion regression was modeled using methods for interval censored survival time data. Median duration of total follow-up was 9 years. Results: 80 (40%) women were diagnosed with Atypical Squamous Cells of Undetermined Significance (ASCUS) or Atypical Glandular Cells of Undetermined Significance (AGUS) while 120 (60%) were diagnosed with Low Grade Squamous Intra-epithelial Lesions (LSIL). Globally, 40% of the lesions were still present at first year of follow up, while 1.5% was still present at 5 year check-up. The multivariate model showed similar regression rates for lesions in women with ASCUS/AGUS and women with LSIL (HR= 0.82, 95% CI 0.59-1.12). Women infected with HR HPV types and those with mixed infections had lower regression rates for lesions than did women infected with LR types (HR=0.526, 95% CI 0.33-0.84, for HR types and HR=0.378, 95% CI 0.20-0.69, for mixed infections). Furthermore, women over 30 years had a higher lesion regression rate than did women under 30 years (HR1.53, 95% CI 1.03-2.27). The study showed that the median time for lesion regression was 9 months while the median time for HPV clearance was 12 months. Conclusions: In the studied population, the type of infection and the age of the women are critical factors for the regression of cervical lesions.
Satellite rainfall retrieval by logistic regression

Science.gov (United States)

Chiu, Long S.

1986-01-01

The potential use of logistic regression in rainfall estimation from satellite measurements is investigated. Satellite measurements provide covariate information in terms of radiances from different remote sensors.The logistic regression technique can effectively accommodate many covariates and test their significance in the estimation. The outcome from the logistical model is the probability that the rainrate of a satellite pixel is above a certain threshold. By varying the thresholds, a rainrate histogram can be obtained, from which the mean and the variant can be estimated. A logistical model is developed and applied to rainfall data collected during GATE, using as covariates the fractional rain area and a radiance measurement which is deduced from a microwave temperature-rainrate relation. It is demonstrated that the fractional rain area is an important covariate in the model, consistent with the use of the so-called Area Time Integral in estimating total rain volume in other studies. To calibrate the logistical model, simulated rain fields generated by rainfield models with prescribed parameters are needed. A stringent test of the logistical model is its ability to recover the prescribed parameters of simulated rain fields. A rain field simulation model which preserves the fractional rain area and lognormality of rainrates as found in GATE is developed. A stochastic regression model of branching and immigration whose solutions are lognormally distributed in some asymptotic limits has also been developed.
High resolution sea-level curve for the latest Frasnian and earliest Famennian derived for high frequency sequences in the Appalachian Basin

Energy Technology Data Exchange (ETDEWEB)

Filer, J.K. (Washington and Lee Univ., Lexington, VA (United States). Dept. of Geology)

1992-01-01

Siliciclastic sequences have been mapped in the subsurface and outcrop of much of the Appalachian basin in facies ranging from shale in the basin plain to shelf sandstone. Eleven transgressive/regressive cycles have been defined in an estimated 1.5 to 2.0 Ma period in the latest Frasnian and earliest Famennian, and range in duration from about 75,000 to 400,000 years. Lithofacies maps, covering most of the basin, were prepared for each sequence. These maps show both the area of basinal black shale deposition, which defines the base of each cycle, and the areal extent of subsequent clinoform siltstone and shelf sandstone deposition in the upper portion of each cycle. The stratigraphic patterns show two stacked sets of progradational basinwide sequences. Geographic scale of the study precludes autocyclic controls of cycles. Sea-level/climate cycles, probably superimposed on longer term tectonic cycles, are the proposed cause of these observed depositional patterns. Removal of the long-term progradational trend of Upper Devonian basin filling results in a proposed eustatic sea-level curve (Johnson and others (1985)) reveals correspondence of three regressive maxima in both models. The curve presented here reveals that an ongoing process of higher frequency sea-level modification was active at this time. Higher frequency sea-level events, nested within previously interpreted lower frequency global events, are inferred to also be eustatic. Models of a biotic crises which occurs at this time should consider the implications of these high frequency sea-level cycles. The patterns observed are consistent with latest Frasnian initiation of glaciation in South America. This would be somewhat earlier than has generally been accepted.
Tax Evasion, Information Reporting, and the Regressive Bias Hypothesis

DEFF Research Database (Denmark)

Boserup, Simon Halphen; Pinje, Jori Veng

A robust prediction from the tax evasion literature is that optimal auditing induces a regressive bias in effective tax rates compared to statutory rates. If correct, this will have important distributional consequences. Nevertheless, the regressive bias hypothesis has never been tested empirically...
Multitask Quantile Regression under the Transnormal Model.

Science.gov (United States)

Fan, Jianqing; Xue, Lingzhou; Zou, Hui

2016-01-01

We consider estimating multi-task quantile regression under the transnormal model, with focus on high-dimensional setting. We derive a surprisingly simple closed-form solution through rank-based covariance regularization. In particular, we propose the rank-based ℓ 1 penalization with positive definite constraints for estimating sparse covariance matrices, and the rank-based banded Cholesky decomposition regularization for estimating banded precision matrices. By taking advantage of alternating direction method of multipliers, nearest correlation matrix projection is introduced that inherits sampling properties of the unprojected one. Our work combines strengths of quantile regression and rank-based covariance regularization to simultaneously deal with nonlinearity and nonnormality for high-dimensional regression. Furthermore, the proposed method strikes a good balance between robustness and efficiency, achieves the "oracle"-like convergence rate, and provides the provable prediction interval under the high-dimensional setting. The finite-sample performance of the proposed method is also examined. The performance of our proposed rank-based method is demonstrated in a real application to analyze the protein mass spectroscopy data.
Quality of life in breast cancer patients--a quantile regression analysis.

Science.gov (United States)

Pourhoseingholi, Mohamad Amin; Safaee, Azadeh; Moghimi-Dehkordi, Bijan; Zeighami, Bahram; Faghihzadeh, Soghrat; Tabatabaee, Hamid Reza; Pourhoseingholi, Asma

2008-01-01

Quality of life study has an important role in health care especially in chronic diseases, in clinical judgment and in medical resources supplying. Statistical tools like linear regression are widely used to assess the predictors of quality of life. But when the response is not normal the results are misleading. The aim of this study is to determine the predictors of quality of life in breast cancer patients, using quantile regression model and compare to linear regression. A cross-sectional study conducted on 119 breast cancer patients that admitted and treated in chemotherapy ward of Namazi hospital in Shiraz. We used QLQ-C30 questionnaire to assessment quality of life in these patients. A quantile regression was employed to assess the assocciated factors and the results were compared to linear regression. All analysis carried out using SAS. The mean score for the global health status for breast cancer patients was 64.92+/-11.42. Linear regression showed that only grade of tumor, occupational status, menopausal status, financial difficulties and dyspnea were statistically significant. In spite of linear regression, financial difficulties were not significant in quantile regression analysis and dyspnea was only significant for first quartile. Also emotion functioning and duration of disease statistically predicted the QOL score in the third quartile. The results have demonstrated that using quantile regression leads to better interpretation and richer inference about predictors of the breast cancer patient quality of life.
A land use regression model for ambient ultrafine particles in Montreal, Canada: A comparison of linear regression and a machine learning approach.

Science.gov (United States)

Weichenthal, Scott; Ryswyk, Keith Van; Goldstein, Alon; Bagg, Scott; Shekkarizfard, Maryam; Hatzopoulou, Marianne

2016-04-01

Existing evidence suggests that ambient ultrafine particles (UFPs) (regression model for UFPs in Montreal, Canada using mobile monitoring data collected from 414 road segments during the summer and winter months between 2011 and 2012. Two different approaches were examined for model development including standard multivariable linear regression and a machine learning approach (kernel-based regularized least squares (KRLS)) that learns the functional form of covariate impacts on ambient UFP concentrations from the data. The final models included parameters for population density, ambient temperature and wind speed, land use parameters (park space and open space), length of local roads and rail, and estimated annual average NOx emissions from traffic. The final multivariable linear regression model explained 62% of the spatial variation in ambient UFP concentrations whereas the KRLS model explained 79% of the variance. The KRLS model performed slightly better than the linear regression model when evaluated using an external dataset (R(2)=0.58 vs. 0.55) or a cross-validation procedure (R(2)=0.67 vs. 0.60). In general, our findings suggest that the KRLS approach may offer modest improvements in predictive performance compared to standard multivariable linear regression models used to estimate spatial variations in ambient UFPs. However, differences in predictive performance were not statistically significant when evaluated using the cross-validation procedure. Crown Copyright © 2015. Published by Elsevier Inc. All rights reserved.
Genetics Home Reference: caudal regression syndrome

Science.gov (United States)

... umbilical artery: Further support for a caudal regression-sirenomelia spectrum. Am J Med Genet A. 2007 Dec ... AK, Dickinson JE, Bower C. Caudal dysgenesis and sirenomelia-single centre experience suggests common pathogenic basis. Am ...
Spontaneous and complete regression of a thoracic disc herniation

International Nuclear Information System (INIS)

Coevoet, V.; Benoudiba, F.; Doyon, D.; Lignieres, C.; Said, G.

1997-01-01

Spontaneous regression of disc herniation is well known but the mechanism is not clear. Some hypotheses have been made. We present here a large thoracic disc herniation diagnosed by MRI which completely regressed one year after a medical treatment with complete amendment of symptoms. (authors)
Nonlinear Forecasting With Many Predictors Using Kernel Ridge Regression

DEFF Research Database (Denmark)

Exterkate, Peter; Groenen, Patrick J.F.; Heij, Christiaan

This paper puts forward kernel ridge regression as an approach for forecasting with many predictors that are related nonlinearly to the target variable. In kernel ridge regression, the observed predictor variables are mapped nonlinearly into a high-dimensional space, where estimation of the predi...
Beta-binomial regression and bimodal utilization.

Science.gov (United States)

Liu, Chuan-Fen; Burgess, James F; Manning, Willard G; Maciejewski, Matthew L

2013-10-01

To illustrate how the analysis of bimodal U-shaped distributed utilization can be modeled with beta-binomial regression, which is rarely used in health services research. Veterans Affairs (VA) administrative data and Medicare claims in 2001-2004 for 11,123 Medicare-eligible VA primary care users in 2000. We compared means and distributions of VA reliance (the proportion of all VA/Medicare primary care visits occurring in VA) predicted from beta-binomial, binomial, and ordinary least-squares (OLS) models. Beta-binomial model fits the bimodal distribution of VA reliance better than binomial and OLS models due to the nondependence on normality and the greater flexibility in shape parameters. Increased awareness of beta-binomial regression may help analysts apply appropriate methods to outcomes with bimodal or U-shaped distributions. © Health Research and Educational Trust.
Regression Models For Multivariate Count Data.

Science.gov (United States)

Zhang, Yiwen; Zhou, Hua; Zhou, Jin; Sun, Wei

2017-01-01

Data with multivariate count responses frequently occur in modern applications. The commonly used multinomial-logit model is limiting due to its restrictive mean-variance structure. For instance, analyzing count data from the recent RNA-seq technology by the multinomial-logit model leads to serious errors in hypothesis testing. The ubiquity of over-dispersion and complicated correlation structures among multivariate counts calls for more flexible regression models. In this article, we study some generalized linear models that incorporate various correlation structures among the counts. Current literature lacks a treatment of these models, partly due to the fact that they do not belong to the natural exponential family. We study the estimation, testing, and variable selection for these models in a unifying framework. The regression models are compared on both synthetic and real RNA-seq data.
Model selection in kernel ridge regression

DEFF Research Database (Denmark)

Exterkate, Peter

2013-01-01

Kernel ridge regression is a technique to perform ridge regression with a potentially infinite number of nonlinear transformations of the independent variables as regressors. This method is gaining popularity as a data-rich nonlinear forecasting tool, which is applicable in many different contexts....... The influence of the choice of kernel and the setting of tuning parameters on forecast accuracy is investigated. Several popular kernels are reviewed, including polynomial kernels, the Gaussian kernel, and the Sinc kernel. The latter two kernels are interpreted in terms of their smoothing properties......, and the tuning parameters associated to all these kernels are related to smoothness measures of the prediction function and to the signal-to-noise ratio. Based on these interpretations, guidelines are provided for selecting the tuning parameters from small grids using cross-validation. A Monte Carlo study...
Classification and regression trees

CERN Document Server

Breiman, Leo; Olshen, Richard A; Stone, Charles J

1984-01-01

The methodology used to construct tree structured rules is the focus of this monograph. Unlike many other statistical procedures, which moved from pencil and paper to calculators, this text's use of trees was unthinkable before computers. Both the practical and theoretical sides have been developed in the authors' study of tree methods. Classification and Regression Trees reflects these two sides, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.
Regression in organizational leadership.

Science.gov (United States)

Kernberg, O F

1979-02-01

The choice of good leaders is a major task for all organizations. Inforamtion regarding the prospective administrator's personality should complement questions regarding his previous experience, his general conceptual skills, his technical knowledge, and the specific skills in the area for which he is being selected. The growing psychoanalytic knowledge about the crucial importance of internal, in contrast to external, object relations, and about the mutual relationships of regression in individuals and in groups, constitutes an important practical tool for the selection of leaders.
Tutorial on Using Regression Models with Count Outcomes Using R

Directory of Open Access Journals (Sweden)

A. Alexander Beaujean

2016-02-01

Full Text Available Education researchers often study count variables, such as times a student reached a goal, discipline referrals, and absences. Most researchers that study these variables use typical regression methods (i.e., ordinary least-squares either with or without transforming the count variables. In either case, using typical regression for count data can produce parameter estimates that are biased, thus diminishing any inferences made from such data. As count-variable regression models are seldom taught in training programs, we present a tutorial to help educational researchers use such methods in their own research. We demonstrate analyzing and interpreting count data using Poisson, negative binomial, zero-inflated Poisson, and zero-inflated negative binomial regression models. The count regression methods are introduced through an example using the number of times students skipped class. The data for this example are freely available and the R syntax used run the example analyses are included in the Appendix.
Sample size determination for logistic regression on a logit-normal distribution.

Science.gov (United States)

Kim, Seongho; Heath, Elisabeth; Heilbrun, Lance

2017-06-01

Although the sample size for simple logistic regression can be readily determined using currently available methods, the sample size calculation for multiple logistic regression requires some additional information, such as the coefficient of determination ([Formula: see text]) of a covariate of interest with other covariates, which is often unavailable in practice. The response variable of logistic regression follows a logit-normal distribution which can be generated from a logistic transformation of a normal distribution. Using this property of logistic regression, we propose new methods of determining the sample size for simple and multiple logistic regressions using a normal transformation of outcome measures. Simulation studies and a motivating example show several advantages of the proposed methods over the existing methods: (i) no need for [Formula: see text] for multiple logistic regression, (ii) available interim or group-sequential designs, and (iii) much smaller required sample size.
Changes in persistence, spurious regressions and the Fisher hypothesis

DEFF Research Database (Denmark)

Kruse, Robinson; Ventosa-Santaulària, Daniel; Noriega, Antonio E.

Declining inflation persistence has been documented in numerous studies. When such series are analyzed in a regression framework in conjunction with other persistent time series, spurious regressions are likely to occur. We propose to use the coefficient of determination R2 as a test statistic to...

Interpreting parameters in the logistic regression model with random effects

DEFF Research Database (Denmark)

Larsen, Klaus; Petersen, Jørgen Holm; Budtz-Jørgensen, Esben

2000-01-01

interpretation, interval odds ratio, logistic regression, median odds ratio, normally distributed random effects......interpretation, interval odds ratio, logistic regression, median odds ratio, normally distributed random effects...
Research and analyze of physical health using multiple regression analysis

Directory of Open Access Journals (Sweden)

T. S. Kyi

2014-01-01

Full Text Available This paper represents the research which is trying to create a mathematical model of the "healthy people" using the method of regression analysis. The factors are the physical parameters of the person (such as heart rate, lung capacity, blood pressure, breath holding, weight height coefficient, flexibility of the spine, muscles of the shoulder belt, abdominal muscles, squatting, etc.., and the response variable is an indicator of physical working capacity. After performing multiple regression analysis, obtained useful multiple regression models that can predict the physical performance of boys the aged of fourteen to seventeen years. This paper represents the development of regression model for the sixteen year old boys and analyzed results.
Northeast Guanabara Bay and coastal plain Holocene sedimentary evolution (Brazil: A contribution

Directory of Open Access Journals (Sweden)

Rodrigo Coutinho Abuchacra

2017-02-01

Full Text Available Sedimentological and radiocarbon investigations are part of an ongoing research on the Bay-head delta of northeast Guanabara Bay, Rio de Janeiro State. Sediment accumulation indicates that the Holocene infill of the bay-head delta started around 8.2 kyr BP and was not in pace with the eustatic sea-level rise. Sediment accumulation was faster during the transgressive phase (0.56 cm.yr-1. However, during the regressive phase, progradation driven by base-level fall was predominant over vertical sediment accumulation (0.02 cm.yr-1. Based on coring, three sedimentary units were defined: fluvial sands (U1, estuarine deposits (U2 and fluvial mud (U3.
Estimating nonlinear selection gradients using quadratic regression coefficients: double or nothing?

Science.gov (United States)

Stinchcombe, John R; Agrawal, Aneil F; Hohenlohe, Paul A; Arnold, Stevan J; Blows, Mark W

2008-09-01

The use of regression analysis has been instrumental in allowing evolutionary biologists to estimate the strength and mode of natural selection. Although directional and correlational selection gradients are equal to their corresponding regression coefficients, quadratic regression coefficients must be doubled to estimate stabilizing/disruptive selection gradients. Based on a sample of 33 papers published in Evolution between 2002 and 2007, at least 78% of papers have not doubled quadratic regression coefficients, leading to an appreciable underestimate of the strength of stabilizing and disruptive selection. Proper treatment of quadratic regression coefficients is necessary for estimation of fitness surfaces and contour plots, canonical analysis of the gamma matrix, and modeling the evolution of populations on an adaptive landscape.
The number of subjects per variable required in linear regression analyses.

Science.gov (United States)

Austin, Peter C; Steyerberg, Ewout W

2015-06-01

To determine the number of independent variables that can be included in a linear regression model. We used a series of Monte Carlo simulations to examine the impact of the number of subjects per variable (SPV) on the accuracy of estimated regression coefficients and standard errors, on the empirical coverage of estimated confidence intervals, and on the accuracy of the estimated R(2) of the fitted model. A minimum of approximately two SPV tended to result in estimation of regression coefficients with relative bias of less than 10%. Furthermore, with this minimum number of SPV, the standard errors of the regression coefficients were accurately estimated and estimated confidence intervals had approximately the advertised coverage rates. A much higher number of SPV were necessary to minimize bias in estimating the model R(2), although adjusted R(2) estimates behaved well. The bias in estimating the model R(2) statistic was inversely proportional to the magnitude of the proportion of variation explained by the population regression model. Linear regression models require only two SPV for adequate estimation of regression coefficients, standard errors, and confidence intervals. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
application of multilinear regression analysis in modeling of soil

African Journals Online (AJOL)

Windows User

Accordingly [1, 3] in their work, they applied linear regression ... (MLRA) is a statistical technique that uses several explanatory ... order to check this, they adopted bivariate correlation analysis .... groups, namely A-1 through A-7, based on their relative expected ..... Multivariate Regression in Gorgan Province North of Iran” ...
Implicit collinearity effect in linear regression: Application to basal ...

African Journals Online (AJOL)

Collinearity of predictor variables is a severe problem in the least square regression analysis. It contributes to the instability of regression coefficients and leads to a wrong prediction accuracy. Despite these problems, studies are conducted with a large number of observed and derived variables linked with a response ...
Merkel Cell Carcinoma with Spontaneous Regression: A Case Report and Immunohistochemical Study

Directory of Open Access Journals (Sweden)

Hitoshi Terui

2016-02-01

Full Text Available Merkel cell carcinoma (MCC is an aggressive neuroendocrine carcinoma that only rarely regresses spontaneously. Since little is known about the immunological mechanisms involved in the spontaneous regression of MCC, we describe a case of MCC with spontaneous regression and employed immunohistochemical staining for cytotoxic and immunosuppressive molecules to investigate possible mechanisms involved in the spontaneous regression of MCC. Interestingly, compared to conventional MCC, tumor-infiltrating lymphocytes in MCC with spontaneous regression contained higher numbers of CD8+ cells and granulysin-bearing cells and lower numbers of CD206+ cells. Our present study suggests one of the possible reasons for the spontaneous regression of MCC.
Gaussian Process Regression for WDM System Performance Prediction

DEFF Research Database (Denmark)

Wass, Jesper; Thrane, Jakob; Piels, Molly

2017-01-01

Gaussian process regression is numerically and experimentally investigated to predict the bit error rate of a 24 x 28 CiBd QPSK WDM system. The proposed method produces accurate predictions from multi-dimensional and sparse measurement data.......Gaussian process regression is numerically and experimentally investigated to predict the bit error rate of a 24 x 28 CiBd QPSK WDM system. The proposed method produces accurate predictions from multi-dimensional and sparse measurement data....
Use of probabilistic weights to enhance linear regression myoelectric control

Science.gov (United States)

Smith, Lauren H.; Kuiken, Todd A.; Hargrove, Levi J.

2015-12-01

Objective. Clinically available prostheses for transradial amputees do not allow simultaneous myoelectric control of degrees of freedom (DOFs). Linear regression methods can provide simultaneous myoelectric control, but frequently also result in difficulty with isolating individual DOFs when desired. This study evaluated the potential of using probabilistic estimates of categories of gross prosthesis movement, which are commonly used in classification-based myoelectric control, to enhance linear regression myoelectric control. Approach. Gaussian models were fit to electromyogram (EMG) feature distributions for three movement classes at each DOF (no movement, or movement in either direction) and used to weight the output of linear regression models by the probability that the user intended the movement. Eight able-bodied and two transradial amputee subjects worked in a virtual Fitts’ law task to evaluate differences in controllability between linear regression and probability-weighted regression for an intramuscular EMG-based three-DOF wrist and hand system. Main results. Real-time and offline analyses in able-bodied subjects demonstrated that probability weighting improved performance during single-DOF tasks (p < 0.05) by preventing extraneous movement at additional DOFs. Similar results were seen in experiments with two transradial amputees. Though goodness-of-fit evaluations suggested that the EMG feature distributions showed some deviations from the Gaussian, equal-covariance assumptions used in this experiment, the assumptions were sufficiently met to provide improved performance compared to linear regression control. Significance. Use of probability weights can improve the ability to isolate individual during linear regression myoelectric control, while maintaining the ability to simultaneously control multiple DOFs.
Evaluation of Linear Regression Simultaneous Myoelectric Control Using Intramuscular EMG.

Science.gov (United States)

Smith, Lauren H; Kuiken, Todd A; Hargrove, Levi J

2016-04-01

The objective of this study was to evaluate the ability of linear regression models to decode patterns of muscle coactivation from intramuscular electromyogram (EMG) and provide simultaneous myoelectric control of a virtual 3-DOF wrist/hand system. Performance was compared to the simultaneous control of conventional myoelectric prosthesis methods using intramuscular EMG (parallel dual-site control)-an approach that requires users to independently modulate individual muscles in the residual limb, which can be challenging for amputees. Linear regression control was evaluated in eight able-bodied subjects during a virtual Fitts' law task and was compared to performance of eight subjects using parallel dual-site control. An offline analysis also evaluated how different types of training data affected prediction accuracy of linear regression control. The two control systems demonstrated similar overall performance; however, the linear regression method demonstrated improved performance for targets requiring use of all three DOFs, whereas parallel dual-site control demonstrated improved performance for targets that required use of only one DOF. Subjects using linear regression control could more easily activate multiple DOFs simultaneously, but often experienced unintended movements when trying to isolate individual DOFs. Offline analyses also suggested that the method used to train linear regression systems may influence controllability. Linear regression myoelectric control using intramuscular EMG provided an alternative to parallel dual-site control for 3-DOF simultaneous control at the wrist and hand. The two methods demonstrated different strengths in controllability, highlighting the tradeoff between providing simultaneous control and the ability to isolate individual DOFs when desired.
Adaptive metric kernel regression

DEFF Research Database (Denmark)

Goutte, Cyril; Larsen, Jan

2000-01-01

Kernel smoothing is a widely used non-parametric pattern recognition technique. By nature, it suffers from the curse of dimensionality and is usually difficult to apply to high input dimensions. In this contribution, we propose an algorithm that adapts the input metric used in multivariate...... regression by minimising a cross-validation estimate of the generalisation error. This allows to automatically adjust the importance of different dimensions. The improvement in terms of modelling performance is illustrated on a variable selection task where the adaptive metric kernel clearly outperforms...
Sparse Reduced-Rank Regression for Simultaneous Dimension Reduction and Variable Selection

KAUST Repository

Chen, Lisha

2012-12-01

The reduced-rank regression is an effective method in predicting multiple response variables from the same set of predictor variables. It reduces the number of model parameters and takes advantage of interrelations between the response variables and hence improves predictive accuracy. We propose to select relevant variables for reduced-rank regression by using a sparsity-inducing penalty. We apply a group-lasso type penalty that treats each row of the matrix of the regression coefficients as a group and show that this penalty satisfies certain desirable invariance properties. We develop two numerical algorithms to solve the penalized regression problem and establish the asymptotic consistency of the proposed method. In particular, the manifold structure of the reduced-rank regression coefficient matrix is considered and studied in our theoretical analysis. In our simulation study and real data analysis, the new method is compared with several existing variable selection methods for multivariate regression and exhibits competitive performance in prediction and variable selection. © 2012 American Statistical Association.
Augmenting Data with Published Results in Bayesian Linear Regression

Science.gov (United States)

de Leeuw, Christiaan; Klugkist, Irene

2012-01-01

In most research, linear regression analyses are performed without taking into account published results (i.e., reported summary statistics) of similar previous studies. Although the prior density in Bayesian linear regression could accommodate such prior knowledge, formal models for doing so are absent from the literature. The goal of this…
Are increases in cigarette taxation regressive?

Science.gov (United States)

Borren, P; Sutton, M

1992-12-01

Using the latest published data from Tobacco Advisory Council surveys, this paper re-evaluates the question of whether or not increases in cigarette taxation are regressive in the United Kingdom. The extended data set shows no evidence of increasing price-elasticity by social class as found in a major previous study. To the contrary, there appears to be no clear pattern in the price responsiveness of smoking behaviour across different social classes. Increases in cigarette taxation, while reducing smoking levels in all groups, fall most heavily on men and women in the lowest social class. Men and women in social class five can expect to pay eight and eleven times more of a tax increase respectively, than their social class one counterparts. Taken as a proportion of relative incomes, the regressive nature of increases in cigarette taxation is even more pronounced.
An improved multiple linear regression and data analysis computer program package

Science.gov (United States)

Sidik, S. M.

1972-01-01

NEWRAP, an improved version of a previous multiple linear regression program called RAPIER, CREDUC, and CRSPLT, allows for a complete regression analysis including cross plots of the independent and dependent variables, correlation coefficients, regression coefficients, analysis of variance tables, t-statistics and their probability levels, rejection of independent variables, plots of residuals against the independent and dependent variables, and a canonical reduction of quadratic response functions useful in optimum seeking experimentation. A major improvement over RAPIER is that all regression calculations are done in double precision arithmetic.
Poisson regression for modeling count and frequency outcomes in trauma research.

Science.gov (United States)

Gagnon, David R; Doron-LaMarca, Susan; Bell, Margret; O'Farrell, Timothy J; Taft, Casey T

2008-10-01

The authors describe how the Poisson regression method for analyzing count or frequency outcome variables can be applied in trauma studies. The outcome of interest in trauma research may represent a count of the number of incidents of behavior occurring in a given time interval, such as acts of physical aggression or substance abuse. Traditional linear regression approaches assume a normally distributed outcome variable with equal variances over the range of predictor variables, and may not be optimal for modeling count outcomes. An application of Poisson regression is presented using data from a study of intimate partner aggression among male patients in an alcohol treatment program and their female partners. Results of Poisson regression and linear regression models are compared.
Bayesian Inference of a Multivariate Regression Model

Directory of Open Access Journals (Sweden)

Marick S. Sinay

2014-01-01

Full Text Available We explore Bayesian inference of a multivariate linear regression model with use of a flexible prior for the covariance structure. The commonly adopted Bayesian setup involves the conjugate prior, multivariate normal distribution for the regression coefficients and inverse Wishart specification for the covariance matrix. Here we depart from this approach and propose a novel Bayesian estimator for the covariance. A multivariate normal prior for the unique elements of the matrix logarithm of the covariance matrix is considered. Such structure allows for a richer class of prior distributions for the covariance, with respect to strength of beliefs in prior location hyperparameters, as well as the added ability, to model potential correlation amongst the covariance structure. The posterior moments of all relevant parameters of interest are calculated based upon numerical results via a Markov chain Monte Carlo procedure. The Metropolis-Hastings-within-Gibbs algorithm is invoked to account for the construction of a proposal density that closely matches the shape of the target posterior distribution. As an application of the proposed technique, we investigate a multiple regression based upon the 1980 High School and Beyond Survey.
Geographically weighted regression model on poverty indicator

Science.gov (United States)

Slamet, I.; Nugroho, N. F. T. A.; Muslich

2017-12-01

In this research, we applied geographically weighted regression (GWR) for analyzing the poverty in Central Java. We consider Gaussian Kernel as weighted function. The GWR uses the diagonal matrix resulted from calculating kernel Gaussian function as a weighted function in the regression model. The kernel weights is used to handle spatial effects on the data so that a model can be obtained for each location. The purpose of this paper is to model of poverty percentage data in Central Java province using GWR with Gaussian kernel weighted function and to determine the influencing factors in each regency/city in Central Java province. Based on the research, we obtained geographically weighted regression model with Gaussian kernel weighted function on poverty percentage data in Central Java province. We found that percentage of population working as farmers, population growth rate, percentage of households with regular sanitation, and BPJS beneficiaries are the variables that affect the percentage of poverty in Central Java province. In this research, we found the determination coefficient R2 are 68.64%. There are two categories of district which are influenced by different of significance factors.
General regression and representation model for classification.

Directory of Open Access Journals (Sweden)

Jianjun Qian

Full Text Available Recently, the regularized coding-based classification methods (e.g. SRC and CRC show a great potential for pattern classification. However, most existing coding methods assume that the representation residuals are uncorrelated. In real-world applications, this assumption does not hold. In this paper, we take account of the correlations of the representation residuals and develop a general regression and representation model (GRR for classification. GRR not only has advantages of CRC, but also takes full use of the prior information (e.g. the correlations between representation residuals and representation coefficients and the specific information (weight matrix of image pixels to enhance the classification performance. GRR uses the generalized Tikhonov regularization and K Nearest Neighbors to learn the prior information from the training data. Meanwhile, the specific information is obtained by using an iterative algorithm to update the feature (or image pixel weights of the test sample. With the proposed model as a platform, we design two classifiers: basic general regression and representation classifier (B-GRR and robust general regression and representation classifier (R-GRR. The experimental results demonstrate the performance advantages of proposed methods over state-of-the-art algorithms.

Directional quantile regression in R

Czech Academy of Sciences Publication Activity Database

Boček, Pavel; Šiman, Miroslav

2017-01-01

Roč. 53, č. 3 (2017), s. 480-492 ISSN 0023-5954 R&D Projects: GA ČR GA14-07234S Institutional support: RVO:67985556 Keywords : multivariate quantile * regression quantile * halfspace depth * depth contour Subject RIV: BD - Theory of Information OBOR OECD: Applied mathematics Impact factor: 0.379, year: 2016 http://library.utia.cas.cz/separaty/2017/SI/bocek-0476587.pdf
Cactus: An Introduction to Regression

Science.gov (United States)

Hyde, Hartley

2008-01-01

When the author first used "VisiCalc," the author thought it a very useful tool when he had the formulas. But how could he design a spreadsheet if there was no known formula for the quantities he was trying to predict? A few months later, the author relates he learned to use multiple linear regression software and suddenly it all clicked into…
Wavelet regression model in forecasting crude oil price

Science.gov (United States)

Hamid, Mohd Helmie; Shabri, Ani

2017-05-01

This study presents the performance of wavelet multiple linear regression (WMLR) technique in daily crude oil forecasting. WMLR model was developed by integrating the discrete wavelet transform (DWT) and multiple linear regression (MLR) model. The original time series was decomposed to sub-time series with different scales by wavelet theory. Correlation analysis was conducted to assist in the selection of optimal decomposed components as inputs for the WMLR model. The daily WTI crude oil price series has been used in this study to test the prediction capability of the proposed model. The forecasting performance of WMLR model were also compared with regular multiple linear regression (MLR), Autoregressive Moving Average (ARIMA) and Generalized Autoregressive Conditional Heteroscedasticity (GARCH) using root mean square errors (RMSE) and mean absolute errors (MAE). Based on the experimental results, it appears that the WMLR model performs better than the other forecasting technique tested in this study.
Biostatistics Series Module 6: Correlation and Linear Regression.

Science.gov (United States)

Hazra, Avijit; Gogtay, Nithya

2016-01-01

Correlation and linear regression are the most commonly used techniques for quantifying the association between two numeric variables. Correlation quantifies the strength of the linear relationship between paired variables, expressing this as a correlation coefficient. If both variables x and y are normally distributed, we calculate Pearson's correlation coefficient ( r ). If normality assumption is not met for one or both variables in a correlation analysis, a rank correlation coefficient, such as Spearman's rho (ρ) may be calculated. A hypothesis test of correlation tests whether the linear relationship between the two variables holds in the underlying population, in which case it returns a P correlation coefficient can also be calculated for an idea of the correlation in the population. The value r 2 denotes the proportion of the variability of the dependent variable y that can be attributed to its linear relation with the independent variable x and is called the coefficient of determination. Linear regression is a technique that attempts to link two correlated variables x and y in the form of a mathematical equation ( y = a + bx ), such that given the value of one variable the other may be predicted. In general, the method of least squares is applied to obtain the equation of the regression line. Correlation and linear regression analysis are based on certain assumptions pertaining to the data sets. If these assumptions are not met, misleading conclusions may be drawn. The first assumption is that of linear relationship between the two variables. A scatter plot is essential before embarking on any correlation-regression analysis to show that this is indeed the case. Outliers or clustering within data sets can distort the correlation coefficient value. Finally, it is vital to remember that though strong correlation can be a pointer toward causation, the two are not synonymous.
Fast multi-output relevance vector regression

OpenAIRE

Ha, Youngmin

2017-01-01

This paper aims to decrease the time complexity of multi-output relevance vector regression from O(VM^3) to O(V^3+M^3), where V is the number of output dimensions, M is the number of basis functions, and V
Logistic Regression Modeling of Diminishing Manufacturing Sources for Integrated Circuits

National Research Council Canada - National Science Library

Gravier, Michael

1999-01-01

.... The research identified logistic regression as a powerful tool for analysis of DMSMS and further developed twenty models attempting to identify the "best" way to model and predict DMSMS using logistic regression...
A regression approach for Zircaloy-2 in-reactor creep constitutive equations

International Nuclear Information System (INIS)

Yung Liu, Y.; Bement, A.L.

1977-01-01

In this paper the methodology of multiple regressions as applied to Zircaloy-2 in-reactor creep data analysis and construction of constitutive equation are illustrated. While the resulting constitutive equation can be used in creep analysis of in-reactor Zircaloy structural components, the methodology itself is entirely general and can be applied to any creep data analysis. The promising aspects of multiple regression creep data analysis are briefly outlined as follows: (1) When there are more than one variable involved, there is no need to make the assumption that each variable affects the response independently. No separate normalizations are required either and the estimation of parameters is obtained by solving many simultaneous equations. The number of simultaneous equations is equal to the number of data sets. (2) Regression statistics such as R 2 - and F-statistics provide measures of the significance of regression creep equation in correlating the overall data. The relative weights of each variable on the response can also be obtained. (3) Special regression techniques such as step-wise, ridge, and robust regressions and residual plots, etc., provide diagnostic tools for model selections. Multiple regression analysis performed on a set of carefully selected Zircaloy-2 in-reactor creep data leads to a model which provides excellent correlations for the data. (Auth.)
On a Robust MaxEnt Process Regression Model with Sample-Selection

Directory of Open Access Journals (Sweden)

Hea-Jung Kim

2018-04-01

Full Text Available In a regression analysis, a sample-selection bias arises when a dependent variable is partially observed as a result of the sample selection. This study introduces a Maximum Entropy (MaxEnt process regression model that assumes a MaxEnt prior distribution for its nonparametric regression function and finds that the MaxEnt process regression model includes the well-known Gaussian process regression (GPR model as a special case. Then, this special MaxEnt process regression model, i.e., the GPR model, is generalized to obtain a robust sample-selection Gaussian process regression (RSGPR model that deals with non-normal data in the sample selection. Various properties of the RSGPR model are established, including the stochastic representation, distributional hierarchy, and magnitude of the sample-selection bias. These properties are used in the paper to develop a hierarchical Bayesian methodology to estimate the model. This involves a simple and computationally feasible Markov chain Monte Carlo algorithm that avoids analytical or numerical derivatives of the log-likelihood function of the model. The performance of the RSGPR model in terms of the sample-selection bias correction, robustness to non-normality, and prediction, is demonstrated through results in simulations that attest to its good finite-sample performance.
Adaptive Metric Kernel Regression

DEFF Research Database (Denmark)

Goutte, Cyril; Larsen, Jan

1998-01-01

Kernel smoothing is a widely used nonparametric pattern recognition technique. By nature, it suffers from the curse of dimensionality and is usually difficult to apply to high input dimensions. In this paper, we propose an algorithm that adapts the input metric used in multivariate regression...... by minimising a cross-validation estimate of the generalisation error. This allows one to automatically adjust the importance of different dimensions. The improvement in terms of modelling performance is illustrated on a variable selection task where the adaptive metric kernel clearly outperforms the standard...
Intermediate and advanced topics in multilevel logistic regression analysis.

Science.gov (United States)

Austin, Peter C; Merlo, Juan

2017-09-10

Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher-level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within-cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population-average effect of covariates measured at the subject and cluster level, in contrast to the within-cluster or cluster-specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster-level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.
Linear regression and sensitivity analysis in nuclear reactor design

International Nuclear Information System (INIS)

Kumar, Akansha; Tsvetkov, Pavel V.; McClarren, Ryan G.

2015-01-01

Highlights: • Presented a benchmark for the applicability of linear regression to complex systems. • Applied linear regression to a nuclear reactor power system. • Performed neutronics, thermal–hydraulics, and energy conversion using Brayton’s cycle for the design of a GCFBR. • Performed detailed sensitivity analysis to a set of parameters in a nuclear reactor power system. • Modeled and developed reactor design using MCNP, regression using R, and thermal–hydraulics in Java. - Abstract: The paper presents a general strategy applicable for sensitivity analysis (SA), and uncertainity quantification analysis (UA) of parameters related to a nuclear reactor design. This work also validates the use of linear regression (LR) for predictive analysis in a nuclear reactor design. The analysis helps to determine the parameters on which a LR model can be fit for predictive analysis. For those parameters, a regression surface is created based on trial data and predictions are made using this surface. A general strategy of SA to determine and identify the influential parameters those affect the operation of the reactor is mentioned. Identification of design parameters and validation of linearity assumption for the application of LR of reactor design based on a set of tests is performed. The testing methods used to determine the behavior of the parameters can be used as a general strategy for UA, and SA of nuclear reactor models, and thermal hydraulics calculations. A design of a gas cooled fast breeder reactor (GCFBR), with thermal–hydraulics, and energy transfer has been used for the demonstration of this method. MCNP6 is used to simulate the GCFBR design, and perform the necessary criticality calculations. Java is used to build and run input samples, and to extract data from the output files of MCNP6, and R is used to perform regression analysis and other multivariate variance, and analysis of the collinearity of data
Geodesic least squares regression for scaling studies in magnetic confinement fusion

International Nuclear Information System (INIS)

Verdoolaege, Geert

2015-01-01

In regression analyses for deriving scaling laws that occur in various scientific disciplines, usually standard regression methods have been applied, of which ordinary least squares (OLS) is the most popular. However, concerns have been raised with respect to several assumptions underlying OLS in its application to scaling laws. We here discuss a new regression method that is robust in the presence of significant uncertainty on both the data and the regression model. The method, which we call geodesic least squares regression (GLS), is based on minimization of the Rao geodesic distance on a probabilistic manifold. We demonstrate the superiority of the method using synthetic data and we present an application to the scaling law for the power threshold for the transition to the high confinement regime in magnetic confinement fusion devices
Prenatal diagnosis of Caudal Regression Syndrome : a case report

Directory of Open Access Journals (Sweden)

Celikaslan Nurgul

2001-12-01

Full Text Available Abstract Background Caudal regression is a rare syndrome which has a spectrum of congenital malformations ranging from simple anal atresia to absence of sacral, lumbar and possibly lower thoracic vertebrae, to the most severe form which is known as sirenomelia. Maternal diabetes, genetic predisposition and vascular hypoperfusion have been suggested as possible causative factors. Case presentation We report a case of caudal regression syndrome diagnosed in utero at 22 weeks' of gestation. Prenatal ultrasound examination revealed a sudden interruption of the spine and "frog-like" position of lower limbs. Termination of pregnancy and autopsy findings confirmed the diagnosis. Conclusion Prenatal ultrasonographic diagnosis of caudal regression syndrome is possible at 22 weeks' of gestation by ultrasound examination.
Modeling Personalized Email Prioritization: Classification-based and Regression-based Approaches

Energy Technology Data Exchange (ETDEWEB)

Yoo S.; Yang, Y.; Carbonell, J.

2011-10-24

Email overload, even after spam filtering, presents a serious productivity challenge for busy professionals and executives. One solution is automated prioritization of incoming emails to ensure the most important are read and processed quickly, while others are processed later as/if time permits in declining priority levels. This paper presents a study of machine learning approaches to email prioritization into discrete levels, comparing ordinal regression versus classier cascades. Given the ordinal nature of discrete email priority levels, SVM ordinal regression would be expected to perform well, but surprisingly a cascade of SVM classifiers significantly outperforms ordinal regression for email prioritization. In contrast, SVM regression performs well -- better than classifiers -- on selected UCI data sets. This unexpected performance inversion is analyzed and results are presented, providing core functionality for email prioritization systems.
Yet another look at MIDAS regression

NARCIS (Netherlands)

Ph.H.B.F. Franses (Philip Hans)

2016-01-01

textabstractA MIDAS regression involves a dependent variable observed at a low frequency and independent variables observed at a higher frequency. This paper relates a true high frequency data generating process, where also the dependent variable is observed (hypothetically) at the high frequency,
Testing the equality of nonparametric regression curves based on ...

African Journals Online (AJOL)

Abstract. In this work we propose a new methodology for the comparison of two regression functions f1 and f2 in the case of homoscedastic error structure and a fixed design. Our approach is based on the empirical Fourier coefficients of the regression functions f1 and f2 respectively. As our main results we obtain the ...
Multivariate Regression of Liver on Intestine of Mice: A ...

African Journals Online (AJOL)

FIRST LADY

pairs recovered. Linear, semi-logarithmic and logarithmic-logarithmic (log- log) regressions were performed. He chose the log-log curves because its variance was more uniform. The statistical comparison of .... E(U1| U2 = u2) is the regression function of U1 on U2, and Var (U1|U2 = u2) is the conditional covariance matrix.
Testing homogeneity in Weibull-regression models.

Science.gov (United States)

Bolfarine, Heleno; Valença, Dione M

2005-10-01

In survival studies with families or geographical units it may be of interest testing whether such groups are homogeneous for given explanatory variables. In this paper we consider score type tests for group homogeneity based on a mixing model in which the group effect is modelled as a random variable. As opposed to hazard-based frailty models, this model presents survival times that conditioned on the random effect, has an accelerated failure time representation. The test statistics requires only estimation of the conventional regression model without the random effect and does not require specifying the distribution of the random effect. The tests are derived for a Weibull regression model and in the uncensored situation, a closed form is obtained for the test statistic. A simulation study is used for comparing the power of the tests. The proposed tests are applied to real data sets with censored data.
Confidence bands for inverse regression models

International Nuclear Information System (INIS)

Birke, Melanie; Bissantz, Nicolai; Holzmann, Hajo

2010-01-01

We construct uniform confidence bands for the regression function in inverse, homoscedastic regression models with convolution-type operators. Here, the convolution is between two non-periodic functions on the whole real line rather than between two periodic functions on a compact interval, since the former situation arguably arises more often in applications. First, following Bickel and Rosenblatt (1973 Ann. Stat. 1 1071–95) we construct asymptotic confidence bands which are based on strong approximations and on a limit theorem for the supremum of a stationary Gaussian process. Further, we propose bootstrap confidence bands based on the residual bootstrap and prove consistency of the bootstrap procedure. A simulation study shows that the bootstrap confidence bands perform reasonably well for moderate sample sizes. Finally, we apply our method to data from a gel electrophoresis experiment with genetically engineered neuronal receptor subunits incubated with rat brain extract
FBH1 Catalyzes Regression of Stalled Replication Forks

Directory of Open Access Journals (Sweden)

Kasper Fugger

2015-03-01

Full Text Available DNA replication fork perturbation is a major challenge to the maintenance of genome integrity. It has been suggested that processing of stalled forks might involve fork regression, in which the fork reverses and the two nascent DNA strands anneal. Here, we show that FBH1 catalyzes regression of a model replication fork in vitro and promotes fork regression in vivo in response to replication perturbation. Cells respond to fork stalling by activating checkpoint responses requiring signaling through stress-activated protein kinases. Importantly, we show that FBH1, through its helicase activity, is required for early phosphorylation of ATM substrates such as CHK2 and CtIP as well as hyperphosphorylation of RPA. These phosphorylations occur prior to apparent DNA double-strand break formation. Furthermore, FBH1-dependent signaling promotes checkpoint control and preserves genome integrity. We propose a model whereby FBH1 promotes early checkpoint signaling by remodeling of stalled DNA replication forks.

[Application of detecting and taking overdispersion into account in Poisson regression model].

Science.gov (United States)

Bouche, G; Lepage, B; Migeot, V; Ingrand, P

2009-08-01

Researchers often use the Poisson regression model to analyze count data. Overdispersion can occur when a Poisson regression model is used, resulting in an underestimation of variance of the regression model parameters. Our objective was to take overdispersion into account and assess its impact with an illustration based on the data of a study investigating the relationship between use of the Internet to seek health information and number of primary care consultations. Three methods, overdispersed Poisson, a robust estimator, and negative binomial regression, were performed to take overdispersion into account in explaining variation in the number (Y) of primary care consultations. We tested overdispersion in the Poisson regression model using the ratio of the sum of Pearson residuals over the number of degrees of freedom (chi(2)/df). We then fitted the three models and compared parameter estimation to the estimations given by Poisson regression model. Variance of the number of primary care consultations (Var[Y]=21.03) was greater than the mean (E[Y]=5.93) and the chi(2)/df ratio was 3.26, which confirmed overdispersion. Standard errors of the parameters varied greatly between the Poisson regression model and the three other regression models. Interpretation of estimates from two variables (using the Internet to seek health information and single parent family) would have changed according to the model retained, with significant levels of 0.06 and 0.002 (Poisson), 0.29 and 0.09 (overdispersed Poisson), 0.29 and 0.13 (use of a robust estimator) and 0.45 and 0.13 (negative binomial) respectively. Different methods exist to solve the problem of underestimating variance in the Poisson regression model when overdispersion is present. The negative binomial regression model seems to be particularly accurate because of its theorical distribution ; in addition this regression is easy to perform with ordinary statistical software packages.
Regressão e crescimento do primogênito no processo de tornar-se irmão Firstborn's regression and growth in the process of becoming a sibling

Directory of Open Access Journals (Sweden)

Débora Silva Oliveira

2013-03-01

Full Text Available Investigaram-se indicadores de regressão e crescimento do primogênito no processo de tornar-se irmão. Participaram três primogênitos pré-escolares no terceiro trimestre de gestação, aos 12 e 24 meses do irmão. Foi aplicado o Teste das Fábulas e realizada análise qualitativa de conteúdo. Os resultados revelaram regressão do primogênito na gestação materna e crescimento, aos 12 e aos 24 meses de idade do irmão. A regressão foi uma forma de enfrentar a chegada do irmão, enquanto que o crescimento revelou capacidade para conquistas ou custos de ser mais velho. Tanto a regressão quanto o crescimento oportunizaram um ir e vir saudável, fundamental para o desenvolvimento rumo à independência. Esses achados têm implicações para a pesquisa e para a clínica.Regression and growth indicators in the process of becoming a sibling were investigated. Three firstborns took part in the study during the first sibling's third trimester of pregnancy, and when the sibling was 12 and 24 months old, respectively. The Fables Test was used and a qualitative content analysis was carried out. Results revealed regression indicators during pregnancy. At 12 and 24 months there were growth indicators together with regression indicators. Regression was used by the firstborn for coping with the sibling's arrival while growth revealed the capacity for acquisitions or the costs of being an older sibling. Both regressive and growth manifestations enabled a healthy to and fro, which is fundamental for development towards independence. These findings have both research and clinical implications.
A SAS-macro for estimation of the cumulative incidence using Poisson regression

DEFF Research Database (Denmark)

Waltoft, Berit Lindum

2009-01-01

the hazard rates, and the hazard rates are often estimated by the Cox regression. This procedure may not be suitable for large studies due to limited computer resources. Instead one uses Poisson regression, which approximates the Cox regression. Rosthøj et al. presented a SAS-macro for the estimation...... of the cumulative incidences based on the Cox regression. I present the functional form of the probabilities and variances when using piecewise constant hazard rates and a SAS-macro for the estimation using Poisson regression. The use of the macro is demonstrated through examples and compared to the macro presented...
Dimension Reduction and Discretization in Stochastic Problems by Regression Method

DEFF Research Database (Denmark)

Ditlevsen, Ove Dalager

1996-01-01

The chapter mainly deals with dimension reduction and field discretizations based directly on the concept of linear regression. Several examples of interesting applications in stochastic mechanics are also given.Keywords: Random fields discretization, Linear regression, Stochastic interpolation, ...
Stochastic development regression using method of moments

DEFF Research Database (Denmark)

Kühnel, Line; Sommer, Stefan Horst

2017-01-01

This paper considers the estimation problem arising when inferring parameters in the stochastic development regression model for manifold valued non-linear data. Stochastic development regression captures the relation between manifold-valued response and Euclidean covariate variables using...... the stochastic development construction. It is thereby able to incorporate several covariate variables and random effects. The model is intrinsically defined using the connection of the manifold, and the use of stochastic development avoids linearizing the geometry. We propose to infer parameters using...... the Method of Moments procedure that matches known constraints on moments of the observations conditional on the latent variables. The performance of the model is investigated in a simulation example using data on finite dimensional landmark manifolds....
Penalized estimation for competing risks regression with applications to high-dimensional covariates

DEFF Research Database (Denmark)

Ambrogi, Federico; Scheike, Thomas H.

2016-01-01

of competing events. The direct binomial regression model of Scheike and others (2008. Predicting cumulative incidence probability by direct binomial regression. Biometrika 95: (1), 205-220) is reformulated in a penalized framework to possibly fit a sparse regression model. The developed approach is easily...... Research 19: (1), 29-51), the research regarding competing risks is less developed (Binder and others, 2009. Boosting for high-dimensional time-to-event data with competing risks. Bioinformatics 25: (7), 890-896). The aim of this work is to consider how to do penalized regression in the presence...... implementable using existing high-performance software to do penalized regression. Results from simulation studies are presented together with an application to genomic data when the endpoint is progression-free survival. An R function is provided to perform regularized competing risks regression according...
Use of multiple linear regression and logistic regression models to investigate changes in birthweight for term singleton infants in Scotland.

Science.gov (United States)

Bonellie, Sandra R

2012-10-01

To illustrate the use of regression and logistic regression models to investigate changes over time in size of babies particularly in relation to social deprivation, age of the mother and smoking. Mean birthweight has been found to be increasing in many countries in recent years, but there are still a group of babies who are born with low birthweights. Population-based retrospective cohort study. Multiple linear regression and logistic regression models are used to analyse data on term 'singleton births' from Scottish hospitals between 1994-2003. Mothers who smoke are shown to give birth to lighter babies on average, a difference of approximately 0.57 Standard deviations lower (95% confidence interval. 0.55-0.58) when adjusted for sex and parity. These mothers are also more likely to have babies that are low birthweight (odds ratio 3.46, 95% confidence interval 3.30-3.63) compared with non-smokers. Low birthweight is 30% more likely where the mother lives in the most deprived areas compared with the least deprived, (odds ratio 1.30, 95% confidence interval 1.21-1.40). Smoking during pregnancy is shown to have a detrimental effect on the size of infants at birth. This effect explains some, though not all, of the observed socioeconomic birthweight. It also explains much of the observed birthweight differences by the age of the mother. Identifying mothers at greater risk of having a low birthweight baby as important implications for the care and advice this group receives. © 2012 Blackwell Publishing Ltd.
On two flexible methods of 2-dimensional regression analysis

Czech Academy of Sciences Publication Activity Database

Volf, Petr

2012-01-01

Roč. 18, č. 4 (2012), s. 154-164 ISSN 1803-9782 Grant - others:GA ČR(CZ) GAP209/10/2045 Institutional support: RVO:67985556 Keywords : regression analysis * Gordon surface * prediction error * projection pursuit Subject RIV: BB - Applied Statistics, Operational Research http://library.utia.cas.cz/separaty/2013/SI/volf-on two flexible methods of 2-dimensional regression analysis.pdf
Time course for tail regression during metamorphosis of the ascidian Ciona intestinalis.

Science.gov (United States)

Matsunobu, Shohei; Sasakura, Yasunori

2015-09-01

In most ascidians, the tadpole-like swimming larvae dramatically change their body-plans during metamorphosis and develop into sessile adults. The mechanisms of ascidian metamorphosis have been researched and debated for many years. Until now information on the detailed time course of the initiation and completion of each metamorphic event has not been described. One dramatic and important event in ascidian metamorphosis is tail regression, in which ascidian larvae lose their tails to adjust themselves to sessile life. In the present study, we measured the time associated with tail regression in the ascidian Ciona intestinalis. Larvae are thought to acquire competency for each metamorphic event in certain developmental periods. We show that the timing with which the competence for tail regression is acquired is determined by the time since hatching, and this timing is not affected by the timing of post-hatching events such as adhesion. Because larvae need to adhere to substrates with their papillae to induce tail regression, we measured the duration for which larvae need to remain adhered in order to initiate tail regression and the time needed for the tail to regress. Larvae acquire the ability to adhere to substrates before they acquire tail regression competence. We found that when larvae adhered before they acquired tail regression competence, they were able to remember the experience of adhesion until they acquired the ability to undergo tail regression. The time course of the events associated with tail regression provides a valuable reference, upon which the cellular and molecular mechanisms of ascidian metamorphosis can be elucidated. Copyright © 2015 Elsevier Inc. All rights reserved.
Group-wise partial least square regression

NARCIS (Netherlands)

Camacho, José; Saccenti, Edoardo

2018-01-01

This paper introduces the group-wise partial least squares (GPLS) regression. GPLS is a new sparse PLS technique where the sparsity structure is defined in terms of groups of correlated variables, similarly to what is done in the related group-wise principal component analysis. These groups are
Function approximation with polynomial regression slines

International Nuclear Information System (INIS)

Urbanski, P.

1996-01-01

Principles of the polynomial regression splines as well as algorithms and programs for their computation are presented. The programs prepared using software package MATLAB are generally intended for approximation of the X-ray spectra and can be applied in the multivariate calibration of radiometric gauges. (author)
Finite Algorithms for Robust Linear Regression

DEFF Research Database (Denmark)

Madsen, Kaj; Nielsen, Hans Bruun

1990-01-01

The Huber M-estimator for robust linear regression is analyzed. Newton type methods for solution of the problem are defined and analyzed, and finite convergence is proved. Numerical experiments with a large number of test problems demonstrate efficiency and indicate that this kind of approach may...
Detecting overdispersion in count data: A zero-inflated Poisson regression analysis

Science.gov (United States)

Afiqah Muhamad Jamil, Siti; Asrul Affendi Abdullah, M.; Kek, Sie Long; Nor, Maria Elena; Mohamed, Maryati; Ismail, Norradihah

2017-09-01

This study focusing on analysing count data of butterflies communities in Jasin, Melaka. In analysing count dependent variable, the Poisson regression model has been known as a benchmark model for regression analysis. Continuing from the previous literature that used Poisson regression analysis, this study comprising the used of zero-inflated Poisson (ZIP) regression analysis to gain acute precision on analysing the count data of butterfly communities in Jasin, Melaka. On the other hands, Poisson regression should be abandoned in the favour of count data models, which are capable of taking into account the extra zeros explicitly. By far, one of the most popular models include ZIP regression model. The data of butterfly communities which had been called as the number of subjects in this study had been taken in Jasin, Melaka and consisted of 131 number of subjects visits Jasin, Melaka. Since the researchers are considering the number of subjects, this data set consists of five families of butterfly and represent the five variables involve in the analysis which are the types of subjects. Besides, the analysis of ZIP used the SAS procedure of overdispersion in analysing zeros value and the main purpose of continuing the previous study is to compare which models would be better than when exists zero values for the observation of the count data. The analysis used AIC, BIC and Voung test of 5% level significance in order to achieve the objectives. The finding indicates that there is a presence of over-dispersion in analysing zero value. The ZIP regression model is better than Poisson regression model when zero values exist.
Regression algorithm for emotion detection

OpenAIRE

Berthelon , Franck; Sander , Peter

2013-01-01

International audience; We present here two components of a computational system for emotion detection. PEMs (Personalized Emotion Maps) store links between bodily expressions and emotion values, and are individually calibrated to capture each person's emotion profile. They are an implementation based on aspects of Scherer's theoretical complex system model of emotion~\\cite{scherer00, scherer09}. We also present a regression algorithm that determines a person's emotional feeling from sensor m...
Multivariate and semiparametric kernel regression

OpenAIRE

Härdle, Wolfgang; Müller, Marlene

1997-01-01

The paper gives an introduction to theory and application of multivariate and semiparametric kernel smoothing. Multivariate nonparametric density estimation is an often used pilot tool for examining the structure of data. Regression smoothing helps in investigating the association between covariates and responses. We concentrate on kernel smoothing using local polynomial fitting which includes the Nadaraya-Watson estimator. Some theory on the asymptotic behavior and bandwidth selection is pro...
Application of principal component regression and partial least squares regression in ultraviolet spectrum water quality detection

Science.gov (United States)

Li, Jiangtong; Luo, Yongdao; Dai, Honglin

2018-01-01

Water is the source of life and the essential foundation of all life. With the development of industrialization, the phenomenon of water pollution is becoming more and more frequent, which directly affects the survival and development of human. Water quality detection is one of the necessary measures to protect water resources. Ultraviolet (UV) spectral analysis is an important research method in the field of water quality detection, which partial least squares regression (PLSR) analysis method is becoming predominant technology, however, in some special cases, PLSR's analysis produce considerable errors. In order to solve this problem, the traditional principal component regression (PCR) analysis method was improved by using the principle of PLSR in this paper. The experimental results show that for some special experimental data set, improved PCR analysis method performance is better than PLSR. The PCR and PLSR is the focus of this paper. Firstly, the principal component analysis (PCA) is performed by MATLAB to reduce the dimensionality of the spectral data; on the basis of a large number of experiments, the optimized principal component is extracted by using the principle of PLSR, which carries most of the original data information. Secondly, the linear regression analysis of the principal component is carried out with statistic package for social science (SPSS), which the coefficients and relations of principal components can be obtained. Finally, calculating a same water spectral data set by PLSR and improved PCR, analyzing and comparing two results, improved PCR and PLSR is similar for most data, but improved PCR is better than PLSR for data near the detection limit. Both PLSR and improved PCR can be used in Ultraviolet spectral analysis of water, but for data near the detection limit, improved PCR's result better than PLSR.
Controlling attribute effect in linear regression

KAUST Repository

Calders, Toon; Karim, Asim A.; Kamiran, Faisal; Ali, Wasif Mohammad; Zhang, Xiangliang

2013-01-01

In data mining we often have to learn from biased data, because, for instance, data comes from different batches or there was a gender or racial bias in the collection of social data. In some applications it may be necessary to explicitly control this bias in the models we learn from the data. This paper is the first to study learning linear regression models under constraints that control the biasing effect of a given attribute such as gender or batch number. We show how propensity modeling can be used for factoring out the part of the bias that can be justified by externally provided explanatory attributes. Then we analytically derive linear models that minimize squared error while controlling the bias by imposing constraints on the mean outcome or residuals of the models. Experiments with discrimination-aware crime prediction and batch effect normalization tasks show that the proposed techniques are successful in controlling attribute effects in linear regression models. © 2013 IEEE.
Controlling attribute effect in linear regression

KAUST Repository

Calders, Toon

2013-12-01

In data mining we often have to learn from biased data, because, for instance, data comes from different batches or there was a gender or racial bias in the collection of social data. In some applications it may be necessary to explicitly control this bias in the models we learn from the data. This paper is the first to study learning linear regression models under constraints that control the biasing effect of a given attribute such as gender or batch number. We show how propensity modeling can be used for factoring out the part of the bias that can be justified by externally provided explanatory attributes. Then we analytically derive linear models that minimize squared error while controlling the bias by imposing constraints on the mean outcome or residuals of the models. Experiments with discrimination-aware crime prediction and batch effect normalization tasks show that the proposed techniques are successful in controlling attribute effects in linear regression models. © 2013 IEEE.
Ordinary Least Squares and Quantile Regression: An Inquiry-Based Learning Approach to a Comparison of Regression Methods

Science.gov (United States)

Helmreich, James E.; Krog, K. Peter

2018-01-01

We present a short, inquiry-based learning course on concepts and methods underlying ordinary least squares (OLS), least absolute deviation (LAD), and quantile regression (QR). Students investigate squared, absolute, and weighted absolute distance functions (metrics) as location measures. Using differential calculus and properties of convex…
Identifying Interacting Genetic Variations by Fish-Swarm Logic Regression

Science.gov (United States)

Yang, Aiyuan; Yan, Chunxia; Zhu, Feng; Zhao, Zhongmeng; Cao, Zhi

2013-01-01

Understanding associations between genotypes and complex traits is a fundamental problem in human genetics. A major open problem in mapping phenotypes is that of identifying a set of interacting genetic variants, which might contribute to complex traits. Logic regression (LR) is a powerful multivariant association tool. Several LR-based approaches have been successfully applied to different datasets. However, these approaches are not adequate with regard to accuracy and efficiency. In this paper, we propose a new LR-based approach, called fish-swarm logic regression (FSLR), which improves the logic regression process by incorporating swarm optimization. In our approach, a school of fish agents are conducted in parallel. Each fish agent holds a regression model, while the school searches for better models through various preset behaviors. A swarm algorithm improves the accuracy and the efficiency by speeding up the convergence and preventing it from dropping into local optimums. We apply our approach on a real screening dataset and a series of simulation scenarios. Compared to three existing LR-based approaches, our approach outperforms them by having lower type I and type II error rates, being able to identify more preset causal sites, and performing at faster speeds. PMID:23984382

SPLINE LINEAR REGRESSION USED FOR EVALUATING FINANCIAL ASSETS 1

Directory of Open Access Journals (Sweden)

Liviu GEAMBAŞU

2010-12-01

Full Text Available One of the most important preoccupations of financial markets participants was and still is the problem of determining more precise the trend of financial assets prices. For solving this problem there were written many scientific papers and were developed many mathematical and statistical models in order to better determine the financial assets price trend. If until recently the simple linear models were largely used due to their facile utilization, the financial crises that affected the world economy starting with 2008 highlight the necessity of adapting the mathematical models to variation of economy. A simple to use model but adapted to economic life realities is the spline linear regression. This type of regression keeps the continuity of regression function, but split the studied data in intervals with homogenous characteristics. The characteristics of each interval are highlighted and also the evolution of market over all the intervals, resulting reduced standard errors. The first objective of the article is the theoretical presentation of the spline linear regression, also referring to scientific national and international papers related to this subject. The second objective is applying the theoretical model to data from the Bucharest Stock Exchange
Identifying Interacting Genetic Variations by Fish-Swarm Logic Regression

Directory of Open Access Journals (Sweden)

Xuanping Zhang

2013-01-01

Full Text Available Understanding associations between genotypes and complex traits is a fundamental problem in human genetics. A major open problem in mapping phenotypes is that of identifying a set of interacting genetic variants, which might contribute to complex traits. Logic regression (LR is a powerful multivariant association tool. Several LR-based approaches have been successfully applied to different datasets. However, these approaches are not adequate with regard to accuracy and efficiency. In this paper, we propose a new LR-based approach, called fish-swarm logic regression (FSLR, which improves the logic regression process by incorporating swarm optimization. In our approach, a school of fish agents are conducted in parallel. Each fish agent holds a regression model, while the school searches for better models through various preset behaviors. A swarm algorithm improves the accuracy and the efficiency by speeding up the convergence and preventing it from dropping into local optimums. We apply our approach on a real screening dataset and a series of simulation scenarios. Compared to three existing LR-based approaches, our approach outperforms them by having lower type I and type II error rates, being able to identify more preset causal sites, and performing at faster speeds.
Semiparametric Mixtures of Regressions with Single-index for Model Based Clustering

OpenAIRE

Xiang, Sijia; Yao, Weixin

2017-01-01

In this article, we propose two classes of semiparametric mixture regression models with single-index for model based clustering. Unlike many semiparametric/nonparametric mixture regression models that can only be applied to low dimensional predictors, the new semiparametric models can easily incorporate high dimensional predictors into the nonparametric components. The proposed models are very general, and many of the recently proposed semiparametric/nonparametric mixture regression models a...
A comparison of regression algorithms for wind speed forecasting at Alexander Bay

CSIR Research Space (South Africa)

Botha, Nicolene

2016-12-01

Full Text Available to forecast 1 to 24 hours ahead, in hourly intervals. Predictions are performed on a wind speed time series with three machine learning regression algorithms, namely support vector regression, ordinary least squares and Bayesian ridge regression. The resulting...
Statistical analysis of sediment toxicity by additive monotone regression splines

NARCIS (Netherlands)

Boer, de W.J.; Besten, den P.J.; Braak, ter C.J.F.

2002-01-01

Modeling nonlinearity and thresholds in dose-effect relations is a major challenge, particularly in noisy data sets. Here we show the utility of nonlinear regression with additive monotone regression splines. These splines lead almost automatically to the estimation of thresholds. We applied this
Prediction accuracy and stability of regression with optimal scaling transformations

NARCIS (Netherlands)

Kooij, van der Anita J.

2007-01-01

The central topic of this thesis is the CATREG approach to nonlinear regression. This approach finds optimal quantifications for categorical variables and/or nonlinear transformations for numerical variables in regression analysis. (CATREG is implemented in SPSS Categories by the author of the
Ajuste de modelos de platô de resposta via regressão isotônica Response plateau models fitting via isotonic regression

Directory of Open Access Journals (Sweden)

Renata Pires Gonçalves

2012-02-01

Full Text Available Dentro do contexto nutricional, a suplementação de microminerais em rações para aves frequentemente é feita em quantidades superiores às exigidas na tentativa de assegurar o bom desempenho dos animais. Os experimentos do tipo dose resposta são muito comuns na determinação de níveis ótimos dos nutrientes na ração e contemplam a utilização de modelos de regressão para atingir tal objetivo. Porém, na análise de regressão usual, geralmente, não se usa uma informação a priori sobre uma possível relação de ordem na variável resposta. A regressão isotônica é um método de estimação por mínimos quadrados que gera estimativas que satisfazem a mesma ordenação dos dados. Na teoria da regressão isotônica, essa informação é utilizada de forma essencial e espera-se que a eficiência do ajuste seja aumentada quando se faz uso dela. Diante do exposto, o presente trabalho tem como objetivo utilizar uma metodologia de regressão isotônica, como uma forma alternativa para analisar dados de deposição de zinco (Zn na tíbia de aves machos da linhagem Hubbard. No estudo, foram considerados os modelos de platô de resposta polinomial quadrático e não linear exponencial. Além desses modelos, também foi proposto o ajuste de um modelo logarítmico para os dados e a eficiência da metodologia foi avaliada por meio de um estudo de simulação Monte Carlo, considerando diferentes cenários para os valores paramétricos. A isotonização dos dados propiciou uma melhora em todos os avaliadores de qualidade de ajuste considerados no trabalho. Dentre os modelos utilizados, o logarítmico apresentou estimativas dos parâmetros mais coerentes com os valores relatados na literatura, para os dados de deposição de Zn na tíbia de aves machos.Within the nutritional context, the supplementation of microminerals in bird food is often made in quantities exceeding those required in the attempt to ensure the proper performance of the animals
Estimating the exceedance probability of rain rate by logistic regression

Science.gov (United States)

Chiu, Long S.; Kedem, Benjamin

1990-01-01

Recent studies have shown that the fraction of an area with rain intensity above a fixed threshold is highly correlated with the area-averaged rain rate. To estimate the fractional rainy area, a logistic regression model, which estimates the conditional probability that rain rate over an area exceeds a fixed threshold given the values of related covariates, is developed. The problem of dependency in the data in the estimation procedure is bypassed by the method of partial likelihood. Analyses of simulated scanning multichannel microwave radiometer and observed electrically scanning microwave radiometer data during the Global Atlantic Tropical Experiment period show that the use of logistic regression in pixel classification is superior to multiple regression in predicting whether rain rate at each pixel exceeds a given threshold, even in the presence of noisy data. The potential of the logistic regression technique in satellite rain rate estimation is discussed.
Virtual machine consolidation enhancement using hybrid regression algorithms

Directory of Open Access Journals (Sweden)

Amany Abdelsamea

2017-11-01

Full Text Available Cloud computing data centers are growing rapidly in both number and capacity to meet the increasing demands for highly-responsive computing and massive storage. Such data centers consume enormous amounts of electrical energy resulting in high operating costs and carbon dioxide emissions. The reason for this extremely high energy consumption is not just the quantity of computing resources and the power inefficiency of hardware, but rather lies in the inefficient usage of these resources. VM consolidation involves live migration of VMs hence the capability of transferring a VM between physical servers with a close to zero down time. It is an effective way to improve the utilization of resources and increase energy efficiency in cloud data centers. VM consolidation consists of host overload/underload detection, VM selection and VM placement. Most of the current VM consolidation approaches apply either heuristic-based techniques, such as static utilization thresholds, decision-making based on statistical analysis of historical data; or simply periodic adaptation of the VM allocation. Most of those algorithms rely on CPU utilization only for host overload detection. In this paper we propose using hybrid factors to enhance VM consolidation. Specifically we developed a multiple regression algorithm that uses CPU utilization, memory utilization and bandwidth utilization for host overload detection. The proposed algorithm, Multiple Regression Host Overload Detection (MRHOD, significantly reduces energy consumption while ensuring a high level of adherence to Service Level Agreements (SLA since it gives a real indication of host utilization based on three parameters (CPU, Memory, Bandwidth utilizations instead of one parameter only (CPU utilization. Through simulations we show that our approach reduces power consumption by 6 times compared to single factor algorithms using random workload. Also using PlanetLab workload traces we show that MRHOD improves
Variable Selection for Regression Models of Percentile Flows

Science.gov (United States)

Fouad, G.

2017-12-01

Percentile flows describe the flow magnitude equaled or exceeded for a given percent of time, and are widely used in water resource management. However, these statistics are normally unavailable since most basins are ungauged. Percentile flows of ungauged basins are often predicted using regression models based on readily observable basin characteristics, such as mean elevation. The number of these independent variables is too large to evaluate all possible models. A subset of models is typically evaluated using automatic procedures, like stepwise regression. This ignores a large variety of methods from the field of feature (variable) selection and physical understanding of percentile flows. A study of 918 basins in the United States was conducted to compare an automatic regression procedure to the following variable selection methods: (1) principal component analysis, (2) correlation analysis, (3) random forests, (4) genetic programming, (5) Bayesian networks, and (6) physical understanding. The automatic regression procedure only performed better than principal component analysis. Poor performance of the regression procedure was due to a commonly used filter for multicollinearity, which rejected the strongest models because they had cross-correlated independent variables. Multicollinearity did not decrease model performance in validation because of a representative set of calibration basins. Variable selection methods based strictly on predictive power (numbers 2-5 from above) performed similarly, likely indicating a limit to the predictive power of the variables. Similar performance was also reached using variables selected based on physical understanding, a finding that substantiates recent calls to emphasize physical understanding in modeling for predictions in ungauged basins. The strongest variables highlighted the importance of geology and land cover, whereas widely used topographic variables were the weakest predictors. Variables suffered from a high
Method for nonlinear exponential regression analysis

Science.gov (United States)

Junkin, B. G.

1972-01-01

Two computer programs developed according to two general types of exponential models for conducting nonlinear exponential regression analysis are described. Least squares procedure is used in which the nonlinear problem is linearized by expanding in a Taylor series. Program is written in FORTRAN 5 for the Univac 1108 computer.
Quantum algorithm for linear regression

Science.gov (United States)

Wang, Guoming

2017-07-01

We present a quantum algorithm for fitting a linear regression model to a given data set using the least-squares approach. Differently from previous algorithms which yield a quantum state encoding the optimal parameters, our algorithm outputs these numbers in the classical form. So by running it once, one completely determines the fitted model and then can use it to make predictions on new data at little cost. Moreover, our algorithm works in the standard oracle model, and can handle data sets with nonsparse design matrices. It runs in time poly( log2(N ) ,d ,κ ,1 /ɛ ) , where N is the size of the data set, d is the number of adjustable parameters, κ is the condition number of the design matrix, and ɛ is the desired precision in the output. We also show that the polynomial dependence on d and κ is necessary. Thus, our algorithm cannot be significantly improved. Furthermore, we also give a quantum algorithm that estimates the quality of the least-squares fit (without computing its parameters explicitly). This algorithm runs faster than the one for finding this fit, and can be used to check whether the given data set qualifies for linear regression in the first place.
Spontaneous regression of metastases from malignant melanoma: a case report

DEFF Research Database (Denmark)

Kalialis, Louise V; Drzewiecki, Krzysztof T; Mohammadi, Mahin

2008-01-01

A case of a 61-year-old male with widespread metastatic melanoma is presented 5 years after complete spontaneous cure. Spontaneous regression occurred in cutaneous, pulmonary, hepatic and cerebral metastases. A review of the literature reveals seven cases of regression of cerebral metastases; thi...
The Offlap Break Position Vs Sea Level: A Discussion

Science.gov (United States)

Tropeano, M.; Pieri, P.; Pomar, L.; Sabato, L.

the offlap break might cause a misinterpretation of the ancient sea-level positions and the inferred relative sea-level changes. 2) both baselevels, the sea level and the wave/tide base, govern sedimentary accumulation in wave/tide dominated shelves and, consequently, two offlap breaks may coexist (beach edge and shoreface edge) in shallow-marine depositional profiles (Carter et al., 1991). In this setting, two seaward-clinobedded lithosomes, separated by an unconformity, may develop during relative still-stand or falls of the sea-level (Hill et al., 1998). In this case, the two stacked lithosomes could be misinterpreted as two different systems tracts, or sequences, and it could led to the construction of an 1 uncorrect curve of sea-level changes. Carter R.M., Abbott S.T., Fulthorpe C.S., Haywick D.W. and Henderson R.A. (1991): Application of global sea-level and sequence-stratigraphic models in Southern Hemi- sphere Neogene strata from New Zealand. Sp. Publ. IAS, 12, 41-65. Hernández- Molina F.J., Fernández-Salas L.M., Lobo F., Somoza L., Diaz-del-Rio V. and Alver- inho Dias J.M. (2000): The infralittoral prograding wedge: a new large-scale prograda- tional sedimentary body in shallow marine environments. Geo-Marine Letters, 20, 109-117. Hill P.R., Longuépée H. and Roberge M. (1998). Live from Canada: forced regression in action; deltaic shoreface sandbodies being formed. Abstracts, 15th Int. Cong. IAS, Alicante (Spain), 427-428. Pomar L. and Tropeano M. (2001). The Cal- carenite di Gravina Formation in Matera (southern Italy): new insights for coarse- grained, large-scale, cross-bedded bodies encased in offshore deposits. AAPG Bull., 85, 661-689. 2
Design and analysis of experiments classical and regression approaches with SAS

CERN Document Server

Onyiah, Leonard C

2008-01-01

Introductory Statistical Inference and Regression Analysis Elementary Statistical Inference Regression Analysis Experiments, the Completely Randomized Design (CRD)-Classical and Regression Approaches Experiments Experiments to Compare Treatments Some Basic Ideas Requirements of a Good Experiment One-Way Experimental Layout or the CRD: Design and Analysis Analysis of Experimental Data (Fixed Effects Model) Expected Values for the Sums of Squares The Analysis of Variance (ANOVA) Table Follow-Up Analysis to Check fo
Clinical Considerations Regarding Regression in Psychotherapy with Patients with Conversion Disorder.

Science.gov (United States)

Kaplan, Marcia

Regression is a ubiquitous phenomenon in psychodynamic psychotherapy and psychoanalysis, typically part of a reorganization that leads to progression, at least with respect to recruiting elements in the unconscious to consciousness. Regression in patients with conversion disorder (i.e., pseudo-neurological symptoms without an organic basis) is often itself somatic/physical rather than psychic in nature. Psychotherapists working with these patients must be prepared for confusing or frightening forms of regression that should be expected as part of the therapeutic process. In conversion disorder patients with adequate character structure, this regression, when handled effectively by the psychotherapist, ultimately leads to verbalized thoughts and feelings and a gradually strengthening alternative to physically experienced psychic conflict.
A Comparison of Advanced Regression Algorithms for Quantifying Urban Land Cover

Directory of Open Access Journals (Sweden)

Akpona Okujeni

2014-07-01

Full Text Available Quantitative methods for mapping sub-pixel land cover fractions are gaining increasing attention, particularly with regard to upcoming hyperspectral satellite missions. We evaluated five advanced regression algorithms combined with synthetically mixed training data for quantifying urban land cover from HyMap data at 3.6 and 9 m spatial resolution. Methods included support vector regression (SVR, kernel ridge regression (KRR, artificial neural networks (NN, random forest regression (RFR and partial least squares regression (PLSR. Our experiments demonstrate that both kernel methods SVR and KRR yield high accuracies for mapping complex urban surface types, i.e., rooftops, pavements, grass- and tree-covered areas. SVR and KRR models proved to be stable with regard to the spatial and spectral differences between both images and effectively utilized the higher complexity of the synthetic training mixtures for improving estimates for coarser resolution data. Observed deficiencies mainly relate to known problems arising from spectral similarities or shadowing. The remaining regressors either revealed erratic (NN or limited (RFR and PLSR performances when comprehensively mapping urban land cover. Our findings suggest that the combination of kernel-based regression methods, such as SVR and KRR, with synthetically mixed training data is well suited for quantifying urban land cover from imaging spectrometer data at multiple scales.
A Solution to Separation and Multicollinearity in Multiple Logistic Regression.

Science.gov (United States)

Shen, Jianzhao; Gao, Sujuan

2008-10-01

In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27-38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth's penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study.
Modeling oil production based on symbolic regression

International Nuclear Information System (INIS)

Yang, Guangfei; Li, Xianneng; Wang, Jianliang; Lian, Lian; Ma, Tieju

2015-01-01

Numerous models have been proposed to forecast the future trends of oil production and almost all of them are based on some predefined assumptions with various uncertainties. In this study, we propose a novel data-driven approach that uses symbolic regression to model oil production. We validate our approach on both synthetic and real data, and the results prove that symbolic regression could effectively identify the true models beneath the oil production data and also make reliable predictions. Symbolic regression indicates that world oil production will peak in 2021, which broadly agrees with other techniques used by researchers. Our results also show that the rate of decline after the peak is almost half the rate of increase before the peak, and it takes nearly 12 years to drop 4% from the peak. These predictions are more optimistic than those in several other reports, and the smoother decline will provide the world, especially the developing countries, with more time to orchestrate mitigation plans. -- Highlights: •A data-driven approach has been shown to be effective at modeling the oil production. •The Hubbert model could be discovered automatically from data. •The peak of world oil production is predicted to appear in 2021. •The decline rate after peak is half of the increase rate before peak. •Oil production projected to decline 4% post-peak
Face Alignment via Regressing Local Binary Features.

Science.gov (United States)

Ren, Shaoqing; Cao, Xudong; Wei, Yichen; Sun, Jian

2016-03-01

This paper presents a highly efficient and accurate regression approach for face alignment. Our approach has two novel components: 1) a set of local binary features and 2) a locality principle for learning those features. The locality principle guides us to learn a set of highly discriminative local binary features for each facial landmark independently. The obtained local binary features are used to jointly learn a linear regression for the final output. This approach achieves the state-of-the-art results when tested on the most challenging benchmarks to date. Furthermore, because extracting and regressing local binary features are computationally very cheap, our system is much faster than previous methods. It achieves over 3000 frames per second (FPS) on a desktop or 300 FPS on a mobile phone for locating a few dozens of landmarks. We also study a key issue that is important but has received little attention in the previous research, which is the face detector used to initialize alignment. We investigate several face detectors and perform quantitative evaluation on how they affect alignment accuracy. We find that an alignment friendly detector can further greatly boost the accuracy of our alignment method, reducing the error up to 16% relatively. To facilitate practical usage of face detection/alignment methods, we also propose a convenient metric to measure how good a detector is for alignment initialization.

On logistic regression analysis of dichotomized responses.

Science.gov (United States)

Lu, Kaifeng

2017-01-01

We study the properties of treatment effect estimate in terms of odds ratio at the study end point from logistic regression model adjusting for the baseline value when the underlying continuous repeated measurements follow a multivariate normal distribution. Compared with the analysis that does not adjust for the baseline value, the adjusted analysis produces a larger treatment effect as well as a larger standard error. However, the increase in standard error is more than offset by the increase in treatment effect so that the adjusted analysis is more powerful than the unadjusted analysis for detecting the treatment effect. On the other hand, the true adjusted odds ratio implied by the normal distribution of the underlying continuous variable is a function of the baseline value and hence is unlikely to be able to be adequately represented by a single value of adjusted odds ratio from the logistic regression model. In contrast, the risk difference function derived from the logistic regression model provides a reasonable approximation to the true risk difference function implied by the normal distribution of the underlying continuous variable over the range of the baseline distribution. We show that different metrics of treatment effect have similar statistical power when evaluated at the baseline mean. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Cointegrating MiDaS Regressions and a MiDaS Test

OpenAIRE

J. Isaac Miller

2011-01-01

This paper introduces cointegrating mixed data sampling (CoMiDaS) regressions, generalizing nonlinear MiDaS regressions in the extant literature. Under a linear mixed-frequency data-generating process, MiDaS regressions provide a parsimoniously parameterized nonlinear alternative when the linear forecasting model is over-parameterized and may be infeasible. In spite of potential correlation of the error term both serially and with the regressors, I find that nonlinear least squares consistent...
Who Will Win?: Predicting the Presidential Election Using Linear Regression

Science.gov (United States)

Lamb, John H.

2007-01-01

This article outlines a linear regression activity that engages learners, uses technology, and fosters cooperation. Students generated least-squares linear regression equations using TI-83 Plus[TM] graphing calculators, Microsoft[C] Excel, and paper-and-pencil calculations using derived normal equations to predict the 2004 presidential election.…
Time series regression model for infectious disease and weather.

Science.gov (United States)

Imai, Chisato; Armstrong, Ben; Chalabi, Zaid; Mangtani, Punam; Hashizume, Masahiro

2015-10-01

Time series regression has been developed and long used to evaluate the short-term associations of air pollution and weather with mortality or morbidity of non-infectious diseases. The application of the regression approaches from this tradition to infectious diseases, however, is less well explored and raises some new issues. We discuss and present potential solutions for five issues often arising in such analyses: changes in immune population, strong autocorrelations, a wide range of plausible lag structures and association patterns, seasonality adjustments, and large overdispersion. The potential approaches are illustrated with datasets of cholera cases and rainfall from Bangladesh and influenza and temperature in Tokyo. Though this article focuses on the application of the traditional time series regression to infectious diseases and weather factors, we also briefly introduce alternative approaches, including mathematical modeling, wavelet analysis, and autoregressive integrated moving average (ARIMA) models. Modifications proposed to standard time series regression practice include using sums of past cases as proxies for the immune population, and using the logarithm of lagged disease counts to control autocorrelation due to true contagion, both of which are motivated from "susceptible-infectious-recovered" (SIR) models. The complexity of lag structures and association patterns can often be informed by biological mechanisms and explored by using distributed lag non-linear models. For overdispersed models, alternative distribution models such as quasi-Poisson and negative binomial should be considered. Time series regression can be used to investigate dependence of infectious diseases on weather, but may need modifying to allow for features specific to this context. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
The crux of the method: assumptions in ordinary least squares and logistic regression.

Science.gov (United States)

Long, Rebecca G

2008-10-01

Logistic regression has increasingly become the tool of choice when analyzing data with a binary dependent variable. While resources relating to the technique are widely available, clear discussions of why logistic regression should be used in place of ordinary least squares regression are difficult to find. The current paper compares and contrasts the assumptions of ordinary least squares with those of logistic regression and explains why logistic regression's looser assumptions make it adept at handling violations of the more important assumptions in ordinary least squares.
Steganalysis using logistic regression

Science.gov (United States)

Lubenko, Ivans; Ker, Andrew D.

2011-02-01

We advocate Logistic Regression (LR) as an alternative to the Support Vector Machine (SVM) classifiers commonly used in steganalysis. LR offers more information than traditional SVM methods - it estimates class probabilities as well as providing a simple classification - and can be adapted more easily and efficiently for multiclass problems. Like SVM, LR can be kernelised for nonlinear classification, and it shows comparable classification accuracy to SVM methods. This work is a case study, comparing accuracy and speed of SVM and LR classifiers in detection of LSB Matching and other related spatial-domain image steganography, through the state-of-art 686-dimensional SPAM feature set, in three image sets.
SEPARATION PHENOMENA LOGISTIC REGRESSION

Directory of Open Access Journals (Sweden)

Ikaro Daniel de Carvalho Barreto

2014-03-01

Full Text Available This paper proposes an application of concepts about the maximum likelihood estimation of the binomial logistic regression model to the separation phenomena. It generates bias in the estimation and provides different interpretations of the estimates on the different statistical tests (Wald, Likelihood Ratio and Score and provides different estimates on the different iterative methods (Newton-Raphson and Fisher Score. It also presents an example that demonstrates the direct implications for the validation of the model and validation of variables, the implications for estimates of odds ratios and confidence intervals, generated from the Wald statistics. Furthermore, we present, briefly, the Firth correction to circumvent the phenomena of separation.
Detection of Outliers in Regression Model for Medical Data

Directory of Open Access Journals (Sweden)

Stephen Raj S

2017-07-01

Full Text Available In regression analysis, an outlier is an observation for which the residual is large in magnitude compared to other observations in the data set. The detection of outliers and influential points is an important step of the regression analysis. Outlier detection methods have been used to detect and remove anomalous values from data. In this paper, we detect the presence of outliers in simple linear regression models for medical data set. Chatterjee and Hadi mentioned that the ordinary residuals are not appropriate for diagnostic purposes; a transformed version of them is preferable. First, we investigate the presence of outliers based on existing procedures of residuals and standardized residuals. Next, we have used the new approach of standardized scores for detecting outliers without the use of predicted values. The performance of the new approach was verified with the real-life data.
Tools to support interpreting multiple regression in the face of multicollinearity.

Science.gov (United States)

Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K

2012-01-01

While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses.
Testing and Modeling Fuel Regression Rate in a Miniature Hybrid Burner

Directory of Open Access Journals (Sweden)

Luciano Fanton

2012-01-01

Full Text Available Ballistic characterization of an extended group of innovative HTPB-based solid fuel formulations for hybrid rocket propulsion was performed in a lab-scale burner. An optical time-resolved technique was used to assess the quasisteady regression history of single perforation, cylindrical samples. The effects of metalized additives and radiant heat transfer on the regression rate of such formulations were assessed. Under the investigated operating conditions and based on phenomenological models from the literature, analyses of the collected experimental data show an appreciable influence of the radiant heat flux from burnt gases and soot for both unloaded and loaded fuel formulations. Pure HTPB regression rate data are satisfactorily reproduced, while the impressive initial regression rates of metalized formulations require further assessment.
Teaching the Concept of Breakdown Point in Simple Linear Regression.

Science.gov (United States)

Chan, Wai-Sum

2001-01-01

Most introductory textbooks on simple linear regression analysis mention the fact that extreme data points have a great influence on ordinary least-squares regression estimation; however, not many textbooks provide a rigorous mathematical explanation of this phenomenon. Suggests a way to fill this gap by teaching students the concept of breakdown…
Multiple regression analysis of Jominy hardenability data for boron treated steels

International Nuclear Information System (INIS)

Komenda, J.; Sandstroem, R.; Tukiainen, M.

1997-01-01

The relations between chemical composition and their hardenability of boron treated steels have been investigated using a multiple regression analysis method. A linear model of regression was chosen. The free boron content that is effective for the hardenability was calculated using a model proposed by Jansson. The regression analysis for 1261 steel heats provided equations that were statistically significant at the 95% level. All heats met the specification according to the nordic countries producers classification. The variation in chemical composition explained typically 80 to 90% of the variation in the hardenability. In the regression analysis elements which did not significantly contribute to the calculated hardness according to the F test were eliminated. Carbon, silicon, manganese, phosphorus and chromium were of importance at all Jominy distances, nickel, vanadium, boron and nitrogen at distances above 6 mm. After the regression analysis it was demonstrated that very few outliers were present in the data set, i.e. data points outside four times the standard deviation. The model has successfully been used in industrial practice replacing some of the necessary Jominy tests. (orig.)
[Evaluation of estimation of prevalence ratio using bayesian log-binomial regression model].

Science.gov (United States)

Gao, W L; Lin, H; Liu, X N; Ren, X W; Li, J S; Shen, X P; Zhu, S L

2017-03-10

To evaluate the estimation of prevalence ratio ( PR ) by using bayesian log-binomial regression model and its application, we estimated the PR of medical care-seeking prevalence to caregivers' recognition of risk signs of diarrhea in their infants by using bayesian log-binomial regression model in Openbugs software. The results showed that caregivers' recognition of infant' s risk signs of diarrhea was associated significantly with a 13% increase of medical care-seeking. Meanwhile, we compared the differences in PR 's point estimation and its interval estimation of medical care-seeking prevalence to caregivers' recognition of risk signs of diarrhea and convergence of three models (model 1: not adjusting for the covariates; model 2: adjusting for duration of caregivers' education, model 3: adjusting for distance between village and township and child month-age based on model 2) between bayesian log-binomial regression model and conventional log-binomial regression model. The results showed that all three bayesian log-binomial regression models were convergence and the estimated PRs were 1.130(95 %CI : 1.005-1.265), 1.128(95 %CI : 1.001-1.264) and 1.132(95 %CI : 1.004-1.267), respectively. Conventional log-binomial regression model 1 and model 2 were convergence and their PRs were 1.130(95 % CI : 1.055-1.206) and 1.126(95 % CI : 1.051-1.203), respectively, but the model 3 was misconvergence, so COPY method was used to estimate PR , which was 1.125 (95 %CI : 1.051-1.200). In addition, the point estimation and interval estimation of PRs from three bayesian log-binomial regression models differed slightly from those of PRs from conventional log-binomial regression model, but they had a good consistency in estimating PR . Therefore, bayesian log-binomial regression model can effectively estimate PR with less misconvergence and have more advantages in application compared with conventional log-binomial regression model.
Spontaneous regression of intracranial malignant lymphoma

International Nuclear Information System (INIS)

Kojo, Nobuto; Tokutomi, Takashi; Eguchi, Gihachirou; Takagi, Shigeyuki; Matsumoto, Tomie; Sasaguri, Yasuyuki; Shigemori, Minoru.

1988-01-01

In a 46-year-old female with a 1-month history of gait and speech disturbances, computed tomography (CT) demonstrated mass lesions of slightly high density in the left basal ganglia and left frontal lobe. The lesions were markedly enhanced by contrast medium. The patient received no specific treatment, but her clinical manifestations gradually abated and the lesions decreased in size. Five months after her initial examination, the lesions were absent on CT scans; only a small area of low density remained. Residual clinical symptoms included mild right hemiparesis and aphasia. After 14 months the patient again deteriorated, and a CT scan revealed mass lesions in the right frontal lobe and the pons. However, no enhancement was observed in the previously affected regions. A biopsy revealed malignant lymphoma. Despite treatment with steroids and radiation, the patient's clinical status progressively worsened and she died 27 months after initial presentation. Seven other cases of spontaneous regression of primary malignant lymphoma have been reported. In this case, the mechanism of the spontaneous regression was not clear, but changes in immunologic status may have been involved. (author)
Weighted SGD for ℓp Regression with Randomized Preconditioning*

Science.gov (United States)

Yang, Jiyan; Chow, Yin-Lam; Ré, Christopher; Mahoney, Michael W.

2018-01-01

In recent years, stochastic gradient descent (SGD) methods and randomized linear algebra (RLA) algorithms have been applied to many large-scale problems in machine learning and data analysis. SGD methods are easy to implement and applicable to a wide range of convex optimization problems. In contrast, RLA algorithms provide much stronger performance guarantees but are applicable to a narrower class of problems. We aim to bridge the gap between these two methods in solving constrained overdetermined linear regression problems—e.g., ℓ2 and ℓ1 regression problems. We propose a hybrid algorithm named pwSGD that uses RLA techniques for preconditioning and constructing an importance sampling distribution, and then performs an SGD-like iterative process with weighted sampling on the preconditioned system.By rewriting a deterministic ℓp regression problem as a stochastic optimization problem, we connect pwSGD to several existing ℓp solvers including RLA methods with algorithmic leveraging (RLA for short).We prove that pwSGD inherits faster convergence rates that only depend on the lower dimension of the linear system, while maintaining low computation complexity. Such SGD convergence rates are superior to other related SGD algorithm such as the weighted randomized Kaczmarz algorithm.Particularly, when solving ℓ1 regression with size n by d, pwSGD returns an approximate solution with ε relative error in the objective value in 𝒪(log n·nnz(A)+poly(d)/ε2) time. This complexity is uniformly better than that of RLA methods in terms of both ε and d when the problem is unconstrained. In the presence of constraints, pwSGD only has to solve a sequence of much simpler and smaller optimization problem over the same constraints. In general this is more efficient than solving the constrained subproblem required in RLA.For ℓ2 regression, pwSGD returns an approximate solution with ε relative error in the objective value and the solution vector measured in
Dynamic travel time estimation using regression trees.

Science.gov (United States)

2008-10-01

This report presents a methodology for travel time estimation by using regression trees. The dissemination of travel time information has become crucial for effective traffic management, especially under congested road conditions. In the absence of c...
Analysis of dental caries using generalized linear and count regression models

Directory of Open Access Journals (Sweden)

Javali M. Phil

2013-11-01

Full Text Available Generalized linear models (GLM are generalization of linear regression models, which allow fitting regression models to response data in all the sciences especially medical and dental sciences that follow a general exponential family. These are flexible and widely used class of such models that can accommodate response variables. Count data are frequently characterized by overdispersion and excess zeros. Zero-inflated count models provide a parsimonious yet powerful way to model this type of situation. Such models assume that the data are a mixture of two separate data generation processes: one generates only zeros, and the other is either a Poisson or a negative binomial data-generating process. Zero inflated count regression models such as the zero-inflated Poisson (ZIP, zero-inflated negative binomial (ZINB regression models have been used to handle dental caries count data with many zeros. We present an evaluation framework to the suitability of applying the GLM, Poisson, NB, ZIP and ZINB to dental caries data set where the count data may exhibit evidence of many zeros and over-dispersion. Estimation of the model parameters using the method of maximum likelihood is provided. Based on the Vuong test statistic and the goodness of fit measure for dental caries data, the NB and ZINB regression models perform better than other count regression models.
A Methodology for Generating Placement Rules that Utilizes Logistic Regression

Science.gov (United States)

Wurtz, Keith

2008-01-01

The purpose of this article is to provide the necessary tools for institutional researchers to conduct a logistic regression analysis and interpret the results. Aspects of the logistic regression procedure that are necessary to evaluate models are presented and discussed with an emphasis on cutoff values and choosing the appropriate number of…
A test for the parameters of multiple linear regression models ...

African Journals Online (AJOL)

A test for the parameters of multiple linear regression models is developed for conducting tests simultaneously on all the parameters of multiple linear regression models. The test is robust relative to the assumptions of homogeneity of variances and absence of serial correlation of the classical F-test. Under certain null and ...
Spontaneous regression of pulmonary bullae

International Nuclear Information System (INIS)

Satoh, H.; Ishikawa, H.; Ohtsuka, M.; Sekizawa, K.

2002-01-01

The natural history of pulmonary bullae is often characterized by gradual, progressive enlargement. Spontaneous regression of bullae is, however, very rare. We report a case in which complete resolution of pulmonary bullae in the left upper lung occurred spontaneously. The management of pulmonary bullae is occasionally made difficult because of gradual progressive enlargement associated with abnormal pulmonary function. Some patients have multiple bulla in both lungs and/or have a history of pulmonary emphysema. Others have a giant bulla without emphysematous change in the lungs. Our present case had treated lung cancer with no evidence of local recurrence. He had no emphysematous change in lung function test and had no complaints, although the high resolution CT scan shows evidence of underlying minimal changes of emphysema. Ortin and Gurney presented three cases of spontaneous reduction in size of bulla. Interestingly, one of them had a marked decrease in the size of a bulla in association with thickening of the wall of the bulla, which was observed in our patient. This case we describe is of interest, not only because of the rarity with which regression of pulmonary bulla has been reported in the literature, but also because of the spontaneous improvements in the radiological picture in the absence of overt infection or tumor. Copyright (2002) Blackwell Science Pty Ltd

Facies-succession and architecture of the third-order sequences and their stratigraphic framework of the Devonian in Yunnan-Guizhou-Guangxi area, South China

Directory of Open Access Journals (Sweden)

Mei Mingxiang

2013-01-01

Full Text Available The Caledonian orogeny at the end of the Silurian resulted in great changes in the palaeogeography in the Yunnan-Guizhou-Guangxi area of South China; the continental area of the Early Paleozoic evolved into the extensive Dian-Qian-Gui Sea in the Late Paleozoic. Early in the Devonian, as a result of a major transgression, seawater encroached gradually from the south to the north and clastic facies were deposited. Carbonate deposition was then established in the Yunnan-Guizhou-Guangxi area, with a palaeogeography marked by attached platforms, isolated platforms and narrow basins. As a result of the Ziyun movement towards the end of the Devonian, the Upper Devonian strata are regressive and thin out from the open-sea to the land areas. A study of the nature and distribution of sedimentary facies in space and time recognises 13 third-order sequences in the Devonian strata in Yunnan-Guizhou-Guangxi area, and these form two second-order sequences. The strata of the Lower Devonian comprise 5 third-order sequences (SQ1 to SQ5, which are dominated by transgressive clastics. 4 third-order sequences (SQ6 to SQ9 in the Middle Devonian are characterized by alternations of transgressive clastics and highstand carbonates. In the Upper Devonian, carbonates constitute 4 third-order sequences (SQ10 to SQ13, which are generally marked by the transgressive limestones and highstand dolomites. On the basis of earlier biostratigraphic studies, sea-level changes represented by the third-order sequences with their different facies successions are explored, and the sequence stratigraphic framework is established. Therefore, the Devonian strata in the study area provide an example for further understanding of depositional trends within the sequence-stratigraphic framework.
Deep ensemble learning of sparse regression models for brain disease diagnosis.

Science.gov (United States)

Suk, Heung-Il; Lee, Seong-Whan; Shen, Dinggang

2017-04-01

Recent studies on brain imaging analysis witnessed the core roles of machine learning techniques in computer-assisted intervention for brain disease diagnosis. Of various machine-learning techniques, sparse regression models have proved their effectiveness in handling high-dimensional data but with a small number of training samples, especially in medical problems. In the meantime, deep learning methods have been making great successes by outperforming the state-of-the-art performances in various applications. In this paper, we propose a novel framework that combines the two conceptually different methods of sparse regression and deep learning for Alzheimer's disease/mild cognitive impairment diagnosis and prognosis. Specifically, we first train multiple sparse regression models, each of which is trained with different values of a regularization control parameter. Thus, our multiple sparse regression models potentially select different feature subsets from the original feature set; thereby they have different powers to predict the response values, i.e., clinical label and clinical scores in our work. By regarding the response values from our sparse regression models as target-level representations, we then build a deep convolutional neural network for clinical decision making, which thus we call 'Deep Ensemble Sparse Regression Network.' To our best knowledge, this is the first work that combines sparse regression models with deep neural network. In our experiments with the ADNI cohort, we validated the effectiveness of the proposed method by achieving the highest diagnostic accuracies in three classification tasks. We also rigorously analyzed our results and compared with the previous studies on the ADNI cohort in the literature. Copyright © 2017 Elsevier B.V. All rights reserved.
A hybrid approach of stepwise regression, logistic regression, support vector machine, and decision tree for forecasting fraudulent financial statements.

Science.gov (United States)

Chen, Suduan; Goo, Yeong-Jia James; Shen, Zone-De

2014-01-01

As the fraudulent financial statement of an enterprise is increasingly serious with each passing day, establishing a valid forecasting fraudulent financial statement model of an enterprise has become an important question for academic research and financial practice. After screening the important variables using the stepwise regression, the study also matches the logistic regression, support vector machine, and decision tree to construct the classification models to make a comparison. The study adopts financial and nonfinancial variables to assist in establishment of the forecasting fraudulent financial statement model. Research objects are the companies to which the fraudulent and nonfraudulent financial statement happened between years 1998 to 2012. The findings are that financial and nonfinancial information are effectively used to distinguish the fraudulent financial statement, and decision tree C5.0 has the best classification effect 85.71%.
Modelling infant mortality rate in Central Java, Indonesia use generalized poisson regression method

Science.gov (United States)

Prahutama, Alan; Sudarno

2018-05-01

The infant mortality rate is the number of deaths under one year of age occurring among the live births in a given geographical area during a given year, per 1,000 live births occurring among the population of the given geographical area during the same year. This problem needs to be addressed because it is an important element of a country’s economic development. High infant mortality rate will disrupt the stability of a country as it relates to the sustainability of the population in the country. One of regression model that can be used to analyze the relationship between dependent variable Y in the form of discrete data and independent variable X is Poisson regression model. Recently The regression modeling used for data with dependent variable is discrete, among others, poisson regression, negative binomial regression and generalized poisson regression. In this research, generalized poisson regression modeling gives better AIC value than poisson regression. The most significant variable is the Number of health facilities (X1), while the variable that gives the most influence to infant mortality rate is the average breastfeeding (X9).
Vectors, a tool in statistical regression theory

NARCIS (Netherlands)

Corsten, L.C.A.

1958-01-01

Using linear algebra this thesis developed linear regression analysis including analysis of variance, covariance analysis, special experimental designs, linear and fertility adjustments, analysis of experiments at different places and times. The determination of the orthogonal projection, yielding
Deriving the Regression Line with Algebra

Science.gov (United States)

Quintanilla, John A.

2017-01-01

Exploration with spreadsheets and reliance on previous skills can lead students to determine the line of best fit. To perform linear regression on a set of data, students in Algebra 2 (or, in principle, Algebra 1) do not have to settle for using the mysterious "black box" of their graphing calculators (or other classroom technologies).…
Superquantile Regression: Theory, Algorithms, and Applications

Science.gov (United States)

2014-12-01

Highway, Suite 1204, Arlington, Va 22202-4302, and to the Office of Management and Budget, Paperwork Reduction Project (0704-0188) Washington DC 20503. 1...Navy submariners, reliability engineering, uncertainty quantification, and financial risk management . Superquantile, superquantile regression...Royset Carlos F. Borges Associate Professor of Operations Research Dissertation Supervisor Professor of Applied Mathematics Lyn R. Whitaker Javier
Multiple Linear Regression: A Realistic Reflector.

Science.gov (United States)

Nutt, A. T.; Batsell, R. R.

Examples of the use of Multiple Linear Regression (MLR) techniques are presented. This is done to show how MLR aids data processing and decision-making by providing the decision-maker with freedom in phrasing questions and by accurately reflecting the data on hand. A brief overview of the rationale underlying MLR is given, some basic definitions…
Simulation Experiments in Practice: Statistical Design and Regression Analysis

OpenAIRE

Kleijnen, J.P.C.

2007-01-01

In practice, simulation analysts often change only one factor at a time, and use graphical analysis of the resulting Input/Output (I/O) data. The goal of this article is to change these traditional, naïve methods of design and analysis, because statistical theory proves that more information is obtained when applying Design Of Experiments (DOE) and linear regression analysis. Unfortunately, classic DOE and regression analysis assume a single simulation response that is normally and independen...
Clinical value of regression of electrocardiographic left ventricular hypertrophy after aortic valve replacement.

Science.gov (United States)

Yamabe, Sayuri; Dohi, Yoshihiro; Higashi, Akifumi; Kinoshita, Hiroki; Sada, Yoshiharu; Hidaka, Takayuki; Kurisu, Satoshi; Shiode, Nobuo; Kihara, Yasuki

2016-09-01

Electrocardiographic left ventricular hypertrophy (ECG-LVH) gradually regressed after aortic valve replacement (AVR) in patients with severe aortic stenosis. Sokolow-Lyon voltage (SV1 + RV5/6) is possibly the most widely used criterion for ECG-LVH. The aim of this study was to determine whether decrease in Sokolow-Lyon voltage reflects left ventricular reverse remodeling detected by echocardiography after AVR. Of 129 consecutive patients who underwent AVR for severe aortic stenosis, 38 patients with preoperative ECG-LVH, defined by SV1 + RV5/6 of ≥3.5 mV, were enrolled in this study. Electrocardiography and echocardiography were performed preoperatively and 1 year postoperatively. The patients were divided into ECG-LVH regression group (n = 19) and non-regression group (n = 19) according to the median value of the absolute regression in SV1 + RV5/6. Multivariate logistic regression analysis was performed to assess determinants of ECG-LVH regression among echocardiographic indices. ECG-LVH regression group showed significantly greater decrease in left ventricular mass index and left ventricular dimensions than Non-regression group. ECG-LVH regression was independently determined by decrease in the left ventricular mass index [odds ratio (OR) 1.28, 95 % confidence interval (CI) 1.03-1.69, p = 0.048], left ventricular end-diastolic dimension (OR 1.18, 95 % CI 1.03-1.41, p = 0.014), and left ventricular end-systolic dimension (OR 1.24, 95 % CI 1.06-1.52, p = 0.0047). ECG-LVH regression could be a marker of the effect of AVR on both reducing the left ventricular mass index and left ventricular dimensions. The effect of AVR on reverse remodeling can be estimated, at least in part, by regression of ECG-LVH.
Variable selection and model choice in geoadditive regression models.

Science.gov (United States)

Kneib, Thomas; Hothorn, Torsten; Tutz, Gerhard

2009-06-01

Model choice and variable selection are issues of major concern in practical regression analyses, arising in many biometric applications such as habitat suitability analyses, where the aim is to identify the influence of potentially many environmental conditions on certain species. We describe regression models for breeding bird communities that facilitate both model choice and variable selection, by a boosting algorithm that works within a class of geoadditive regression models comprising spatial effects, nonparametric effects of continuous covariates, interaction surfaces, and varying coefficients. The major modeling components are penalized splines and their bivariate tensor product extensions. All smooth model terms are represented as the sum of a parametric component and a smooth component with one degree of freedom to obtain a fair comparison between the model terms. A generic representation of the geoadditive model allows us to devise a general boosting algorithm that automatically performs model choice and variable selection.
Regression periods in infancy: a case study from Catalonia.

Science.gov (United States)

Sadurní, Marta; Rostan, Carlos

2002-05-01

Based on Rijt-Plooij and Plooij's (1992) research on emergence of regression periods in the first two years of life, the presence of such periods in a group of 18 babies (10 boys and 8 girls, aged between 3 weeks and 14 months) from a Catalonian population was analyzed. The measurements were a questionnaire filled in by the infants' mothers, a semi-structured weekly tape-recorded interview, and observations in their homes. The procedure and the instruments used in the project follow those proposed by Rijt-Plooij and Plooij. Our results confirm the existence of the regression periods in the first year of children's life. Inter-coder agreement for trained coders was 78.2% and within-coder agreement was 90.1%. In the discussion, the possible meaning and relevance of regression periods in order to understand development from a psychobiological and social framework is commented upon.
Demonstration of a Fiber Optic Regression Probe in a High-Temperature Flow

Science.gov (United States)

Korman, Valentin; Polzin, Kurt

2011-01-01

The capability to provide localized, real-time monitoring of material regression rates in various applications has the potential to provide a new stream of data for development testing of various components and systems, as well as serving as a monitoring tool in flight applications. These applications include, but are not limited to, the regression of a combusting solid fuel surface, the ablation of the throat in a chemical rocket or the heat shield of an aeroshell, and the monitoring of erosion in long-life plasma thrusters. The rate of regression in the first application is very fast, while the second and third are increasingly slower. A recent fundamental sensor development effort has led to a novel regression, erosion, and ablation sensor technology (REAST). The REAST sensor allows for measurement of real-time surface erosion rates at a discrete surface location. The sensor is optical, using two different, co-located fiber-optics to perform the regression measurement. The disparate optical transmission properties of the two fiber-optics makes it possible to measure the regression rate by monitoring the relative light attenuation through the fibers. As the fibers regress along with the parent material in which they are embedded, the relative light intensities through the two fibers changes, providing a measure of the regression rate. The optical nature of the system makes it relatively easy to use in a variety of harsh, high temperature environments, and it is also unaffected by the presence of electric and magnetic fields. In addition, the sensor could be used to perform optical spectroscopy on the light emitted by a process and collected by fibers, giving localized measurements of various properties. The capability to perform an in-situ measurement of material regression rates is useful in addressing a variety of physical issues in various applications. An in-situ measurement allows for real-time data regarding the erosion rates, providing a quick method for
Physics constrained nonlinear regression models for time series

International Nuclear Information System (INIS)

Majda, Andrew J; Harlim, John

2013-01-01

A central issue in contemporary science is the development of data driven statistical nonlinear dynamical models for time series of partial observations of nature or a complex physical model. It has been established recently that ad hoc quadratic multi-level regression (MLR) models can have finite-time blow up of statistical solutions and/or pathological behaviour of their invariant measure. Here a new class of physics constrained multi-level quadratic regression models are introduced, analysed and applied to build reduced stochastic models from data of nonlinear systems. These models have the advantages of incorporating memory effects in time as well as the nonlinear noise from energy conserving nonlinear interactions. The mathematical guidelines for the performance and behaviour of these physics constrained MLR models as well as filtering algorithms for their implementation are developed here. Data driven applications of these new multi-level nonlinear regression models are developed for test models involving a nonlinear oscillator with memory effects and the difficult test case of the truncated Burgers–Hopf model. These new physics constrained quadratic MLR models are proposed here as process models for Bayesian estimation through Markov chain Monte Carlo algorithms of low frequency behaviour in complex physical data. (paper)
An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression

Science.gov (United States)

Weiss, Brandi A.; Dardick, William

2016-01-01

This article introduces an entropy-based measure of data-model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify…
Arcuate Fasciculus in Autism Spectrum Disorder Toddlers with Language Regression

Directory of Open Access Journals (Sweden)

Zhang Lin

2018-03-01

Full Text Available Language regression is observed in a subset of toddlers with autism spectrum disorder (ASD as initial symptom. However, such a phenomenon has not been fully explored, partly due to the lack of definite diagnostic evaluation methods and criteria. Materials and Methods: Fifteen toddlers with ASD exhibiting language regression and fourteen age-matched typically developing (TD controls underwent diffusion tensor imaging (DTI. DTI parameters including fractional anisotropy (FA, average fiber length (AFL, tract volume (TV and number of voxels (NV were analyzed by Neuro 3D in Siemens syngo workstation. Subsequently, the data were analyzed by using IBM SPSS Statistics 22. Results: Compared with TD children, a significant reduction of FA along with an increase in TV and NV was observed in ASD children with language regression. Note that there were no significant differences between ASD and TD children in AFL of the arcuate fasciculus (AF. Conclusions: These DTI changes in the AF suggest that microstructural anomalies of the AF white matter may be associated with language deficits in ASD children exhibiting language regression starting from an early age.
Depth-weighted robust multivariate regression with application to sparse data

KAUST Repository

Dutta, Subhajit; Genton, Marc G.

2017-01-01

A robust method for multivariate regression is developed based on robust estimators of the joint location and scatter matrix of the explanatory and response variables using the notion of data depth. The multivariate regression estimator possesses desirable affine equivariance properties, achieves the best breakdown point of any affine equivariant estimator, and has an influence function which is bounded in both the response as well as the predictor variable. To increase the efficiency of this estimator, a re-weighted estimator based on robust Mahalanobis distances of the residual vectors is proposed. In practice, the method is more stable than existing methods that are constructed using subsamples of the data. The resulting multivariate regression technique is computationally feasible, and turns out to perform better than several popular robust multivariate regression methods when applied to various simulated data as well as a real benchmark data set. When the data dimension is quite high compared to the sample size it is still possible to use meaningful notions of data depth along with the corresponding depth values to construct a robust estimator in a sparse setting.
Robust Face Recognition via Multi-Scale Patch-Based Matrix Regression.

Directory of Open Access Journals (Sweden)

Guangwei Gao

Full Text Available In many real-world applications such as smart card solutions, law enforcement, surveillance and access control, the limited training sample size is the most fundamental problem. By making use of the low-rank structural information of the reconstructed error image, the so-called nuclear norm-based matrix regression has been demonstrated to be effective for robust face recognition with continuous occlusions. However, the recognition performance of nuclear norm-based matrix regression degrades greatly in the face of the small sample size problem. An alternative solution to tackle this problem is performing matrix regression on each patch and then integrating the outputs from all patches. However, it is difficult to set an optimal patch size across different databases. To fully utilize the complementary information from different patch scales for the final decision, we propose a multi-scale patch-based matrix regression scheme based on which the ensemble of multi-scale outputs can be achieved optimally. Extensive experiments on benchmark face databases validate the effectiveness and robustness of our method, which outperforms several state-of-the-art patch-based face recognition algorithms.
Hierarchical Matching and Regression with Application to Photometric Redshift Estimation

Science.gov (United States)

Murtagh, Fionn

2017-06-01

This work emphasizes that heterogeneity, diversity, discontinuity, and discreteness in data is to be exploited in classification and regression problems. A global a priori model may not be desirable. For data analytics in cosmology, this is motivated by the variety of cosmological objects such as elliptical, spiral, active, and merging galaxies at a wide range of redshifts. Our aim is matching and similarity-based analytics that takes account of discrete relationships in the data. The information structure of the data is represented by a hierarchy or tree where the branch structure, rather than just the proximity, is important. The representation is related to p-adic number theory. The clustering or binning of the data values, related to the precision of the measurements, has a central role in this methodology. If used for regression, our approach is a method of cluster-wise regression, generalizing nearest neighbour regression. Both to exemplify this analytics approach, and to demonstrate computational benefits, we address the well-known photometric redshift or `photo-z' problem, seeking to match Sloan Digital Sky Survey (SDSS) spectroscopic and photometric redshifts.
Depth-weighted robust multivariate regression with application to sparse data

KAUST Repository

Dutta, Subhajit

2017-04-05

A robust method for multivariate regression is developed based on robust estimators of the joint location and scatter matrix of the explanatory and response variables using the notion of data depth. The multivariate regression estimator possesses desirable affine equivariance properties, achieves the best breakdown point of any affine equivariant estimator, and has an influence function which is bounded in both the response as well as the predictor variable. To increase the efficiency of this estimator, a re-weighted estimator based on robust Mahalanobis distances of the residual vectors is proposed. In practice, the method is more stable than existing methods that are constructed using subsamples of the data. The resulting multivariate regression technique is computationally feasible, and turns out to perform better than several popular robust multivariate regression methods when applied to various simulated data as well as a real benchmark data set. When the data dimension is quite high compared to the sample size it is still possible to use meaningful notions of data depth along with the corresponding depth values to construct a robust estimator in a sparse setting.

PARAMETRIC AND NON PARAMETRIC (MARS: MULTIVARIATE ADDITIVE REGRESSION SPLINES) LOGISTIC REGRESSIONS FOR PREDICTION OF A DICHOTOMOUS RESPONSE VARIABLE WITH AN EXAMPLE FOR PRESENCE/ABSENCE OF AMPHIBIANS

Science.gov (United States)

The purpose of this report is to provide a reference manual that could be used by investigators for making informed use of logistic regression using two methods (standard logistic regression and MARS). The details for analyses of relationships between a dependent binary response ...
Geographically weighted regression and multicollinearity: dispelling the myth

Science.gov (United States)

Fotheringham, A. Stewart; Oshan, Taylor M.

2016-10-01

Geographically weighted regression (GWR) extends the familiar regression framework by estimating a set of parameters for any number of locations within a study area, rather than producing a single parameter estimate for each relationship specified in the model. Recent literature has suggested that GWR is highly susceptible to the effects of multicollinearity between explanatory variables and has proposed a series of local measures of multicollinearity as an indicator of potential problems. In this paper, we employ a controlled simulation to demonstrate that GWR is in fact very robust to the effects of multicollinearity. Consequently, the contention that GWR is highly susceptible to multicollinearity issues needs rethinking.
Top Incomes, Heavy Tails, and Rank-Size Regressions

Directory of Open Access Journals (Sweden)

Christian Schluter

2018-03-01

Full Text Available In economics, rank-size regressions provide popular estimators of tail exponents of heavy-tailed distributions. We discuss the properties of this approach when the tail of the distribution is regularly varying rather than strictly Pareto. The estimator then over-estimates the true value in the leading parametric income models (so the upper income tail is less heavy than estimated, which leads to test size distortions and undermines inference. For practical work, we propose a sensitivity analysis based on regression diagnostics in order to assess the likely impact of the distortion. The methods are illustrated using data on top incomes in the UK.
DYNA3D/ParaDyn Regression Test Suite Inventory

Energy Technology Data Exchange (ETDEWEB)

Lin, Jerry I. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

2016-09-01

The following table constitutes an initial assessment of feature coverage across the regression test suite used for DYNA3D and ParaDyn. It documents the regression test suite at the time of preliminary release 16.1 in September 2016. The columns of the table represent groupings of functionalities, e.g., material models. Each problem in the test suite is represented by a row in the table. All features exercised by the problem are denoted by a check mark (√) in the corresponding column. The definition of “feature” has not been subdivided to its smallest unit of user input, e.g., algorithmic parameters specific to a particular type of contact surface. This represents a judgment to provide code developers and users a reasonable impression of feature coverage without expanding the width of the table by several multiples. All regression testing is run in parallel, typically with eight processors, except problems involving features only available in serial mode. Many are strictly regression tests acting as a check that the codes continue to produce adequately repeatable results as development unfolds; compilers change and platforms are replaced. A subset of the tests represents true verification problems that have been checked against analytical or other benchmark solutions. Users are welcomed to submit documented problems for inclusion in the test suite, especially if they are heavily exercising, and dependent upon, features that are currently underrepresented.
Multiple linear regression and regression with time series error models in forecasting PM10 concentrations in Peninsular Malaysia.

Science.gov (United States)

Ng, Kar Yong; Awang, Norhashidah

2018-01-06

Frequent haze occurrences in Malaysia have made the management of PM 10 (particulate matter with aerodynamic less than 10 μm) pollution a critical task. This requires knowledge on factors associating with PM 10 variation and good forecast of PM 10 concentrations. Hence, this paper demonstrates the prediction of 1-day-ahead daily average PM 10 concentrations based on predictor variables including meteorological parameters and gaseous pollutants. Three different models were built. They were multiple linear regression (MLR) model with lagged predictor variables (MLR1), MLR model with lagged predictor variables and PM 10 concentrations (MLR2) and regression with time series error (RTSE) model. The findings revealed that humidity, temperature, wind speed, wind direction, carbon monoxide and ozone were the main factors explaining the PM 10 variation in Peninsular Malaysia. Comparison among the three models showed that MLR2 model was on a same level with RTSE model in terms of forecasting accuracy, while MLR1 model was the worst.
Identification of Influential Points in a Linear Regression Model

Directory of Open Access Journals (Sweden)

Jan Grosz

2011-03-01

Full Text Available The article deals with the detection and identification of influential points in the linear regression model. Three methods of detection of outliers and leverage points are described. These procedures can also be used for one-sample (independentdatasets. This paper briefly describes theoretical aspects of several robust methods as well. Robust statistics is a powerful tool to increase the reliability and accuracy of statistical modelling and data analysis. A simulation model of the simple linear regression is presented.
Zero-Shot Learning via Attribute Regression and Class Prototype Rectification.

Science.gov (United States)

Luo, Changzhi; Li, Zhetao; Huang, Kaizhu; Feng, Jiashi; Wang, Meng

2018-02-01

Zero-shot learning (ZSL) aims at classifying examples for unseen classes (with no training examples) given some other seen classes (with training examples). Most existing approaches exploit intermedia-level information (e.g., attributes) to transfer knowledge from seen classes to unseen classes. A common practice is to first learn projections from samples to attributes on seen classes via a regression method, and then apply such projections to unseen classes directly. However, it turns out that such a manner of learning strategy easily causes projection domain shift problem and hubness problem, which hinder the performance of ZSL task. In this paper, we also formulate ZSL as an attribute regression problem. However, different from general regression-based solutions, the proposed approach is novel in three aspects. First, a class prototype rectification method is proposed to connect the unseen classes to the seen classes. Here, a class prototype refers to a vector representation of a class, and it is also known as a class center, class signature, or class exemplar. Second, an alternating learning scheme is proposed for jointly performing attribute regression and rectifying the class prototypes. Finally, a new objective function which takes into consideration both the attribute regression accuracy and the class prototype discrimination is proposed. By introducing such a solution, domain shift problem and hubness problem can be mitigated. Experimental results on three public datasets (i.e., CUB200-2011, SUN Attribute, and aPaY) well demonstrate the effectiveness of our approach.
Is there still a place for the concept of 'therapeutic regression' in psychoanalysis?

Science.gov (United States)

Spurling, Laurence S

2008-06-01

The author uses his own failure to find a place for the idea of therapeutic regression in his clinical thinking or practice as the basis for an investigation into its meaning and usefulness. He makes a distinction between three ways the term 'regression' is used in psychoanalytic discourse: as a way of evoking a primitive level of experience; as a reminder in some clinical situations of the value of non-intervention on the part of the analyst; and as a description of a phase of an analytic treatment with some patients where the analyst needs to put aside normal analytic technique in order to foster a regression in the patient. It is this third meaning, which the author terms "therapeutic regression" that this paper examines, principally by means of an extended discussion of two clinical examples of a patient making a so-called therapeutic regression, one given by Winnicott and the other by Masud Khan. The author argues that in these examples the introduction of the concept of therapeutic regression obscures rather than clarifies the clinical process. He concludes that, as a substantial clinical concept, the idea of therapeutic regression has outlived its usefulness. However he also notes that many psychoanalytic writers continue to find a use for the more generic concept of regression, and that the very engagement with the more particular idea of therapeutic regression has value in provoking questions as to what is truly therapeutic in psychoanalytic treatment.
Development of a User Interface for a Regression Analysis Software Tool

Science.gov (United States)

Ulbrich, Norbert Manfred; Volden, Thomas R.

2010-01-01

An easy-to -use user interface was implemented in a highly automated regression analysis tool. The user interface was developed from the start to run on computers that use the Windows, Macintosh, Linux, or UNIX operating system. Many user interface features were specifically designed such that a novice or inexperienced user can apply the regression analysis tool with confidence. Therefore, the user interface s design minimizes interactive input from the user. In addition, reasonable default combinations are assigned to those analysis settings that influence the outcome of the regression analysis. These default combinations will lead to a successful regression analysis result for most experimental data sets. The user interface comes in two versions. The text user interface version is used for the ongoing development of the regression analysis tool. The official release of the regression analysis tool, on the other hand, has a graphical user interface that is more efficient to use. This graphical user interface displays all input file names, output file names, and analysis settings for a specific software application mode on a single screen which makes it easier to generate reliable analysis results and to perform input parameter studies. An object-oriented approach was used for the development of the graphical user interface. This choice keeps future software maintenance costs to a reasonable limit. Examples of both the text user interface and graphical user interface are discussed in order to illustrate the user interface s overall design approach.
The M Word: Multicollinearity in Multiple Regression.

Science.gov (United States)

Morrow-Howell, Nancy

1994-01-01

Notes that existence of substantial correlation between two or more independent variables creates problems of multicollinearity in multiple regression. Discusses multicollinearity problem in social work research in which independent variables are usually intercorrelated. Clarifies problems created by multicollinearity, explains detection of…
Regression Discontinuity Designs Based on Population Thresholds

DEFF Research Database (Denmark)

Eggers, Andrew C.; Freier, Ronny; Grembi, Veronica

In many countries, important features of municipal government (such as the electoral system, mayors' salaries, and the number of councillors) depend on whether the municipality is above or below arbitrary population thresholds. Several papers have used a regression discontinuity design (RDD...
Transient simulation of regression rate on thrust regulation process in hybrid rocket motor

Directory of Open Access Journals (Sweden)

Tian Hui

2014-12-01

Full Text Available The main goal of this paper is to study the characteristics of regression rate of solid grain during thrust regulation process. For this purpose, an unsteady numerical model of regression rate is established. Gas–solid coupling is considered between the solid grain surface and combustion gas. Dynamic mesh is used to simulate the regression process of the solid fuel surface. Based on this model, numerical simulations on a H2O2/HTPB (hydroxyl-terminated polybutadiene hybrid motor have been performed in the flow control process. The simulation results show that under the step change of the oxidizer mass flow rate condition, the regression rate cannot reach a stable value instantly because the flow field requires a short time period to adjust. The regression rate increases with the linear gain of oxidizer mass flow rate, and has a higher slope than the relative inlet function of oxidizer flow rate. A shorter regulation time can cause a higher regression rate during regulation process. The results also show that transient calculation can better simulate the instantaneous regression rate in the operation process.
Estimation Methods for Non-Homogeneous Regression - Minimum CRPS vs Maximum Likelihood

Science.gov (United States)

Gebetsberger, Manuel; Messner, Jakob W.; Mayr, Georg J.; Zeileis, Achim

2017-04-01

Non-homogeneous regression models are widely used to statistically post-process numerical weather prediction models. Such regression models correct for errors in mean and variance and are capable to forecast a full probability distribution. In order to estimate the corresponding regression coefficients, CRPS minimization is performed in many meteorological post-processing studies since the last decade. In contrast to maximum likelihood estimation, CRPS minimization is claimed to yield more calibrated forecasts. Theoretically, both scoring rules used as an optimization score should be able to locate a similar and unknown optimum. Discrepancies might result from a wrong distributional assumption of the observed quantity. To address this theoretical concept, this study compares maximum likelihood and minimum CRPS estimation for different distributional assumptions. First, a synthetic case study shows that, for an appropriate distributional assumption, both estimation methods yield to similar regression coefficients. The log-likelihood estimator is slightly more efficient. A real world case study for surface temperature forecasts at different sites in Europe confirms these results but shows that surface temperature does not always follow the classical assumption of a Gaussian distribution. KEYWORDS: ensemble post-processing, maximum likelihood estimation, CRPS minimization, probabilistic temperature forecasting, distributional regression models
Improving sub-pixel imperviousness change prediction by ensembling heterogeneous non-linear regression models

Directory of Open Access Journals (Sweden)

Drzewiecki Wojciech

2016-12-01

Full Text Available In this work nine non-linear regression models were compared for sub-pixel impervious surface area mapping from Landsat images. The comparison was done in three study areas both for accuracy of imperviousness coverage evaluation in individual points in time and accuracy of imperviousness change assessment. The performance of individual machine learning algorithms (Cubist, Random Forest, stochastic gradient boosting of regression trees, k-nearest neighbors regression, random k-nearest neighbors regression, Multivariate Adaptive Regression Splines, averaged neural networks, and support vector machines with polynomial and radial kernels was also compared with the performance of heterogeneous model ensembles constructed from the best models trained using particular techniques.
A Hybrid Approach of Stepwise Regression, Logistic Regression, Support Vector Machine, and Decision Tree for Forecasting Fraudulent Financial Statements

Directory of Open Access Journals (Sweden)

Suduan Chen

2014-01-01

Full Text Available As the fraudulent financial statement of an enterprise is increasingly serious with each passing day, establishing a valid forecasting fraudulent financial statement model of an enterprise has become an important question for academic research and financial practice. After screening the important variables using the stepwise regression, the study also matches the logistic regression, support vector machine, and decision tree to construct the classification models to make a comparison. The study adopts financial and nonfinancial variables to assist in establishment of the forecasting fraudulent financial statement model. Research objects are the companies to which the fraudulent and nonfraudulent financial statement happened between years 1998 to 2012. The findings are that financial and nonfinancial information are effectively used to distinguish the fraudulent financial statement, and decision tree C5.0 has the best classification effect 85.71%.
DIABETES MELLITUS AND ITS ROLE IN CAUDAL REGRESSION SYNDROME

Directory of Open Access Journals (Sweden)

Sandeep

2016-03-01

Full Text Available BACKGROUND Caudal regression syndrome also called as sacral agenesis or hypoplasia of the sacrum is a congenital disorder in which there is abnormal development of the lower part of the vertebral column 1 due to which there is a plethora of abnormalities such as gross motor deficiencies and other genitor-urinary malformations which in deed depends on the extent of malformations that is seen. Caudal regression syndrome is rare, with an estimated incidence of 1:7500-100,000. The aim of the study is to find the frequency of manifestations and the manifestations itself. METHODS Fifty patients who were pregnant and were diagnosed with diabetes mellitus were identified and were referred to the Department of Medicine. RESULTS In the present study the frequency of manifestations of caudal regression syndrome is 8 in 100 diagnosed patients. CONCLUSION The malformations in the babies born to diabetic mothers are high in the population of costal Karnataka and Kerala.
Regression analysis understanding and building business and economic models using Excel

CERN Document Server

Wilson, J Holton

2012-01-01

The technique of regression analysis is used so often in business and economics today that an understanding of its use is necessary for almost everyone engaged in the field. This book will teach you the essential elements of building and understanding regression models in a business/economic context in an intuitive manner. The authors take a non-theoretical treatment that is accessible even if you have a limited statistical background. It is specifically designed to teach the correct use of regression, while advising you of its limitations and teaching about common pitfalls. This book describe
Combining logistic regression with classification and regression tree to predict quality of care in a home health nursing data set.

Science.gov (United States)

Guo, Huey-Ming; Shyu, Yea-Ing Lotus; Chang, Her-Kun

2006-01-01

In this article, the authors provide an overview of a research method to predict quality of care in home health nursing data set. The results of this study can be visualized through classification an regression tree (CART) graphs. The analysis was more effective, and the results were more informative since the home health nursing dataset was analyzed with a combination of the logistic regression and CART, these two techniques complete each other. And the results more informative that more patients' characters were related to quality of care in home care. The results contributed to home health nurse predict patient outcome in case management. Improved prediction is needed for interventions to be appropriately targeted for improved patient outcome and quality of care.
Robust geographically weighted regression of modeling the Air Polluter Standard Index (APSI)

Science.gov (United States)

Warsito, Budi; Yasin, Hasbi; Ispriyanti, Dwi; Hoyyi, Abdul

2018-05-01

The Geographically Weighted Regression (GWR) model has been widely applied to many practical fields for exploring spatial heterogenity of a regression model. However, this method is inherently not robust to outliers. Outliers commonly exist in data sets and may lead to a distorted estimate of the underlying regression model. One of solution to handle the outliers in the regression model is to use the robust models. So this model was called Robust Geographically Weighted Regression (RGWR). This research aims to aid the government in the policy making process related to air pollution mitigation by developing a standard index model for air polluter (Air Polluter Standard Index - APSI) based on the RGWR approach. In this research, we also consider seven variables that are directly related to the air pollution level, which are the traffic velocity, the population density, the business center aspect, the air humidity, the wind velocity, the air temperature, and the area size of the urban forest. The best model is determined by the smallest AIC value. There are significance differences between Regression and RGWR in this case, but Basic GWR using the Gaussian kernel is the best model to modeling APSI because it has smallest AIC.
Analysis of γ spectra in airborne radioactivity measurements using multiple linear regressions

International Nuclear Information System (INIS)

Bao Min; Shi Quanlin; Zhang Jiamei

2004-01-01

This paper describes the net peak counts calculating of nuclide 137 Cs at 662 keV of γ spectra in airborne radioactivity measurements using multiple linear regressions. Mathematic model is founded by analyzing every factor that has contribution to Cs peak counts in spectra, and multiple linear regression function is established. Calculating process adopts stepwise regression, and the indistinctive factors are eliminated by F check. The regression results and its uncertainty are calculated using Least Square Estimation, then the Cs peak net counts and its uncertainty can be gotten. The analysis results for experimental spectrum are displayed. The influence of energy shift and energy resolution on the analyzing result is discussed. In comparison with the stripping spectra method, multiple linear regression method needn't stripping radios, and the calculating result has relation with the counts in Cs peak only, and the calculating uncertainty is reduced. (authors)

Some links on this page may take you to non-federal websites. Their policies may differ from this site.