Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Jul;3(7):322-329.

Exploratory Analysis in Time-Varying Data Sets: a Healthcare Network Application

Exploratory Analysis in Time-Varying Data Sets: a Healthcare Network Application

Narine Manukyan et al. Int J Adv Comput Sci. 2013 Jul.

Abstract

We introduce a new method for exploratory analysis of large data sets with time-varying features, where the aim is to automatically discover novel relationships between features (over some time period) that are predictive of any of a number of time-varying outcomes (over some other time period). Using a genetic algorithm, we co-evolve (i) a subset of predictive features, (ii) which attribute will be predicted (iii) the time period over which to assess the predictive features, and (iv) the time period over which to assess the predicted attribute. After validating the method on 15 synthetic test problems, we used the approach for exploratory analysis of a large healthcare network data set. We discovered a strong association, with 100% sensitivity, between hospital participation in multi-institutional quality improvement collaboratives during or before 2002, and changes in the risk-adjusted rates of mortality and morbidity observed after a 1-2 year lag. The proposed approach is a potentially powerful and general tool for exploratory analysis of a wide range of time-series data sets.

Keywords: Artificial intelligence; genetic algorithm; knowledge discovery; pattern recognition.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
a) Hypothesis of causality. b) Inverted hypothesis tested by the classifier.
Fig. 2
Fig. 2
Information is extracted and aggregated from the time-series data relative to a dividing year (2004, in this example) and lag (2 years, in this example).
Fig. 3
Fig. 3
Overall architecture of the approach, illustrated for use with the VON data set. Items outlined in red are co-evolved by GAMET.
Fig. 4
Fig. 4
Experimental results on the VON data set. The bars indicate the frequency with which each of the individual features was selected in 10 GAMET trials. The red asterisks near the top indicate the features selected in the single best individual.

References

    1. Banks A. Cross-national time-series data archive 1815–2007. Jerusalem, Israel: Databanks International; 2008.
    1. Bernstein I, Horbar J, Badger G, Ohlsson A, Golan A, et al. Morbidity and mortality among very-low-birth-weight neonates with intrauterine growth restriction. American journal of obstetrics and gynecology. 2000;182(1):198–206. - PubMed
    1. DeHaas D, Craig J, Rickert C, Haake P, Stor K, Eppstein M. Feature selection and classification in noisy epistatic problems using a hybrid evolutionary approach. poster and published extended abstract accepted for Genetic and Evolutionary Computation Conference (GECCO) 2007
    1. Fayyad U, Piatetsky-Shapiro G, Smyth P. From data mining to knowledge discovery in databases. AI magazine. 1996;17(3):37.
    1. Hecht-Nielsen R. Counterpropagation networks. Applied optics. 1987;26(23):4979–4983. - PubMed

LinkOut - more resources