Data fusion


Data fusion

Data fusion, is generally defined as the use of techniques that combine data from multiple sources and gather that information into discrete, actionable items in order to achieve inferences, which will be more efficient and narrowly tailored than if they were achieved by means of disparate sources.

fusion of the data from 2 sources (dimension #1 & #2) can yield classifier superior to any classifiers based on dimension #1 or dimension #2.

Data fusion processes are often categorized as low, intermediate or high, depending on the processing stage at which fusion takes place.[1] Low level fusion, (Data fusion) combines several sources of raw data to produce new raw data. The expectation is that fused data is more informative and synthetic than the original inputs.

For example, sensor fusion is also known as (multi-sensor) data fusion and is a subset of information fusion.

Contents

Data Fusion in Geospatial Applications

In the geospatial (GIS) domain, data fusion is often synonymous with data integration. In these applications, there is often a need to combine diverse data sets into a unified (fused) data set which includes all of the data points and time steps from the input data sets. The fused data set is different from a simple combined superset in that the points in the fused data set contain attributes and metadata which might not have been included for these points in the original data set.

A simplified example of this process is shown below where data set "α" is fused with data set β to form the fused data set δ. Data points in set "α" have spatial coordinates X and Y and attributes A1 and A2. Data points in set β have spatial coordinates X and Y and attributes B1 and B2. The fused data set contains all points and attributes

Input Data Set α

Point X Y A1 A2
α1 10 10 M N
α2 10 30 M N
α3 30 10 M N
α4 30 30 M N

Input Data Set β

Point X Y B1 B2
β1 20 20 Q R
β2 20 40 Q R
β3 40 20 Q R
β4 40 40 Q R

Fused Data Set δ

Point X Y A1 A2 B1 B2
δ1 10 10 M N Q R
δ2 10 30 M N Q R
δ3 30 10 M N Q R
δ4 30 30 M N Q R
δ5 20 20 M N Q R
δ6 20 40 M N Q R
δ7 40 20 M N Q R
δ8 40 40 M N Q R

In this simple case all attributes are uniform across the entire analysis domain, so attributes may be simply assigned. In more realistic applications, attributes are rarely uniform and some type of interpolation is usually required to properly assign attributes to the data points in the fused set.

Visualization of fused data sets for rock lobster tracks in the Tasman Sea.  Image generated using Eonfusion software by Myriax Pty. Ltd. - eonfusion.myriax.com

In a much more complicated application, marine animal researchers use data fusion to combine animal tracking data with bathymetric, meteorological, sea surface temperature (SST) and animal habitat data to examine and understand habitat utilization and animal behavior in reaction to external forces such as weather or water temperature. Each of these data sets exhibit a different spatial grid and sampling rate so a simple combination would likely create erroneous assumptions and taint the results of the analysis. But through the use of data fusion, all data and attributes are brought together into a single view in which a more complete picture of the environment is created. This enables scientists to identify key locations and times and form new insights into the interactions between the environment and animal behaviors.

In the figure at right, rock lobsters are studied off the coast of Tasmania. Dr. Hugh Pederson of the University of Tasmania used data fusion software to fuse southern rock lobster tracking data (color-coded for in yellow and black for day and night, respectively) with bathymetry and habitat data to create a unique 4D picture of rock lobster behavior.

Data fusion vs. Data integration

In applications outside of the geospatial domain, differences in the usage of the terms Data integration and Data fusion apply. In areas such as business intelligence, for example, data integration is used to describe the combining of data, whereas data fusion is integration followed by reduction or replacement. Data integration might be viewed as set combination wherein the larger set is retained, whereas fusion is a set reduction technique with improved confidence.

Data Fusion and the JDL Model

In the mid-1980s, the Joint Directors of Laboratories formed the Data Fusion Subpanel (which later became known as the Data Fusion Group). The JDL/DFG introduced a model of data fusion that divided the various processes into 6 levels:

Level 0: Source Preprocessing/subobject refinement

Level 1: Object refinement

Level 2: Situation refinement

Level 3: Impact Assessment (or Threat Refinement)

Level 4: Process Refinement

Level 5: User Refinement (or Cognitive Refinement)

Although the JDL Model is still in use today, it is often criticized for its implication that the levels necessarily happen in order from 0-5 and also for its lack of adequate representation of the potential for a human-in-the-loop. Despite these shortcomings, the JDL model is useful for visualizing the data fusion process and also for facilitating discussion and common understanding (Hall et al. 2007).

See also

Application areas

References

  1. ^ Lawrence A. Klein (2004). Sensor and data fusion: A tool for information assessment and decision making. SPIE Press. p. 51. ISBN 0819454354. http://books.google.co.za/books?id=-782bo4u_ogC. 

General references

  1. Dave L. Hall and James Llinas, “Introduction to Multisensor Data Fusion”, Proc. of IEEE , Vol. 85, No. 1, pp. 6 – 23, Jan 1997.
  2. Erik Blasch, Ivan Kadar, John Salerno, Mieczyslaw Kokar, Subrata Dase, Gerald Powell, Daniel Corkill, and E. Euspini (2006), Issues and Challenges in Situation Assessment (Level 2 Fusion), Journal of Advances in Information Fusion, Vol 1, No 2, Dec. 2006.

Books

  • Liggins, Martin E., David L. Hall, and James Llinas. Multisensor Data Fusion, Second Edition Theory and Practice (Multisensor Data Fusion). CRC, 2008. ISBN 978-1-4200-5308-1
  • David L. Hall, Sonya A. H. McMullen, Mathematical Techniques in Multisensor Data Fusion (2004), ISBN 1580533353
  • Springer, Information Fusion in Data Mining (2003), ISBN 3540006761
  • H. B. Mitchell, Multi-sensor Data Fusion – An Introduction (2007) Springer-Verlag, Berlin, ISBN 9783540714637
  • S. Das, High-Level Data Fusion (2008), Artech House Publishers, Norwood, MA, ISBN 9781596932814 and 1596932813

External links


Wikimedia Foundation. 2010.

Look at other dictionaries:

  • Data Fusion — Datenfusion (engl. data fusion) bezeichnet die Zusammenführung und Vervollständigung lückenhafter Datensätze zur Datenbereinigung. Während bei der Duplikaterkennung die Datensätze weitgehend vollständig sind und nur kleine Abweichungen aufweisen …   Deutsch Wikipedia

  • data fusion — noun Set of methodologies for fusing information coming from different, and sometimes non homogeneous, sources. The result of fusion is a qualitatively different knowledge always referred to a context …   Wiktionary

  • Multi-Sensor Data Fusion — Multi Sensor Datenfusion (engl. multi sensor data fusion, kurz oft auch nur Data Fusion genannt) bezeichnet die Zusammenführung und Aufbereitung von bruchstückhaften und teilweise widersprüchlichen Sensordaten in ein homogenes, für den Menschen… …   Deutsch Wikipedia

  • Data integration — involves combining data residing in different sources and providing users with a unified view of these data.[1] This process becomes significant in a variety of situations, which include both commercial (when two similar companies need to merge… …   Wikipedia

  • Fusion — can refer to combining two or more distinct things *Cell fusion *Freezing, a chemistry term for a liquid undergoing a phase change into a solid *Gene fusion, a genetic event and molecular biology technique *Nuclear fusion, the process by which… …   Wikipedia

  • Data-centric programming language — defines a category of programming languages where the primary function is the management and manipulation of data. A data centric programming language includes built in processing primitives for accessing data stored in sets, tables, lists, and… …   Wikipedia

  • Data's Day — Star Trek: The Next Generation episode Dr. Crusher teaches Data tap dancing. Episode no …   Wikipedia

  • Data Owner — selten auch deutsch Dateneigner – ist ein Begriff aus dem Informationsmanagement. Entsprechend zum Process Owner, der für einen bestimmten Prozess zuständig ist, ist der Data Owner im Rahmen der Governance und Qualität von Daten für einen… …   Deutsch Wikipedia

  • Fusion power — The Sun is a natural fusion reactor. Fusion power is the power generated by nuclear fusion processes. In fusion reactions two light atomic nuclei fuse together to form a heavier nucleus (in contrast with fission power). In doing so they release a …   Wikipedia

  • Fusión en burbujas — La fusión en burbujas, también conocida como sonofusión , es el nombre no técnico para una reacción de fusión nuclear que algunos investigadores creen que ocurre durante una versión de alta presión de la sonoluminiscencia, una forma extrema de la …   Wikipedia Español


Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”

We are using cookies for the best presentation of our site. Continuing to use this site, you agree with this.