Data (computing)

Data (computing)

In computer science, data (play /ˈdtə/ day-tə or /ˈdætə/) is information in a form suitable for use with a computer.[1] Data is often distinguished from programs. A program is a sequence of instructions that detail a task for the computer to perform. In this sense, data is thus everything that is not program code.[2]

In an alternate usage, binary files (which are not human-readable) are sometimes called "data" as distinguished from human-readable "text".[3] The total amount of digital data in 2007 was estimated to be 281 billion gigabytes (= 281 exabytes).[4][5]

Data vs programs

Fundamentally, computers follow the instructions they are given. A set of instructions to perform a given task (or tasks) is called a "program". In the nominal case, the program, as executed by the computer, will consist of binary machine code. The elements of storage manipulated by the program, but not actually executed by the CPU, contain data.

Typically, different files are used to store programs vs data. Executable files contain programs; all other files are data files. However, executable files may also contain data which is "built-in" to the program. In particular, some executable files have a data segment, which nominally contains constants and initial values (both data).

For example: a user might first instruct the operating system to load a word processor program from one file, and then edit a document stored in another file. In this example, the document would be considered data. If the word processor also features a spell checker, then the dictionary (word list) for the spell checker would also be considered data. The algorithms used by the spell checker to suggest corrections would be considered code.

The line between program and data can become blurry. An interpreter, for example, is a program. The input data to an interpreter is itself a program—just not one expressed in native machine language. In many cases, the interpreted program will be a human-readable text file, which is manipulated with a text editor—more normally associated with plain text data. Metaprogramming similarly involves programs manipulating other programs as data. Also, for programs like compilers, linkers, debuggers, program updaters, etc. may other programs serve as data. The most special case is the case of self-modifying code, which manipulates itself.

See also


  1. ^ "data". American Heritage Dictionary of the English Language. Archived from the original on 2007-11-04. Retrieved 2007-03-19. 
  2. ^ "data". Webopedia. Retrieved 2007-03-19. 
  3. ^ "file(1)". OpenBSD Manual Pages. 2004-12-04. Retrieved 2007-03-19. 
  4. ^ Paul, Ryan (March 12, 2008). "Study: amount of digital info > global storage capacity". Ars Technica. Retrieved 2008-03-12. 
  5. ^ Gantz, John F. et al. (2008). "The Diverse and Exploding Digital Universe". International Data Corporation via EMC. Retrieved 2008-03-12. 

Wikimedia Foundation. 2010.

См. также в других словарях:

  • Data Intensive Computing — is a class of parallel computing applications which use a data parallel approach to processing large volumes of data typically terabytes or petabytes in size and typically referred to as Big Data. Computing applications which devote most of their …   Wikipedia

  • Data (disambiguation) — Data consist of propositions that reflect reality, such as measurements or observations of a variable. Data may also refer to: Data (computing), in computer science, often distinguished from code or software Data (Euclid), a book by ancient Greek …   Wikipedia

  • Data — For data in a computer science context, see Data (computing). For other senses of the word, see Data (disambiguation). See also datum, a disambiguation page. The term data refers to qualitative or quantitative attributes of a variable or set of… …   Wikipedia

  • Data modeling — The data modeling process. The figure illustrates the way data models are developed and used today. A conceptual data model is developed based on the data requirements for the application that is being developed, perhaps in the context of an… …   Wikipedia

  • Data syncretization — Date syncretization is the combining of differing data elements from a variety of source data, and managing the set as a tightly blended arrangement of data. It may also involve the unification of various data operations into a single framework… …   Wikipedia

  • Data-centric programming language — defines a category of programming languages where the primary function is the management and manipulation of data. A data centric programming language includes built in processing primitives for accessing data stored in sets, tables, lists, and… …   Wikipedia

  • Data parallelism — (also known as loop level parallelism) is a form of parallelization of computing across multiple processors in parallel computing environments. Data parallelism focuses on distributing the data across different parallel computing nodes. It… …   Wikipedia

  • Data Foundry — Type Privately held company Industry Telecommunications IT Services Area served Worldwide Key people Ron Yokubaitis Co CEO and Founder Carolyn Yokubaitis Co CEO and Founder Jonah Yokubaitis – Principal and Founder …   Wikipedia

  • Data security — is the means of ensuring that data is kept safe from corruption and that access to it is suitably controlled. Thus data security helps to ensure privacy. It also helps in protecting personal data. Data security is part of the larger practice of… …   Wikipedia

  • Data stream mining — is the process of extracting knowledge structures from continuous, rapid data records. A data stream is an ordered sequence of instances that in many applications of data stream mining can be read only once or a small number of times using… …   Wikipedia

Поделиться ссылкой на выделенное

Прямая ссылка:
Нажмите правой клавишей мыши и выберите «Копировать ссылку»