Data Collection Fundamentals 121

“Data Collection Fundamentals" provides an overview of the basic life cycle, structures, and qualities of common data used in Industry 4.0. Data collection describes the process of collecting and analyzing various types of electronic information. As the collection develops, analytics add value to data and provide a competitive advantage to manufacturers.

After taking this class, users will be able to define what data collection is, describe how it functions and is deployed, and identify the different ways that collected data is processed and stored. Additionally, users will better understand the value and importance of the data being collected and appreciate the safety steps needed to protect it from internal and external threats.

Class Details

Class Name:
Data Collection Fundamentals 121
Number of Lessons:

Class Outline

  • Data
  • Common Types of Manufacturing Data
  • Data Collection
  • Quantitative and Qualitative Data
  • Data Sources
  • Data Collection Methods
  • Data Collection Review
  • Data Storage
  • Data Analysis
  • Data Safety
  • Final Review


  • Describe data.
  • Identify common data types in manufacturing.
  • Describe the use of a hypothesis in data collection.
  • Distinguish between quantitative and qualitative data.
  • Describe sources of data.
  • Distinguish between the inline and sampling methods.
  • Identify storage issues.
  • Describe data analysis.
  • Describe data collection safety.

Job Roles



Vocabulary Term Definition
5 Whys Technique A troubleshooting process where operators ask a series of "why" questions, usually five, in order to isolate the root cause of a problem. The 5 Whys Technique is also a useful starting point for other troubleshooting methods.
backup A strategy in which copies of original data files are stored on one or more separate devices. Backups are critical in order protect data that is lost or damaged.
CAD Computer-aided design. A computer software program that aids in the automated design and technical precision drawing of a part, product, process, or building. CAD can create three-dimensional (3D) digital models used for digital twins.
carbide insert A replaceable cutting bit made out of carbide that has multiple cutting edges. Carbide inserts can be indexed to another cutting edge once one is excessively worn.
cloud A network of remote servers that can be accessed through the internet. The cloud stores many software applications and can be used to back up data.
clutter Messy or disordered. Determining clutter during data collection helps identify what to store and what not to store.
computer numerical control CNC. A self-contained system of computers and precise motors that executes program instructions to guide machine tool components. Computer numerical control allows operators to program sequences of machining operations.
cyber threat Any potential event or attack that could access or damage computers or digital networks. Cyber threats may include inadvertent events or malicious attacks from hackers.
data A collection of numbers, facts, and information about a process or product. Data can be created, communicated, and recorded by sensors in smart objects.
data collection plan A document that defines all the details concerning an information gathering project. Data collection includes how much and what type of data is required and when and how it should be collected.
data warehouse A computer application or system that manages massive amounts of data from a variety of sources. Data warehouses can be located on premises or are cloud-based.
database Computer storage that holds data and is searchable. A database both stores and organizes information.
dataset Similar types of information collected into a single asset, or unit. Datasets are used by manufacturers to organize the information they collect.
external hard drives A memory storage device that stores and retrieves data on a computer. An external hard drive can connect to and access files on various computing devices, usually through a USB connection.
flash drives A small, portable memory card that can be used to store data, such as CNC part programs. Flash drives connect to hardware devices through a USB port.
forecast A prediction of demand patterns for a product, which is used to calculate future inventory levels. Forecasting in the digital supply chain typically results in more accurate estimates when ordering raw material and other supplies.
frequency An amount or rate of something measured in units like seconds. Frequencies are common measurements used with data collection.
HIPAA Health Insurance Portability and Accountability Act. Establishes national standards to protect medical records and other private health information. HIPAA rules typically apply to all personal data collected from healthcare providers.
hypothesis A tentative theory based on observation. Hypotheses can be tested through further observation and research.
Industrial Internet of Things IIoT. A network of physical devices used in manufacturing that contain computing systems that allow them to send and receive data. The Industrial Internet of Things allows devices to exchange data and automate processes without any human intervention.
Industry 4.0 A stage in manufacturing that uses connected devices and digital technologies. Industry 4.0 uses automation and data exchange to achieve advancements in a variety of industries.
inline method A process of counting each item in sequence during manufacturing or other process. Inline methods of data collection typically capture all data during a specific period.
International Automotive Task Force IATF. A group of automotive manufacturers and their respective trade associations formed to improve product quality. International Automotive Task Force members include automakers from the U.S., the U.K., and Europe.
International Organization for Standardization ISO. A non-governmental organization based in Switzerland that develops and establishes standards, rules, and guidelines designed to ensure that products, processes, and services are fit for their purposes. The International Organization for Standardization took its abbreviation ISO from the Greek word isos, which means equal.
machine tool A power-driven machine that holds a variety of tools. Machine tools can hold a variety of cutting and manufacturing tools.
malware Any malicious code or software that can potentially harm a computer, device, or network, or retrieve data from the network or device without authorization. Malware often exists undetected on systems for extended periods of time.
passwords A series of characters, known only by authorized users, that allow the users to access an otherwise locked digital system. Passwords effectively prevent unauthorized access as long as they are not shared or discovered by unauthorized users.
production metrics Data that tracks the number or rate at which parts are produced. Production metrics can be used to detect errors and track an operation&#8217s progress.
production rates The speed at which a manufacturing operation produces parts. Production rates, or build rates, for smart manufacturing can be different than traditional manufacturing..
qualitative data Measuring the descriptive characteristics of a thing such as height, weight, or gender. Qualitative typically describes any non-numeric data.
quantitative data Measuring an amount or number. Quantitative data typically includes anything that can be counted or measured numerically.
sampling method A data collection strategy that uses a representative part or small group of parts from a larger group. In sampling methods, a larger sample increases accuracy.
scrap rate The percentage of material not used to create the final part. Scrap rates describe the material removed during machining and also parts that are out of tolerance that can’t be sold or used.
server The physical computer that shares information with other computers within its network. The server for a network of CNC machines would share part programs.
server clustering A system backup strategy in which contents of a primary server are duplicated and constantly updated on a group of synchronized servers. Server clustering helps prevent data loss and slower processing speeds when the volume of data being transferred is high.
server mirroring A system backup strategy in which contents of a primary server are duplicated and constantly updated on a separate server or storage device. Server mirroring can help organizations recover from cyber attacks by restoring lost or compromised files.
smart manufacturing Technologically integrated manufacturing that creates and uses data in real time to address the needs of the factory, supplier, and customer. Smart manufacturing is an advancement of traditional manufacturing automation.
smart sensor A device equipped with software that can detect physical inputs, process them as data, and output digital signals. Smart sensors are more advanced than normal digital sensors since they can process data internally rather than simply sending digital signals to an external system to be processed.
tooling Assorted tools used in various manufacturing processes. Tooling is used in many machine operations such as milling, turning, and additive manufacturing among other processes.
two-step verification A security measure that requires users to enter additional information in addition to a password when logging into or accessing a system. Two-step verification methods include entering temporary codes sent to trusted devices and answering security questions.
volume A measurement of the amount of data used within a network or stored by a database. Volume is a key calculation used with data collection.