Transactional data

Transactional data is data that is generated as a result of a transaction. This can include data such as the date and time of the transaction, the amount of money involved, the parties involved, and any other relevant details. This data can be used to track the performance of a business, to identify trends, and … Read more

Data janitor (data wrangler)

A data janitor (data wrangler) is a person responsible for organizing, cleaning and maintaining large data sets. This may involve tasks such as sorting data, removing duplicates, filling in missing values, and dealing with inconsistencies. Data janitors often work with data scientists and analysts to help them prepare data for analysis. What is Wrangler data? … Read more

Reference data

Reference data is data that is used to identify a particular record or entity within a dataset. This data can be used to cross-reference other data, or to provide additional context for analysis. For example, a reference dataset may contain customer IDs, product IDs, or geographic codes that can be used to link to other … Read more

Apache Parquet

Apache Parquet is a columnar storage format for Hadoop. Parquet is a columnar storage format for Hadoop that uses the concept of nested data structures to provide efficient compression and encoding of data. A Parquet file consists of a header followed by a series of blocks. Each block contains a compressed chunk of data. The … Read more

Data virtualization software

Data virtualization software is a type of software that allows users to access and manipulate data stored in a virtual environment. This type of software is often used by businesses to allow employees to access data from multiple locations, or to allow customers to access data from a remote location. Data virtualization software can also … Read more

Support vector machine (SVM)

A support vector machine (SVM) is a supervised learning algorithm that can be used for both classification and regression tasks. The algorithm is based on finding a hyperplane that best separates the data into two classes. In order to find the hyperplane, the SVM algorithm first creates a set of possible hyperplanes, and then chooses … Read more

Golden record

A golden record is a single, consolidated view of an entity that contains the most accurate and complete information about that entity. A golden record can be used to provide a consistent view of an entity across different systems and can serve as the authoritative source of information for an organization. The term is often … Read more

System of record (SOR)

The system of record (SOR) is the authoritative data source for a particular data element or set of data elements. In most cases, the system of record is the source system where the data is first entered. The system of record is often the source of truth for a particular data element. For example, if … Read more

Driver’s Privacy Protection Act (DPPA)

The Driver’s Privacy Protection Act (DPPA) is a federal law that establishes strict rules for how state motor vehicle departments (DMVs) may collect, use, and share personal information about drivers. The law also establishes a national driver’s privacy database, which allows drivers to find out who has accessed their records and to limit access to … Read more

Core banking system

A core banking system (CBS) is the software used to support a bank’s most common transactions. This can include things like deposits, withdrawals, transfers, and account management. The CBS is the backbone of a bank’s operations, and as such, it is critical that it is secure, stable, and scalable. A CBS is usually implemented as … Read more