Geeks With Blogs
Saqib Ullah BootStrapper Know How

I have found very nice definition of Cardinality and its types.

Definition

In SQL (Structured Query Language), the term cardinality refers to the uniqueness of data values contained in a particular column (attribute) of a database table. The lower the cardinality, the more duplicated elements in a column. Thus, a column with the lowest possible cardinality would have the same value for every row. SQL databases use cardinality to help determine the optimal query plan for a given query.

 

Values of Cardinality

When dealing with columnar value sets, there are 3 types of cardinality: high-cardinality, normal-cardinality, and low-cardinality.

High-cardinality refers to columns with values that are very uncommon or unique. High-cardinality column values are typically identification numbers, email addresses, or user names. An example of a data table column with high-cardinality would be a USERS table with a column named USER_ID. This column would contain unique values of 1-n. Each time a new user is created in the USERS table, a new number would be created in the USER_ID column to identify them uniquely. Since the values held in the USER_ID column are unique, this column's cardinality type would be referred to as high-cardinality.

Normal-cardinality refers to columns with values that are somewhat uncommon. Normal-cardinality column values are typically names, street addresses, or vehicle types. An example of a data table column with normal-cardinality would be a CUSTOMER table with a column named LAST_NAME, containing the last names of customers. While some people have common last names, such as Smith, others have uncommon last names. Therefore, an examination of all of the values held in the LAST_NAME column would show "clumps" of names in some places (e.g.: a lot of Smith's ) surrounded on both sides by a long series of unique values. Since there is a variety of possible values held in this column, its cardinality type would be referred to as normal-cardinality.

Low-cardinality refers to columns with few unique values. Low-cardinality column values are typically status flags, Boolean values, or major classifications such as gender. An example of a data table column with low-cardinality would be a CUSTOMER table with a column named NEW_CUSTOMER. This column would contain only 2 distinct values: Y or N, denoting whether the customer was new or not. Since there are only 2 possible values held in this column, its cardinality type would be referred to as low-cardinality.

 

Posted on Friday, February 22, 2013 8:25 PM | Back to top


Comments on this post: What is Database Cardinality

# Fontaneros Madrid
Requesting Gravatar...
I've been looking for this information, I have done much
Left by leonardo osorio on Feb 23, 2013 3:17 AM

# re: What is Database Cardinality
Requesting Gravatar...
Thanks for the Cardinality definition. The new semester starts soon and need this information for a programming project.
Left by Sandra on Mar 30, 2013 12:43 AM

# re: What is Database Cardinality
Requesting Gravatar...
Thanks, I'm currently writing a new essay about Cardinality of DB. This is very useful for that.
Left by Jana on Apr 14, 2013 6:02 AM

# re: What is Database Cardinality
Requesting Gravatar...
My database is back to speed and works well again, thanks for your detailed tipps.
Left by Sarah on Apr 29, 2013 12:21 AM

# re: What is Database Cardinality
Requesting Gravatar...
Thanks for that tipp, this will be useful for my new Database project.
Left by Dana on May 20, 2013 6:31 AM

Your comment:
 (will show your gravatar)


Copyright © Saqib Ullah | Powered by: GeeksWithBlogs.net | Join free