Netezza Create Table Command and Examples

Use the Netezza CREATE TABLE command to create a new, initially empty table in the current database. The CREATE TABLE command automatically creates a data type that represents the tuple type (structure type) corresponding to one row of the table. Netezza Create Table Command Syntax CREATE [ TEMPORARY | TEMP ] TABLE <table> ( <col> <type> [<col_constraint>][,<col> <type> [<col_constraint>]…] [<table_constraint>[,<table_constraint>… ] ) [ DISTRIBUTE ON { RANDOM | [HASH] (<col>[,<col>…]) } ] [ ORGANIZE ON { (<col>) | NONE } ] [ ROW SECURITY ] Read: Importance of right Distribution…

Continue ReadingNetezza Create Table Command and Examples
Comments Off on Netezza Create Table Command and Examples

Netezza Failover – High Availability Architecture

Netezza appliance include necessary components to function seamlessly in the event of any hardware issue so that its availability is more that 99.99%. This is called Netezza high availability architecture that perform seamlessly in case of Netezza failover. There are two host in a cluster in all Netezza appliances so that if one fails other once can take over the task that is being carried out. Netezza Failover - High Availability Architecture Overview Netezza appliance uses the Linux-HA (high availability) and Distributed Replicated Block Device (DRBD) for the host cluster…

Continue ReadingNetezza Failover – High Availability Architecture
Comments Off on Netezza Failover – High Availability Architecture

Changing Netezza Table Distribution key and Example

Choosing right distribution key is one of the important factor to improve the performance of Netezza server. If you have created the table with RANDOM distribution or with different column with lots of duplicate records then you should immediately change the distribution key otherwise that will reduces the performance. Changing Netezza table distribution key is process of redistributing the Netezza table using Netezza nzsql. Changing Netezza Table Distribution key and Example You can achieve the redistribution in couple of ways: Redistribute using CTAS Creating new table and loading data at…

Continue ReadingChanging Netezza Table Distribution key and Example
1 Comment

Netezza Best Practices to Improve Performance

Today there is a increased demand in advanced analytics on big data. Netezza is designed with built in functionalities to perform advanced analytics on really big data sets. To improve the performance, you should follow some Netezza best practices. Best practice should not mean hundreds of rules and regulations to follow. Recommended that basic principles are followed on following features of Netezza: Distribution Data types Statistics Zone maps Clustered base tables Groom table command Netezza Best Practices on Distributions In a Netezza data warehouse appliance good distribution is fundamental element…

Continue ReadingNetezza Best Practices to Improve Performance
Comments Off on Netezza Best Practices to Improve Performance

Netezza Internal DataTypes: rowid, createxid, deletexid and datasliceid

Netezza supports internal datatypes that are used by system to perform internal tasks on the table records. Netezza internal datatypes includes: rowid, transaction ID (createxid, deletexid), and dataslice. Shows Netezza internal Datatypes: Internal Datatype Column Name rowid rowid transaction ID Createxid deletexid dataslice datasliceid Netezza Internal DataTypes: Rowid Identifies a specific and unique record in the database. These row numbers are not necessarily sequential within a table. Usually, the initial rowid value is 100,000. The Netezza host assigns a range of sequential rowids to each SPU in the system. When…

Continue ReadingNetezza Internal DataTypes: rowid, createxid, deletexid and datasliceid
Comments Off on Netezza Internal DataTypes: rowid, createxid, deletexid and datasliceid

Netezza Hash Function Usage and Examples

You can use Netezza hash function to encode the data by transforming the input into a hash code or hash values. If you use the hash function, no two inputs have same hash values. The hash algorithm is designed to minimise the collision (two inputs having same hash value). You can use the hash function to speed up the data record retrieval by using simple one way lookup. Hash code is used as an index into a hash table which has a pointer to data records. Netezza hash function also…

Continue ReadingNetezza Hash Function Usage and Examples
Comments Off on Netezza Hash Function Usage and Examples

Netezza Skew and How to avoid it

You will hear a lot about "Netezza Skew" if you are developing data warehouse on Netezza, Redshift, Teradata, hive or Impala database. The performance of the system is directly linked to uniform distribution of the user data  across all of the data slices in the system. When you create a table and then load the data into the system, the rows of the table should be distributed uniformly among all the data slices. If some data slices have more rows of a table than others this scenarios is called skew.…

Continue ReadingNetezza Skew and How to avoid it
Comments Off on Netezza Skew and How to avoid it

Importance of right Netezza Distribution key

This post is all about how data is distributed (Netezza distribution key) in Netezza server. Feel free to make comments or suggestions to improve it, or pass it on if you like. Let’s first understand how NPS stores the data on disk drives. Each Snippet Processor in the Snippet Processing Unit (SPU) has a dedicated hard drive has its separate CPU, FPGA, separate RAM memory, hard disks  and the data on stored on drive is called a data slice. Read: Changing Netezza Table Distribution Key Cluster Based Tables (CBT) in…

Continue ReadingImportance of right Netezza Distribution key
Comments Off on Importance of right Netezza Distribution key