What is Netezza - learn Reading Time: 5 minutes

Netezza Twinfin is the advanced analytics and warehousing solution provided by IBM. It currently has been rebranded as IBM Puredata for analytics (PDA).

Netezza utilizes a restrictive design called Asymmetric Massively Parallel Processing (AMPP) which joins the enormous information processing proficiency of Massively Parallel Processing (MPP) where nothing (CPU, memory, stockpiling) is shared and symmetric multiprocessing to arrange the equal processing. The MPP is accomplished through an array of S-Blades which are workers on its own running its own working frameworks associated with plates. While there might be different items which follow comparable design, one extraordinary equipment part utilized by Netezza called the Database Accelerator card which is joined to the S-Blades. These quickening agent cards can play out a portion of the question processing stages while information is being perused from the circle rather than the processing being done in the CPU. Moving huge measure of information from the circle to the CPU and playing out all the phases of question processing in the CPU is one of the significant bottlenecks in the huge numbers of the data set administration frameworks utilized for information warehousing and investigation use cases.

The fundamental equipment segments of the Netezza machine are a host which is a Linux worker, which can convey to an array of S-Blades every one of which has 8 processor centers and 16 GB of RAM running Linux working framework. Every processor in the S-Blade is associated with plates in a circle array through a Database Accelerator card which utilizes FPGA innovation. Host is additionally liable for all the customer collaborations to the apparatus like dealing with information base questions, meetings and so on alongside dealing with the meta-information about the items like data set, tables and so on put away in the apparatus. The S-Blades among themselves and to the host can convey through an exclusively fabricated IP based superior organization.

Netezza S-Blades

The S-Blades are likewise alluded as Snippet Processing Array or SPA in short and every CPU in the S-Blades joined with the Database Accelerator card appended to the CPU is alluded as a Snippet Processor.

Netezza S-Blades

Let us use the example of a Data Warehouse for a huge retail firm and one of the tables store the insights concerning the entirety of its 10 million clients. Likewise expect that there are 25 columns in the tables and the absolute length of each table column is 250 bytes. In Netezza the 10 million client records will be stored fairly equally across all the disks available in the disk arrays connected to the snippet processors in the S-Blades in a compressed form. At the point when an user queries for state Customer Id, Name and State who joined the retail firm in a specific period arranged by state and name, the below is how the processing will occur:

Conclusion

FPGA Accelerator Full Netezza Course Next: Netezza Architecture



Meet Ananth Tirumanur. Hi there ๐Ÿ‘‹

I work on projects in data science, big data, data engineering, data modeling, software engineering, and system design.

Connect with me:

My Resources:

Languages and Tools:

AWS, Bash, Docker, Elasticsearch, Git, Grafana, Hadoop, Hive, EMR, Glue, Athena, Lambda, Step Functions, Airflow/MWAA, DynamoDB, Kafka, Kubernetes, Linux, MariaDB, MySQL, Pandas, PostgreSQL, Python, Redis, Scala, SQLite