Optimization of Loop Unrolling on dense Vectormatrix multiplication Parallel Processing - PowerPoint PPT Presentation

1 / 8

About This Presentation

Title:

Optimization of Loop Unrolling on dense Vectormatrix multiplication Parallel Processing

Description:

Optimization of Loop Unrolling on dense Vector-matrix multiplication -Parallel Processing ... Conclusion - Result Table. 1.00. 2.23, 2.23. 1048.576. 0.66. 2.000 ... – PowerPoint PPT presentation

Number of Views:216

Avg rating:3.0/5.0

Slides: 9

Provided by: smal9

Category:

Tags: dense | loop | multiplication | optimization | parallel | processing | table | unrolling | vectormatrix

Transcript and Presenter's Notes

Title: Optimization of Loop Unrolling on dense Vectormatrix multiplication Parallel Processing

1
Optimization of Loop Unrolling on dense
Vector-matrix multiplication
-Parallel Processing

By Sumit Malhotra
Computer Science, Florida Tech
767050340
Dr. Charles Fulton

2
Aim of project

To find the best loop unrolling parameters for
different number of processors on a 5120 X 5120
matrix.

3
Algorithm for Matrix Multiplication

m n 5120
for (i0 i lt local_m iUNROLL2)
for (j0 j lt n jUNROLL)
matrix multiplication
Where UNROLL2 and UNROLL are loop unrolling
parameters
and local_m m/p and p number of processors.
Therefore the size of matrix on each processor
will be local_m x n.

4
Size of matrix on each processor

Size of Matrix when p1 5120 X 5120
Size of Matrix when p2 2560 X 5120
Size of Matrix when p4 1280 X 5120
Size of Matrix when p8 640 X 5120
Where p
number of processors.

5
Sample Code

UNROLL2 UNROLL 2
for (i0 i lt local_m iUNROLL2)
for (j0 j lt n jUNROLL)
yi local_Aij xj
local_Aij1 xj1
yi1 local_Ai1j xj
local_Ai1j1
xj1

6
Sample Code

UNROLL2 2, UNROLL 4.
for (i0 i lt local_m iUNROLL2)
for (j0 j lt n jUNROLL)
yi local_Aij xj
local_Aij1 xj1
local_Aij2 xj2
local_Aij3 xj3
yi1 local_Ai1j xj
local_Ai1j1 xj1
local_Ai1j2 xj2
local_Ai1j3 xj3

7
Time Calculation

Start clock()
Multiplication code //Computation.
MPI_Gather() //Communication.
End clock()
Total Computation Communication Time Start
End
Start clock()
MPI_Gather() //Communication.
End clock()
Communication Time Start End
Start clock()
MPI_Scatter() //Communication.
End clock()
Scatter Time Start End

8
Conclusion - Result Table

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

World's Best PowerPoint Templates PowerPoint PPT Presentation

World's Best PowerPoint Templates - CrystalGraphics offers more PowerPoint templates than anyone else in the world, with over 4 million to choose from. Winner of the Standing Ovation Award for “Best PowerPoint Templates” from Presentations Magazine. They'll give your presentations a professional, memorable appearance - the kind of sophisticated look that today's audiences expect. Boasting an impressive range of designs, they will support your presentations with inspiring background photos or videos that support your themes, set the right mood, enhance your credibility and inspire your audiences.

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Parallel%20Implementation%20of%20Ant%20Colony%20Optimization%20on%20%20Traveling%20Salesman%20problem PowerPoint PPT Presentation

Parallel%20Implementation%20of%20Ant%20Colony%20Optimization%20on%20%20Traveling%20Salesman%20problem - Parallel Implementation of Ant Colony ... Ant colony optimization algorithm is a ... Key concept of ACO based on communication among ants based on the use ... | PowerPoint PPT presentation | free to view

Pipeline Optimization PowerPoint PPT Presentation

Pipeline Optimization - Pipeline Optimization Pipeline with data forwarding and accelerated branch Loop Unrolling Dual Pipeline | PowerPoint PPT presentation | free to view

Optimization%20of%20application%20in%20virtual%20laboratory%20constructing%20workflows%20based%20on%20application%20sources%20and%20providing%20data%20for%20workflow%20scheduling%20algorithms PowerPoint PPT Presentation

Optimization%20of%20application%20in%20virtual%20laboratory%20constructing%20workflows%20based%20on%20application%20sources%20and%20providing%20data%20for%20workflow%20scheduling%20algorithms - Optimization of application in virtual laboratory constructing workflows based on application sources and providing data for workflow scheduling algorithms | PowerPoint PPT presentation | free to view

Parallel Robots Market 2019 By Global Industry Trends, Future Plans And Opportunity Assessment 2026 PowerPoint PPT Presentation

Parallel Robots Market 2019 By Global Industry Trends, Future Plans And Opportunity Assessment 2026 - Parallel robots are vertical overhead robots that have two or more arms which are interconnected through several joints/links. These robots are segregated based on their functioning namely, Hexapods which are used due to their motion freedom in different applications; whereas, Delta robots are used for simple pick and place applications. | PowerPoint PPT presentation | free to view

A Performance Optimization Framework for Compilation of Tensor Contraction Expressions Into Parallel Programs PowerPoint PPT Presentation

A Performance Optimization Framework for Compilation of Tensor Contraction Expressions Into Parallel Programs - A Performance Optimization Framework for Compilation of Tensor Contraction Expressions Into Parallel Programs Gerald Baumgartner, Ohio State David E. Bernholdt, ORNL | PowerPoint PPT presentation | free to view

Input-Series–Output-Parallel-Connected Buck Rectifiers for High-Voltage Applications || 2015-2016 IEEE Power electronics Projects Training PowerPoint PPT Presentation

Input-Series–Output-Parallel-Connected Buck Rectifiers for High-Voltage Applications || 2015-2016 IEEE Power electronics Projects Training - Input-Series–Output-Parallel-Connected Buck Rectifiers for High-Voltage Applications || 2015-2016 IEEE Power electronics Projects Training Contact: IIS TECHNOOGIES ph:9952077540,landline:044 42637391 mail:info@iistechnologies.in | PowerPoint PPT presentation | free to view

Parallel Robots Market Challenges On Upcoming Trends, Future Prediction Report 2019-2026 PowerPoint PPT Presentation

Parallel Robots Market Challenges On Upcoming Trends, Future Prediction Report 2019-2026 - Global parallel robots market is expected to grow with a steady CAGR in the forecast period of 2019-2026. The report contains data from the base year of 2018 and the historic year of 2017. This rise in market value can be attributed to the rise in the automation of industries and operations from various end-user verticals. | PowerPoint PPT presentation | free to view

Complete Magento Store Performance Optimization Process PowerPoint PPT Presentation

Complete Magento Store Performance Optimization Process - Complete Magento 2 Speed Optimization Process to Speed up your eCommerce Store. Our Magento 2 Developers follow each complex strategy to get the best results. read more at: https://www.vihadigitalcommerce.com/speed-up-magento-store-with-performance-optimization-techniques-part-1/ | PowerPoint PPT presentation | free to view

Query processing and optimization PowerPoint PPT Presentation

Query processing and optimization - Query processing and optimization Definitions Query processing translation of query into low-level activities evaluation of query data extraction Query optimization ... | PowerPoint PPT presentation | free to view

Instruction Level Parallelism (ILP) PowerPoint PPT Presentation

Instruction Level Parallelism (ILP) - Instruction Level Parallelism (ILP) Colin Stevens What is a parallel instruction? ILP is a measure of the number of instructions that can be performed during a single ... | PowerPoint PPT presentation | free to view

Tata Local Loop Service Provider in India | Price and Tariff Plans | Call: 9036000187 PowerPoint PPT Presentation

Tata Local Loop Service Provider in India | Price and Tariff Plans | Call: 9036000187 - Call: 9036000187 Tata Local Loop Service Provider in India - Bangalore. Tata Provides Best Price and tariff Plans | PowerPoint PPT presentation | free to view

Query Optimization Over Web Services PowerPoint PPT Presentation

Query Optimization Over Web Services - Query Optimization Over ... /Ki* is minimized Profiling combined with query processing for trying out various chunk ... Let WS1, . . . , WSn be a plan with a ... | PowerPoint PPT presentation | free to view

Optimizing single thread performance PowerPoint PPT Presentation

Optimizing single thread performance - Optimizing single thread performance Dependence Loop transformations | PowerPoint PPT presentation | free to view

Program Optimization PowerPoint PPT Presentation

Program Optimization - Matrix multiplication. Multiply n-by-n matrices A and B, and store in matrix C ... Daily, times TBA on course mailing list. Review sessions ... | PowerPoint PPT presentation | free to view

Using extremal optimization for Java program initial placement in clusters of JVMs PowerPoint PPT Presentation

Using extremal optimization for Java program initial placement in clusters of JVMs - Using extremal optimization for Java program initial ... the application A ProActive Java multi ... Poland 2Institute of High Performance Computing ... | PowerPoint PPT presentation | free to view

CS 213: Parallel Processing Architectures PowerPoint PPT Presentation

CS 213: Parallel Processing Architectures - Parallelism moved to instruction level. Microprocessor performance ... Process Level or Thread level parallelism; mainstream for general purpose computing? ... | PowerPoint PPT presentation | free to view

Parallel Optimization Tools for High Performance Design of Integrated Circuits PowerPoint PPT Presentation

Parallel Optimization Tools for High Performance Design of Integrated Circuits - have no idea how far might be from the optimal solution. 2. Months Late. Profit. 0. 3. 6 ... applies pruning of sub-optimal branches ... | PowerPoint PPT presentation | free to view

Program looping PowerPoint PPT Presentation

Program looping - Program looping Why we need loop Make code concise for repetitive processes When to use loop Run a block of code repetitively Process multiple data using same procedure | PowerPoint PPT presentation | free to view

A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms PowerPoint PPT Presentation

A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms - A Prototypical Self-Optimizing Package for Parallel Implementation of Fast Signal Transforms Kang Chen and Jeremy Johnson Department of Mathematics and Computer Science | PowerPoint PPT presentation | free to view

Parallel Job Deployment and Monitoring in a Hierarchy of Mobile Agents PowerPoint PPT Presentation

Parallel Job Deployment and Monitoring in a Hierarchy of Mobile Agents - Parallel Job Deployment and Monitoring in a Hierarchy of Mobile Agents Munehiro Fukuda Computing & Software Systems, University of Washington, Bothell | PowerPoint PPT presentation | free to view

Tutorial on Neural Network Models for Speech and Image Processing PowerPoint PPT Presentation

Tutorial on Neural Network Models for Speech and Image Processing - ... Applications in speech and image processing PART I Feature Extraction and Classification Problems in ... Analysis Feature extraction Image ... | PowerPoint PPT presentation | free to view

Dissertation Algorithms Tips For Developing Whale Optimization Algorithms - PhD Assistance PowerPoint PPT Presentation

Dissertation Algorithms Tips For Developing Whale Optimization Algorithms - PhD Assistance - Dissertation Algorithms Tips For Developing Whale Optimization Algorithms - PhD Assistance - http://bit.ly/2PRQ4u1 • You will find the best dissertation research areas / topics for future researchers enrolled in Engineering • In order to identify the future research topics, we have reviewed the Engineering literature (recent peer-reviewed studies) on optimization process. • The nature-inspired meta-heuristic optimization algorithm is the recent trend in Artificial Intelligence. Read More : http://www.phdassistance.com/industries/computer-science-information/ #WHALEOPTIMIZATIONALGORITHM #PhDAssistance #engineeringapplications For Any Queries : Website: www.phdassistance.com Phd Research Lab : www.research.phdassistance.com Email: info@phdassistance.com Phone : +91-4448137070 Contact Name Ganesh / Vinoth Kumar | PowerPoint PPT presentation | free to view

Design and Synthesis of Image Processing Systems using Reconfigurable Dataflow Graphs PowerPoint PPT Presentation

Design and Synthesis of Image Processing Systems using Reconfigurable Dataflow Graphs - Design and Synthesis of Image Processing Systems using Reconfigurable Dataflow Graphs Mainak Sen and Shuvra S. Bhattacharyya Department of Electrical and Computer ... | PowerPoint PPT presentation | free to view

A Parallel, High Performance Implementation of the Dot Plot Algorithm PowerPoint PPT Presentation

A Parallel, High Performance Implementation of the Dot Plot Algorithm - A Parallel, High Performance Implementation of the Dot Plot Algorithm Chris Mueller July 8, 2004 Overview Motivation Availability of large sequences Dot plot offers ... | PowerPoint PPT presentation | free to view

Operation Pretreatment Process of Textile PowerPoint PPT Presentation

Operation Pretreatment Process of Textile - Operation Pretreatment Process of Textile Pretreatment Process of Textile Materials Definition of Pretreatment textile processing Teknologi dan Rekayasa Process ... | PowerPoint PPT presentation | free to view

An Introduction and Overview of the Parallel Curriculum Model: Promise and Process PowerPoint PPT Presentation

An Introduction and Overview of the Parallel Curriculum Model: Promise and Process - An Introduction and Overview of the Parallel Curriculum Model: Promise and Process. 1. Explanation: Welcome to our online support materials for those of you ... | PowerPoint PPT presentation | free to view

Exploiting InstructionLevel Parallelism with Software Approach PowerPoint PPT Presentation

Exploiting InstructionLevel Parallelism with Software Approach - ... can be the source of a reasonable amount of parallelism. ... detecting loop-level parallelism ... support for more parallelism at compile time. Conditional ... | PowerPoint PPT presentation | free to view