SQL Server 2000 Query Processing and Optimization - PowerPoint PPT Presentation

1 / 43

About This Presentation

Title:

SQL Server 2000 Query Processing and Optimization

Description:

Clustering columns automatically added non-clustered indices (as key/bookmark) ... Locates base table row using 'bookmark' retrieved from non-clustered index ... – PowerPoint PPT presentation

Number of Views:132

Avg rating:3.0/5.0

Slides: 44

Provided by: donv2

Category:

more less

Transcript and Presenter's Notes

Title: SQL Server 2000 Query Processing and Optimization

1
SQL Server 2000Query Processing and Optimization

Don Vilen
Program Manager
SQL Server Development Team

2
Agenda

SQL Server Overview
SQL Server Architecture
Storage and Access Methods
Query Processing and Optimization
Transaction Processing
Other Topics

3
Query Processing and Optimization
4
Query Processing and Optimization

SQL Server Architecture
Query Processor goals
Optimization techniques
Query plans
Futures

5
Server ArchitectureThe Big Picture
6
Query Processor Components

Query optimization
Selection of best execution plan
Cost-based, transformation-driven
Extensive logical inferences
Query execution
Algorithms to perform join, group by,
Hash-based, merge-based
Parallelism
Provides plan building blocks

7
Query Processor Goals

Responsibilities
Processing of DML queries (T-SQL and )
SELECT, INSERT, UPDATE, DELETE,
Processing of DDL operations
Index creation, DBCC CHECK,
Creation and maintenance of statistics
DBCC UPDATE STATISTICS

8
Query ProcessorGoal Performance

SQL Server 6.5 provides excellent OLTP
performance
SQL Server 7.0 extends to set-oriented, decision
support queries (star schema)
SQL Server 2000 extends to handle snowflake
schemas, indexed views, partitioned views, and
extensive parallelism

9
Query ProcessorGoal Modularity

Robustness
Rapid future innovation
Uniform internal interfaces
Learn from the lessons of 6.5
Keep the code clean ? extensible

10
Query Processor Goal Functionality

Distributed and heterogeneous queries
More complex queries, gt 16 tables
Some extensions, like SELECT TOP
Indexed views support
Statistics on non-indexed columns
Partitioned views
SELECT, INSERT, UPDATE, DELETE
Cascading DRI, etc
Rich heterogeneous query support (OLEDB)

11
Agenda

SQL Server Architecture
Query Processor goals
Optimization techniques
Query plans
Futures

12
Query OptimizerOverview

Rewrite- property- cost-driven
Like DB2 C/S v5 Tandem, not Oracle v6 (but
Oracle 8i does)
Extensive rewrite set
E.g. index selection, join order,
Rich inference capabilities
E.g. contradiction detection
Sensitive to query complexity
Optimization time-out based on estimated
execution cost

13
Query OptimizerOptimization model
Input tree
Subtree
Pool of alternatives
Rewrite
Output cheapest plan
New subtree
14
Query OptimizerOptimization time

Will you wait for the exhaustive optimization of
your 20-table query?
Goal Make optimization time proportional to the
query complexity
Query complexity ? cost of the optimal plan

15
Query OptimizerMulti-stage optimization

Multiple stages
No-choice queries (trivial plan)
Transaction processing queries
Complex query I
Complex query II
Parallel queries
No knobs!

16
Query OptimizerTree-rewrite and search

Change join order
(R JOIN S) JOIN T, can also be done as
(R JOIN T) JOIN S
Alternative or replacement
Evaluate filter conditions early
Extent of search based on query
Sub-second queries optimized quickly

17
Query OptimizerTransformations

Lots of transformations in SQL Server (over 300)

Filter (A.x 5)
GrpBy A.x, sum(A.y)
Join
Join
Join
B
A
B
A
B
A
Join
Join
Hash-Join
Filter (A.x 5)
B
B
GrpBy A.x, sum(A.y)
B
A
A
A
Simplification
Implementation
Exploration
18
Query OptimizerOver 300 transformation rules

Join reordering
Outerjoins
Subqueries
Aggregation
Star and snowflakes
Join elimination
Materialized views
Index plans
Update plans

Halloween protection
Empty tab simplification
(Integrity constraints)
Partitioned tables
Parallelism
Remote queries

19
Query OptimizerLogical inferences

Equivalence classes for columns
If ab then sort(a) same as sort(b)
Implied join predicates
Keys and functional dependencies
GROUPBY(e,ename) same as GROUPBY(e)
Contradiction detection
Infer empty table(s) from check constraints
Join simplification using FK constraints
Outer join simplification using nullability

20
Query OptimizerResult size estimation

Basis for cost estimation
Uses statistics on stored data
Densities and histograms
Keys (from unique indices)
Constraints (DRI, check constraints)
Available in showplan

21
Query OptimizerStatistics on demand

Optimizer relies on up-to-date statistics
Automatic create, drop and update statistics
Fall-back mechanism
Quick statistics estimation (heuristics)
Statistics on histogram and density
MAXDIFF to capture frequent values

22
Query OptimizerCost calculation

Cost to first/last row
I/O CPU, normalized to seconds
Row goals during optimization
E.g. optimize for first-10 rows retrieval
Available in showplan output
Forms basis for FAST-N/TOP-N hints

23
Query OptimizerChoosing the right plan

Give as much information as possible!
You wont always know how your tables will be
used
Declare constraints they can help
Uniqueness
DRI
Nullability
Keep statistics up to date
Use auto-stats or your own maintenance plan
Provide useful indexes ?

24
Query OptimizerHints

Youre smart but
One or more indexes
Join order
Join, grouping, distinct algorithms
Row goal (FAST N)
But be careful!
Maintenance
Compatibility
your data volume can change

25
Query OptimizerUtility Operations

More than queries
Update plans
Bulk Import, export convert
CREATE INDEX
DBCC CHECKDB/CHECKTABLE
ALTER TABLE
CREATE STATISTICS

26
Query OptimizerOptimized update plans

Small updates (e.g., OLTP operations)
Row-by-row update all indexes for each row
Standard technique may use lots of random I/O
Large updates (e.g., warehouse refresh)
Index-by-index update
Pre-sorting per index merges change into index
Each index leaf is touched at most once
Saved index update cost often exceeds sort cost

27
1 of every 8 rows is deleted
(These spool operations share one work file)
8KB/page / 24B/entry 335 entries/page 70
fill factor ? 235 1 in 8 ? 30 deletions/page
A sort operation per updated index very fast
index maintenance
Is each index leaf touchedonce or 30 times?
On average, after lots of random insertions and
deletions, B-tree pages are about 70 full
thats why this fill factor is used in the
example.
28
Query OptimizerUtility example BCP IN