The quirky and wonderful self-join optimization

2023-11-232023-11-23 / Daniel Hutmacher / 1 Comment

This blog post started as a “what if” contemplation in my head: Suppose you have a reasonably large table with a clustered index and a number of non-clustered indexes. If your WHERE clause filters by multiple columns covered by those non-clustered indexes, could it potentially be faster to rewrite that WHERE clause to use those non-clustered indexes?

The answer might surprise you.

Continue reading →

Optimizing for something else

2020-08-262020-08-26 / Daniel Hutmacher / Leave a comment

Ah, the feeling when you think of a new blog post topic, only to realize that you already wrote a post about that six years ago.

Well, I’ve done the work, so I might as well have a little fun with it.

Continue reading →

Last row per group

2016-04-112016-04-09 / Daniel Hutmacher / 6 Comments

A very common challenge in T-SQL development is filtering a result so it only shows the last row in each group (partition, in this context). Typically, you’ll see these types of queries for SCD 2 dimension tables, where you only want the most recent version for each dimension member. With the introduction of windowed functions in SQL Server, there are a number of ways to do this, and you’ll see that performance can vary considerably.

Continue reading →

Different query plans for “OR” type queries

2016-03-232016-03-23 / Daniel Hutmacher / 1 Comment

The SQL Server query optimizer can find interesting ways to tackle seemingly simple operations that can be hard to optimize. Consider the following query on a table with two indexes, one on (a), the other on (b):

SELECT a, b
FROM #data
WHERE a<=10 OR b<=10000;

The basic problem is that we would really want to use both indexes in a single query.

In this post, we’re going to take a look at a few examples of how this type of query would be optimized, as well as how statistics can affect the query plan, and finally, we’ll take a look at a slightly rare plan operator called “Merge Join (Concatenation)”.

Continue reading →

HASH JOIN deep-dive

2014-06-012014-05-13 / Daniel Hutmacher / 4 Comments

Among the three different types of join operators used by SQL Server, the HASH JOIN does some of the hardest work. It scales pretty well and is very suitable for parallel processing. As such, it can be very powerful in many applications, but hash joins can potentially consume quite a bit of memory, so seeing on in your query plan could be an indicator of a performance tuning issue in your query or data.

Continue reading →

Great article series on statistics

2013-08-09 / Daniel Hutmacher / 2 Comments

I took some time today to read up on Dale Burnett’s series on statistics in SQL Server. In seven detailed posts, Dale goes through different aspects of how statistics work in SQL Server, and how to use them to your advantage.

If you’re into query optimization, and you’ve got your glasses on, chances are this article is for you.

Reading a query plan

2013-02-182013-02-19 / Daniel Hutmacher / 6 Comments

Knowing how to read a query plan is absolutely key to optimizing SQL Server query performance. The query plan tells you how SQL Server goes about running your query, including what indexes are used (and how), what join strategies are applied and a lot of other information. If you can read the query plan, you can make the appropriate changes to indexes, query hints, join conditions, etc to tune your workload for optimum performance.

Continue reading →

sqlsunday.com

T-SQL tips and tricks, best practices and query plans from the field.

optimization

The quirky and wonderful self-join optimization

Optimizing for something else

Last row per group

Different query plans for “OR” type queries

HASH JOIN deep-dive

Great article series on statistics

Reading a query plan