Chapter 20 Advanced 62 Questions

Practice Questions — Indexes and Query Performance

← Back to Notes

8 Easy

16 Medium

12 Hard

Topic-Specific Questions

Question 1

Easy

What data structure does MySQL use for most indexes?

Balanced tree.

B-Tree

Question 2

Easy

In InnoDB, what is the clustered index?

The table itself.

The primary key's B-Tree — the table rows ARE the leaves. Data is physically ordered by PK.

Question 3

Easy

What command creates an index on a column?

Standard DDL.

CREATE INDEX idx_name ON table(column);

Question 4

Easy

Given an index on (a, b, c), which query CANNOT use it?

Leftmost prefix rule.

WHERE b = 5 AND c = 10 — missing the leading 'a' column.

Question 5

Easy

What command shows MySQL's query plan?

Single keyword before the SELECT.

EXPLAIN

Question 6

Medium

In EXPLAIN output, what does type=ALL indicate?

The scanned rows are...

A full table scan — no index used. On large tables this is slow.

Question 7

Medium

What does 'Using index' in EXPLAIN's Extra column mean?

Related to covering index.

The query is served entirely from the index (no row lookup) — a covering index was used.

Question 8

Medium

Why does WHERE YEAR(date_col) = 2026 often not use an index on date_col?

Function on indexed column.

The function YEAR() must be evaluated for every row to check the predicate, which requires scanning every row — defeating the index.

Question 9

Medium

Given a 1M-row table, roughly how many B-Tree disk reads are needed for a PK lookup?

B-Trees are wide.

About 3-4 reads (log_100(1,000,000) ≈ 3).

Question 10

Medium

Does WHERE name LIKE 'Aar%' use an index on name?

Prefix wildcard.

Yes — prefix wildcards are sargable.

Question 11

Medium

Does WHERE name LIKE '%sharma' use the index?

Leading wildcard.

No — leading wildcard prevents B-Tree use.

Question 12

Medium

What is cardinality in SHOW INDEX output?

Distinctness.

The estimated number of distinct values in the column. Higher cardinality = more selective index.

Question 13

Hard

In a composite index on (a, b), does a query WHERE a = 1 ORDER BY b need a separate sort?

B-Trees store values in order.

No. The index already has rows sorted by b within a=1, so MySQL reads them in order.

Question 14

Hard

What is the 'Using filesort' note in EXPLAIN?

It's about sorting.

MySQL is sorting the result set in memory (or on disk if too big), because no index provided the required order. On large result sets this is slow.

Question 15

Hard

If you have indexes on (a), (b), and (a, b), which one is likely redundant?

Leftmost prefix.

The index on (a) alone is usually redundant — (a, b) can serve any query that (a) alone can.

Question 16

Medium

What is a covering index?

All columns needed.

A covering index contains ALL the columns the query needs (both in SELECT and WHERE). MySQL can answer the query entirely from the index without looking up rows. EXPLAIN shows 'Using index' in the Extra column.

Question 17

Hard

Why do indexes slow down writes?

Writes have to maintain consistency.

Every INSERT must add entries to every index. Every UPDATE that changes indexed columns must reposition index entries. Every DELETE must remove index entries. The more indexes, the more work per write. Additionally, index pages must be updated in the buffer pool and eventually on disk, which contributes to I/O.

Question 18

Hard

When is it worth NOT indexing a column that appears in WHERE?

Think about cardinality and table size.

Don't index if: (1) the table is tiny (full scan is already fast), (2) the column has low cardinality AND the query distribution is balanced (e.g., status with 3 equal-frequency values), (3) the query is rare, (4) you have better indexes that already serve the predicate, (5) write performance is critical and the index isn't high-impact on read perf.

Question 19

Easy

Write the SQL to create an index on the email column of a users table.

Standard DDL.

CREATE INDEX idx_users_email ON users(email);

Question 20

Medium

Given a common query WHERE customer_id = ? AND order_date >= ?, write the best composite index.

Equality before range.

CREATE INDEX idx_orders_cust_date
  ON orders(customer_id, order_date);

Question 21

Medium

Rewrite this query to be sargable (index-friendly): SELECT * FROM orders WHERE MONTH(order_date) = 4 AND YEAR(order_date) = 2026;

Convert to a range.

SELECT * FROM orders
WHERE order_date >= '2026-04-01'
  AND order_date <  '2026-05-01';

Question 22

Hard

Given a query SELECT customer_id, SUM(amount) FROM orders WHERE status='paid' GROUP BY customer_id ORDER BY SUM(amount) DESC LIMIT 10; — propose an index that helps.

Filter by status, then group by customer, summing amount.

CREATE INDEX idx_orders_status_cust_amt
  ON orders(status, customer_id, amount);

Question 23

Medium

Write the command to see all indexes on the 'orders' table.

SHOW command.

SHOW INDEX FROM orders;
-- Or:
SHOW CREATE TABLE orders;

Question 24

Hard

Write an EXPLAIN statement for this query and describe what you would look for in the output: SELECT name FROM users WHERE email = 'aarav@x.com';

Check type, key, rows, Extra.

EXPLAIN SELECT name FROM users WHERE email = 'aarav@x.com';

Look for: type should be const or ref (unique or regular index hit), key should name your email index, rows should be 1 (or close), Extra should ideally say 'Using index' if the index covers 'name'.

Question 25

Hard

You see 'type=index' in EXPLAIN. Is that good?

Not as good as it sounds.

Not really. type=index means MySQL is doing a FULL index scan — reading every index entry in order. It's better than type=ALL (full table scan) because the index is usually smaller than the table, but it's still a scan. Aim for type=ref, range, eq_ref, or const.

Mixed & Application Questions

Question 1

Easy

If a query is running fast on 100 rows and slow on 1M rows, the likely cause is:

Full-scan cost grows linearly.

A full table scan — missing or unused index.

Question 2

Easy

A unique index does what in addition to speeding up queries?

Constraint.

Enforces uniqueness — duplicate inserts fail with a 1062 error.

Question 3

Medium

Given an index on (a, b), can WHERE a = 1 ORDER BY b DESC use the index for sorting?

Indexes can be scanned backward.

Yes. MySQL reads the index in reverse — 'Backward index scan' — no filesort needed.

Question 4

Medium

You run EXPLAIN and see rows=5,000,000. Is the query guaranteed to scan 5 million rows?

EXPLAIN shows estimates.

No — 'rows' is an estimate based on statistics, not an exact count.

Question 5

Medium

Which is typically faster on a 10M-row table: primary-key lookup or a secondary-index lookup fetching all columns?

PK lookup step.

Primary-key lookup. Secondary-index lookup requires an extra 'PK lookup' step to fetch non-indexed columns.

Question 6

Medium

Drop the index idx_old from the table t.

Two equivalent syntaxes.

DROP INDEX idx_old ON t;
-- or
ALTER TABLE t DROP INDEX idx_old;

Question 7

Hard

You have orders(id, customer_id, product_id, amount, status). Queries often filter by (customer_id, status) and (product_id, status). Propose good indexes.

Two composite indexes, each matching a query pattern.

CREATE INDEX idx_orders_cust_status
  ON orders(customer_id, status);

CREATE INDEX idx_orders_prod_status
  ON orders(product_id, status);

Question 8

Hard

Your EXPLAIN output shows 'Using where; Using filesort'. What does it mean and how do you improve?

Filesort = sorting without an index.

The query filters rows (using where) but the ORDER BY requires a separate sort step (filesort) because no index provided the sorted order. Improve by adding an index that covers both the WHERE columns and the ORDER BY column (equality first, then sort column).

Question 9

Hard

Implicit type conversion can kill an index. Explain with an example.

Column type vs parameter type.

If phone is VARCHAR and you query WHERE phone = 9810012345 (integer), MySQL converts all phone values to integers for comparison — a function on every row. Fix: pass the parameter as a string: WHERE phone = '9810012345'.

Question 10

Medium

What's the difference between PRIMARY KEY and UNIQUE INDEX?

Nullability and count.

A table has at most one PRIMARY KEY; PK columns cannot be NULL; in InnoDB the PK defines the clustered index (table order). A table can have many UNIQUE INDEXes; unique index columns CAN be NULL (multiple NULLs are allowed because NULL != NULL). Both enforce uniqueness and speed up lookups.

Question 11

Hard

How does the optimizer choose between two candidate indexes?

Cost-based.

MySQL's optimizer is cost-based. It estimates the number of rows each plan would read (using index cardinality and histograms) and picks the plan with the lowest estimated cost. If stats are stale (table grew, data shifted), it may pick wrong. Run ANALYZE TABLE to refresh stats, or use FORCE INDEX as a last resort.

Multiple Choice Questions

MCQ 1

What data structure does MySQL use for most indexes?

A. Hash table
B. B-Tree
C. Linked list
D. Skip list

Answer: B
B is correct. Default storage engine InnoDB uses B-Tree indexes for all standard indexes. MEMORY engine offers HASH indexes. FULLTEXT uses inverted indexes.

MCQ 2

Which command creates an index?

A. ADD INDEX table.col
B. CREATE INDEX idx ON t(col)
C. MAKE INDEX idx FOR t.col
D. INDEX t(col)

Answer: B
B is correct. CREATE INDEX name ON table(col) is the SQL-standard syntax. ALTER TABLE t ADD INDEX name (col) is equivalent.

MCQ 3

In InnoDB, the primary key defines:

A. A secondary index
B. The clustered index (the table's physical order)
C. A hash lookup
D. Nothing special

Answer: B
B is correct. Rows ARE the leaves of the PK B-Tree. Secondary indexes reference the PK for row lookups.

MCQ 4

Which EXPLAIN 'type' is the worst?

A. const
B. ref
C. range
D. ALL

Answer: D
D is correct. ALL = full table scan. The ranking best→worst is: const, eq_ref, ref, range, index, ALL.

MCQ 5

Which is the correct composite index for WHERE a = ? AND b = ? AND c > ??

A. (c, b, a)
B. (a, b, c)
C. (c, a, b)
D. (b, a, c)

Answer: B
B is correct. Equality columns first (a, b), range column last (c). This order lets the index narrow fast and then range-scan on c.

MCQ 6

Which query can use a B-Tree index on name?

A. WHERE name LIKE '%sharma%'
B. WHERE name LIKE 'Aar%'
C. WHERE UPPER(name) = 'AARAV'
D. WHERE LENGTH(name) > 5

Answer: B
B is correct. Only prefix LIKE (no leading wildcard) is sargable. Options A, C, D all disable the index.

MCQ 7

What does 'Using index' in EXPLAIN's Extra column indicate?

A. The query used an index to filter
B. The query was answered entirely from the index (covering index)
C. The query did not use any index
D. An index scan was done instead of a seek

Answer: B
B is correct. Covering index — no need to access the table itself. Fastest possible read.

MCQ 8

Which operation in InnoDB is ALWAYS the fastest?

A. Secondary index lookup
B. Primary key lookup
C. Full-text search
D. Hash index lookup

Answer: B
B is correct. PK lookup = 1 B-Tree traversal in the clustered index, fetching the full row directly. No PK lookup step needed.

MCQ 9

Which query would NOT use an index on (customer_id, order_date)?

A. WHERE customer_id = 101
B. WHERE customer_id = 101 AND order_date = '2026-04-16'
C. WHERE order_date >= '2026-01-01'
D. WHERE customer_id = 101 AND order_date >= '2026-01-01'

Answer: C
C is correct. No leading customer_id — leftmost prefix rule violated, cannot use the index.

MCQ 10

Why might adding too many indexes HURT performance?

A. Reads get slower
B. Every INSERT/UPDATE/DELETE must update every index
C. MySQL crashes
D. Connections drop

Answer: B
B is correct. Writes must maintain all indexes; disk and RAM are wasted on duplicated data.

MCQ 11

What does EXPLAIN ANALYZE do (MySQL 8.0.18+)?

A. Shows the plan without running the query
B. Runs the query and shows the plan with actual timing and row counts
C. Analyzes table statistics
D. Drops and recreates indexes

Answer: B
B is correct. Regular EXPLAIN shows estimates; EXPLAIN ANALYZE runs the query and returns actuals, exposing estimate errors.

MCQ 12

Which is the best fix for WHERE UPPER(email) = 'AARAV@X.COM'?

A. Add an index on email
B. Store email pre-lowercased and write <code>WHERE email = 'aarav@x.com'</code>
C. Use FORCE INDEX
D. Use a hash index

Answer: B
B is correct. Canonicalize on insert. Option A doesn't help because UPPER() on the column still kills sargability. MySQL 8 also supports functional indexes as an alternative, but canonical storage is simpler.

MCQ 13

A query uses 'Using temporary; Using filesort'. What does this suggest?

A. MySQL is caching results on disk
B. The query is producing a temp table (often for GROUP BY) AND sorting separately
C. The query is corrupt
D. The index is being rebuilt

Answer: B
B is correct. These are both red flags for performance on large result sets. Check if a composite index can serve both the GROUP BY and ORDER BY.

MCQ 14

When are HASH indexes (in MEMORY engine) faster than B-Trees?

A. For range queries
B. For exact-equality lookups on unique keys
C. For ORDER BY
D. For prefix LIKE

Answer: B
B is correct. Hash indexes are O(1) for exact matches but useless for ranges and ordering. B-Trees are O(log n) exact and O(log n + k) for ranges — more versatile.

MCQ 15

Why does SELECT * hurt covering-index performance?

A. It makes the query slower to parse
B. If any column isn't in the index, a row lookup is needed, defeating the covering benefit
C. It prevents MySQL from using any index
D. It's always slower than SELECT with columns

Answer: B
B is correct. SELECT * forces MySQL to fetch all columns, which usually means a PK lookup even if the filter is served by a secondary index. List only the columns you need to benefit from covering indexes.

MCQ 16

Which SHOW command lists all indexes on a table?

A. SHOW TABLES
B. SHOW INDEX FROM t
C. SHOW DATABASES
D. SHOW PROCESS

Answer: B
B is correct. SHOW INDEX FROM t or SHOW INDEXES FROM t — both work.

MCQ 17

Cardinality in SHOW INDEX represents:

A. The number of rows in the table
B. The estimated number of distinct values in the indexed column
C. The index depth
D. The index size in KB

Answer: B
B is correct. High cardinality = selective index. Low cardinality = few distinct values (often a poor index choice by itself).

MCQ 18

Why might the optimizer pick a full scan over an available index?

A. The index is new
B. Cost estimates suggest the index would fetch a large fraction of the table (random I/O is worse than sequential for big percentages)
C. MySQL doesn't like indexes
D. Bug

Answer: B
B is correct. If the optimizer estimates the index will read more than ~20-30% of the table, a sequential scan may actually be faster because of I/O patterns. Stale statistics can also trigger this — ANALYZE TABLE refreshes them.

MCQ 19

Which is the correct way to drop an index?

A. DROP INDEX idx ON table
B. DELETE INDEX idx FROM table
C. REMOVE INDEX idx FROM table
D. CREATE INDEX WITH DELETE

Answer: A
A is correct. DROP INDEX idx ON table or ALTER TABLE table DROP INDEX idx.

MCQ 20

Which scenario is a GOOD use case for a FULLTEXT index?

A. Exact email lookups
B. Range queries on dates
C. Free-text search in article bodies with <code>MATCH(col) AGAINST(...)</code>
D. Joining on numeric IDs

Use FULLTEXT correctly.

-- Leading wildcard LIKE kills any B-Tree index on body.
-- Full scan examines every row.

-- Fix: FULLTEXT index with natural-language search
CREATE FULLTEXT INDEX idx_posts_body ON posts(body);

-- Query becomes:
SELECT id, title
FROM posts
WHERE MATCH(body) AGAINST('django' IN NATURAL LANGUAGE MODE)
LIMIT 20;

-- EXPLAIN shows type=fulltext.
-- Supports phrase search, relevance ranking, stopword handling.
-- Caveats: min token length (default 4 chars, configurable),
-- InnoDB stopword list, requires rebuilding index after bulk load.

Challenge 8: End-to-End Query Tuning

Hard

You get a complaint: 'The orders dashboard is slow'. The query:

SELECT customer_id, SUM(amount) AS t FROM orders WHERE status='paid' AND order_date >= '2026-01-01' GROUP BY customer_id ORDER BY t DESC LIMIT 10;

. Walk through EXPLAIN, propose the index, and verify.

Sample Input

orders(id, customer_id, order_date, amount, status) with 50M rows.

Sample Output

Full diagnosis, index, and verified EXPLAIN.

Explain every step.

-- Step 1: Baseline EXPLAIN
-- type=ALL, rows=50M, Extra='Using where; Using temporary; Using filesort'
-- Dashboard takes 15+ seconds.

-- Step 2: Analyze the query
-- Filter: status + order_date (two columns, mixed equality/range)
-- Group: customer_id
-- Sort: SUM(amount) DESC (can't be served by an index -- aggregate result)
-- Project: customer_id, amount

-- Step 3: Design an index
-- Put equality first (status), then range (order_date). Include customer_id
-- and amount for covering, so aggregation doesn't need row lookups.

CREATE INDEX idx_orders_status_date_cust_amt
  ON orders(status, order_date, customer_id, amount);

-- Step 4: Re-run EXPLAIN
-- type=range, key=idx_orders_status_date_cust_amt
-- rows=~500k (filtered down from 50M)
-- Extra='Using where; Using index; Using temporary; Using filesort'
-- The 'Using temporary' and 'Using filesort' remain because the ORDER BY
-- is on the AGGREGATE SUM(amount), not the raw columns -- that's unavoidable.
-- But the 100x reduction in scanned rows takes the query from 15s to ~200ms.

-- Step 5: Verify with EXPLAIN ANALYZE to confirm actual timing.

Need to Review the Concepts?

Go back to the detailed notes for this chapter.

Read Chapter Notes

Want to master SQL and databases with a mentor?

Explore our MySQL Masterclass