On the importance to have good Hive statistics on your tables Posted on March 23, 2020, updated on March 20, 2020 by Yannick JaquierTable of contents Preamble The problematic queries Problem has gone with good Hive statistics References Continue reading
Hive fetch task really improving response time by bypassing MapReduce ? Posted on November 24, 2019, updated on February 28, 2020 by Yannick JaquierTable of contents Preamble Identical queries not same response time Partitions statistics and concatenation Fetch task performing worst than MapReduce ? To go further References Continue reading
ORC versus Parquet compression and response time Posted on August 2, 2019, updated on February 20, 2020 by Yannick JaquierTable of contents Preamble ORC versus Parquet compression ORC versus Parquet response time References Continue reading
How to identify table fragmentation and remove it ? Posted on December 18, 2018, updated on October 16, 2019 by Yannick JaquierTable of contents Preamble Legacy situation Newest methods to estimate tables size Table fragmentation identification Move, shrink or export/import ? References Continue reading
How to non intrusively find index rebuild or shrink candidates ? Posted on November 23, 2018, updated on November 28, 2019 by Yannick JaquierTable of contents Preamble Legacy situation Newest methods to estimate indexes size Index rebuild candidates list To go further Rebuild or shrink ? References Continue reading
AWR mining for performance trend analysis Posted on January 20, 2018, updated on May 28, 2019 by Yannick JaquierTable of contents Preamble Parallel downgrades Unstable execution time Direct path Checkpoint AWR figures extract and load References Continue reading