Large Extract

Rationale

The VIZYUL™ Large Extract performance tuning rule fires tableau data extracts with 500,000 or more rows of data.

Insight

Consider this, during the data discovery phase of designing a dashboard, often times data sources are added to a tableau workbook that don’t end up making the final cut.  Which means there are unused data sources in your workbook.

Unused data sources usually don’t negatively impact the time it takes tableau to render your dashboards, well, because it’s not used.  However, there are other areas where unused extracts can have an unfavorable impact of a tableau workbook.

Consider the case where a tableau user has finalized a dashboard and wants to share it with the team.  The author plans to save the workbook as a packaged tableau workbook.  If the unused data sources are extracts, each one increases the size of the packaged tableau workbook.  In this case, removing unused data sources can have a positive impact on the size of the overall workbook.

Action

  • Consider removing unused data sources from your workbook prior to publishing or giving access to your viewers.
  • Consider the additional resources below

Additional Resources

  • How to publish ONLY the metadata for  large tableau data extract (if you have a billion row extract you need to get to the tableau server without having your poor laptop process that much data, this post is FOR YOU!)
    • Method 1 – http://tableaulove.tumblr.com/post/18945358848/how-to-publish-an-unpopulated-tableau-extract
    • Method 2 – http://www.tableau.com/about/blog/2013/9/easy-empty-local-extracts-25152
  • http://kb.tableau.com/articles/knowledgebase/optimizing-incremental-refreshes