<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Ann&#039;s Blog &#187; pivot tables</title>
	<atom:link href="/tag/pivot-tables/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.annkemery.com</link>
	<description>Equipping you to collect, analyze, and visualize data</description>
	<lastBuildDate>Mon, 13 Apr 2015 17:29:38 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=4.1.1</generator>
	<item>
		<title>Private Data Analysis Webinar for the Emerging Practitioners in Philanthropy</title>
		<link>http://www.annkemery.com/event/2015-08-26_epip/</link>
		<comments>http://www.annkemery.com/event/2015-08-26_epip/#comments</comments>
		<pubDate>Wed, 26 Aug 2015 15:00:00 +0000</pubDate>
		<dc:creator><![CDATA[Ann K. Emery]]></dc:creator>
				<category><![CDATA[data analysis]]></category>
		<category><![CDATA[emerging practitioners in philanthropy]]></category>
		<category><![CDATA[Microsoft Excel]]></category>
		<category><![CDATA[pivot tables]]></category>
		<category><![CDATA[spreadsheets]]></category>

		<guid isPermaLink="false">http://www.annkemery.com/?post_type=tribe_events&#038;p=6199</guid>
		<description><![CDATA[On August 26, 2015, I'm leading an online workshop exclusively for members of Emerging Practitioners in Philanthropy.]]></description>
				<content:encoded><![CDATA[<h3><strong>Analyzing Grantmaking Data: Saving Time and Energy with Pivot Tables</strong></h3>
<p>Pivot tables are the fastest, easiest way to make sense of spreadsheets. Whether you spend 10% or 100% of your day working with data, pivot tables are a must-have spreadsheet skill.</p>
<p>During this 60-minute online workshop, you’ll learn how to:</p>
<ul>
<li>design your spreadsheet to be compatible with pivot tables (e.g., prerequisites like contiguous data);</li>
<li>create pivot tables from scratch;</li>
<li>navigate the field list, row headers, column headers, and filters;</li>
<li>drag and drop variables to calculate averages, tallies, and sums;</li>
<li>remove duplicate entries from your dataset;</li>
<li>make sure your pivot table still works even after you’ve updated the raw numbers in your spreadsheet; and</li>
<li>group items like names and dates together so that you can analyze them in aggregate.</li>
</ul>
<h3><strong>Format</strong></h3>
<p>This workshop is heavily hands-on, so bring your energy and enthusiasm! The instructor will demonstrate a skill and then you&#8217;ll have a chance to practice using the sample datasets provided. We’ll look at several different types of datasets together, like demographic data from your organization’s member database and survey responses similar to your organization’s latest consumer satisfaction survey.</p>
<h3><strong>Target Audience</strong></h3>
<p>Anyone who uses spreadsheets. This online workshop is perfect for anyone who&#8217;s brand new to pivot tables and serves as an excellent refresher for anyone who&#8217;s used pivot tables in the past and wants to learn new shortcuts.</p>
<h3><strong>What People Are Saying</strong></h3>
<p><em>&#8220;The workshop was really thought provoking about how we can be more efficient in our data analysis!&#8221;</em></p>
<p><em>&#8220;Webinar is amazing! I&#8217;m learning so much! Thank you! This is awesome.&#8221;</em></p>
<p><em>&#8220;This is sooo helpful!&#8221;</em></p>
<p><em>&#8220;I LOVE THIS!&#8221;</em></p>
<blockquote class="twitter-tweet" lang="en"><p><a href="https://twitter.com/AnnKEmery">@AnnKEmery</a> is an amazing teacher! Pivot Tables are cool! <a href="https://twitter.com/EPIPDC">@EPIPDC</a> <a href="https://twitter.com/hashtag/data?src=hash">#data</a> <a href="https://twitter.com/hashtag/eval?src=hash">#eval</a> <a href="https://twitter.com/hashtag/nonprofits?src=hash">#nonprofits</a> <a href="https://twitter.com/hashtag/philanthropy?src=hash">#philanthropy</a> <a href="https://twitter.com/hashtag/skills?src=hash">#skills</a></p>
<p>— Jenny (@JennyB_DC) <a href="https://twitter.com/JennyB_DC/status/578640903440564225">March 19, 2015</a></p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<blockquote class="twitter-tweet" lang="en"><p>
One thing I learned from <a href="https://twitter.com/AnnKEmery">@AnnKEmery</a> today: you don&#8217;t have to know what you&#8217;re looking for to start looking at your data <a href="https://twitter.com/hashtag/dataanalysis?src=hash">#dataanalysis</a> — Jenny (@JennyB_DC) <a href="https://twitter.com/JennyB_DC/status/578641384183328768">March 19, 2015</a>
</p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<blockquote class="twitter-tweet" lang="en"><p>Learning about pivot tables for data analysis with <a href="https://twitter.com/AnnKEmery">@AnnKEmery</a>, which will be really useful for <a href="https://twitter.com/hashtag/Philamplify?src=hash">#Philamplify</a> &#8212; thanks <a href="https://twitter.com/EPIPDC">@EPIPDC</a>! <a href="https://twitter.com/hashtag/philanthropy?src=hash">#philanthropy</a></p>
<p>— Caitlin Duffy (@DuffyInDC) <a href="https://twitter.com/DuffyInDC/status/578594056596537345">March 19, 2015</a></p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<blockquote class="twitter-tweet" lang="en"><p>
Pivot table webinar w <a href="https://twitter.com/AnnKEmery">@AnnKEmery</a> is fantastic!!&#8230; — Museology Evaluation (@UWNewDirections) <a href="https://twitter.com/UWNewDirections/status/554692982499069952">January 12, 2015</a>
</p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<blockquote class="twitter-tweet" lang="en"><p>Outstanding webinar by <a href="https://twitter.com/AnnKEmery">@AnnKEmery</a> on Pivot Tables! Stay tuned for highlights &amp; notes on <a href="http://t.co/ZHHYedOcdw">http://t.co/ZHHYedOcdw</a> — Katherine Haugh (@Katherine_Haugh) <a href="https://twitter.com/Katherine_Haugh/status/554708296352157697">January 12, 2015</a></p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<h3><strong>Registration</strong></h3>
<p>This webinar is offered exclusively to EPIP members. To join EPIP, please visit <a href="http://www.epip.org/membership/">http://www.epip.org/membership/</a>.</p>
<p>&nbsp;</p>
<p><em>Want to discuss a possible workshop or webinar for your team? <a title="Contact" href="/contact/">Contact me</a>.</em></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.annkemery.com/event/2015-08-26_epip/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Private Data Analysis Workshop for the Emerging Practitioners in Philanthropy</title>
		<link>http://www.annkemery.com/event/2015-03-19_epip-workshop/</link>
		<comments>http://www.annkemery.com/event/2015-03-19_epip-workshop/#comments</comments>
		<pubDate>Thu, 19 Mar 2015 12:00:00 +0000</pubDate>
		<dc:creator><![CDATA[Ann K. Emery]]></dc:creator>
				<category><![CDATA[data analysis]]></category>
		<category><![CDATA[pivot tables]]></category>
		<category><![CDATA[spreadsheet navigation]]></category>

		<guid isPermaLink="false">http://www.annkemery.com/?post_type=tribe_events&#038;p=6111</guid>
		<description><![CDATA[On March 19, I'm leading a members-only workshop for the Emerging Practitioners in Philanthropy about using pivot tables to quickly and easily analyze data.]]></description>
				<content:encoded><![CDATA[<p>Pivot tables are the fastest, easiest way to make sense of spreadsheets. Whether you spend 10% or 100% of your day working with data, pivot tables are a must-have spreadsheet skill. In this 90-minute workshop, you’ll learn how to:</p>
<ul>
<li>design your spreadsheet to be compatible with pivot tables (e.g., prerequisites like contiguous data);</li>
<li>create pivot tables from scratch;</li>
<li>navigate the field list, row headers, column headers, and filters;</li>
<li>run descriptive statistics such as averages and frequencies;</li>
<li>clean data by checking for duplicates;</li>
<li>make sure your pivot table still works even after you’re updated the raw numbers in your spreadsheet; and</li>
<li>group items like names and dates together so that you can analyze them in aggregate.</li>
</ul>
<p>We’ll look at several different types of datasets together, like demographic data on people from your organization’s member database and survey responses similar to your organization’s latest consumer satisfaction survey.</p>
<p>Please bring laptops and power cords.</p>
<h3><strong>What People Are Saying</strong></h3>
<p><em>&#8220;The workshop was really thought provoking about how we can be more efficient in our data analysis!&#8221;</em></p>
<p><em>&#8220;Webinar is amazing! I&#8217;m learning so much! Thank you! This is awesome.&#8221;</em></p>
<p><em>&#8220;This is sooo helpful!&#8221;</em></p>
<p><em>&#8220;I LOVE THIS!&#8221;</em></p>
<blockquote class="twitter-tweet" lang="en"><p><a href="https://twitter.com/AnnKEmery">@AnnKEmery</a> is an amazing teacher! Pivot Tables are cool! <a href="https://twitter.com/EPIPDC">@EPIPDC</a> <a href="https://twitter.com/hashtag/data?src=hash">#data</a> <a href="https://twitter.com/hashtag/eval?src=hash">#eval</a> <a href="https://twitter.com/hashtag/nonprofits?src=hash">#nonprofits</a> <a href="https://twitter.com/hashtag/philanthropy?src=hash">#philanthropy</a> <a href="https://twitter.com/hashtag/skills?src=hash">#skills</a></p>
<p>— Jenny (@JennyB_DC) <a href="https://twitter.com/JennyB_DC/status/578640903440564225">March 19, 2015</a></p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<blockquote class="twitter-tweet" lang="en"><p>
One thing I learned from <a href="https://twitter.com/AnnKEmery">@AnnKEmery</a> today: you don&#8217;t have to know what you&#8217;re looking for to start looking at your data <a href="https://twitter.com/hashtag/dataanalysis?src=hash">#dataanalysis</a> — Jenny (@JennyB_DC) <a href="https://twitter.com/JennyB_DC/status/578641384183328768">March 19, 2015</a>
</p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<blockquote class="twitter-tweet" lang="en"><p>Learning about pivot tables for data analysis with <a href="https://twitter.com/AnnKEmery">@AnnKEmery</a>, which will be really useful for <a href="https://twitter.com/hashtag/Philamplify?src=hash">#Philamplify</a> &#8212; thanks <a href="https://twitter.com/EPIPDC">@EPIPDC</a>! <a href="https://twitter.com/hashtag/philanthropy?src=hash">#philanthropy</a></p>
<p>— Caitlin Duffy (@DuffyInDC) <a href="https://twitter.com/DuffyInDC/status/578594056596537345">March 19, 2015</a></p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<blockquote class="twitter-tweet" lang="en"><p>
Pivot table webinar w <a href="https://twitter.com/AnnKEmery">@AnnKEmery</a> is fantastic!!&#8230; — Museology Evaluation (@UWNewDirections) <a href="https://twitter.com/UWNewDirections/status/554692982499069952">January 12, 2015</a>
</p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<blockquote class="twitter-tweet" lang="en"><p>Outstanding webinar by <a href="https://twitter.com/AnnKEmery">@AnnKEmery</a> on Pivot Tables! Stay tuned for highlights &amp; notes on <a href="http://t.co/ZHHYedOcdw">http://t.co/ZHHYedOcdw</a> — Katherine Haugh (@Katherine_Haugh) <a href="https://twitter.com/Katherine_Haugh/status/554708296352157697">January 12, 2015</a></p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<h3><strong>Registration</strong></h3>
<p><em>This is a members-only workshop for the Emerging Practitioners in Philanthropy (EPIP). Membership is $150/year for individuals ($85/year for students). Learn more at <a href="http://www.epip.org/">http://www.epip.org/</a>.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.annkemery.com/event/2015-03-19_epip-workshop/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Public Webinar: Saving Time and Energy with Pivot Tables</title>
		<link>http://www.annkemery.com/event/2015-01-12-pivot-tables/</link>
		<comments>http://www.annkemery.com/event/2015-01-12-pivot-tables/#comments</comments>
		<pubDate>Mon, 12 Jan 2015 12:00:00 +0000</pubDate>
		<dc:creator><![CDATA[Ann K. Emery]]></dc:creator>
				<category><![CDATA[Analysis]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[data management]]></category>
		<category><![CDATA[pivot table webinar]]></category>
		<category><![CDATA[pivot tables]]></category>
		<category><![CDATA[spreadsheets]]></category>

		<guid isPermaLink="false">http://www.annkemery.com/?post_type=tribe_events&#038;p=5653</guid>
		<description><![CDATA[Pivot tables are the fastest, easiest way to make sense of spreadsheets. Whether you spend 10% or 100% of your day working with data, pivot tables are a must-have spreadsheet skill. In this 90-minute webinar, you’ll learn how to: design your spreadsheet to be compatible...]]></description>
				<content:encoded><![CDATA[<p>Pivot tables are the fastest, easiest way to make sense of spreadsheets. Whether you spend 10% or 100% of your day working with data, pivot tables are a must-have spreadsheet skill. In this 90-minute webinar, you’ll learn how to:</p>
<ul>
<li>design your spreadsheet to be compatible with pivot tables (e.g., prerequisites like contiguous data);</li>
<li>create pivot tables from scratch;</li>
<li>navigate the field list, row headers, column headers, and filters;</li>
<li>run descriptive statistics such as averages, means, counts, and frequencies;</li>
<li>clean data by checking for duplicates;</li>
<li>make sure your pivot table still works even after you’re updated the raw numbers in your spreadsheet; and</li>
<li>group items like names and dates together so that you can analyze them in aggregate.</li>
</ul>
<p>We’ll look at several different types of datasets together, like demographic data on people from your organization’s member database and survey responses similar to your organization’s latest consumer satisfaction survey.</p>
<p>You’ll get to keep the spreadsheets used during the webinar, and I’ll give you access to the webinar recording so you can watch it again while you’re practicing on your own.</p>
<h3>What People Are Saying</h3>
<p><em>&#8220;Webinar is amazing!&nbsp;I&#8217;m learning so much! Thank you! This is awesome.&#8221;</em><br />
<em>&#8220;This is sooo helpful!&#8221;</em><br />
<em>&#8220;I LOVE THIS!&#8221;</em></p>
<blockquote class="twitter-tweet" lang="en"><p><a href="https://twitter.com/AnnKEmery">@AnnKEmery</a> is an amazing teacher! Pivot Tables are cool! <a href="https://twitter.com/EPIPDC">@EPIPDC</a> <a href="https://twitter.com/hashtag/data?src=hash">#data</a> <a href="https://twitter.com/hashtag/eval?src=hash">#eval</a> <a href="https://twitter.com/hashtag/nonprofits?src=hash">#nonprofits</a> <a href="https://twitter.com/hashtag/philanthropy?src=hash">#philanthropy</a> <a href="https://twitter.com/hashtag/skills?src=hash">#skills</a></p>
<p>— Jenny (@JennyB_DC) <a href="https://twitter.com/JennyB_DC/status/578640903440564225">March 19, 2015</a></p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<blockquote class="twitter-tweet" lang="en"><p>
One thing I learned from <a href="https://twitter.com/AnnKEmery">@AnnKEmery</a> today: you don&#8217;t have to know what you&#8217;re looking for to start looking at your data <a href="https://twitter.com/hashtag/dataanalysis?src=hash">#dataanalysis</a></p>
<p>— Jenny (@JennyB_DC) <a href="https://twitter.com/JennyB_DC/status/578641384183328768">March 19, 2015</a>
</p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<blockquote class="twitter-tweet" lang="en"><p>Learning about pivot tables for data analysis with <a href="https://twitter.com/AnnKEmery">@AnnKEmery</a>, which will be really useful for <a href="https://twitter.com/hashtag/Philamplify?src=hash">#Philamplify</a> &#8212; thanks <a href="https://twitter.com/EPIPDC">@EPIPDC</a>! <a href="https://twitter.com/hashtag/philanthropy?src=hash">#philanthropy</a></p>
<p>— Caitlin Duffy (@DuffyInDC) <a href="https://twitter.com/DuffyInDC/status/578594056596537345">March 19, 2015</a></p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<blockquote class="twitter-tweet" lang="en"><p>Pivot table webinar w <a href="https://twitter.com/AnnKEmery">@AnnKEmery</a> is fantastic!!&#8230; — Museology Evaluation (@UWNewDirections) <a href="https://twitter.com/UWNewDirections/status/554692982499069952">January 12, 2015</a></p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<blockquote class="twitter-tweet" lang="en"><p>Outstanding webinar by <a href="https://twitter.com/AnnKEmery">@AnnKEmery</a> on Pivot Tables! Stay tuned for highlights &amp; notes on <a href="http://t.co/ZHHYedOcdw">http://t.co/ZHHYedOcdw</a> — Katherine Haugh (@Katherine_Haugh) <a href="https://twitter.com/Katherine_Haugh/status/554708296352157697">January 12, 2015</a></p></blockquote>
<p><script src="//platform.twitter.com/widgets.js" async="" charset="utf-8"></script></p>
<p>&nbsp;</p>
<h3>Purchase the Recording and Accompanying Materials</h3>
<p>This webinar has finished, but you can still purchase the recording and accompanying spreadsheets. <a title="Contact" href="/contact/">Contact me</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.annkemery.com/event/2015-01-12-pivot-tables/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Pivot Tables: Your Tool for Exposing Miscreant Data</title>
		<link>http://www.annkemery.com/ozdusoleil/</link>
		<comments>http://www.annkemery.com/ozdusoleil/#comments</comments>
		<pubDate>Mon, 18 Mar 2013 14:28:59 +0000</pubDate>
		<dc:creator><![CDATA[Oz Du Soleil]]></dc:creator>
				<category><![CDATA[Analyzing Data]]></category>
		<category><![CDATA[crap data]]></category>
		<category><![CDATA[data quality]]></category>
		<category><![CDATA[DataScopic]]></category>
		<category><![CDATA[duplicate values]]></category>
		<category><![CDATA[Oz du Soleil]]></category>
		<category><![CDATA[pivot table as a diagnostic tool]]></category>
		<category><![CDATA[pivot tables]]></category>

		<guid isPermaLink="false">http://emeryevaluation.com/?p=2534</guid>
		<description><![CDATA[Thank you, Ann Emery and thanks especially to the visitors of www.AnnKEmery.com. It meant a lot to be asked to do a guest blogpost because Ann&#8217;s approach is practical, focused on real-world experience, and her dedication to empowerment of others is a key theme at my own...]]></description>
				<content:encoded><![CDATA[<div>
<blockquote style="border: 1px solid #666; padding: 8px; font-size: 12px; background-color: #ffffcc; margin-bottom: 20px;"><p><img class="alignright  wp-image-2624" style="margin: 10px;" src="http://i0.wp.com/datascopic.net/wp-content/uploads/2012/11/OzLive-Action-Hat.jpg?resize=110%2C184" alt="Oz du Soleil, DataScopic" data-recalc-dims="1" /><span style="font-style: normal;">Thank you, Ann Emery and thanks especially to the visitors of www.AnnKEmery.com. It meant a lot to be asked to do a guest blogpost because Ann&#8217;s approach is practical, focused on real-world experience, and her dedication to empowerment of others is a key theme at my own site, <strong><span style="color: #000000;"><a href="http://datascopic.net" target="_blank">DataScopic.net</a></span></strong>.</span></p>
<p>I&#8217;ve been working with Excel and data for 15 years and developed a skill for scrubbing data. So, data quality is always on my mind. This year, I&#8217;m teaching more workshops so that others are empowered to manage their own data quality and develop sound spreadsheets.</p>
<p>Hopefully, you&#8217;ll find this blogpost useful. Please comment, ask questions and be in touch.</p>
<p>&#8211; Oz du Soleil</p></blockquote>
</div>
<div></div>
<p>Today, we&#8217;re going to discuss data quality, messy data, or, as I&#8217;ve described in several blogposts, <a href="http://datascopic.net/tag/crap-data/" target="_blank">crap data</a>. As a complement to this post, there is a dataset of 455 rows for us to work. Click <a title="Sample Data for Pivot Tables &amp; Data Quality Blogpost" href="http://datascopic.net/wp-content/uploads/2013/03/AnnEmery-Data.xlsx" target="_blank">here</a> to download the sample dataset.</p>
<h2><span style="color: #2006b0;">WE HAVE OUR DATA:</span> What Do We Want To Know?</h2>
<p>We want to dig in and find out things like:</p>
<ul>
<li><span style="line-height: 14px;">Of the members we had in 2009, how many are still active members?</span></li>
<li><span style="line-height: 14px;"> What were the 2010 donation amounts of the currently active vs. inactive members?</span></li>
<li>What were the 2008-2010 donations for the states where we plan to close offices: IN, MA and SC?</li>
<li>What were the 2008 donation totals of the Gold, Platinum and Lifetime members?</li>
<li>Are our Lifetime members clustered in a region, in a few states or, is there no correlation between residence and Lifetime membership?</li>
</ul>
<p>This is so easy with pivot tables. You don&#8217;t need subtotals or SUMIFS formulae. But &#8230;</p>
<p style="text-align: center;"><img class="wp-image-3802 aligncenter" style="margin-top: 10px; margin-bottom: 10px;" src="http://i2.wp.com/datascopic.net/wp-content/uploads/2013/03/stop.jpg?resize=300%2C225" alt="stop" data-recalc-dims="1" /></p>
<div></div>
<div>
<h2><span style="color: #2006b0;">QUESTION:</span> Can We Trust This Data?</h2>
<p>A clever use of pivot tables is to throw your information into a pivot table as soon as it arrives at your desk. There are two things that we want to know before we start our analysis:</p>
</div>
<ol>
<li><span style="line-height: 13px;">What are we dealing with?</span>
<ul>
<li>We have states in the dataset. Do we have 50 states and Washington, DC? How about Puerto Rico, Guam and US Virgin Islands?</li>
<li>For Marital Status, do we have Married, Single, Widowed, Divorced and Domestic Partnership? Or, just Married or Single?</li>
</ul>
</li>
<li>Is this clean?
<ul>
<li>Are there empty fields? Where are they and are they critical? We can live with a missing fax number, we can&#8217;t live without a missing membership level.</li>
<li>Are there any duplicates that need to be merged into single entries?</li>
<li>Is there anything just plain bizarre? Are there complete addresses in the State field or, &#8220;NOV&#8221; in a field that should only have YES or NO?)</li>
</ul>
</li>
</ol>
<div>Don&#8217;t make a single bar chart or summary table until we know the answers to those two questions.</div>
<div>
<h2><span style="color: #2006b0;">ANSWER: </span>Use A Pivot Table to Assess Data Quality</h2>
</div>
<p>The old way of checking our data quality would be to scroll through, eyeballing for obvious duplicates; we would sort by the State field and eyeball through for blanks and nonsensical entries.</p>
<p>That is painful, tedious, and time-consuming. Eyeballing datasets is also prone to errors and must end. TODAY. Rest your eyes. A pivot table can save hours or even days, depending on the size and complexity of the dataset.</p>
<p>In this video, I generate a pivot table and focus only on the data quality. We see that there are duplicates and bizarre information that render the dataset untrustworthy until we get it cleaned up. As you watch the video, don&#8217;t focus too much on the results or the &#8220;how to.&#8221; Instead, listen to the thought process and questions I ask about the dataset.</p>
<div style="text-align: center;"><span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='420' height='315' src='http://www.youtube.com/embed/abFDNiiLJa0?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0' allowfullscreen='true'></iframe></span></div>
<div></div>
<div></div>
<div>Now we know:</div>
<div></div>
<div>
<ol>
<li>What are we dealing with?
<ul>
<li>There are 25 states represented in our dataset, including Puerto Rico. Eight people aren&#8217;t assigned to a state.</li>
<li>There are 5 membership levels: Rookie, Silver, Gold, Platinum, and Lifetime.</li>
<li>There are 422 members.</li>
<li>In terms of marital status, we only have Married and Single options represented in the dataset.</li>
<li>We also know our donation levels between 2008 and 2010.</li>
</ul>
</li>
<li>Is this clean? <em><strong>NO!</strong></em>
<ul>
<li>Kenneth is in the dataset 5 times. Adara is represented 3 times.</li>
<li>There are 422 members and 455 rows of data. That&#8217;s 33 rows too many. They need to be investigated and merged into single entries.</li>
<li>&#8220;17&#8221; is not a state. The people who live in &#8220;17&#8221; need to be researched and corrected. Also, review the data-entry process to see how that was allowed.</li>
<li>There are 56 people whose active/inactive status is unknown.</li>
</ul>
</li>
</ol>
</div>
<div>
<h2><span style="color: #2006b0;">NOW WHAT?</span> Conclusions</h2>
<p>The dataset has to be scrubbed. More importantly, Ann said it best in one of our conversations: &#8220;The main skill in working with data is developing your personal computer program: your brain.&#8221;</p>
<p>One goal of the video is to show how to think through the ways we might expose crap data. Using pivot tables eliminates the need to eyeball for errant data. This minimizes the filtering, sorting and scrolling that we&#8217;d otherwise use. Pivot tables save time and yield more accurate insight than our old ways.</p>
<p>For many years this wasn&#8217;t something I even thought of, and I was pretty darned lucky that nothing went wrong. Eventually I just got embarrassed when my boss kept noticing things that didn&#8217;t add up. The problem wasn&#8217;t my math. It was naive trust of the data that had been handed to me.</p>
<p>I&#8217;m curious. How do you go about investigating data quality? How much time do you spend on it? What happens when you expose miscreant information?</p>
</div>
<div></div>
<div>
<div></div>
<p style="text-align: right;"><span style="font-size: 11px;">stop sign photo credit: <a href="http://www.flickr.com/photos/ladybeames/2896787167/">ladybeames</a> via <a href="http://photopin.com">photopin</a> <a href="http://creativecommons.org/licenses/by-nc/2.0/">cc</a></span></p>
<p style="display: inline!important;"><span style="font-style: normal;"> </span></p>
</div>
<div style="line-height: 1.2em;">
<blockquote style="border: 1px solid #666; padding: 8px; font-size: 14px; background-color: #ffffcc; margin-bottom: 20px;"><p><span style="font-style: normal;">Oz du Soleil is a Chicago-based Excel instructor and evangelist for clean data, sriracha and bowties. You can find Oz at <a style="color: #0f3647; font-size: 14px; font-style: normal; line-height: 24px;" href="https://plus.google.com/107967746448846350911/posts" rel="author">Google+</a> and his website: <a title="DataScopic" href="http://datascopic.net" target="_blank">DataScopic.net</a>.</span></p></blockquote>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.annkemery.com/ozdusoleil/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
	</channel>
</rss>
