Skip to main content Accessibility help
×
Hostname: page-component-84b7d79bbc-dwq4g Total loading time: 0 Render date: 2024-07-28T12:30:10.150Z Has data issue: false hasContentIssue false

6 - Other Methodologies

Published online by Cambridge University Press:  09 November 2021

Get access

Summary

In the previous chapters, I’ve shown you how to do everything in Excel. This is because it's a widely used tool and I want to make data classification, normalisation and cleansing as simple and accessible as possible for everyone. There's no denying it, Excel isn't necessarily the easiest option! It's cumbersome, slow and open to a lot of errors, especially if you’re new to using it.

Alternative tools

There are tools other than Excel that you can use, but unfortunately many of them involve tiresome processes and still require editing of the source data separately. For example, in place of using pivot tables to check the quality and accuracy of your data, you could use a visualisation tool such as Power BI, Tableau, Qlik or one of the many others available.

Visualisation tools plug into the data source, i.e. Excel, SQL or Python. Some might help you with parts of the cleansing, but you’ll still have to work in two separate tools to get the job done. This can be frustrating, timeconsuming and it can be hard to trace the source of any problems.

While I try to stay as system agnostic as possible, I have only found one solution so far that gives the ability to view and edit the source data in the same window as the visualisation and update in real-time. There may be other solutions out there that I’m unaware of and I’m always looking for ways to improve my processes so I encourage you to take the methods from this book, improve on them if you can and let me know!

The more options that are available for people to work efficiently with data, the better the quality and accuracy. The most important thing for me is to get as many people working with, cleaning, classifying and normal - ising data as possible, however that may be.

Omniscope

The tool that I have been working with for almost a decade is called Omniscope. From this, I have created my own proprietary methodology for efficiently and accurately normalising, classifying and cleansing data.

Type
Chapter
Information
Between the Spreadsheets
Classifying and Fixing Dirty Data
, pp. 111 - 130
Publisher: Facet
Print publication year: 2021

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

  • Other Methodologies
  • Susan Walsh
  • Book: Between the Spreadsheets
  • Online publication: 09 November 2021
  • Chapter DOI: https://doi.org/10.29085/9781783305049.007
Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

  • Other Methodologies
  • Susan Walsh
  • Book: Between the Spreadsheets
  • Online publication: 09 November 2021
  • Chapter DOI: https://doi.org/10.29085/9781783305049.007
Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

  • Other Methodologies
  • Susan Walsh
  • Book: Between the Spreadsheets
  • Online publication: 09 November 2021
  • Chapter DOI: https://doi.org/10.29085/9781783305049.007
Available formats
×