integrity/

directory
v0.0.0-...-361c87b Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jan 4, 2017 License: Apache-2.0

README

Maintaining Integrity in Data Science Applications

Integity in data science applications is crucial, because data science applications exist to provide data-driven insights. As soon as the integrity of a data science application breaks down, people lose trust in the output and, as a result, will refuse to make decisions based on the output. Go helps us maintain integrity in terms of reproducibility and deployment, which are common struggles for data scientists.

Notes

  • Data science applications should consider integrity before performance or sophistication.
  • A lack of reproducibility destroys the credibility of a data science application.
  • Integrity cannot be maintained with a complicated deploy.
  • If errors and edge cases are handled gracefully in Go, you can have confidence in how your application will behave.
  • There are ways of deploying Go that maintain integrity, even if you utilize various dependencies for your statistics, ML, etc.

Example python data science Dockerfile
Example Go Dockerfile

Code Review

Parse a clean CSV with python
Parse a clean CSV with Go
Force Integrity breakdown with python CSV parsing
Maintain integrity in Go CSV parsing

Exercises

Exercise 1

Implement another way of handling the CSV parsing error we encountered above. That is, handle the missing value in a way other than throwing an error.

Template | Answer


All material is licensed under the Apache License Version 2.0, January 2004.

Directories

Path Synopsis
Sample program to compare parsing a clean CSV with Go to parsing a clean CSV with python.
Sample program to compare parsing a clean CSV with Go to parsing a clean CSV with python.
Sample program to illustrate maintaining integrity with Go in the presence of messy data.
Sample program to illustrate maintaining integrity with Go in the presence of messy data.
exercises
exercise1
Sample program to illustrate maintaining integrity with Go in the presence of messy data.
Sample program to illustrate maintaining integrity with Go in the presence of messy data.
template1
Sample program to illustrate maintaining integrity with Go in the presence of messy data.
Sample program to illustrate maintaining integrity with Go in the presence of messy data.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL