Tools

Notes for Tools of Data Science: Git, Command Line, Notebook, AWS etc

²¹ Data Processing - (Py)Spark

Published: 2022-01-31

Category: { Tools }

Tags:

#Tools #Data Engineering #Spark #PySpark

References: - Introduction to PySpark on DataCamp - Cluster configurations - Cleaning Data with PySpark

Summary: Processing Data using (Py)Spark

Pages: 9

⁸ Documentation

Published: 2021-08-28

Category: { Tools }

Tags:

#Tools #Python #Documentation

References: - sphinx-doc. sphinx-doc/sphinx: Main repository for the Sphinx documentation builder. In: GitHub [Internet]. [cited 28 Aug 2021]. Available: https://github.com/sphinx-doc/sphinx - Read the Docs - squidfunk. squidfunk/mkdocs-material: Technical documentation that just works. In: GitHub [Internet]. [cited 28 Aug 2021]. Available: https://github.com/squidfunk/mkdocs-material

Summary: Documenting my data science project using sphinx or mkdocs-material

Pages: 9

⁷ Cookiecutter

Published: 2021-08-27

Category: { Tools }

Tags:

#Tools #Python #Data Science

References: - drivendata. drivendata/cookiecutter-data-science: A logical, reasonably standardized, but flexible project structure for doing and sharing data science work. In: GitHub [Internet]. [cited 27 Aug 2021]. Available: https://github.com/drivendata/cookiecutter-data-science - cookiecutter. cookiecutter/cookiecutter: A command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, VueJS projects. In: GitHub [Internet]. [cited 27 Aug 2021]. Available: https://github.com/cookiecutter/cookiecutter

Summary: Use cookiecutter to initialize a project

Pages: 9

⁶ Some ML Workflow Frameworks

Published: 2021-01-13

Category: { Tools }

Tags:

#Machine Learning #Workflow

References: - Built-in magic commands for Jupyter - AWS Cloudwatch

Summary: Managing workflows in machine learning projects is not trivial.

Pages: 9

⁵ Git

Published: 2016-06-22

Category: { Tools }

Tags:

#Tools #Command Line #Git

References: - Must Have Git Aliases: Advanced Examples - Setting up a repository @ atlassian.com

Summary: git the tool you need for your everyday work

Pages: 9

⁴ Terminal

Published: 2019-12-31

Category: { Tools }

Tags:

#Tools #Command Line #Terminal

References: - Rapidly invoke an editor to write a long, complex, or tricky command

Summary: work more efficiently

Pages: 9

³ GNUPlot

Published: 2017-09-04

Category: { Tools }

Tags:

#Tools #Command Line #Bash

References: - GNUPLOT

Summary: quickly make a graph in your command line

Pages: 9

² Amazon CloudWatch Logs

Published: 2019-03-11

Category: { Tools }

Tags:

#Tools #AWS

References: - Built-in magic commands for Jupyter - AWS Cloudwatch

Summary: CloudWatch logs as a tool for pipeline logs

Pages: 9

¹ Jupyter Notebook

Published: 2018-06-20

Category: { Tools }

Tags:

#Tools #Jupyter

References: - Built-in magic command @ ipython

Summary: Jupyter Notebook is a useful tool for data scientists

Pages: 9

Tools

Notes for Tools of Data Science: Git, Command Line, Notebook, AWS etc

21 Data Processing - (Py)Spark

8 Documentation

7 Cookiecutter

6 Some ML Workflow Frameworks

5 Git

4 Terminal

3 GNUPlot

2 Amazon CloudWatch Logs

1 Jupyter Notebook

²¹ Data Processing - (Py)Spark

⁸ Documentation

⁷ Cookiecutter

⁶ Some ML Workflow Frameworks

⁵ Git

⁴ Terminal

³ GNUPlot

² Amazon CloudWatch Logs

¹ Jupyter Notebook