alnoda-workspaces/workspaces/notebook-workspace
2022-07-05 19:02:07 +00:00
..
docs candidates notebook additions 2022-07-05 19:02:07 +00:00
examples notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
img notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
luigi notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
mercury notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
mkdocs notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
nbviewer notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
tutorials notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
1.15.0.package.json notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
Dockerfile notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
frontpage_copy.json notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
jupyter-requirements.txt notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
notebooks-requirements.txt notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
README.md notebook readme typo fix 2022-06-23 08:02:47 +00:00
settings.json notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00
supervisord-notebooks.conf notebook and kubespray workspaces 2022-06-20 18:24:28 +00:00

Alnoda logo

Jupyter notebook workspace

Opinionated Jupyter notebook & Jupyter Lab installation, together with the toolset to get the most out of the Jupyter notebooks.

Explore and analyze data, build reports, presentations and documentation. Convert notebooks to various formats. Serve notebooks for reporting. Create data pipelines from notebooks. Schedule executions with UI and monitoring tool.

Workspace collage

Why this images

  1. You need a ready-to-go Jupyter environment with batteries included, such as pandas dataframes vizualisation and exploration, code autocompletion, etc. And you don't want to install all the extensions and struggle with dependencies resolution.
  2. You need to schedule executions of your notebooks, whether it is on local laptop or on the server
  3. You need to make use of your notebooks - create slides, PDFs, HTML, reports, docs
  4. You want to turn notebooks into analytical reporting apps or interactive web applications
  5. You want to productionalize notebooks - create pipelines of batch jobs and schedule their executions

There are short tutorials on how to use most of the mentioned features, with example notebooks.

Start

docker run --name space-1 -d -p 8020-8040:8020-8040 alnoda/notebook-workspace

and open localhost:8020 in browser.

Features

Notebooks

  • Jupyter Notebook - Classic web-based notebook environment for interactive computing.
  • JupyterLab - An extensible environment for interactive and reproducible computing, based on the Jupyter Notebook.
  • VS-Code notebooks extension - Jupyter Notebooks in Visual Studio Code

Notebook execution

  • NBClient - Run Jupyter Notebooks in different execution contexts, including the command line.
  • Nbterm - Lets you view, edit and execute Jupyter Notebooks in the terminal, and produce the new notebook as a result of execution.
  • Papermill - A tool for parameterizing, executing, and analyzing Jupyter Notebooks.

Convert Notebooks to other formats

  • Jupytext - Can save Jupyter notebooks as python or markdown.
  • Nbconvert - Tool that allows to convert an .ipynb notebook file into various static formats including executable scripts.
  • nbless - Conversion of python notebooks into various commands, including .py executable scripts.

Notebook as reports

  • Voila - Voilà turns Jupyter notebooks into standalone web applications.
  • NBViewer - A simple way to view and share Jupyter Notebooks.
  • Datapane -The easiest way to create data science reports from Python.

Interactive web applications

  • Mercury - Mercury is a perfect tool to convert any Jupyter notebook to interactive web app.
  • Jupyter Dash - Develop Plotly Dash apps interactively from within Jupyter environments.

Data visualisation and exploration

  • D-Tale - is a comprehensive solution that allows to explore and analyze raw data files, as well as pandas dataframes. D-Tale is extremely powerful, is great toolset to visualise, explore and study the datasets.
  • Lux - Recommends a set of visualizations highlighting interesting trends and patterns in the pandas data frame.
  • Qgrid - Jupyter notebook widget which uses SlickGrid to render pandas data frames.
  • Data Preview - VS-Code extension which provides tabular view for csv files and allows interactive exploration of data.

Pipelines: scheduling and execution

  • Ploomber - The fastest way to build data pipelines with Jupyter notebooks.
  • Luigi - Build complex pipelines of batch jobs. Luigi handles dependency resolution, workflow management, visualization etc.
  • Cronicle - Task scheduler and runner, with a web based front-end UI. It handles both scheduled, repeating and on-demand jobs, targeting any number of worker servers, with real-time stats and live log viewer.

Documentation and slideshow

  • Jupyter book - Create beautiful, publication-quality books and documents from computational content.
  • MkDocs - Create awesome documentation for your project with only markdown.
  • RISE - Allows you to instantly turn your Jupyter Notebooks into a slideshow.

Jupyter extensions

  • Jupyterlab-lsp - Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename.
  • Jupyterlab-theme-toggle - Toggle light/dark themes in the JupyterLab Top Bar area.

Python data and viz packages

  • Pandas. Fast, powerful, flexible and easy to use open source data analysis and manipulation tool.
  • Fugue SQL - SQL for Pandas.
  • Numpy - Fundamental package for scientific computing with Python.
  • SciPy - Software for mathematics, science, and engineering.
  • Matplotlib - Comprehensive library for creating static, animated, and interactive visualizations in Python
  • Seaborn - Data visualization library which provides a high-level interface for drawing attractive and informative statistical graphics.
  • Plotline - An implementation of a grammar of graphics in Python, it is based on ggplot2.
  • Plotly - Interactive, open-source plotting library that supports over 40 unique chart types.
  • Altair - Altair is a declarative statistical visualization library for Python. Altair's API is simple, friendly and consistent.

Python tools:

Dev tools:

  • VS-Code - browser-based open source version of popular Visual Studio Code IDE. It has VS-Code extensions and works in browser. This means it can run inside a docker container on local machine or in cloud. A lot of beautiful color themes and many common plugins are already installed to save time.
  • Terminal - secure browser-based terminal.
  • FileBrowser - manage files and folders inside the workspace, and exchange data between local environment and the workspace
  • Static File Server - view any static html sites as easy as if you do it on your local machine. Serve static websites easily.
  • Ungit - rings user friendliness to git without sacrificing the versatility of it.
  • Midnight Commander - Feature rich visual file manager with internal text viewer and editor.
  • Process Monitor - Monitor running process and resource utilization.
  • Quicklaunch UI with getting started tutorial

Image is built from Ubuntu 20.4 with the additional CLI apps

  • Zsh, Oh my Zsh
  • Python 3, Pip
  • Node/nodeenv
  • curl, wget, telnet, jq
  • Git: git, git-flow, lazygit
  • File browsers: mc, xplr
  • Text editors: nano, vim, mcedit
  • System monitors: ncdu, htop, glances, vizex
  • Process Control: supervisord
  • Job scheduler: cron

Docs

See our guides on

Demo

Demo: Notebook workspace

notebook-wid-demo