Calling Julia from Python - an experiment on data loading

Last update: Jun 07, 2022

Overview

Calling Julia from Python - an experiment on data loading

See the slides.

TLDR

After reading Patrick's blog post, we decided to try to replace C++ with Julia to check:

How easy/hard it is
How much improvement can be gained with a basic version
How much improvement can be gained with an optimized version

A basic version is already an improvement over the pure Python version, and an optimized version was faster than the C++ version.

Reproduction

Follow Patrick's blog post to install the C++ part.
Install Julia (We've used Julia 1.6.3)
- I recommend using Jill
- We'll refer to this Julia as path/to/julia.
Install Python
- Ideally, one dynamically linked to libpython.
- To test it, use ldd path/to/python and look for libpython3.9. It should exist for the shared version.
- If you don't have, look into workarounds here
- Tip: Archlinux's system Python is dynamically linked.
- We've used Python 3.9.7 from Archlinux.
Open Julia and enter the following commands:
- ENV["PYTHON"] = "path/to/python"
- using Pkg
- Pkg.add("PyCall")
- This will make sure that the packages we are installing use the correct Python version
Install juliapy with path/to/python -m pip install julia
Run path/to/python and enter
- import julia
- julia.install("julia=path/to/julia")
Download dataset and store in gen-data folder:
Run scalability_test.py - it should take several hours (over 10) and consume a moderate amount of memory.
Run scalability_analysis.py.

You might also like...

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Apache MXNet (incubating) for Deep Learning Master Docs License Apache MXNet (incubating) is a deep learning framework designed for both efficiency an

29 Nov 16, 2022

Numba-accelerated Pythonic implementation of MPDATA with examples in Python, Julia and Matlab

PyMPDATA PyMPDATA is a high-performance Numba-accelerated Pythonic implementation of the MPDATA algorithm of Smolarkiewicz et al. used in geophysical

Atmospheric Cloud Simulation Group @ Jagiellonian University

15 Nov 23, 2022

Python and Julia in harmony.

PythonCall & JuliaCall Bringing Python® and Julia together in seamless harmony: Call Python code from Julia and Julia code from Python via a symmetric

414 Jan 7, 2023

Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab

PySDM PySDM is a package for simulating the dynamics of population of particles. It is intended to serve as a building block for simulation systems mo

32 Oct 18, 2022

QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.

Comments

Fix python versions ~~using poetry~~

To prevent this pull request from becoming too large, I'll merge this and create a new issue to set the python versions.

Originally posted by @abelsiqueira in https://github.com/abelsiqueira/call-julia-from-python-experiments/issues/1#issuecomment-987970132

opened by abelsiqueira 1
Improve docker-10
Fixes: #10

Changes Ubuntu version to 21.10

Adds extra environment variables

Removes the Python virtual environment

Add make flags to compile the tools faster

Remove the downloaded tar files

Uninstall dev dependencies
opened by fdiblen 0

Calling Julia from Python - an experiment on data loading

Related tags

Overview

Calling Julia from Python - an experiment on data loading

TLDR

Reproduction

You might also like...

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Numba-accelerated Pythonic implementation of MPDATA with examples in Python, Julia and Matlab

Python and Julia in harmony.

Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab

QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.

Small-bets - Ergodic Experiment With Python

Perspective: Julia for Biologists

MacroTools provides a library of tools for working with Julia code and expressions.

✔️ Visual, reactive testing library for Julia. Time machine included.

Comments

Fix python versions ~~using poetry~~

Improve docker-10

Releases(v0.3.0)

v0.3.0(Jan 4, 2022)

v0.2.0(Dec 10, 2021)

v0.1.0(Nov 17, 2021)

v0.1.0-rc2(Nov 17, 2021)

v0.1.0-rc(Nov 17, 2021)

Owner

Abel Siqueira

Kaggle Ultrasound Nerve Segmentation competition [Keras]

Official PyTorch implementation of "Preemptive Image Robustification for Protecting Users against Man-in-the-Middle Adversarial Attacks" (AAAI 2022)

Training a deep learning model on the noisy CIFAR dataset

Ascend your Jupyter Notebook usage

Automatic number plate recognition using tech: Yolo, OCR, Scene text detection, scene text recognation, flask, torch

Bib-parser - Convenient script to parse .bib files with the ACM Digital Library like metadata

Apply AnimeGAN-v2 across frames of a video clip

Collection of TensorFlow2 implementations of Generative Adversarial Network varieties presented in research papers.

(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Stochastic Extragradient: General Analysis and Improved Rates

This is the repo for the paper "Improving the Accuracy-Memory Trade-Off of Random Forests Via Leaf-Refinement".

Joint Learning of 3D Shape Retrieval and Deformation, CVPR 2021

HGCAE Pytorch implementation. CVPR2021 accepted.

Object detection, 3D detection, and pose estimation using center point detection:

Storchastic is a PyTorch library for stochastic gradient estimation in Deep Learning

Implementation of "Debiasing Item-to-Item Recommendations With Small Annotated Datasets" (RecSys '20)

[CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)

Keras implementation of the GNM model in paper ’Graph-Based Semi-Supervised Learning with Nonignorable Nonresponses‘

BanditPAM: Almost Linear-Time k-Medoids Clustering

Fix python versions using poetry