Performant, differentiable reinforcement learning

Last update: Dec 27, 2022

Related tags

Deep Learning deluca

Overview

`deluca`

Performant, differentiable reinforcement learning

Notes

This is pre-alpha software and is undergoing a number of core changes. Updates to follow.
Please see the examples for guidance on how to use deluca

Comments

Exception error during installing deluca

Hi.

I am trying to install deluca and I get an Exception error. I am using

Ubuntu 64 on a virtual machine Pycharm CE 2021.2, Python 3.8 pip 212.1.2

I tried to install deluca with the package manager in Pycharm, the terminal in Pycharm and also the Ubuntu terminal. The error is the same. Note that I can install other normal packages like Numpy, Scipy, etc with no problem. Thanks in advance and I am looking forward to using this amazing package!

pip install deluca
Collecting deluca
   Using cached deluca-0.0.17-py3-none-any.whl (52 kB)
Collecting flax
   Using cached flax-0.3.4-py3-none-any.whl (183 kB)
Collecting brax
   Using cached brax-0.0.4-py3-none-any.whl (117 kB)
Processing
./.cache/pip/wheels/78/ae/07/bd3adac873fa80efc909c09331831905ac657dbb8d1278235e/jax-0.2.19-py3-none-any.whl
Collecting optax
   Using cached optax-0.0.9-py3-none-any.whl (118 kB)
Collecting scipy
   Using cached
scipy-1.7.1-cp38-cp38-manylinux_2_5_x86_64.manylinux1_x86_64.whl (28.4 MB)
Collecting numpy
   Using cached
numpy-1.21.2-cp38-cp38-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
(15.8 MB)
Collecting matplotlib
   Using cached matplotlib-3.4.3-cp38-cp38-manylinux1_x86_64.whl (10.3 MB)
Collecting msgpack
   Using cached msgpack-1.0.2-cp38-cp38-manylinux1_x86_64.whl (302 kB)
Collecting grpcio
   Using cached grpcio-1.39.0-cp38-cp38-manylinux2014_x86_64.whl (4.3 MB)
Collecting clu
   Using cached clu-0.0.6-py3-none-any.whl (77 kB)
Collecting gym
   Using cached gym-0.19.0.tar.gz (1.6 MB)
Collecting absl-py
   Using cached absl_py-0.13.0-py3-none-any.whl (132 kB)
Collecting tfp-nightly[jax]<=0.13.0.dev20210422
   Using cached tfp_nightly-0.13.0.dev20210422-py2.py3-none-any.whl (5.3 MB)
Collecting jaxlib
   Using cached jaxlib-0.1.70-cp38-none-manylinux2010_x86_64.whl (46.9 MB)
Collecting dataclasses
   Using cached dataclasses-0.6-py3-none-any.whl (14 kB)
Collecting opt-einsum
   Using cached opt_einsum-3.3.0-py3-none-any.whl (65 kB)
Collecting chex>=0.0.4
   Using cached chex-0.0.8-py3-none-any.whl (57 kB)
Requirement already satisfied: pillow>=6.2.0 in
/usr/lib/python3/dist-packages (from matplotlib->flax->deluca) (7.0.0)
Collecting cycler>=0.10
   Using cached cycler-0.10.0-py2.py3-none-any.whl (6.5 kB)
Collecting pyparsing>=2.2.1
   Using cached pyparsing-2.4.7-py2.py3-none-any.whl (67 kB)
Collecting kiwisolver>=1.0.1
   Using cached kiwisolver-1.3.1-cp38-cp38-manylinux1_x86_64.whl (1.2 MB)
Requirement already satisfied: python-dateutil>=2.7 in
/usr/lib/python3/dist-packages (from matplotlib->flax->deluca) (2.7.3)
Requirement already satisfied: six>=1.5.2 in
/usr/lib/python3/dist-packages (from grpcio->brax->deluca) (1.14.0)
Collecting tensorflow-datasets
   Using cached tensorflow_datasets-4.4.0-py3-none-any.whl (4.0 MB)
Collecting packaging
   Using cached packaging-21.0-py3-none-any.whl (40 kB)
Collecting ml-collections
   Using cached ml_collections-0.1.0-py3-none-any.whl (88 kB)
Collecting tensorflow
   Downloading tensorflow-2.6.0-cp38-cp38-manylinux2010_x86_64.whl
(458.4 MB)
      |▋                               | 8.4 MB 16 kB/s eta
7:44:54ERROR: Exception:
Traceback (most recent call last):
   File
"/usr/share/python-wheels/urllib3-1.25.8-py2.py3-none-any.whl/urllib3/response.py",
line 425, in _error_catcher
     yield
   File
"/usr/share/python-wheels/urllib3-1.25.8-py2.py3-none-any.whl/urllib3/response.py",
line 507, in read
     data = self._fp.read(amt) if not fp_closed else b""
   File
"/usr/share/python-wheels/CacheControl-0.12.6-py2.py3-none-any.whl/cachecontrol/filewrapper.py",
line 62, in read
     data = self.__fp.read(amt)
   File "/usr/lib/python3.8/http/client.py", line 455, in read
     n = self.readinto(b)
   File "/usr/lib/python3.8/http/client.py", line 499, in readinto
     n = self.fp.readinto(b)
   File "/usr/lib/python3.8/socket.py", line 669, in readinto
     return self._sock.recv_into(b)
   File "/usr/lib/python3.8/ssl.py", line 1241, in recv_into
     return self.read(nbytes, buffer)
   File "/usr/lib/python3.8/ssl.py", line 1099, in read
     return self._sslobj.read(len, buffer)
socket.timeout: The read operation timed out

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
   File
"/usr/lib/python3/dist-packages/pip/_internal/cli/base_command.py", line
186, in _main
     status = self.run(options, args)
   File
"/usr/lib/python3/dist-packages/pip/_internal/commands/install.py", line
357, in run
     resolver.resolve(requirement_set)
   File
"/usr/lib/python3/dist-packages/pip/_internal/legacy_resolve.py", line
177, in resolve
     discovered_reqs.extend(self._resolve_one(requirement_set, req))
   File
"/usr/lib/python3/dist-packages/pip/_internal/legacy_resolve.py", line
333, in _resolve_one
     abstract_dist = self._get_abstract_dist_for(req_to_install)
   File
"/usr/lib/python3/dist-packages/pip/_internal/legacy_resolve.py", line
282, in _get_abstract_dist_for
     abstract_dist = self.preparer.prepare_linked_requirement(req)
   File
"/usr/lib/python3/dist-packages/pip/_internal/operations/prepare.py",
line 480, in prepare_linked_requirement
     local_path = unpack_url(
   File
"/usr/lib/python3/dist-packages/pip/_internal/operations/prepare.py",
line 282, in unpack_url
     return unpack_http_url(
   File
"/usr/lib/python3/dist-packages/pip/_internal/operations/prepare.py",
line 158, in unpack_http_url
     from_path, content_type = _download_http_url(
   File
"/usr/lib/python3/dist-packages/pip/_internal/operations/prepare.py",
line 303, in _download_http_url
     for chunk in download.chunks:
   File "/usr/lib/python3/dist-packages/pip/_internal/utils/ui.py", line
160, in iter
     for x in it:
   File "/usr/lib/python3/dist-packages/pip/_internal/network/utils.py",
line 15, in response_chunks
     for chunk in response.raw.stream(
   File
"/usr/share/python-wheels/urllib3-1.25.8-py2.py3-none-any.whl/urllib3/response.py",
line 564, in stream
     data = self.read(amt=amt, decode_content=decode_content)
   File
"/usr/share/python-wheels/urllib3-1.25.8-py2.py3-none-any.whl/urllib3/response.py",
line 529, in read
     raise IncompleteRead(self._fp_bytes_read, self.length_remaining)
   File "/usr/lib/python3.8/contextlib.py", line 131, in __exit__
     self.gen.throw(type, value, traceback)
   File
"/usr/share/python-wheels/urllib3-1.25.8-py2.py3-none-any.whl/urllib3/response.py",
line 430, in _error_catcher
     raise ReadTimeoutError(self._pool, None, "Read timed out.")
urllib3.exceptions.ReadTimeoutError:
HTTPSConnectionPool(host='files.pythonhosted.org', port=443): Read timed
out.

opened by FarnazAdib 4

Internal change

Internal change

FUTURE_COPYBARA_INTEGRATE_REVIEW=https://github.com/google/deluca/pull/57 from google:inverse_map baa4932444495538d91151653165cdcb386b52fc

opened by copybara-service[bot] 0
Internal change

Internal change

FUTURE_COPYBARA_INTEGRATE_REVIEW=https://github.com/google/deluca/pull/57 from google:inverse_map baa4932444495538d91151653165cdcb386b52fc

opened by copybara-service[bot] 0
Internal change

Internal change

FUTURE_COPYBARA_INTEGRATE_REVIEW=https://github.com/google/deluca/pull/57 from google:inverse_map baa4932444495538d91151653165cdcb386b52fc
cla: yes

opened by copybara-service[bot] 0
Internal change

Internal change

FUTURE_COPYBARA_INTEGRATE_REVIEW=https://github.com/google/deluca/pull/57 from google:inverse_map baa4932444495538d91151653165cdcb386b52fc
cla: yes

opened by copybara-service[bot] 0
Consider dependency on OpenAI Gym
Not clear what the benefits of compatibility are since existing agents that work on OpenAI Gym environments have no guarantee of working on deluca environments

OpenAI Gym bundles environment with initialization and task. Not necessarily something we want to do.
opened by danielsuo 0
Changes to _adaptive.py

Hello! I made some modifications to AdaGPC (in _adaptive.py). In the existing implementation, GPC outperforms AdaGPC in the known LDS setting, which is the opposite of what one should expect. Based on some preliminary experiments, I believe AdaGPC is now working properly (at least in the known dynamics version). (I also made some miscellaneous changes in other files, e.g., to the imports in some of the agent files -- I think there might have been some file restructuring across different versions of deluca, but the imports were not updated to reflect this change, causing some errors at runtime.) Please let me know if you have any questions/concerns. Thanks!

opened by simran135 1
[JAX] Avoid private implementation detail _ScalarMeta.

[JAX] Avoid private implementation detail _ScalarMeta.

The closest public approximation to type(jnp.float32) is type[Any]. Nothing is ever actually an instance of one of these types, either (they build DeviceArrays if instantiated.)

opened by copybara-service[bot] 0
[JAX] Avoid private implementation detail _ScalarMeta.

[JAX] Avoid private implementation detail _ScalarMeta.

The closest public approximation to type(jnp.float32) is type[Any]. Nothing is ever actually an instance of one of these types, either (they build DeviceArrays if instantiated.)

opened by copybara-service[bot] 0
Internal change

Internal change

FUTURE_COPYBARA_INTEGRATE_REVIEW=https://github.com/google/deluca/pull/57 from google:inverse_map baa4932444495538d91151653165cdcb386b52fc

opened by copybara-service[bot] 0
Implementation of drc

Hi

Thanks for providing this interesting package.

I am trying to test drc on a simple setup and I notice that the current implementation of drc does not work. I mean when I try it for a simple partially observable linear system with A = np.array([[1.0 0.95], [0.0, -0.9]]), B = np.array([[0.0], [1.0]]) C = np.array([[1.0, 0]]) Q , R = I gaussian process noise, zero observation noise which is open loop stable, the controller acts like a zero controller. I tried to get a different response by setting the hyperparameters but they are mostly the same. Then I looked at the implementation at the deluca github and I noticed that the counterfactual cost is not defined correctly (if I am not wrong). According to Algorithm 1 in [1], we need to use M_t to compute y_t (which depends on the previous controls (u) using again M_t) but in the implementation, the previous controls based on M_{t-i} are used. Anyway, I implemented the algorithm using M_t but what I get after the simulation is either close to zero control or an unstable one.

I was wondering if you have any code example for the DRC algorithm that works? [1] Simchowitz, Max and Singh, Karan and Hazan, Elad, "Improper learning for non-stochastic control", COLT 2020.

Thanks a lot, Sincerely, Farnaz

opened by FarnazAdib 4

Releases(v0.0.17)

v0.0.17(Jul 19, 2021)

Please see https://readthedocs.org/projects/deluca for details about this release.
Source code(tar.gz)
Source code(zip)
v0.0.16(Jul 19, 2021)

Please see https://readthedocs.org/projects/deluca for details about this release.
Source code(tar.gz)
Source code(zip)
v0.0.15(Jul 19, 2021)

Please see https://readthedocs.org/projects/deluca for details about this release.
Source code(tar.gz)
Source code(zip)
v0.0.13(Jul 18, 2021)

Please see https://readthedocs.org/projects/deluca for details about this release.
Source code(tar.gz)
Source code(zip)
v0.0.12(Jul 18, 2021)

Please see https://readthedocs.org/projects/deluca for details about this release.
Source code(tar.gz)
Source code(zip)
v0.0.11(Jul 18, 2021)

Please see https://readthedocs.org/projects/deluca for details about this release.
Source code(tar.gz)
Source code(zip)

Owner

Google

Google ❤️ Open Source

GitHub Repository

African language Speech Recognition - Speech-to-Text

Swahili-Speech-To-Text Table of Contents Swahili-Speech-To-Text Overview Scenario Approach Project Structure data: models: notebooks: scripts tests: l

2 Jan 05, 2023

Credit fraud detection in Python using a Jupyter Notebook

Credit-Fraud-Detection - Credit fraud detection in Python using a Jupyter Notebook , using three classification models (Random Forest, Gaussian Naive Bayes, Logistic Regression) from the sklearn libr

4 Dec 28, 2021

Unsupervised Foreground Extraction via Deep Region Competition

Unsupervised Foreground Extraction via Deep Region Competition [Paper] [Code] The official code repository for NeurIPS 2021 paper "Unsupervised Foregr

28 Nov 06, 2022

Original code for "Zero-Shot Domain Adaptation with a Physics Prior"

Zero-Shot Domain Adaptation with a Physics Prior [arXiv] [sup. material] - ICCV 2021 Oral paper, by Attila Lengyel, Sourav Garg, Michael Milford and J

40 Dec 21, 2022

Code for our paper "Interactive Analysis of CNN Robustness"

Perturber Code for our paper "Interactive Analysis of CNN Robustness" Datasets Feature visualizations: Google Drive Fine-tuning checkpoints as saved m

0 Aug 17, 2021

The pytorch implementation of DG-Font: Deformable Generative Networks for Unsupervised Font Generation

DG-Font: Deformable Generative Networks for Unsupervised Font Generation The source code for 'DG-Font: Deformable Generative Networks for Unsupervised

130 Dec 05, 2022

This repository includes different versions of the prescribed-time controller as Simulink blocks and MATLAB script codes for engineering applications.

Prescribed-time Control Prescribed-time control (PTC) blocks in Simulink environment, MATLAB R2020b. For more theoretical details, refer to the papers

1 Mar 11, 2022

Measure WWjj polarization fraction

WlWl Polarization Measure WWjj polarization fraction Paper: arXiv:2109.09924 Notice: This code can only be used for the inference process, if you want

4 Apr 10, 2022

Official NumPy Implementation of Deep Networks from the Principle of Rate Reduction (2021)

Deep Networks from the Principle of Rate Reduction This repository is the official NumPy implementation of the paper Deep Networks from the Principle

49 Dec 16, 2022

FedMM: Saddle Point Optimization for Federated Adversarial Domain Adaptation

This repository contains the code accompanying the paper " FedMM: Saddle Point Optimization for Federated Adversarial Domain Adaptation" Paper link: R

20 Jun 29, 2022

Adversarial-Information-Bottleneck - Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck (NeurIPS21)

NeurIPS 2021 Title: Distilling Robust and Non-Robust Features in Adversarial Exa

35 Dec 26, 2022

Performant, differentiable reinforcement learning

Related tags

Overview

deluca

Notes

Comments

Releases(v0.0.17)

v0.0.17(Jul 19, 2021)

v0.0.16(Jul 19, 2021)

v0.0.15(Jul 19, 2021)

v0.0.13(Jul 18, 2021)

v0.0.12(Jul 18, 2021)

v0.0.11(Jul 18, 2021)

Owner

Google

African language Speech Recognition - Speech-to-Text

Credit fraud detection in Python using a Jupyter Notebook

Unsupervised Foreground Extraction via Deep Region Competition

Original code for "Zero-Shot Domain Adaptation with a Physics Prior"

Code for our paper "Interactive Analysis of CNN Robustness"

The pytorch implementation of DG-Font: Deformable Generative Networks for Unsupervised Font Generation

This repository includes different versions of the prescribed-time controller as Simulink blocks and MATLAB script codes for engineering applications.

Measure WWjj polarization fraction

Official NumPy Implementation of Deep Networks from the Principle of Rate Reduction (2021)

FedMM: Saddle Point Optimization for Federated Adversarial Domain Adaptation

Adversarial-Information-Bottleneck - Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck (NeurIPS21)

Machine learning and Deep learning models, deploy on telegram (the best social media)

Code for Contrastive-Geometry Networks for Generalized 3D Pose Transfer

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

Source code for deep symbolic optimization.

Efficiently Disentangle Causal Representations

🙄 Difficult algorithm, Simple code.

PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud, CVPR 2019.

This is the repo for Uncertainty Quantification 360 Toolkit.

Research on Event Accumulator Settings for Event-Based SLAM

`deluca`