Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

Last update: Jul 22, 2022

Related tags

Overview

Datashredder

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

You can chose the chance of corruption e.g i have a chance of 100 therfore there is a 1 in 100 chance of the next peice of data to be corrupted this allows you to controll how much corruption you want.

You can also chose to have a random peice of corruption data or random e.g Corruption data is FF

Not Corrupted: 30 32 35 53 f0 72

Corrupted: 30 FF 35 53 FF 72

A random corruption would chose a random corruption data each iteration

Examples

Cats

Each image has a corruption data of 00

There is 206824 iterations on this image

Not corrupted image

Corrupted images

Image #	Chance	Corruptions
1	2000	39
2	1500	133
3	1000	200
4	500	432
5	200	1020
6	100	2069

simple way to build the declarative and destributed data pipelines with python

unipipeline simple way to build the declarative and distributed data pipelines. Why you should use it Declarative strict config Scaffolding Fully type

0 Jan 26, 2022

A python package which can be pip installed to perform statistics and visualize binomial and gaussian distributions of the dataset

GBiStat package A python package to assist programmers with data analysis. This package could be used to plot : Binomial Distribution of the dataset p

4 Oct 17, 2022

Python data processing, analysis, visualization, and data operations

Python This is a Python data processing, analysis, visualization and data operations of the source code warehouse, book ISBN: 9787115527592 Descriptio

1 Jan 16, 2022

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift This project is composed of two parts: Part1 and Part2

1 Jan 19, 2022

A computer algebra system written in pure Python

SymPy See the AUTHORS file for the list of authors. And many more people helped on the SymPy mailing list, reported bugs, helped organize SymPy's part

9.9k Dec 31, 2022

Very basic but functional Kakuro solver written in Python.

kakuro.py Very basic but functional Kakuro solver written in Python. It uses a reduction to exact set cover and Ali Assaf's elegant implementation of

4 Jan 15, 2022

Catalogue data - A Python Scripts to prepare catalogue data

catalogue_data Scripts to prepare catalogue data. Setup Clone this repo. Install

3 Mar 3, 2022

Convert tables stored as images to an usable .csv file

Convert an image of numbers to a .csv file This Python program aims to convert images of array numbers to corresponding .csv files. It uses OpenCV for

711 Dec 26, 2022

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc

359 Dec 22, 2022

Releases(0.2.17)

0.2.17(Nov 18, 2021)
Changes:

Bug patches 9433cbf501bf18b2871df117121e8dbaed9a46dd

Removed tqdm 9ad0d65c49226755f5d7dffad99a5698ada68d22

Install Command: pip install pip install Datashredder==0.2.17

Full Changelog: https://github.com/awesomelewis2007/Datashredder/compare/0.2.15...0.2.17
Source code(tar.gz)
Source code(zip)
Datashredder-0.2.17-py3-none-any.whl(16.34 KB)
Datashredder-0.2.17.tar.gz(16.13 KB)
0.2.15(Nov 14, 2021)
Changes:

Added C installer

Added C help file

Added Makefile

Added pyproject.toml

Added setup.py

Improved Demo

Install Command: pip install pip install Datashredder==0.2.15

Full Changelog: https://github.com/awesomelewis2007/Datashredder/compare/0.1.10...0.2.15
Source code(tar.gz)
Source code(zip)
Datashredder-0.2.15-py3-none-any.whl(14.97 KB)
Datashredder-0.2.15.tar.gz(15.82 KB)
0.1.10(Oct 31, 2021)

This is the first release of datashredder

This release is not on pypi Full Changelog: https://github.com/awesomelewis2007/Datashredder/commits/0.1.10
Source code(tar.gz)
Source code(zip)

Owner

GitHub Repository

Geospatial data-science analysis on reasons behind delay in Grab ride-share services

Grab x Pulis Detailed analysis done to investigate possible reasons for delay in Grab services for NUS Data Analytics Competition 2022, to be found in

6 Jun 07, 2022

A columnar data container that can be compressed.

Unmaintained Package Notice Unfortunately, and due to lack of resources, the Blosc Development Team is unable to maintain this package anymore. During

944 Dec 09, 2022

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc

359 Dec 22, 2022

Stitch together Nanopore tiled amplicon data without polishing a reference

Stitch together Nanopore tiled amplicon data using a reference guided approach Tiled amplicon data, like those produced from primers designed with pri

14 Aug 30, 2022

A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms

MatrixProfile MatrixProfile is a Python 3 library, brought to you by the Matrix Profile Foundation, for mining time series data. The Matrix Profile is

302 Dec 29, 2022

pyhsmm MITpyhsmm - Bayesian inference in HSMMs and HMMs. MIT

Bayesian inference in HSMMs and HMMs This is a Python library for approximate unsupervised inference in Bayesian Hidden Markov Models (HMMs) and expli

527 Dec 04, 2022

Handle, manipulate, and convert data with units in Python

unyt A package for handling numpy arrays with units. Often writing code that deals with data that has units can be confusing. A function might return

304 Jan 02, 2023

DaDRA (day-druh) is a Python library for Data-Driven Reachability Analysis.

DaDRA (day-druh) is a Python library for Data-Driven Reachability Analysis. The main goal of the package is to accelerate the process of computing estimates of forward reachable sets for nonlinear dy

2 Nov 08, 2021

Fancy data functions that will make your life as a data scientist easier.

WhiteBox Utilities Toolkit: Tools to make your life easier Fancy data functions that will make your life as a data scientist easier. Installing To ins

3 Oct 03, 2022

Pypeln is a simple yet powerful Python library for creating concurrent data pipelines.

Pypeln Pypeln (pronounced as "pypeline") is a simple yet powerful Python library for creating concurrent data pipelines. Main Features Simple: Pypeln

1.4k Dec 31, 2022

This cosmetics generator allows you to generate the new Fortnite cosmetics, Search pak and search cosmetics!

COSMETICS GENERATOR This cosmetics generator allows you to generate the new Fortnite cosmetics, Search pak and search cosmetics! Remember to put the l

11 Dec 13, 2022

The lastest all in one bombing tool coded in python uses tbomb api

BaapG-Attack is a python3 based script which is officially made for linux based distro . It is inbuit mass bomber with sms, mail, calls and many more bombing

59 Dec 25, 2022

Spaghetti: an open-source Python library for the analysis of network-based spatial data

pysal/spaghetti SPAtial GrapHs: nETworks, Topology, & Inference Spaghetti is an open-source Python library for the analysis of network-based spatial d

203 Jan 03, 2023

A DSL for data-driven computational pipelines

"Dataflow variables are spectacularly expressive in concurrent programming" Henri E. Bal , Jennifer G. Steiner , Andrew S. Tanenbaum Quick overview Ne

1.9k Jan 03, 2023

Snakemake workflow for converting FASTQ files to self-contained CRAM files with maximum lossless compression.

Snakemake workflow: name A Snakemake workflow for description Usage The usage of this workflow is described in the Snakemake Workflow Catalog. If

1 Dec 16, 2021

Universal data analysis tools for atmospheric sciences

U_analysis Universal data analysis tools for atmospheric sciences Script written in python 3. This file defines multiple functions that can be used fo

1 Oct 10, 2021

t-SNE and hierarchical clustering are popular methods of exploratory data analysis, particularly in biology.

tree-SNE t-SNE and hierarchical clustering are popular methods of exploratory data analysis, particularly in biology. Building on recent advances in s

61 Nov 21, 2022

Multiple Pairwise Comparisons (Post Hoc) Tests in Python

scikit-posthocs is a Python package that provides post hoc tests for pairwise multiple comparisons that are usually performed in statistical data anal

264 Dec 30, 2022

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video. You can chose the cha

2 Jul 22, 2022

Building house price data pipelines with Apache Beam and Spark on GCP

This project contains the process from building a web crawler to extract the raw data of house price to create ETL pipelines using Google Could Platform services.

1 Nov 22, 2021

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

Related tags

Overview

Datashredder

Examples

Cats

Not corrupted image

Corrupted images

You might also like...

simple way to build the declarative and destributed data pipelines with python

A python package which can be pip installed to perform statistics and visualize binomial and gaussian distributions of the dataset

Python data processing, analysis, visualization, and data operations

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

A computer algebra system written in pure Python

Very basic but functional Kakuro solver written in Python.

Catalogue data - A Python Scripts to prepare catalogue data

Convert tables stored as images to an usable .csv file

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

Releases(0.2.17)

0.2.17(Nov 18, 2021)

0.2.15(Nov 14, 2021)

0.1.10(Oct 31, 2021)

Owner

Geospatial data-science analysis on reasons behind delay in Grab ride-share services

A columnar data container that can be compressed.

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

Stitch together Nanopore tiled amplicon data without polishing a reference

A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms

pyhsmm MITpyhsmm - Bayesian inference in HSMMs and HMMs. MIT

Handle, manipulate, and convert data with units in Python

DaDRA (day-druh) is a Python library for Data-Driven Reachability Analysis.

Fancy data functions that will make your life as a data scientist easier.

Pypeln is a simple yet powerful Python library for creating concurrent data pipelines.

This cosmetics generator allows you to generate the new Fortnite cosmetics, Search pak and search cosmetics!

The lastest all in one bombing tool coded in python uses tbomb api

Spaghetti: an open-source Python library for the analysis of network-based spatial data

A DSL for data-driven computational pipelines

Snakemake workflow for converting FASTQ files to self-contained CRAM files with maximum lossless compression.

Universal data analysis tools for atmospheric sciences

t-SNE and hierarchical clustering are popular methods of exploratory data analysis, particularly in biology.

Multiple Pairwise Comparisons (Post Hoc) Tests in Python

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

Building house price data pipelines with Apache Beam and Spark on GCP