Ricardo M. Ferraz Leal
Panel
: 3-dimensional array:labels
, major_axis
, minor_axis
Panel4D
(Experimental)labels
, items
, major_axis
, minor_axis
cython
operations!Parallel computing: threading, multiprocessing, etc..
References:
Together, they allow for easy analysis of scientific datasets that don’t fit into memory.
References:
DistArray
provides general multidimensional NumPy
-like distributed arrays to Python.
It intends to bring the strengths of NumPy
to data-parallel high-performance computing.
DistArray
has a similar API to NumPy.
Python with types...
# jit decorator tells Numba to compile this function.
# The argument types will be inferred by Numba when function is called.
@jit
def sum2d(arr):
@vectorize([float64(float64, float64)])
def f(x, y):
return x + y
Castra is an on-disk, partitioned, compressed, column store. Castra provides efficient columnar range queries.
To convert file/data formats
Formats:
The Blaze ecosystem is a set of libraries that help users store, describe, query and process data. It is composed of the following core projects:
import seaborn as sns
sns.jointplot(data=df, kind="kde");