Ricardo M. Ferraz Leal
Panel: 3-dimensional array:labels, major_axis, minor_axisPanel4D (Experimental)labels, items, major_axis, minor_axiscython operations!Parallel computing: threading, multiprocessing, etc..
References:
Together, they allow for easy analysis of scientific datasets that don’t fit into memory.
References:
DistArray provides general multidimensional NumPy-like distributed arrays to Python.
It intends to bring the strengths of NumPy to data-parallel high-performance computing.
DistArray has a similar API to NumPy.
Python with types...
# jit decorator tells Numba to compile this function.
# The argument types will be inferred by Numba when function is called.
@jit
def sum2d(arr):
@vectorize([float64(float64, float64)])
def f(x, y):
return x + y
Castra is an on-disk, partitioned, compressed, column store. Castra provides efficient columnar range queries.
To convert file/data formats
Formats:
The Blaze ecosystem is a set of libraries that help users store, describe, query and process data. It is composed of the following core projects:
import seaborn as sns
sns.jointplot(data=df, kind="kde");
