In focus

    About Apache Arrow

    Apache Arrow enables execution engines to take advantage of the latest Single input multiple data operations included in modern processors and the native vectorized optimization of analytical data processing. Columnar layout of data also allows for a better use of CPU caches by placing all data relevant to a column operation in as compact of a format as possible. Apache Arrow is not standalone piece of software but rather a component used to accelerate analytics within a particular system and to allow Apache Arrow systems to exchange data with low overhead. It is sufficiently flexible to support most complex data models. Apache Arrow acts as a new high performance interface between various systems. It is also focused on supporting a wide variety of industry standard programming languages. Java, C, C++, Python are underway and more languages are expected soon.