Library for producing small, fast columnar storage for Hadoop workloads
ORC is a self-describing type-aware columnar file format designed
for Hadoop workloads. It is optimized for large streaming reads,
but with integrated support for finding required rows quickly.
Storing data in a columnar format lets the reader read, decompress,
and process only the values that are required for the current query.
Because ORC files are type-aware, the writer chooses the most
appropriate encoding for the type and builds an internal index as
the file is written. Predicate pushdown uses those indexes to
determine which stripes in a file need to be read for a particular
query and the row indexes can narrow the search to a particular set
of 10,000 rows. ORC supports the complete set of types in Hive,
including the complex types: structs, lists, maps, and unions.
- RPM
- liborc1-1.7.8-1.fc36.x86_64.rpm
- Summary
- Library for producing small, fast columnar storage for Hadoop workloads
- URL
- http://orc.apache.org/
- Group
- Unspecified
- License
- ASL 2.0
- Source
-
liborc-1.7.8-1.fc36.src.rpm
- Checksum
- 0b1dd8014955f7e51d7f10a5110471edde8606588453974f9c1fdcca91fd0a56
- Build Date
- 2023/03/07 01:29:31
- Requires
- Provides
-
liborc(x86-64) = 1.7.8-1.fc36
liborc.so.1
liborc1 = 1.7.8-1.fc36
liborc1(x86-64) = 1.7.8-1.fc36