orc()

Read(42) Label: orc, hdfs, cursor,

Description:

Retrieve data from an ORC file.  

Syntax:

f.orc([col,…];[n])

Note:

External library function (See External Library Guide).

 

The function retrieves data from a local ORC file or an ORC file in HDFS.

Parameter:

f

A file object.

col

To-be-retrieved fields; return all fields by default.

n

A positive integer representing the number of records to be retrieved; return all records when this parameter is absent. This parameter becomes invalid when @c option works.

Option:

@c

Return a cursor.

@m

Retrieve data in multiple threads. When @c option is present, retrieve data through and from multiple cursors.

Example:

 

A

 

1

=file("F:/tmp/mytest.orc")

Open a local ORC file.

2

=A1.orc()

Retrieve data from A1’s file and return all fields.

3

=A1.orc@c()

Return a cursor.

4

=file("hdfs://localhost:9000/user/hive/warehouse/test1.orc")

Open an ORC file in HDFS.

5

=A4.orc("product","store";10)

Retrieve the specified fields and return the first 10 records only.

6

=hive_open("hdfs://localhost:9000","thrift://localhost:9083","hive","asus")

Connect to the HIVE database.

7

=hive_table@o(A6)

Find all ORC tables.

8

=A7.select(tableName=="myOrc")

Select myOrc table.

9

=file(A8.location)

Load a file in HDFS.

10

=A9.orc(;10)

Retrieve data from A9’s table and return the first 10 records only.

11

=A9.orc@cm()

Retrieve data using multiple cursors.

12

=A11.fetch(10)

Same as A10.

13

=hive_close(A6)