Package: github.com/apache/arrow-go/v18/arrow/ipc

package ipc

Import Path
	github.com/apache/arrow-go/v18/arrow/ipc (on go.dev)

Dependency Relation
	imports 26 packages, and imported by 4 packages

Involved Source Files

	    compression.go
	    endian_swap.go
	    file_reader.go
	    file_writer.go
	    ipc.go
	    message.go
	    metadata.go
	    reader.go
	    writer.go
Package-Level Type Names (total 12)

	/* sort by: alphabet | popularity */
	 type FileReader (struct)
		FileReader is an Arrow file reader.

		Methods (total 11)
			(*FileReader) Close() error
				Close cleans up resources used by the File.
				Close does not close the underlying reader.

			(*FileReader) NumDictionaries() int
			(*FileReader) NumRecords() int
			(*FileReader) Read() (rec arrow.RecordBatch, err error)
				Read reads the current record batch from the underlying stream and an error, if any.
				When the Reader reaches the end of the underlying stream, it returns (nil, io.EOF).
				
				The returned record batch value is valid until the next call to Read.
				Users need to call Retain on that RecordBatch to keep it valid for longer.

			(*FileReader) ReadAt(i int64) (arrow.RecordBatch, error)
				ReadAt reads the i-th record batch from the underlying stream and an error, if any.

			(*FileReader) Record(i int) (arrow.Record, error)
				Record returns the i-th record from the file.
				The returned value is valid until the next call to Record.
				Users need to call Retain on that Record to keep it valid for longer.
				
				Deprecated: Use [RecordBatch] instead.

			(*FileReader) RecordAt(i int) (arrow.Record, error)
				RecordAt returns the i-th record from the file. Ownership is transferred to the
				caller and must call Release() to free the memory. This method is safe to
				call concurrently.
				
				Deprecated: Use [RecordBatchAt] instead.

			(*FileReader) RecordBatch(i int) (arrow.RecordBatch, error)
				RecordBatch returns the i-th record batch from the file.
				The returned value is valid until the next call to RecordBatch.
				Users need to call Retain on that RecordBatch to keep it valid for longer.

			(*FileReader) RecordBatchAt(i int) (arrow.RecordBatch, error)
				RecordBatchAt returns the i-th record batch from the file. Ownership is transferred to the
				caller and must call Release() to free the memory. This method is safe to
				call concurrently.

			(*FileReader) Schema() *arrow.Schema
			(*FileReader) Version() MetadataVersion
		Implements (at least 4)
			*FileReader : github.com/apache/arrow-go/v18/arrow/arrio.Reader
			*FileReader : github.com/apache/arrow-go/v18/arrow/arrio.ReaderAt
			*FileReader : github.com/prometheus/common/expfmt.Closer
			*FileReader : io.Closer
		As Outputs Of (at least 2)
			func NewFileReader(r ReadAtSeeker, opts ...Option) (*FileReader, error)
			func NewMappedFileReader(data []byte, opts ...Option) (*FileReader, error)

	 type FileWriter (struct)
		FileWriter is an Arrow file writer.

		Methods (total 2)
			(*FileWriter) Close() error
			(*FileWriter) Write(rec arrow.RecordBatch) error
		Implements (at least 3)
			*FileWriter : github.com/apache/arrow-go/v18/arrow/arrio.Writer
			*FileWriter : github.com/prometheus/common/expfmt.Closer
			*FileWriter : io.Closer
		As Outputs Of (at least one exported)
			func NewFileWriter(w io.Writer, opts ...Option) (*FileWriter, error)

	 type Message (struct)
		Message is an IPC message, including metadata and body.

		Methods (total 5)
			(*Message) BodyLen() int64
			(*Message) Release()
				Release decreases the reference count by 1.
				Release may be called simultaneously from multiple goroutines.
				When the reference count goes to zero, the memory is freed.

			(*Message) Retain()
				Retain increases the reference count by 1.
				Retain may be called simultaneously from multiple goroutines.

			(*Message) Type() MessageType
			(*Message) Version() MetadataVersion
		Implements (at least one exported)
			*Message : github.com/apache/arrow-go/v18/arrow/scalar.Releasable
		As Outputs Of (at least 2)
			func NewMessage(meta, body *memory.Buffer) *Message
			func MessageReader.Message() (*Message, error)

	 type MessageReader (interface)

		Methods (total 3)
			( MessageReader) Message() (*Message, error)
			( MessageReader) Release()
			( MessageReader) Retain()
		Implements (at least one exported)
			 MessageReader : github.com/apache/arrow-go/v18/arrow/scalar.Releasable
		As Outputs Of (at least one exported)
			func NewMessageReader(r io.Reader, opts ...Option) MessageReader
		As Inputs Of (at least one exported)
			func NewReaderFromMessageReader(r MessageReader, opts ...Option) (reader *Reader, err error)

	 type MessageType flatbuf.MessageHeader (basic type)
		MessageType represents the type of Message in an Arrow format.

		Methods (only one)
			( MessageType) String() string
		Implements (at least 2)
			 MessageType : expvar.Var
			 MessageType : fmt.Stringer
		As Outputs Of (at least one exported)
			func (*Message).Type() MessageType
		As Types Of (total 6)
			const MessageDictionaryBatch
			const MessageNone
			const MessageRecordBatch
			const MessageSchema
			const MessageSparseTensor
			const MessageTensor

	 type MetadataVersion flatbuf.MetadataVersion (basic type)
		MetadataVersion represents the Arrow metadata version.

		Methods (only one)
			( MetadataVersion) String() string
		Implements (at least 2)
			 MetadataVersion : expvar.Var
			 MetadataVersion : fmt.Stringer
		As Outputs Of (at least 2)
			func (*FileReader).Version() MetadataVersion
			func (*Message).Version() MetadataVersion
		As Types Of (total 5)
			const MetadataV1
			const MetadataV2
			const MetadataV3
			const MetadataV4
			const MetadataV5

	 type Option (func)
		Option is a functional option to configure opening or creating Arrow files
		and streams.

		As Outputs Of (at least 10)
			func WithAllocator(mem memory.Allocator) Option
			func WithCompressConcurrency(n int) Option
			func WithDelayReadSchema(v bool) Option
			func WithDictionaryDeltas(v bool) Option
			func WithEnsureNativeEndian(v bool) Option
			func WithFooterOffset(offset int64) Option
			func WithLZ4() Option
			func WithMinSpaceSavings(savings float64) Option
			func WithSchema(schema *arrow.Schema) Option
			func WithZstd() Option
		As Inputs Of (at least 9)
			func GetRecordBatchPayload(batch arrow.RecordBatch, opts ...Option) (Payload, error)
			func NewFileReader(r ReadAtSeeker, opts ...Option) (*FileReader, error)
			func NewFileWriter(w io.Writer, opts ...Option) (*FileWriter, error)
			func NewMappedFileReader(data []byte, opts ...Option) (*FileReader, error)
			func NewMessageReader(r io.Reader, opts ...Option) MessageReader
			func NewReader(r io.Reader, opts ...Option) (*Reader, error)
			func NewReaderFromMessageReader(r MessageReader, opts ...Option) (reader *Reader, err error)
			func NewWriter(w io.Writer, opts ...Option) *Writer
			func NewWriterWithPayloadWriter(pw PayloadWriter, opts ...Option) *Writer

	 type Payload (struct)
		Payload is the underlying message object which is passed to the payload writer
		for actually writing out ipc messages

		Methods (total 4)
			(*Payload) Meta() *memory.Buffer
				Meta returns the buffer containing the metadata for this payload,
				callers must call Release on the buffer

			(*Payload) Release()
			(*Payload) SerializeBody(w io.Writer) error
				SerializeBody serializes the body buffers and writes them to the provided
				writer.

			(*Payload) WritePayload(w io.Writer) (int, error)
				WritePayload serializes the payload in IPC format
				into the provided writer.

		As Outputs Of (at least 2)
			func GetRecordBatchPayload(batch arrow.RecordBatch, opts ...Option) (Payload, error)
			func GetSchemaPayload(schema *arrow.Schema, mem memory.Allocator) Payload
		As Inputs Of (at least one exported)
			func PayloadWriter.WritePayload(Payload) error

	 type PayloadWriter (interface)
		PayloadWriter is an interface for injecting a different payloadwriter
		allowing more reusability with the Writer object with other scenarios,
		such as with Flight data

		Methods (total 3)
			( PayloadWriter) Close() error
			( PayloadWriter) Start() error
			( PayloadWriter) WritePayload(Payload) error
		Implements (at least 2)
			 PayloadWriter : github.com/prometheus/common/expfmt.Closer
			 PayloadWriter : io.Closer
		As Inputs Of (at least one exported)
			func NewWriterWithPayloadWriter(pw PayloadWriter, opts ...Option) *Writer

	 type ReadAtSeeker (interface)

		Methods (total 3)
			( ReadAtSeeker) Read(p []byte) (n int, err error)
			( ReadAtSeeker) ReadAt(p []byte, off int64) (n int, err error)
			( ReadAtSeeker) Seek(offset int64, whence int) (int64, error)
		Implemented By (at least 9)
			 github.com/apache/arrow-go/v18/internal/utils.Reader (interface)
			 github.com/coreos/etcd/pkg/fileutil.LockedFile
			*github.com/klauspost/compress/s2.ReadSeeker
			*github.com/polarsignals/wal/fs.File
			*bytes.Reader
			*io.SectionReader
			 mime/multipart.File (interface)
			*os.File
			*strings.Reader
		Implements (at least 6)
			 ReadAtSeeker : github.com/apache/arrow-go/v18/internal/utils.Reader
			 ReadAtSeeker : github.com/apache/arrow-go/v18/parquet.ReaderAtSeeker
			 ReadAtSeeker : io.Reader
			 ReadAtSeeker : io.ReaderAt
			 ReadAtSeeker : io.ReadSeeker
			 ReadAtSeeker : io.Seeker
		As Inputs Of (at least one exported)
			func NewFileReader(r ReadAtSeeker, opts ...Option) (*FileReader, error)

	 type Reader (struct)
		Reader reads records from an io.Reader.
		Reader expects a schema (plus any dictionaries) as the first messages
		in the stream, followed by records.

		Methods (total 8)
			(*Reader) Err() error
				Err returns the last error encountered during the iteration over the
				underlying stream.

			(*Reader) Next() bool
				Next returns whether a RecordBatch could be extracted from the underlying stream.

			(*Reader) Read() (arrow.RecordBatch, error)
				Read reads the current record batch from the underlying stream and an error, if any.
				When the Reader reaches the end of the underlying stream, it returns (nil, io.EOF).

			(*Reader) Record() arrow.Record
				Record returns the current record that has been extracted from the
				underlying stream.
				It is valid until the next call to Next.
				
				Deprecated: Use [RecordBatch] instead.

			(*Reader) RecordBatch() arrow.RecordBatch
				RecordBatch returns the current record batch that has been extracted from the
				underlying stream.
				It is valid until the next call to Next.

			(*Reader) Release()
				Release decreases the reference count by 1.
				When the reference count goes to zero, the memory is freed.
				Release may be called simultaneously from multiple goroutines.

			(*Reader) Retain()
				Retain increases the reference count by 1.
				Retain may be called simultaneously from multiple goroutines.

			(*Reader) Schema() *arrow.Schema
		Implements (at least 4)
			*Reader : github.com/apache/arrow-go/v18/arrow/array.RecordReader
			*Reader : github.com/apache/arrow-go/v18/arrow/arrio.Reader
			*Reader : github.com/apache/arrow-go/v18/arrow/compute/exec.ArrayIter[bool]
			*Reader : github.com/apache/arrow-go/v18/arrow/scalar.Releasable
		As Outputs Of (at least 2)
			func NewReader(r io.Reader, opts ...Option) (*Reader, error)
			func NewReaderFromMessageReader(r MessageReader, opts ...Option) (reader *Reader, err error)

	 type Writer (struct)
		Writer is an Arrow stream writer.

		Methods (total 2)
			(*Writer) Close() error
			(*Writer) Write(rec arrow.RecordBatch) (err error)
		Implements (at least 3)
			*Writer : github.com/apache/arrow-go/v18/arrow/arrio.Writer
			*Writer : github.com/prometheus/common/expfmt.Closer
			*Writer : io.Closer
		As Outputs Of (at least 2)
			func NewWriter(w io.Writer, opts ...Option) *Writer
			func NewWriterWithPayloadWriter(pw PayloadWriter, opts ...Option) *Writer


Package-Level Functions (total 21)

	 func GetRecordBatchPayload(batch arrow.RecordBatch, opts ...Option) (Payload, error)
		GetRecordBatchPayload produces the ipc payload for a given record batch.
		The resulting payload itself must be released by the caller via the Release
		method after it is no longer needed.

	 func GetSchemaPayload(schema *arrow.Schema, mem memory.Allocator) Payload
		GetSchemaPayload produces the ipc payload for a given schema.

	 func NewFileReader(r ReadAtSeeker, opts ...Option) (*FileReader, error)
		NewFileReader opens an Arrow file using the provided reader r.

	 func NewFileWriter(w io.Writer, opts ...Option) (*FileWriter, error)
		NewFileWriter opens an Arrow file using the provided writer w.

	 func NewMappedFileReader(data []byte, opts ...Option) (*FileReader, error)
		NewMappedFileReader is like NewFileReader but instead of using a ReadAtSeeker,
		which will force copies through the Read/ReadAt methods, it uses a byte slice
		and pulls slices directly from the data. This is useful specifically when
		dealing with mmapped data so that you can lazily load the buffers and avoid
		extraneous copies. The slices used for the record column buffers will simply
		reference the existing data instead of performing copies via ReadAt/Read.
		
		For example, syscall.Mmap returns a byte slice which could be referencing
		a shared memory region or otherwise a memory-mapped file.

	 func NewMessage(meta, body *memory.Buffer) *Message
		NewMessage creates a new message from the metadata and body buffers.
		NewMessage panics if any of these buffers is nil.

	 func NewMessageReader(r io.Reader, opts ...Option) MessageReader
		NewMessageReader returns a reader that reads messages from an input stream.

	 func NewReader(r io.Reader, opts ...Option) (*Reader, error)
		NewReader returns a reader that reads records from an input stream.

	 func NewReaderFromMessageReader(r MessageReader, opts ...Option) (reader *Reader, err error)
		NewReaderFromMessageReader allows constructing a new reader object with the
		provided MessageReader allowing injection of reading messages other than
		by simple streaming bytes such as Arrow Flight which receives a protobuf message

	 func NewWriter(w io.Writer, opts ...Option) *Writer
		NewWriter returns a writer that writes records to the provided output stream.

	 func NewWriterWithPayloadWriter(pw PayloadWriter, opts ...Option) *Writer
		NewWriterWithPayloadWriter constructs a writer with the provided payload writer
		instead of the default stream payload writer. This makes the writer more
		reusable such as by the Arrow Flight writer.

	 func WithAllocator(mem memory.Allocator) Option
		WithAllocator specifies the Arrow memory allocator used while building records.

	 func WithCompressConcurrency(n int) Option
		WithCompressConcurrency specifies a number of goroutines to spin up for
		concurrent compression of the body buffers when writing compress IPC records.
		If n <= 1 then compression will be done serially without goroutine
		parallelization. Default is 1.

	 func WithDelayReadSchema(v bool) Option
		WithDelayedReadSchema alters the ipc.Reader behavior to delay attempting
		to read the schema from the stream until the first call to Next instead
		of immediately attempting to read a schema from the stream when created.

	 func WithDictionaryDeltas(v bool) Option
		WithDictionaryDeltas specifies whether or not to emit dictionary deltas.

	 func WithEnsureNativeEndian(v bool) Option
		WithEnsureNativeEndian specifies whether or not to automatically byte-swap
		buffers with endian-sensitive data if the schema's endianness is not the
		platform-native endianness. This includes all numeric types, temporal types,
		decimal types, as well as the offset buffers of variable-sized binary and
		list-like types.
		
		This is only relevant to ipc Reader objects, not to writers. This defaults
		to true.

	 func WithFooterOffset(offset int64) Option
		WithFooterOffset specifies the Arrow footer position in bytes.

	 func WithLZ4() Option
		WithLZ4 tells the writer to use LZ4 Frame compression on the data
		buffers before writing. Requires >= Arrow 1.0.0 to read/decompress

	 func WithMinSpaceSavings(savings float64) Option
		WithMinSpaceSavings specifies a percentage of space savings for
		compression to be applied to buffers.
		
		Space savings is calculated as (1.0 - compressedSize / uncompressedSize).
		
		For example, if minSpaceSavings = 0.1, a 100-byte body buffer won't
		undergo compression if its expected compressed size exceeds 90 bytes.
		If this option is unset, compression will be used indiscriminately. If
		no codec was supplied, this option is ignored.
		
		Values outside of the range [0,1] are handled as errors.
		
		Note that enabling this option may result in unreadable data for Arrow
		Go and C++ versions prior to 12.0.0.

	 func WithSchema(schema *arrow.Schema) Option
		WithSchema specifies the Arrow schema to be used for reading or writing.

	 func WithZstd() Option
		WithZstd tells the writer to use ZSTD compression on the data
		buffers before writing. Requires >= Arrow 1.0.0 to read/decompress


Package-Level Variables (only one)

	  var Magic []byte
		Magic string identifying an Apache Arrow file.


Package-Level Constants (total 13)

	const ExtensionMetadataKeyName = "ARROW:extension:metadata"
	const ExtensionTypeKeyName = "ARROW:extension:name"
		constants for the extension type metadata keys for the type name and
		any extension metadata to be passed to deserialize.

	const MessageDictionaryBatch MessageType = 2
	const MessageNone MessageType = 0
	const MessageRecordBatch MessageType = 3
	const MessageSchema MessageType = 1
	const MessageSparseTensor MessageType = 5
	const MessageTensor MessageType = 4
	const MetadataV1 MetadataVersion = 0 // version for Arrow Format-0.1.0
	const MetadataV2 MetadataVersion = 1 // version for Arrow Format-0.2.0
	const MetadataV3 MetadataVersion = 2 // version for Arrow Format-0.3.0 to 0.7.1
	const MetadataV4 MetadataVersion = 3 // version for >= Arrow Format-0.8.0
	const MetadataV5 MetadataVersion = 4 // version for >= Arrow Format-1.0.0, backward compatible with v4


The pages are generated with Golds v0.8.2. (GOOS=linux GOARCH=amd64)
Golds is a Go 101 project developed by Tapir Liu.
PR and bug reports are welcome and can be submitted to the issue list.
Please follow @zigo_101 (reachable from the left QR code) to get the latest news of Golds.