Support stream Chunk reading in Python API

In the Python mcap reader API, chunks are read into memory as full blocks. This often causes OOM errors on smaller systems.

Main reader loop: https://github.com/foxglove/mcap/blob/a68c76979d646fa90a24bff3457be2da0c371f0a/python/mcap/mcap/reader.py#L294-L318
Chunk data read line: https://github.com/foxglove/mcap/blob/0a06331980a47ef606cca06b6ee3fc987f5f2d52/python/mcap/mcap/records.py#L173-L173

	if isinstance(next_item, ChunkIndex):
	self._stream.seek(next_item.chunk_start_offset + 1 + 8, io.SEEK_SET)
	chunk = Chunk.read(ReadDataStream(self._stream))
	for index, record in enumerate(
	breakup_chunk(chunk, validate_crc=self._validate_crcs)
	):
	if isinstance(record, Message):
	channel = summary.channels[record.channel_id]
	if topics is not None and channel.topic not in topics:
	continue
	if start_time is not None and record.log_time < start_time:
	continue
	if end_time is not None and record.log_time >= end_time:
	continue
	if channel.schema_id == 0:
	schema = None
	else:
	schema = summary.schemas[channel.schema_id]
	message_queue.push(
	(
	(schema, channel, record),
	next_item.chunk_start_offset,
	index,
	)
	)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support stream Chunk reading in Python API #974

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support stream Chunk reading in Python API #974

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions