feat: Initial Zip file and OME-Zarr Archive (RFC-9) support #306

mkitti · 2026-01-28T08:19:44Z

Purpose.

Support OME-Zarr zip archives (RFC-9). I am an author of OME-Zarr RFC-9. Zarr zip file implementations are wide spread and implemented in a number of Zarr implementations.

Notably Neuroglancer implements Zip file key-value store.

Background

Zip files are documented by a PKWare appnote. A zip file contains local file entries followed by a central directory at the end of the file. While the local file entries at the beginning file could be read in streaming fashion, it is often necessary to read the central directory at the end of the file first. Duplicate local file entries may exist in the beginning of the archive, but only the central directory can correctly indicate the latest version for example.

OME Zarr RFC-9 proposes a standard way to store OME-Zarr datasets in zip files with a OZX extension. A particular challenge for Zarr in zip files in the large numbers of files which may make the central directory more difficult to parse. The RFC recommends a number of recommendations to minimize the number of files such as using sharding.

Another recommendation by the RFC is to list zarr.json metadata first in the central directory. This allows an application like fileglancer which is mainly concerned with parsing metadata to quickly access the metadata without having the parse the entire central directory. It also allows the entire tree of the hieararchy to be elucidated. To facilitate detection of this optimization, JSON is stored in the comment of the zip which occurs at the end of the central directory. This contains a jsonFirst flag to indicate that the writer of the file placed the JSON files first after the central directory. If this flag is true, a reader may assume that no further zarr.json files exist within the archive once the first non-zarr.json file is read.

Design

While Python does provide a zipfile standard library, a key performance optimization here is the abilty to read a partial central directory. While browsing, only the metadata may be needed. As such this pull request contains new code to read the central directory partially, which the standard library does not support.

If there becomes a need stream the contents of the zip file for external applications, then some additional reading and caching of the central directory may be needed.

While I was initially working on this feature with OZX in mind, implementing generic zip file support also seemed useful. Thus the core implementation splits generic zip support and then builds OME-Zarr support on top of that.

Large archives with thousands of entries can be slow to load. This adds pagination support to load entries incrementally: - Backend: /api/ozx-list now accepts offset/limit params and returns total_count, has_more for pagination - Frontend: New useOzxFileEntriesInfiniteQuery hook with TanStack Query - ZipBrowser: Shows "Load more" button and entry count progress Initial load fetches 100 entries, with more loaded on demand. Co-Authored-By: Claude Opus 4.5 <[email protected]>

mkitti · 2026-01-28T19:40:05Z

fileglancer/app.py

+            total_count = reader.cd_entries_count
+
+            # Parse entries up to offset + limit
+            reader.parse_central_directory(max_new_entries=offset + limit)


I think this should only be limit based on the caching and resumable behavior, but I'm not completely sure about persistence between requests.

mkitti and others added 8 commits January 28, 2026 01:49

Initial version of OZX support, with Claude

9e02239

Remove Zarr v2 support from OZX

33baf09

Factor out generic Zip support, improve stop condition

d9313d6

Update OZX implementation documentation

703e8a9

Add central directory caching

58fdc8d

Add frontend components to view within zip or ozx archives

f719b98

Rename generic ozx components to zip if not ozx specific

1a904d9

mkitti commented Jan 28, 2026

View reviewed changes

Fix JSON formatting for files in a OZX archive

11045d0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Initial Zip file and OME-Zarr Archive (RFC-9) support #306

feat: Initial Zip file and OME-Zarr Archive (RFC-9) support #306

Uh oh!

mkitti commented Jan 28, 2026 •

edited

Loading

Uh oh!

mkitti Jan 28, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: Initial Zip file and OME-Zarr Archive (RFC-9) support #306

Are you sure you want to change the base?

feat: Initial Zip file and OME-Zarr Archive (RFC-9) support #306

Uh oh!

Conversation

mkitti commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose.

Background

Design

Uh oh!

mkitti Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mkitti commented Jan 28, 2026 •

edited

Loading

mkitti Jan 28, 2026 •

edited

Loading