The Markdown source in this directory is published as part of the Apache
Cloudberry website at https://cloudberry.apache.org/docs. The site build is
owned by apache/cloudberry-site
(Docusaurus-based); this repository keeps the source-of-truth Markdown so
documentation can be edited alongside the PXF code it describes.
-
Files use plain Markdown with Docusaurus-flavoured frontmatter:
--- title: Reading and Writing HDFS Parquet Data description: Read and write Parquet data in HDFS via PXF. sidebar_position: 6 ---
-
Internal links use relative
.mdpaths (for example,./hdfs_text.mdor../administering/cfg_server.md#about-the-pxffsbasepath-property). Heading anchors are auto-generated by Docusaurus from the heading text. -
Images live under
graphics/and are referenced with relative paths (for example,../graphics/pxfarch.png). -
The directory layout mirrors the website sidebar. Each sub-directory carries a
_category_.jsondescribing its label and position; new categories should follow the same pattern.
Clone apache/cloudberry-site, point its sync script at this checkout, and run
npm start. See the cloudberry-site repository for details.