-
Notifications
You must be signed in to change notification settings - Fork 195
Open
Description
Hello,
I tried to generate data contract from one GCS source file.
datacontract import --format parquet --source 'gs://some-server/news.parquet'
I get following error.
│
│ │
│ 88 │ if pyarrow.types.is_list(pyarrow_field.type): │
│ 89 │ │ return create_property(name=field_name, logical_type="array", physical_type="LIS │
│ 90 │ │
│ ❱ 91 │ raise DataContractException( │
│ 92 │ │ type="schema", │
│ 93 │ │ name="Parse parquet schema", │
│ 94 │ │ reason=f"{pyarrow_field.type} currently not supported.", │
│ │
│ ╭───────────────────────── locals ─────────────────────────╮ │
│ │ field_name = 'url' │ │
│ │ pyarrow_field = pyarrow.Field<url: large_string> │ │
│ ╰──────────────────────────────────────────────────────────╯ │
╰────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
DataContractException: Run operation failed: [schema] Parse parquet schema - None - ResultEnum.failed - large_string currently not supported. -
datacontract
Seems the tool is not supporting all Parqet types.
Metadata
Metadata
Assignees
Labels
No labels