we'll continue this project but I learnt from it that most of the openAPI specs are a mess with hundreds of endpoints, incomplete definitions, lack of relations between endpoints, unique constraints etc. so there's tons of heuristics needed anyway. but sometimes it works. and is quite amazning!
if you want to enforce schema and data contracts:
- you can use pydantic models to validate data (if you use pydantic model as a table definition, this is the default)
- we have soon-to-be-merged schema contract PR: https://github.com/dlt-hub/dlt/pull/594
My observations are that it is more than 1% of people that are fine with auto-generated schemas. But that could be selection bias (they use our library because they like it).