|
|
|
|
|
by MilStdJunkie
737 days ago
|
|
I got one. Say I gave it a corpus of structured[1] files that follow Schema X, then I gave it a pile of outputs (PDF, HTML) generated from that corpus, where StructureFileName.xml = StructuredFileName.pdf. Could you see this doohickey being able to take in just the PDF/HTML/Word output, then output its best guess at chucking that into a Schema X file? Pretty much everyone I work with are XML fetishists, and adore hard coded ontologies and taxonomies forged with many years of blood and sweat. I'm a bit more pragmatic and technology-minded. Even before AI I was pretty sure that using Python ML to generate a graph of keywords was a hell of a lot more useful than handcrafted ontology - doesn't cost hundreds of thousands of dollars in billable hours either. Now, with this stuff, we can get around hard coding all that structure itself, and maybe have source documents that normal people can read without about five zeroes worth of bespoke tools. [1] And when I say "structured" I mean *completely frickin bananas". |
|
If not we could easily fine tune a custom model for this task, particularly if you already have a bunch of input/outputs.
Can you send me an email at support [at] endtype.com?