|
|
|
|
|
by sansseriff
712 days ago
|
|
I find it interesting that their entity extraction method for building a knowledge graph does not use or require one of the 'in-vogue' extraction libraries like instructor, Marvin, or Guardrails (all of which build off of pydantic). They just tell the llm to list graph nodes and edges in a list, and do some basic delimiter parsing, and load the result right into a networkx graph [1]. Is this because GPT-4 and the like have become very reliable at following specific formatting instructions, like a certain .json schema? It looks like they just provide in the prompt a number of examples that follow the schema they want [2]. [1] https://github.com/microsoft/graphrag/blob/main/graphrag/ind... [2] https://github.com/microsoft/graphrag/blob/main/graphrag/ind... |
|
[1] https://www.promptingguide.ai/techniques/fewshot