|
|
|
|
|
by dalke
2223 days ago
|
|
While true (I make a living working with them), I don't think the OP is that interested in graphs which are that small. What makes molecular graphs a bit more challenging is that there are 1 billion or so "small" graphs to search in a very large database. There are also important issues in how to represent a molecule as a molecular graph. Concepts like stereochemistry are tricky to handle. "Search" here means subgraph isomorphism search. No one that I know of uses a graph isomorphism test to compare two structures. Instead, they convert the molecular graph into a string, where the atoms and bonds are in a canonical order - perhaps a SMILES string or InChI string - and use a string search. Graph canonicalization is therefore important for molecular search, but not for the "complicated graphs, say social networks" the OP mentioned. One specialized topic I worked on was the maximum common substructure problem. Given two molecules, what is the largest subgraph. Now extend that to N molecules. Now, find the largest subgraph which is in at least P% of those molecules. |
|