| "arbitrary layering and deep nesting are not good engineering practices" Perhaps, but that's irrelevant; I'm describing the difference in how hierarchical and relational models are designed. "I prefer to use hash-map as the table with the primary key hash index, with key as the primary key and val(colname-colval-hashmap) as the row content." And what if you need a second index? Your indexing should be separate from your data model, otherwise you can't write performant relational algebra operations that apply in the general case. "It can also be implemented very elegantly with clojure.core. using hash-map operation is simpler, clearer, smoother and high performance." No it can't. Suppose I have a relation with keys: a, b, c, d and e. I want to index on a, b and the pair (c, d). How would I do that in Clojure? What happens if I later decide I also want to index on e? This is the sort of problem that's trivial to solve in a relational database, and extremely hard in Clojure, because Clojure doesn't have the functions or data structures to support data modelled in this way. That's not to say that Clojure can't have these tools; just that they aren't built into clojure.core, because that's not what it's designed for. "set, vector, list is generally not a good default data container, only used when needed" Yes they are. Sets are the basis of relational algebra. You're complecting the ideas of data representation with data indexing. Sets are a good representation of a relation, but a poor index. We can get the best of both worlds by combining the two: (def a
(let [r1 {:a/id 1, :a/name "a1", :b/id 1}
r2 {:a/id 2, :a/name "a2", :b/id 1}]
{:relation #{r1 r2}
:index
{:a/id {1 #{r1}, 2 #{r2}}
:a/name {"a1" #{r1}, "a2" #{r2}}
:b/id {1 #{r1 r2}}}}))
A data structure like this allows us to start writing efficient relational algebra. For example, with a natural join we can look for the smallest index two relations have in common.So we can begin to construct the infrastructure we need to perform relational algebra in Clojure, but it's not there to begin with, and therefore Clojure isn't designed around the relational model. |
Implementing an RMDB is not equivalent to relational algebraic operations. I think it should be to use simple, direct and lightweight the relational logic model to solve real-world problems. Don't over-optimize, over-generalize and over-complicate, keep it simple and direct.
Just like we design a Database in RMDB to solve real-world projects, This is the normal way to use relational algebra and models. After the database design is complete, we don't need care how the index is implemented, and we don't need care the underlying storage of the data. I mean, clojure is used as RMDB , is not used as tool of construct RMDB .
Therefore, my method is to design the application-level data model. The problem you said does not exist. When I get the data from the database (or elsewhere), I simply transform the data to the target model, you can think of this model as a table or view, you don't need to transform again, so you don't need multiple indexes.