I'm trying to write a set of Scheme macros to simplify building simple graph structures using a declarative style, but as a beginning Schemer I'm struggling with macro hygiene.
I have defined two record types :graph
and :node
:
(define-record-type :graph
(make-graph nodes links)
graph?
(nodes graph-nodes)
(links node-links))
(define-record-type :node
(make-node name pins)
node?
(name node-name)
(pins node-pins))
A pin
in my example is just a string (pin name), a link
simply a cons-pair of endpoint
, and and endpoint
a cons-pair of a node name and a pin-name.
What I would like to be able to do is to define macro's graph
, node
and link
that allow my to create graphs in a declarative style, like this:
(define g (graph
(node "v0"
(pin "out"))
(node "v1"
(pin "out"))
(node "add"
(pin "lhs")
(pin "rhs")
(pin "out"))
(link (("v0" . "out") . ("add" . "lhs"))
(link (("v1" . "out") . ("add" . "rhs"))))
The result of this expression would be a value of type :graph
, where (graph-nodes g)
would evaluate to a list of 3 :node
values, (graph-links g)
would evaluate to ((("v0"."out") . "add" . "lhs")) (("v1" . "out") . ("add" . "rhs")))
, and each of the 3 nodes would be :node
values with the pins
they were defined with.
An additional requirement would be that only the graph
macro would be defined at global scope, I would very much like to avoid having to define global macro's for anything I can pass as the body
of the graph
invocation. These macro's make only sense in the context of an expansion of the graph
macro, and I'd like to maintain the principle of 'smallest possible scope' for them, and also not litter the global scope with generic names like node
or pin
.
Obviously it would be relatively simple to simply create the graph directly using make-graph
and passing the nodes and links directly as lists. I would prefer to use macro's though, because it allows a more declarative style to define the graph: you don't need to remember the order of the arguments to pass in, you could mix the order of node/links declarations, extend the graph record with more additional fields without breaking the graph definition code, trivially transform the macro call to/from other data types (XML, JSN), etc.
I tried to solve this problem using nested macro's: a global graph
macro, which has a local node
macro.
For simplicity, I will leave out defining links or pins in this example, to extend the example for this, the let-syntax
would have an additional macro link
, and the macro for node
would itself have another nested macro called pin
(so let-syntax
for pin
inside let-syntax
for node
inside define-syntax
for graph
):
(define-syntax graph
(syntax-rules ()
((graph . body)
(let ((nodes '())
(links '()))
(let-syntax ((node
(syntax-rules ()
((node name))
(make-node name '()))))
(begin . body))
(make-graph nodes links)))))
(graph (node "v0") (node "v1"))
Unfortunately, this does not work, as node
will be unbound where body
is inserted in the expansion of the graph
macro. I think I understand why: since Scheme macro's are hygienic, any variables (pattern variables or locals) will be expanded/renamed to macro-local variables, so the node
macro defined by let-syntax
will not actually be called node
anymore after expansion, so the node
in the body
I passed into the graph
macro will not see it.
A very similar question has been asked here, and while I sort of understand the explanation there, I am not certain how to interpret the proposed solution/workaround.
If I read correctly, both solutions in the other question involve a) explicitly passing/matching references to any functions I would like to call from the body
passed into graph
(so in my case, node
, link
, pin
), or b) defining node
, link
, pin
as global macro's.
Solution a) is not attractive as it defeats the purpose of the macro (which is to define graphs in declarative style, with minimal syntax), and solution b) feels like it defeats the whole point of having hygienic macro's, because now I need to have macro's like node
, link
and pin
in the global scope, which are very likely to collide with code that wants to define things (local variables, symbols) with the same name.
Considering people have been able to do the craziest things creating DSL's using Scheme macro's, I'm almost certain that what I want to do should be possible some way or other. I've not been able to figure out yet, which makes me think I'm missing some fundamental part of the puzzle and/or have reasoned myself into some dark corner of Scheme (nested/recursive macro's) I should not have to be in because there are much simpler solutions to achieve what I want.
Edit: Based on Shawn's answer, I came up with the following:
(define-record-type :graph
(make-graph nodes links)
graph?
(nodes graph-nodes)
(links graph-links))
(define-record-type :node
(make-node name pins)
node?
(name node-name)
(pins node-pins))
(define-record-type :link
(make-link from to)
link?
(from link-from)
(to link-to))
(define-record-type :pin
(make-pin name)
pin?
(name pin-name))
(define-syntax graph
(lambda (x)
(define node-def? (lambda (def) (eq? (car def) 'node)))
(define link-def? (lambda (def) (eq? (car def) 'link)))
(define pin-def? (lambda (def) (eq? (car def) 'pin)))
(define (parse-pin-definition pin-def)
#`(make-pin (quote #,(cdr pin-def))))
(define (parse-node-definition node-def)
(let ((pins (map parse-pin-definition (filter pin-def? (cddr node-def)))))
#`(make-node (quote #,(cadr node-def)) (list #,@pins))))
(define (parse-link-definition link-def)
#`(make-link (quote #,(cadr link-def)) (quote #,(cddr link-def))))
(define (parse-graph-definition graph-def)
(let ((nodes '())
(links '()))
(map (lambda (def)
(cond
((node-def? def)
(set! nodes (cons (parse-node-definition def) nodes))))
(cond
((link-def? def)
(set! links (cons (parse-link-definition def) links)))))
graph-def)
(values nodes links)))
(syntax-case x ()
((graph . graph-def)
(let-values (((nodes links) (parse-graph-definition (syntax->datum #'graph-def))))
#`(make-graph (list #,@nodes) (list #,@links)))))))
(display (graph
(node "v0"
(pin "out"))
(node "v1"
(pin "out"))
(node "add"
(pin "lhs")
(pin "rhs")
(pin "sum"))
(link ("v0" . "out") ("add" . "lhs"))
(link ("v1" . "out") ("add" . "rhs"))))
This seems to work well and is pretty readable. It lacks any kind of error checking/handling but it illustrates well the concept of using syntax-case
for this as recommended by Shawn :)
I'm not sure if you can do what you want with
syntax-rules
- if you can it's probably something long and complicated and tedious to come up with, but by usingsyntax-case
to validate the structure and types of the arguments to a macro, it becomes fairly easy to do in a single macro.Note I changed the syntax of the
link
clauses fromto
which is simpler and easier to work with. A list of 2-element lists of pairs is also stored in the generated graph record. You might also consider using symbols instead of strings for all those identifiers, which is more idiomatic in lisp family languages.