Is there a way to view the schema of a graph in a Neptune cluster using Jupyter Notebook?
Like you would do a "select * from tablename limit 10" in an RDS using SQL, similarly is there a way to get a sense of the graph data through Jupyter Notebook?
It depends on how large your graph is as to how well this will perform but you can get a sense of the type of nodes and edges you have using something like the example below. From the tags you used I assume you are using Gremlin:
If you have a very large graph try putting something like
limit(100000)
before thegroupCount
step.If you are using a programming language like Python (with gremlin python installed) then you will need to add a
next()
terminal step to the queries as in:Having found the labels and distribution of the labels you could use one of them to explore some properties. Let's imagine there is a label called "person".
Remember with Gremlin property graphs vertices with the same label may not necessarily have all the same properties so it's good to look at more than one vertex to get a sense for that as well.