Spider Web App - PDF Analysis with Neo4j AI Assisted
Description:
This one started as an exploration into graph databases and ended up being a pretty neat little desktop tool. The idea was simple enough - take a PDF, pull out the text and metadata, and store it all in a Neo4j graph database so you could start building relationships between documents over time. The kind of thing that could eventually turn into a personal knowledge graph.
The app is built with JavaFX so it has a real GUI - you click a button, pick a PDF from your file system, and it handles the extraction and storage automatically. Apache PDFBox does the heavy lifting for parsing the PDF content, and Neo4j runs in embedded mode so there's no separate database server to set up.
Tool kit
- JavaFX 21 - GUI framework
- Apache PDFBox 3.0 - PDF text extraction
- Neo4j Java Driver 5.14 - Graph database (embedded)
- Maven - Build tool
Cost to setup
Total Cost: $0 - all open source libraries