Convert, process, and create documents — Word, Excel, PDF, PowerPoint, images, and more
Craft Agent includes built-in tools for working with common document formats. Convert between formats, extract text, merge files, create reports, and process images — all from within your conversation.
The most common operation — extract readable text from any document:
"Read this Word document and summarise it""What's in this Excel file?""Extract the text from this PDF"
The agent uses markitdown as a universal converter. It handles Word documents, Excel spreadsheets, PowerPoint presentations, PDFs, HTML files, and Jupyter notebooks — converting them all to readable Markdown.
"What events are in this .ics file?""Read my exported calendar"
The ical-tool parses .ics calendar files and extracts event details (title, date, time, location, attendees).
All document tools support --help for detailed usage information. If you’re curious about what a specific tool can do, just ask: “What can the pdf-tool do?”
What if the agent can't read a file directly?
If the agent encounters a binary file it can’t read natively (like .docx or .pptx), it automatically falls back to markitdown to convert the document to readable text. You don’t need to do anything — just ask the agent to read the file.
Where do created files go?
When the agent creates or converts documents, the output files are saved in the current working directory (or a path you specify). The agent will tell you the exact file path after creation.
Can I chain operations?
Absolutely. You can ask for multi-step workflows like “Read this PDF, extract the financial data, and create an Excel spreadsheet with just the quarterly revenue figures.” The agent will use multiple tools in sequence.