Universal document parser workflow for n8n that converts multiple document formats into plain text using the Google Drive API.
This workflow eliminates the need for separate parsing nodes for each file type (PDF, DOCX, TXT, etc.) by leveraging Google's document conversion engine.
The workflow can process:
- DOCX
- DOC
- Google Docs
- TXT
- Input file ID is provided from Google Drive.
- The workflow calls the Google Drive API to create a converted copy of the file as a Google Doc.
- Google Docs automatically converts the file content.
- The workflow retrieves the document using the exportLinks[text/plain] endpoint.
- Final output is clean plain text, ready for:
- AI processing
- summarization
- embeddings
- vector databases
- automation workflows
Main steps:
- Manual Trigger
- Set File ID
- Google Drive API – Copy with Convert
- Export Converted Document as Plain Text
- AI document summarization
- RAG pipelines
- Knowledge base ingestion
- Resume parsing
- Contract analysis
- Email/document automation
- n8n
- Google Drive API access
- Google OAuth credentials configured in n8n
- Import the workflow JSON into n8n
- Configure Google OAuth credentials
- Update the file_id field
- Run the workflow
workflow.json– n8n workflowworkflow.png– visual workflow architecture
Sanket Gawankar
AI Automation | n8n | Communication Platforms | Cisco | Zoom