Toolchain to preprocess google docs for import into Notion
The Notion import tool is quite fragile and does not support many file formats. uW Google docs have some unsupported formats. This package converts google docs to files that Notion imports. The google docs download retains the original folder structure but Notion does not.
This utility prefixes the filename with their path so that a notion user can recreate the original folder structure as Notion pages. The following file types are supported.
- .docx, ,pdf, .html, .md,
- .pptx, .pptm,
- .png, .jpg, .jpeg, .gif,
- .mp4, .3gp, .m4v,
- .odt, .xlsx
- Download your google docs
- Unzip the downloaded files into a folder like: sharedTraining
- Place the following tools into the folder containing your extracted files
- run rfn to preprocess the files
- sh ./ppg sharedTraining
- Follow the instructions at the end of the program to do the actual import
- rfn - Refactor File Names to retain folder structure
- pdf2html - convert PDFs to HTML
- ppg - PreProcess Google docs for Notion Import
- ppt2md - Convert PPTX files to Markdown
You will need root password to install these:
- Bash or WSL for execution environment
- soffice for Libre Office cli conversion of ODT documents to DOCX
These will be auto installed but you will still need root password
- zip
- pptx2md