Saxon has EXpath and thus expath-zip or expath-archive, respectively. These can be used to work on docx directly. Create Flat OPC from this and adapt other scripts to work on that hierarchy