Hello all; I've been with Gengo for some time now, but I think this may be my first time posting! (Sup!)
So on to the topic of the day: the new .docx text extraction capabilities.
I can respect the decision to automatically convert .docx file formats to workbench-compatible jobs, even if I, personally, prefer working in .doc formats. (I find it useful to go back and review some of my older work to see what I can improve on with the files stored on my computer.)
That being the case, I wonder if it makes more sense to adopt this sort of functionality to .pdf file formats? Oftentimes I'll take a .pdf job from a customer that almost necessitates rebuilding the entire thing (especially when dealing with pamphlets) due to the vertical/horizontal compatibility issues that come with the JP-ENG language pair.
Even if it's a pipe-dream, I would almost prefer a function that solely extracts the text from a PDF to eliminate the need for finicky formatting updates. When we're dealing with the expectation of quick turn-arounds (especially in the Pro category), finagling with formatting can honestly either make or break meeting a deadline. We're translators here, not graphics designers! (Well, some of us. =D)
Anywho, any thoughts?