No it should process any file in the unprocessed folder.
Hmmm. I see that they don’t have an H1, so I think they are just overwriting one file in the HTML Files?
As a fallback, is it possible to have a <title> group be used as the H1, if no H1 is found? This is really old HTML, so no H1s.
Here again the slug pulled from the filename would help, as well.
Thanks, much.
good analysis.
yes the plugin uses the first H1 to determine the title and it uses the title to dertime if it is new or an update.
specifically changing the slug to be based on filename would not actually solve it as such, as still need a title.
what it probably needs is a fallback, as you suggest, title may be a good shout.
You have come up with some good issues, but please bear in mind, as work in this is in my spare time and ideas or changes I may choose to corporate may take some time, especially as it is holiday season.
if you are hand with php then there is a development repo you can contribute pull requests to.
I’ve offered a couple of changes in my fork:
https://github.com/ejnwebmaster/load-html-files
They address the slug and the post title issue.
Thanks for your efforts!
sed -i 's/<title>/<h1>/g' *.html
sed -i 's/<\/title/<\/h1>/g' *.html
The above, executed in the appropriate “processed” folder, and then moving the files back into “unprocessed” fixes the issue of a missing H1. Slugs are still wrong.