This is a gentle reminder that if you are NIH funded that you must comply with the NIH Public Access policy when publishing and citing your NIH funded work and deposit your peer-reviewed manuscripts into PubMed Central.
Nature Publishing Group will be archiving for authors for free into PubMed Central. Read their press release here.
Related to this topic is my personal opinion on full-text data mining for natural language processing or other purposes. I’m personally shocked that only a small fraction of the PubMed Central corpus is available under an open access policy which allows for distribution of the papers. It seems to me that if the entire world is freely able to read the works, then we should be able to mine the words to see if the articles are worth reading. Aren’t the words, in essence, the data from which we run our NLP processes? And if we are struggling to cope with understanding millions of articles in the biomedical literature, shouldn’t there be a better way of discovering which articles are, in fact, truly meaningful to us? Something to think about…