Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Abbreviations for over 38000 NCBI journal titles #8

Merged
merged 37 commits into from
Jun 8, 2021

Conversation

stroobandt
Copy link
Contributor

@stroobandt stroobandt commented Feb 22, 2021

Unfortunately, the early 2019 #6 PR by @retorquere went already stale, even before it ever got merged.

However, the current PR also fixes issue #4 and largely expands the amount of abbreviated NCBI journals to over 38000 by including not only the indexed journals, but also those that were referenced by NCBI.

JSON file extraction from XML is done in XSLT; which runs orders of magnitude faster than the previous interpreted Python script.

Finally, the use and contents of a number of different journal title abbreviation files is carefully documented.


NCBI Journal Title Abbreviations

The abbreviations of over 38000 journal titles referenced in the catalogue of the National Center for Biotechnology Information, part of the US National Library of Medicine, are provided in JSON format. This allows Pandoc and other programs to produce references with correctly abbreviated journal titles in accordance with the American Medical Association Manual of Style.

Topics not only cover the entire spectrum of biomedical and life sciences, but also include many related fields.

@retorquere
Copy link

@rmzelle I'm OK with this PR being merged instead of mine. I could use that as a starting point for prepping for npmjs releases.

@stroobandt
Copy link
Contributor Author

@retorquere Thank you for supporting this PR.
That is very kind of you.

@retorquere
Copy link

@rmzelle is it not desirable to have this transformation? I could really use it.

@rmzelle
Copy link
Member

rmzelle commented Jun 8, 2021

@stroobandt, @retorquere, apologies for the wait.

@adam3smith, looks fine to me.

@adam3smith adam3smith merged commit 1c1eb18 into citation-style-language:master Jun 8, 2021
@adam3smith
Copy link
Member

Thank you very much!

@retorquere
Copy link

The XML source files are all empty (zero bytes).

@stroobandt
Copy link
Contributor Author

@retorquere The XML input files are empty on purpose because of their file size and the fact that, because of this, these do not go down well with git. Moreover, these are not source code files, but rather input data files.

How to obtain a fresh copy of these XML input files, is explained in detail in the accompanying README.md.

The much smaller JSON output files are included. Most users only need these.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants