[Humanist] 29.198 end of digital humanities? parsing bibliographical reference lists?

Humanist Discussion Group willard.mccarty at mccarty.org.uk
Sun Aug 9 08:31:34 CEST 2015


                 Humanist Discussion Group, Vol. 29, No. 198.
            Department of Digital Humanities, King's College London
                       www.digitalhumanities.org/humanist
                Submit to: humanist at lists.digitalhumanities.org

  [1]   From:    Amir Simantov <wawina at gmail.com>                          (45)
        Subject: Parsing Bibliographic Reference Lists

  [2]   From:    Paul Fishwick <metaphorz at gmail.com>                       (14)
        Subject: the end?


--[1]------------------------------------------------------------------------
        Date: Tue, 28 Jul 2015 07:42:04 -0500
        From: Amir Simantov <wawina at gmail.com>
        Subject: Parsing Bibliographic Reference Lists


Dear scholars and information technologists,

I am a software developer, and I am currently looking for a tool or library
to parse bibliographic reference lists for a client of mine.

MY TASK

I need to import data from a website with static HTML pages into Drupal,
the content management system I most often use. Part of the data are
references lists. I need to parse each reference into its metadata parts,
that is, author, book title, journal, pages, etc., according to its type
(article, book, etc). An example of a page containing reference lists to be
parsed can be found here
 http://t.sidekickopen13.com/e1t/c/5/f18dQhb0S7lC8dDMPbW2n0x6l2B9nMJW7t5XYg2Bppq6VfD0Ns1p1kpdW2BFNn256dW65f7Hk6Qz02?t=http%3A%2F%2Ffaculty.washington.edu%2Fkpotter%2Fxtxt1.htm&si=6254107054047232&pi=4040507e-2870-4616-e89c-aa917eb1b6fd 


More information about the Humanist mailing list