Updates to the Trove newspapers section of GLAM Workbench – adding links to app-ified versions of some notebooks, & direct links to @mybinderteam for everything. If you work with @TroveAustralia newspapers you might find it useful.

Updates to the Trove newspapers section of GLAM Workbench – adding links to app-ified versions of some notebooks, & direct links to @mybinderteam for everything. If you work with @TroveAustralia newspapers you might find it useful.
NSW State Archives publishes a number of detailed indexes containing data manually extracted from their records. These provide additional entry points to the records, such as a person’s name, or a place. But they also provide useful data for analysis. However, to explore the index data we need to get it out of the web interface and into a form that can be easily downloaded and manipulated. I’ve created a series of Jupyter notebooks to harvest the all the indexes and save the data in a series of CSV-formatted files.
Visualising CV-detected column widths across 100 volumes (30,000+ pages) of Sydney Stock Exchange records from @TheANUArchives…
New in GLAM Workbench! Notebooks to harvest, index, analyse, and aggregate transcripts of speeches & interviews by Australian prime ministers. Plus links to harvested data and aggregated files. #dhhacks
I’ve updated my harvest of the PM Transcripts site — 22,814 XML files with transcripts of speeches, interviews, media releases etc by Australian Prime Ministers. Now with added @TurnbullMalcolm…
Repository includes an index of the files, and aggregations by PM. #dhhacks
Reorganising things a little at GLAM Workbench. @statelibrarynsw gets its own section. Hansard and @datagovau GLAM datasets now under ‘Australian government’. Making some space for further additions…
What’s that? You want MORE GLAM data? Well, I’ve started a list of sources for Australian GLAM data. Metadata, full text, images & more. Contributions welcome! #dhhacks
I’ve updated my harvest of GLAM datasets from data.gov.au. Now there’s 584 CSV files available for download! #dhhacks
I’ve put a copy of my article on using @TroveAustralia for digital research/play, written for the @HTANSW journal, up on my blog. #dhhacks
I’ve updated the list of orgs who have supported the digitisation of @TroveAustralia’s journals. As usual @statelibrarynsw leads the way, but great to see @dvaaus supporting online access.
Today I finished updating a harvest of all OCRd text available from Trove’s digitised journals. That’s about 7gb of text from 30,462 issues of 384 different journals — a fab corpus for text analysis! Here’s all the metadata, links, and harvesting code. @TDHASSN #dhhacks
Update time! Yesterday I updated my Trove digitised journals app to include all the exciting new titles added to Trove in the last few months. This includes ABC Weekly, Current Notes on International Affairs & much more. #dhhacks
A quick interactive view of newspaper articles in @TroveAustralia by state and year. Click on the bars or legend to filter by state. Jupyter notebook on its way… vega.github.io/editor/
Anyone who’s been to one of my Trove workshops will be pleased to know that the WWI effect is still evident when viewing the total number of @TroveAustralia newspaper articles by year. As is the copyright cliff of death…
So there are now almost twice as many newspaper articles in @TroveAustralia from NSW as there are from any other state. (cc @statelibrarynsw)
Well look at that! – a selection of my @TroveAustralia related Jupyter notebooks turned into simple apps using Voila and delivered via Heroku. Save complete newspaper articles as images, create thumbnails, or download pages! #dhhacks
Kicked off a new GLAM Workbench repository dedicated to @SLSA with a quick notebook hack to get higher res versions of digitised photos. #dhhacks
Search @TroveAustralia newspapers without leaving Twitter using the updated and enhanced @TroveNewsBot! After 6 years of regular tweeting, TroveNewsBot needed an upgrade. Check out all its new features, including article thumbnails, here. #dhhacks
Recent additions to the Trove Newspapers section of the GLAM Workbench: getting images from @TroveAustralia newspaper articles, and uploading article to @Omeka-S: glam-workbench.github.io/trove-new…
Want to upload @TroveAustralia newspaper articles to @Omeka-S to create an exhibition or populate a research database? This notebook collects article references from a search, a Trove list, or Zotero, & uploads metadata, images & PDF to your Omeka site. #dhhacks