Tim Sherratt - Sharing recent updates and work-in-progress

Tim Sherratt

Sharing recent updates and work-in-progress

03 Jan 2024

National Archives of Australia in 2023 – digitisation of files

In 2023 the National Archives of Australia digitised 416,602 files (down from 575,597 in 2022). This chart shows the number of files digitised per day in 2023. These files were drawn from 1,423 different series, but the vast bulk (81%) were...
02 Jan 2024

Trove newspapers in 2023

I’ve been capturing weekly snapshots of the Trove newspaper corpus for the last couple of years. You can see the latest results in the Trove Newspaper Data Dashboard. Using this data I’ve compiled a quick summary of changes over the last ye...
15 Sep 2023

Trove Data Guide update – accessing data from newspapers and gazettes

I’m continuing to slog away at the Trove Data Guide (part of the ARDC’s HASS Community Data Lab) – dumping everything I know about Trove into a format that I hope will be useful for researchers. I’ve just finished a first pass through the s...
31 Aug 2023

Some important updates for the Trove Newspaper & Gazette Harvester

Version 3 of the Trove API is out, and version 2 is scheduled to be decommissioned in early 2023 – that means I have a lot of code to update! First cab of the rank is the Trove Newspaper & Gazette Harvester with version 0.7.1 now available....
31 Aug 2023

Run GLAM Workbench notebooks on the ARDC’s new Binder service

There are a number of different ways to run the Jupyter notebooks in the GLAM Workbench depending on your needs and technical skills. But the easiest and quickest has always been the public, international Binder service, based in Europe. On...
26 Aug 2023

Trove Query Parser updated!

I’ve just updated the Trove Query Parser to work with version 3 of the Trove API. You just give it the url of a search in Trove’s newspapers, and it translates the search into a set of parameters that the API will understand. So this: parse...
18 Aug 2023

Family history resources in the GLAM Workbench

It’s Family History Month, so I thought a brief post was in order describing some of the family history related resources in the GLAM Workbench. GLAM Name Index Search This is the biggie (in more ways than one). I’ve brought 263 datasets fr...
17 Aug 2023

Bye bye birdsite

In early June I pinned a “nobody’s home” post to my profile and said goodbye to Twitter. After 15 years, I was sad to leave behind friends and colleagues, but glad to get away from the hate, the nazis, and the transphobes. I hadn’t been pos...
08 Aug 2023

Exploring the front pages of newspapers (10 years on)

Way back in 2012, I used the brand new Trove API to download the details of 4 million articles published on the front pages of newspapers. I did it for two reasons: first, I wanted to see how the content of front pages changed over time; an...
18 Jul 2023

Trove API Console updates

The Trove API Console provides examples of the Trove API in action that you can run, edit, and share. It’s been online for 9 years now, and I’ve just updated it to use version 3 of the Trove API by default. I’ve also added a new ‘Share’ but...
03 Jul 2023

Getting to work on the Trove Data Guide

The ARDC has started work on the development of a HASS Community Data Lab to support digital research in the humanities. I’m part of the team of contractors, and my work package is focused on the development of a Trove Data Guide. My aim is...
08 May 2023

Updated harvest of NSW State Archives indexes – more than 2 million rows of data!

The NSW State Archives (now part of Museums of History NSW) publishes a series of useful indexes to its collections. The indexes include basic data transcribed from the records, such as names, dates, and places, providing fine-grained acces...
24 Mar 2023

A big milestone, Trove contributor data, and the coming of API v3 – recent GLAM Workbench updates

There have been quite a few GLAM Workbench updates over the last month, here’s some notes. (See February’s update for more recent changes…) General developments After many months of work, all thirteen Trove repositories within the GLAM Wor...
17 Feb 2023

Maps, people, lists & more – recent updates to Trove resources in the GLAM Workbench

Once again I’ve gotten a bit behind in noting GLAM Workbench updates, so here’s a quick catch up on some Trove-related changes from the last couple of months. Trove API introduction The section that introduces the Trove API (or APIs!) hasn’...
11 Dec 2022

Real Face of White Australia – updated site to transcribe records from the National Archives of Australia

Back in 2017, I worked with students from my ‘Exploring Digital Heritage’ class at the University of Canberra to develop and launch a site to transcribe records from the National Archives of Australia relating to the administration of the W...
10 Dec 2022

Recent presentations – Library of Congress Data Jam, Everyday Heritage, Wikidata, and GLAM Workbench!

October and November brought a flurry of presentations from which I’m still recovering. Here’s a few details and links. Library of Congress Data Jam In October, the Computing Cultural Heritage in the Cloud project at the Library of Congress...
21 Nov 2022

The Australian history industry and the impact of digitisation (open access preprint chapter)

The Australian History Industry was published recently. Edited by Paul Ashton and Paula Hamilton, the book ‘explores the complex, multi-roomed house of Australian history’, exploring academic, school, and public history, the impact of digit...
21 Nov 2022

Recent updates to trove-newspaper-harvester and trove-newspaper-images

Catching up on some software package updates over the last few months. The trove-newspaper-harvester package is now at v0.6.5. Recent changes include: Fix to handle articles with missing metadata Don’t try to re-download existing text and ...
22 Sep 2022

Do you want your Trove newspaper articles in bulk? Meet the new Trove Newspaper Harvester Python package!

The Trove Newspaper Harvester has been around in different forms for more than a decade. It helps you download all the articles in a Trove newspaper search, opening up new possibilities for large-scale analysis. You can use it as a command-...
15 Sep 2022

From 48 PDFs to one searchable database – opening up the Tasmanian Post Office Directories with the GLAM Workbench

A few weeks ago I created a new search interface to the NSW Post Office Directories from 1886 to 1950. Since then, I’ve used the same process on the Sydney Telephone Directories from 1926 to 1954. Both of these publications had been digitis...
05 Sep 2022

Fresh harvest of OCRd text from Trove's digitised periodicals – 9gb of text to explore and analyse!

I’ve updated the GLAM Workbench’s harvest of OCRd text from Trove’s digitised periodicals. This is a completely fresh harvest, so should include any corrections made in recent months. It includes: 1,430 periodicals OCRd text from 41,645 is...
05 Sep 2022

Explore Trove's digitised newspapers by place

I’ve updated my map displaying places where Trove digitised newspapers were published or distributed. You can view all the places on single map – zoom in for more markers, and click on a marker for title details and a link back to Trove. If...
01 Sep 2022

Making NSW Postal Directories (and other digitised directories) easier to search with the GLAM Workbench and Datasette

As part of my work on the Everyday Heritage project I’m looking at how we can make better use of digitised collections to explore the everyday experiences woven around places such as Parramatta Road in Sydney. For example, the NSW Postal Di...
29 Aug 2022

Interested in Victorian shipwrecks? Kim Doyle and Mitchell Harrop have added a new notebook to the Heritage Council of Victoria section of the GLAM Workbench exploring shipwrecks in the Victorian Heritage Database: glam-workbench.net/heritage-…

29 Aug 2022

Updates!