Category: R Programming

Aug 02 2017

On Word Embeddings and Fish – Letters of 1916

Letters of 1916 is a public humanities project run by Maynooth University and directed by Professor Susan Schreibman. The project is creating a crowd-sourced digital collection of letters written between 1st November 1915 and 31st October 1916. For my internship, I have been working on the project as a data scientist, analysing the collection. Following …

Continue reading

Jul 06 2017

Topic Modelling: PoS Tagging – Letters of 1916

Letters of 1916 is a public humanities project run by Maynooth University and directed by Professor Susan Schreibman. The project is creating a crowd-sourced digital collection of letters written between 1st November 1915 and 31st October 1916. For my internship, I have been working on the project as a data scientist, analysing the collection. In …

Continue reading

Jun 26 2017

Topic Modelling – Letters of 1916

Letters of 1916 is a public humanities project run by Maynooth University and directed by Professor Susan Schreibman. The project is creating a crowd-sourced digital collection of letters written between 1st November 1915 and 31st October 1916. For my internship, I have been working on the project as a data scientist, analysing the collection. One …

Continue reading

Oct 03 2016

Jane Austen in Vector Space – Presentation at JADH

JADH 2016

In September, I presented a paper which discussed the application of vector space models to a corpus of Jane Austen’s published novels at the Japanese Association for Digital Humanities Conference in Tokyo. The paper was titled ‘Jane Austen in Vector Space: Applying vector space models to 19th century literature’ and outlined some of the findings …

Continue reading

Mar 10 2016

Beyond the Word Cloud

Possibly the most common entry level visualisation in computational textual analysis is the word cloud. There are multiple online tools (the best known probably being Wordle) which allow you to create them in all sorts of styles. A word cloud can be very useful for identifying frequent terms at a glance, or to indicate possible …

Continue reading

Jul 10 2015

Marine Lives – R and The Silver Ships – Frequencies

In this second blog post on the Marine Lives Three Silver Ships project, (see the first post here) I look at how to identify the folio pages in the HCA 13/70 Depositions which mention the three ships (Salvador, Sampson and Saint George). Processing and Calculating Raw Frequencies Using the .txt file downloaded in the last …

Continue reading

Jul 09 2015

Marine Lives – R and The Silver Ships – Extracting Data

Within the larger Marine Lives project there are a number of smaller sub-projects which focus on specific areas of interest. One of these smaller projects is the Three Silver Ships project: The purpose of the project is to identify relevant references to cases involving the Three Silver Ships in the various depositions and papers and …

Continue reading

Apr 02 2015

Sentiment Analysis – Further Down the ‘R’abbit Hole

It seems rather strange to think that, just under eight months ago, I had not written any computer code (I’m not including little bits of BASIC from the ’80s), and yet lines of code or the blinking cursor of Terminal no longer instil a sense of rising panic. Although programming has a very steep learning …

Continue reading

Nov 13 2014

Rubber Duck Debugging

In the second week of the Coursera R Programming course and things were getting decidedly tough with the focus being on creating functions. The shift from basic calculations to writing functions is a very steep learning curve. I had made the mistake of reading a ‘warning’ post which highlighted the difficulties of creating the assignment …

Continue reading

Nov 13 2014

Learning R – Next Steps

My initial foray into the world of programming seemed to go fairly well, at least, I managed to get my head around the basics and haven’t run screaming from my laptop. There were a few areas which I found more tricky than others, but I think that some of that is because I don’t have a maths …

Continue reading