Ever wondered where people from Europe’s cities went on Erasmus to? Look no further.
What am I looking at here?
This is a map of a lot of the Erasmus student exchanges between 2008 and 2019 (a lot, not all, due to technical reasons explained below). The map is city-based, so all universities in one city are grouped together. You can also select time periods and also go through the data year by year.
The map only shows exchanges between Erasmus member states, so for example not the Eastern Partnership countries or Erasmus Mundus exchanges. It only shows student exchanges, not work placements (which would be another interesting map, but would require some further work).
Where did you get the data from?
The Erasmus data is published by the European Commission. Further data for geocoding the various universities is also taken from the Commission and from Geonames.
What did you do to the data?
While the data provided by the Commission is very nice, it doesn’t show geographic location. This means I had to geocode each exchange in the dataset. For this I had to start on two different levels: The first few years showed home and host universities, the other years cities. So, for the first few years, I first had to convert the codes of universities to the cities the universities were located in. Then I only had the names of cities, and those I could then geocode.
But this is of course much easier said then done: The data was hand-filled, so I had to deal with a lot of different ways of writing the names of cities. I did my best, but overall I was only able to identify around 84% of them. That sounds alright until you remember that an exchange involves two cities, so only 70% (0,84² = 0,7) of exchanges could be completely geocoded.
As you might notice, the map is already quite laggy because there are so many different exchanges on there. In order to help out a bit with that, I removed all exchanges that only included a single student.
In total, there are 4,9 million exchanges in the data. Out of those, 4,7 million were usable (mostly excluding non-student exchanges), and out of those, 3,3 million could successfully be geocoded. I couldn’t find data on how many people actually went on exchange between 2008 and 2013, but for 2014 to 2019, that means around 55% of all exchanges are included in the map.
You can check what I did with the data yourself in this project’s repository.
Why should I care about this?
Well, you probably shouldn’t. But if you are so inclined, it might be interesting to check the relations of cities: What cities do students in Marseille like to go on Erasmus to? How far does the Erasmus network of Sofia stretch to? And you can also check developments over time: How popular has Budapest been as an Erasmus destination? Has this changed, and if yes, when and why?
But that would just be my suggestions.