sometimes I'm asked by colleagues from other public administrations what they should ask candidates they want to recruit for data science roles... my go to practical interview question is a link to this data:
https://ec.europa.eu/eurostat/databrowser/bookmark/fb7169d9-11d5-47de-9335-fa64107e66ea?lang=en
and asking to reproduce the graph in the picture using the tools they prefer, but the data is not immediately in a format they can use for plotting, and a surprising amount of candidates struggle quite a lot!
=> More informations about this toot | More toots from brodriguesco@fosstodon.org
@brodriguesco The thing has an API and spits out a nice zipped CSV. Rather surprised that this would trip up people.
=> More informations about this toot | More toots from koen_hufkens@mastodon.social
@koen_hufkens candidates would get a laptop with both python and R and the usual plotting packaging, and the data already downloaded as a csv. But as I said, the data in the csv is not quite plotting ready, so it requires some preparing. Many people actually never have to deal with messy data it seems!
=> More informations about this toot | More toots from brodriguesco@fosstodon.org
@brodriguesco Selecting the route? Damn, we're at the comprehensive reading level then.
=> More informations about this toot | More toots from koen_hufkens@mastodon.social
@koen_hufkens there's that, and also not forgetting to replace the ":" characters that are used as NA, but the step that people really don't get is that they should get the data into a long format. The downloaded data is wide, with each month-year, quarter-year and year, a separate column. Only selecting month-years and pivoting the data is a major hurdle!
=> More informations about this toot | More toots from brodriguesco@fosstodon.org
@brodriguesco I would consider those really entry level issues, something I would really expect people not to trip over if applying for a data science role though.
=> More informations about this toot | More toots from koen_hufkens@mastodon.social
@koen_hufkens totally! suffice to say we have issues finding people 🤣
=> More informations about this toot | More toots from brodriguesco@fosstodon.org
@brodriguesco @koen_hufkens the wideness was my first observation. I hope my students would know to do a pivot-longer first and then figure out what else needs to be done.
=> More informations about this toot | More toots from nxskok@mastodon.cloud
@brodriguesco I would prefer access to the raw data or some "scratch" like app with dplyr verbs than those web forms 😅
=> More informations about this toot | More toots from jrosell@mastodon.social
@jrosell you would get a laptopt with both python and R and all usual packages for plotting and the data would already be downloaded as a csv on that laptop.
=> More informations about this toot | More toots from brodriguesco@fosstodon.org
@brodriguesco I would hate to reproduce those x-axis labels, though 😂👌
=> More informations about this toot | More toots from jrosell@mastodon.social This content has been proxied by September (3851b).Proxy Information
text/gemini