Who am I? – Curtin Institute for Computation – Your digital research toolbox



Who am I? – Curtin Institute for Computation – Your digital research toolbox

0 0


slides-20160404-cbs-hdr

Slides for presentation for CBS HDR students

On Github CurtinIC / slides-20160404-cbs-hdr

Your digital research toolbox

Andrea Bedini, Curtin Institute for Computation, @andreabedini

Who am I?

  • PhD in Physics @ Università di Milano in 2010
  • Postdoc @ UniMelb 2010–2015
  • Lots of numerical research workmathematical combinatorics/statistical physics/genomics/traffic modelling
  • Self-taught programmer / Open Source contributor
  • Joined Curtin University this year

Curtin Institute for Computation

  • Started last year
  • Gathers ~70 researchers
  • Truly interdisciplinary
    • Humanities
    • Health
    • Science & Engineering
    • Business School
  • Foster collaboration to solve complex problems using
    • computational modelling
    • data analysis
    • data visualisation
    • machine learning
  • Help to access and utilise existing facilitiese.g. Pawsey, HIVE, Nectar Research Cloud
  • Provide basic computing and data skills training

Team of computational specialists hired to help researchers

  • Rebecca Lange (Humanitiles)
  • Kevin Chai (Health)
  • Shiv Meka (Sci&Eng)
  • Andrea Bedini (Business School) me!

Your digital research toolbox

  • IT is changing how people do research
  • Researchers today:
    • compute
    • visualise
    • collaborate (faster, larger groups)
    • collect/manage data (in larger quantites)
    • write software
Many students start PhDs at Curtin without even the most basic of computing and data skills And it's not their fault ... Using Excel won't give you an edge anymore!

What should you be in your toolbox?

  • Two data wrangling skills
    • Manage possibly large datasets

      MovieLens

    • Know what a database is and when you need it

      Often a big CSV file is enough

      Year,Make,Model,Description,Price 1997,Ford,E350,,3000 1999,Chevy,Extended Edition,,4900 1999,Chevy,Extended Edition,,5000 1996,Jeep,Grand Cherokee,"MUST SELL!",4799 ...
If I can't Google you, you don't exist
Unix Shell • Version Control with Git • Using Databases and SQL • Programming with Python • Programming with R • R for Reproducible Scientific Analysis • Automation and Make Data Cleaning • Data Analysis and Visualization in Python/R • Data wrangling and processing • Geospatial Data • Social sciences text mining

Training sessions coming soon

Hacky Hour

  • Every Wednesday 3pm at Common Ground
  • Follow us on Twitter: @CUHackyHour

Research morning tea

Questions / Feedback

  • What data problems do you have?
  • What is your workflow?

Find the slides at http://curtinic.github.io/slides-20160404-cbs-hdr/

Your digital research toolbox Andrea Bedini, Curtin Institute for Computation, @andreabedini