All Projects → h4sci → h4sci-course

h4sci / h4sci-course

Licence: other
ETH PhD Program course

Programming Languages

r
7636 projects

Projects that are alternatives of or similar to h4sci-course

yt-channels-DS-AI-ML-CS
A comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
Stars: ✭ 1,038 (+5363.16%)
Mutual labels:  programming, data-engineering
Kriptofolio
Free open source minimalistic cryptocurrencies portfolio app for Android.
Stars: ✭ 79 (+315.79%)
Mutual labels:  programming
uptasticsearch
An Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (+147.37%)
Mutual labels:  data-engineering
semver-generator
Semantic version generator using git commit keywords and overrides
Stars: ✭ 26 (+36.84%)
Mutual labels:  programming
FunctionLibrary
A library to learn Algorithms and Data Structures for newbies
Stars: ✭ 13 (-31.58%)
Mutual labels:  programming
ayushi7rawat
Ayushi Rawat's README
Stars: ✭ 49 (+157.89%)
Mutual labels:  programming
scilab
Open source, cross-platform numerical computational package and a high-level, numerically oriented programming language.
Stars: ✭ 52 (+173.68%)
Mutual labels:  programming
pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 970 (+5005.26%)
Mutual labels:  data-engineering
consize
Consize is a concatenative programming language
Stars: ✭ 27 (+42.11%)
Mutual labels:  programming
openroberta-lab
The programming environment »Open Roberta Lab« by Fraunhofer IAIS enables children and adolescents to program robots. A variety of different programming blocks are provided to program motors and sensors of the robot. Open Roberta Lab uses an approach of graphical programming so that beginners can seamlessly start coding. As a cloud-based applica…
Stars: ✭ 98 (+415.79%)
Mutual labels:  programming
python-tutorial-for-beginners
Python Tutorial for Beginners with 500 Code Examples
Stars: ✭ 167 (+778.95%)
Mutual labels:  programming
medium-code
A place to store my jupyter notebooks for my medium articles
Stars: ✭ 27 (+42.11%)
Mutual labels:  programming
codemirror-blocks
A library for building language-specific, CodeMirror-friendly editors that are a11y-friendly.
Stars: ✭ 22 (+15.79%)
Mutual labels:  programming
dbt-sugar
dbt-sugar is a CLI tool that allows users of dbt to have fun and ease performing actions around dbt models
Stars: ✭ 139 (+631.58%)
Mutual labels:  data-engineering
CC33Z
Curso de Ciência da Computação
Stars: ✭ 50 (+163.16%)
Mutual labels:  programming
30-seconds-of-git
Short git snippets for all your development needs
Stars: ✭ 235 (+1136.84%)
Mutual labels:  programming
SBTCVM-Gen2-9
SBTCVM is a virtual machine implementation of a balanced ternary (base 3) computer. Features several compiled languages for ternary software development.
Stars: ✭ 32 (+68.42%)
Mutual labels:  programming
ilrx
I Love Ruby Examples
Stars: ✭ 35 (+84.21%)
Mutual labels:  programming
30-seconds-of-python
Short Python code snippets for all your development needs
Stars: ✭ 8,452 (+44384.21%)
Mutual labels:  programming
a80
Intel 8080/Zilog Z80 assembler written in D.
Stars: ✭ 23 (+21.05%)
Mutual labels:  programming

About this Course

Creative Commons License

The vast majority of data has been created within the last decade. As a result, more and more fields of research start to consider and embrace programming to process and analyse data. This course teaches applied programming with data and aims to leverage the open source tech stack to deal with this new wealth and complexity of data.

The idea behind Hacking for Social Sciences is to build a solid understanding of core technologies and concepts to help researchers develop a data processing strategy and increase your possibilities when working with data. The course approach is to single out those concepts stemming from software development that are easy to adopt and useful to social scientists. The course has three major learning objectives:

  • Be able to evaluate the role of focal components in a data science tech toolbox and pick problems suitable for the problem. Learn how technologies like R, Python, Git Version Control, docker or Cloud Computing could play together in your research project.
  • Learn how to manage and version control source code. Hacking for Social Sciences teaches how to use git version control to collaborate professionally, make your research reproducible and your code base persistent.
  • Applied data sourcing and data transformation Learn how to communicate with SQL databases. Learn how to consume data from different sources using machine to machine communication interfaces (APIs) such as the OpenStreetMap geocoding API / Routing Engine or the KOF data API for macroeconomic time series.

Non-Goals: Hacking for Social Sciences is not a Statistics, Econometrics or Machine Learning course. Though experience in these fields will help inasmuch that students will have an easier time to motivate investing in programming and to come up with their own application examples, profound methodological knowledge is not a prerequisite.

Hacking for Social Sciences is a course taught within the PhD program of ETH Zurich's D-MTEC Department. It's first taught in 2020 fall semester.

Resources

Read Online

Source Code

Server & Community

Schedule

This course was originated with the concepts and tricks in mind that I wish I had know when I started my own PhD (back in 2012). While some of the concepts became only more inevitable, other things may not apply today. Hence, it is up to you to bring your questions and problems and help create the most useful course experience. The course will always be centered around the open source data science stack, but its blocks will be adapated according to popular demand.

REQUIRED BEFORE THE START OF THE COURSE: Make sure you have checked out the course's R Studio server. It is greatly appreciated if users with R and git experience installed git and R locally in advance of the course. Please also make sure everyone has a free, working github.com account before the start of the course.

Block 1: General Overview, How to Git

October 1, 10:00 a.m. - 1:00 p.m. CEST (online): The Big Picture

October 2, 10:00 a.m. - 2:00 p.m. CEST (online): Git & Workflows

Block 2

October 22, 10:00 a.m. - 1:00 p.m. CEST (online): R Programming Crash Course

October 23, 10:00 a.m. - 2:00 p.m. CEST (online): Programming with Data

Block 3

November 19, 10:00 a.m. - 1:00 p.m. CEST (online): Infrastructure

November 20, 10:00 a.m. - 2:00 p.m. CEST (online): Infrastructure

Block 4

December 3, 10:00 a.m. - 1:00 p.m. CEST (online): Semester Projects

December 4, 10:00 a.m. - 2:00 p.m. CEST (online): Semester Projects

Format

The 4 blocks of the course contain

  • short live sessions
  • interactive questionnaires / apps
  • pre-recorded videos
  • own reading / research
  • programming or setup tasks

All 4 blocks will contain most of these elements if not all.

Exam (Leistungsnachweis)

Ungraded programming tasks, active participation in class. The final programming task will be to create a production ready CI/CD pipeline in a group. This could be a regular data update (ETL process) or automated build/testing of a package. Group size will depend on course registrations.

License

This work by Dr. Matthias Bannert is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. This includes all illustrations unless stated otherwise. Logos of software products or companies are just used to referenced to these very companies and products and are not shared under a CC license.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].