Welcome!
From all of us at Bombora's DSML team, we're excited to get to know you! To do so, we'd like to begin by engaging in a few exercises we're interested in (and hope you are as well).
NOTE: We'll be leveraging open tools and public repositories. Ethics and professional aspirations should drive you to work creatively, enthusiastically, and independently. Clearly, you should leverage all resources you can, but please, do not share this exam or your solutions with any other entity, human or machine. :)
Thank you so much! Now, get comfortable and let's go!
Instructions
Overview
We're going to leverage typical analysis and development workflows used by our team to conduct an interview assessment. In short, you'll:
- clone this repository locally or use colab
- open the exam Jupyter notebook in your preferred enviornment (web or local)
- answer three questions of your choice, one from each section
- compress your results and email them to the people specified on the email you recieved
Details
1. Make your own copy of repo
Create a local clone of the repo on your local machine, with git
installed:
git clone https://github.com/bomboradata/datascience-exam.git
cd datascience-exam
2. Setup Jupyter Notebook Environment
- We prefer Python due to its rich ecosystem but please feel free to choose your own language
- Use your local development environment or open the exam notebook using Colab:
- Assumning you are using pandas, you can also download the data directly from the repo using the following
pd.read_csv("https://raw.githubusercontent.com/bomboradata/datascience-exam/master/option_1.csv")
pd.read_csv("https://raw.githubusercontent.com/bomboradata/datascience-exam/master/option_2.csv")
- Or see External data: Local Files, Drive, Sheets, and Cloud Storage for details on how to upload data in your colab notebook
- Assumning you are using pandas, you can also download the data directly from the repo using the following
3. Solve THREE problems, one from each section.
-
Take note that there are three sections, they are:
- 1. Algo + Data Structures
- 2. Prob + Stats
- 3. NLP Algorithm Development
-
Select and answer ONE question from EACH section, or to be totally explicit:
- select and answer a question from section 1 (e.g., either Q1.1 or Q1.2).
- select and answer a question from section 2 (e.g., either Q2.1, Q2.2, or Q2.3).
- select and answer a question from section 3 (e.g., either Q3.1, or Q3.2)
4. Jupyter Notebook Submission
Please compress your results and email them. We ask you to use one Jupyter Notebook. Use markdown cells to label each portion with the questions chosen. Also, feel free to use markdown cells to walk through your thought process or document pieces you would explore in the future.
Please submit your results, even if you do not completely finish a question from each section. There is no correct answer and we evaluate your approach over outputs.
Questions?
Please email the people specified on the email.
Finished, what's next?
Go enjoy your day, you deserve it!
<script async defer src="https://buttons.github.io/buttons.js"></script>