Related Posts
More Posts
Additional Posts in Data & Analytics Consultants
today I choose violence

Thought this was interesting. Across 160 teams of researchers, just about all failed to make good life outcome predictions on things like GPA, evictions, layoffs, and others. Data followed 4.5k families across 15 years, with 13k features (varied over time). Haven't looked at it directly yet, but will be turning the docs and data inside out... In the meantime, authors claim this as showing the limits of ML. Oh, and it's published in PNAS, so you know there's some big publication energy there.
https://www.pnas.org/content/117/15/8398
Got messaged by a C3 . ai recruiter. Read that wlb is bad and that the interview process is absurdly long, but the Glassdoor reviews are 4.2 and can't find actual hours worked posted by anyone. How's the culture really? I'd be aiming for DS consulting, something more functional but with DS/ML concepts as my differentiator.
C3.ai, Inc.
New to Fishbowl?
unlock all discussions on Fishbowl.





Sql and python
Pick a cloud platform to learn the pipeline and orchestration tool
Try apache beam
DE is pretty broad. I’d say a good place to start is learning python, pandas and pyspark. Airflow is increasingly popular for building data pipelines. Familiarize yourself with SQL and NoSQL (Mongo). If you’re interested in ML, scikit-learn has the most easy to understand and comprehensive learning material.
SQL. Everything else is role/firm specific.
Get comfortable with ETL. Start with the concept itself and the million variations that the concept entails.
A lot of ETL is done thru SQL. So the next concept to master after SQL is ETL.