6 Meetup - USF
6.1 Analysing & Preventing Unconscious Bias in Machine Learning. Rachel Thomas - 2018-10-19
https://www.meetup.com/USF-Seminar-Series-in-Data-Science/events/254217548/ Video: facebook.com/usfca.msds/
Abstract: Increasingly AI is finding its way into nearly every product we use (everything from photo sharing apps to criminal justice decision algorithms), but often various types of bias are buried in the underlying data and models. This can have a damaging impact on both individuals and society. Through the lens of 3 case studies, we will look at how to diagnose bias, identify some sources, and some steps towards addressing it.
- Check out gendershades.org
- Good example of use of data & diversity at varying levels of technical detail
- Word Embeddings
- Word2Vec - google library of word embedings
- Stanford has a similar libraries
- Rachel Thomas - word embeddings youtube
- github: fastai/word-embeddings-workshop
ML can amplify bias
- Compass software:
- Determining who has to post bail
- sentencing
- parole
- Problems:
- Runaway feedback loops (predictive policing, etc.)
- Ethical variables to include - recedivism algorithms?
- Solutions:
- AI ethics resources
- Fastai Practical Deep Learning For Coders course
- De-bias word embeddings - at level of perception vs level of action
- Need to be looking for bias throughout
- “Datasheets for Datasets” - great paper/resource
- List of good questions about data sets
- Identify “human” elements of data sets
- Case study of history of datasets and regulations
- Meetup talk youtube: Evan Estola - When Recommendations Systems Go Bad - MLconf SEA 2016
- Talk to domain experts and those impacted
- Think about unintended consequences in advance:
- trolls/harassers
- authoritarian governments
- propaganda/disinformation
- Questions:
- Bias in data
- Code auditable? Open source?
- Error rates for different sub-groups
- Accuracy of simple, rule-based alternative?
- Appeals process for mistakes?
- How diverse is the team building it?
- Diverse teams perform better
- Believing you are meritocratic INCREASES bias
- How do we address more nuanced biases once low hanging fruit are addressed?
- Can be an interesting conversation, but don’t let perfect be the enemy of the good.
6.2 USF Intro session - 2018-10-19
- FAQ
- Recommended Linear Algebra course:
- 12 Month accelerated program
- 6 7-week modules & 1 2-week intersession course
- 7/8/19 - 6/28/19
- 60-80 hrs/wk
- Practicum:
- Similar to an internship, but with faculty mentors
- Want to ensure good probems / work mentors
- Highly recommend applications by 12/5 - better scholarship opportunities, ore thorough review of applications
- Pre-reqs: linear algebra @ accredited university
- Don’t have to have completed prereqs, but need a defined plan.
- Send info ?s to info@datascience
- Faculty interview:
- Programming
- inferential statistics
- linear algebra
- Personal statement: only 2 pages (won’t even read page 3)
- show genuine interest in why this particular program
- where below average, address the issues
- want self-awareness - everyone struggles in the program - need to be able to self-evaluate
- Letters of rec
- Strong letter of rec with detail of why fit for this program
- Work experience = 1 academic, 1 professional preferred
- Both can be from work if no reasonably current academic references
6.3 Python & Other Stuff Links flagged on mobile that need to be integrated in blog
D&D 5E Random Character Generator
How You Can Switch Languages and Get Up to Speed With Python as Quickly as Possible
The Open Source Computer Science Degree
I Made an IDE for Python, Written in Python. Check It Out and Give Me Feedback!
Huge Recommendation: Python Crash Course by Eric Matthes.
Can We Have «How to Get Started With Linux» Megathread?
Comprehensive Python Cheatsheet
PSA: Many of Berkeley’s Courses Have Lectures and Materials Free Online
How to Make a Desktop GUI Application in Python?
Are There Any Books on Getting Into the Mindset of a Coder or to Think Like a Coder?
Essential Books That Every Programmer Should Read
Analyzing My Weight Loss Journey With Machine Learning
What Are Some Very Useful, Lesser Known Python Libraries for Data Science?
I Just Finished a 48 Hour Game Jam Using Python and Pygame! (Source in Comments.)
28 Jupyter Notebook Tips, Tricks, and Shortcuts
Ipython-Contrib/Jupyter_Contrib_Nbextensions
I Just Published a 17-Part Video Series on Learning Regex in Python
10 Steps to Set Up Your Python Project for Success
10 Steps to Set Up Your Python Project for Success
400GB Data-Set of Conversational Audio. (Unlabeled) Cross-Post From R/NLP [Project]
From ‘R vs Python’ to ‘R and Python’
[D] Machine Learning Productivity Hacks
Build Your Own AlphaZero AI Using Python and Keras
“I Wish I’d Found This Tool Earlier!!” One Example: Python Tutor
Dr. Mike Pwned Explains Hashing and Other Cool Stuff (Python Script in Comments)
Productivity Tips for Jupyter When Working in Python & R
I Made a Simple Tool to Fight Computer Vision Syndrome.
Comprehensive Python Cheatsheet
How I’m Able to Take Notes in Mathematics Lectures Using LaTeX and Vim
Python (Django) Source Code of missinboxalready.com
The Resources That Helped Me Learn How to Be a Data Engineer and Data Scientist
I Made This Figure in Python as a Masters Student and It’s Still My Favorite
Learning Data Science: Our Favorite Resources From Free to Not
Develop and Share R Shiny Content With This Docker Stack
I Wrote a Python Package to Do Adaptive Sampling of Functions in Parallel [OC]
[D] Which GPU(S) to Get for Deep Learning: My Experience and Advice for Using GPUs in Deep Learning
Machine Learning | Face Recognition in 10 Min
Purrleterian/Image-to-ASCII-Art-Converter
[P] Curated List of Python Resources
Openlibrary: One Webpage for Every Book Ever Published!
Which Are the 3-5 PEPs That Should I Read?
Eat for Free in NYC Using Python, Automation, Artificial Intelligence, and Instagram
Curated List of Data Science Resources
Online Courses Recommended by Hacker News Users.
I Converted MIT’s OpenCourseWare ”Learn Python Course” to a Layout Which Views Better on All Devices
13 Project Ideas for Intermediate Python Developers
36 Amazing Python Open Source Projects (V.2019)
Favorite Data Science Podcast for Fundamental, Advanced Topics?
Recommend a Data Science Courses
Notes on Data Science, Machine Learning, Artificial Intelligence, History, and Social Science.
Getting Started With Kaggle: Explore the Dataset
R to Python: Data Wrangling With Dplyr and Pandas · GitHub
Let’s Write a Data Science Ethics Syllabus!
Complete Guide to Python Package Creation, Automated Testing and Deployment to PyPI
The Reason I Am Using Altair for Most of My Visualization in Python
[D] Machine Learning - WAYR (What Are You Reading) - Week 62
How to Start Your Own Remote Business & Make Money as a Freelancer With Upwork
15 Types of Regression You Should Know
[AI Application] Let Your Machine Teach Itself to Play Flappy Bird!
100-basic-machine-learning-interview
Learning Machine Learning Resources
Best Books or Courses to Learn Sql for a Beginner?
is_it_possible_to_quantify_my_life_with_python
What Are Underrated YouTube Channels for Python?
Resource Dump ; Linear Algebra for Biologists + Stats
List of Data Science and Machine Learning GitHub Repositories to Try In 2019
beginner_tip_deploy_your_commandline_apps_on
Introduction to Machine Learning in Python With Scikit-Learn (Video Series)
[AI Application] Let Your Machine Play Super Mario Bros!
Learn React by Creating a Fully Functional Chat Application
Help With Sources for Starting!
How We Can Prepare Now for Catastrophically Dangerous AI—and Why We Can’t Wait
Yao Yao MSDS Alum the Job Search Interview Offer Letter Experience for Data Science
The Third Annual AI Now Report: 10 More Ways to Make AI Safe for Human Flourishing
Predictim Claims Its AI Can Flag ‘Risky’ Babysitters. So I Tried It on the People Who Watch My Kids.
What Is Your Favorite DS/ML/Statistics/KD Event?
Effective Presentations Techniques for Scientific and Technical Content
20 YouTube Channels for AI & Data Science
My First Arduino Project: Voice Controlled Organizer Using EasyVR 3.0 and Some WS2811S.
I Made List of Resources for R/learnProgramming
Using Survival Analysis to Predict Customer Churn
Creating a Simple App From Start the Finish
The Joel Test: 12 Steps to Better Code
I Want to Watch a Programmers Workflow From the Start of a Project to the End. Does This Exist?
Advice/ Links/ Tips to Learning Python for DS
Facebook Open Sources PyText NLP Framework
The Hundred-Page Machine Learning Book Manuscript Is Complete
Detect ANY Object With Raspberry Pi and TensorFlow
5 Papers and Articles About Project Management in Data Science
Find Usernames Across Over 75 Social Networks. Very Useful for Reconnaissance
Comprehensive Python Cheatsheet
23 Awesome Programming Blogs to Follow In 2019
Homm3 on Raspberry Pi 3 WITHOUT Exagear
Prof. Mike Gelbart’s (UBC) DS Lectures Are on YouTube and They’re Amazing
Looking for Advice on Resources to Learn Python…
My Agile Workflow (Suggested by the Experts) (Episode 1)
Git for Your Shortcuts — Make Readable, Versioned Shortcut Backups With Git (via Working Copy)
Best Data Science Talk One Can Watch?
Over 500 Top PDFs Posted to Hacker News In 2018
An Ethics Checklist for Data Scientists
Compiled Notes for People Who Are Learning Python.
How Software, Data, and a Hell of a Lot of Work Helped Me Lose 110 Pounds in 25 Months
How Software, Data, and a Hell of a Lot of Work Helped Me Lose 110 Pounds in 25 Months
I Never Took a Bayesian Class. What Book or Video Series Should I Watch to Get Up to Speed?
Python 3’S F-Strings: an Improved String Formatting Syntax (Guide)