6 Meetup - USF

6.1 Analysing & Preventing Unconscious Bias in Machine Learning. Rachel Thomas - 2018-10-19

https://www.meetup.com/USF-Seminar-Series-in-Data-Science/events/254217548/ Video: facebook.com/usfca.msds/

Abstract: Increasingly AI is finding its way into nearly every product we use (everything from photo sharing apps to criminal justice decision algorithms), but often various types of bias are buried in the underlying data and models. This can have a damaging impact on both individuals and society. Through the lens of 3 case studies, we will look at how to diagnose bias, identify some sources, and some steps towards addressing it.

Check out gendershades.org
- Good example of use of data & diversity at varying levels of technical detail
Word Embeddings
- Word2Vec - google library of word embedings
- Stanford has a similar libraries
- Rachel Thomas - word embeddings youtube
- github: fastai/word-embeddings-workshop
ML can amplify bias
Compass software:
- Determining who has to post bail
- sentencing
- parole
Problems:
- Runaway feedback loops (predictive policing, etc.)
- Ethical variables to include - recedivism algorithms?
Solutions:
- AI ethics resources
- Fastai Practical Deep Learning For Coders course
- De-bias word embeddings - at level of perception vs level of action
  - Need to be looking for bias throughout
- “Datasheets for Datasets” - great paper/resource
  - List of good questions about data sets
  - Identify “human” elements of data sets
  - Case study of history of datasets and regulations
- Meetup talk youtube: Evan Estola - When Recommendations Systems Go Bad - MLconf SEA 2016
- Talk to domain experts and those impacted
- Think about unintended consequences in advance:
  - trolls/harassers
  - authoritarian governments
  - propaganda/disinformation
Questions:
- Bias in data
- Code auditable? Open source?
- Error rates for different sub-groups
- Accuracy of simple, rule-based alternative?
- Appeals process for mistakes?
- How diverse is the team building it?
  - Diverse teams perform better
  - Believing you are meritocratic INCREASES bias
- How do we address more nuanced biases once low hanging fruit are addressed?
  - Can be an interesting conversation, but don’t let perfect be the enemy of the good.

6.2 USF Intro session - 2018-10-19

FAQ
- Recommended Linear Algebra course:
  - University of North Dakota
12 Month accelerated program
- 6 7-week modules & 1 2-week intersession course
- 7/8/19 - 6/28/19
- 60-80 hrs/wk
- Practicum:
  - Similar to an internship, but with faculty mentors
  - Want to ensure good probems / work mentors
Highly recommend applications by 12/5 - better scholarship opportunities, ore thorough review of applications
Pre-reqs: linear algebra @ accredited university
- Don’t have to have completed prereqs, but need a defined plan.
Send info ?s to info@datascience
Faculty interview:
- Programming
- inferential statistics
- linear algebra
Personal statement: only 2 pages (won’t even read page 3)
- show genuine interest in why this particular program
- where below average, address the issues
  - want self-awareness - everyone struggles in the program - need to be able to self-evaluate
Letters of rec
- Strong letter of rec with detail of why fit for this program
- Work experience = 1 academic, 1 professional preferred
- Both can be from work if no reasonably current academic references

6.3 Python & Other Stuff Links flagged on mobile that need to be integrated in blog

D&D 5E Random Character Generator

How You Can Switch Languages and Get Up to Speed With Python as Quickly as Possible

Python Programming Course

Google Colaboratory

The Open Source Computer Science Degree

I Made an IDE for Python, Written in Python. Check It Out and Give Me Feedback!

Huge Recommendation: Python Crash Course by Eric Matthes.

Minimally Sufficient Pandas

After Nearly 100,000 Subscribers, We Still Don’t Have a Wiki Answering the Most Basic Questions. Help Us Fix It.

My D&D5 Encounter Simulator Now Allows Alignments to Be Altered After Many Requests: Thanks You Passionate Visitors and Sorry for Tardiness (C. 3 Years)

Can We Have «How to Get Started With Linux» Megathread?

Comprehensive Python Cheatsheet

PSA: Many of Berkeley’s Courses Have Lectures and Materials Free Online

How to Make a Desktop GUI Application in Python?

Best Data Science Blogs?

Best Data Science Books?

Are There Any Books on Getting Into the Mindset of a Coder or to Think Like a Coder?

Essential Books That Every Programmer Should Read

Analyzing My Weight Loss Journey With Machine Learning

What Are Some Very Useful, Lesser Known Python Libraries for Data Science?

I Just Finished a 48 Hour Game Jam Using Python and Pygame! (Source in Comments.)

28 Jupyter Notebook Tips, Tricks, and Shortcuts

Ipython-Contrib/Jupyter_Contrib_Nbextensions

I Just Published a 17-Part Video Series on Learning Regex in Python

Useful Python Tricks

I Just Published a Video on How to Discover Hidden Web APIs So That You Can Use Them Through Your Python Programs. More Interesting Examples Coming Soon!

10 Steps to Set Up Your Python Project for Success

10 Steps to Set Up Your Python Project for Success

First Contributions

400GB Data-Set of Conversational Audio. (Unlabeled) Cross-Post From R/NLP [Project]

From ‘R vs Python’ to ‘R and Python’

Where Are the GOOD Books? I Don’t Need Something That Spends Ten Whole Chapters on Control Structures, Functions, Lists, Etc. I’m Not Looking for an Introduction to Syntax. I Want the Dirty and Hard Stuff. Decorators, Classes, Inheritance, Recursion, Making Packages, Etc. Any Recommendations

[D] Machine Learning Productivity Hacks

Build Your Own AlphaZero AI Using Python and Keras

“I Wish I’d Found This Tool Earlier!!” One Example: Python Tutor

Dr. Mike Pwned Explains Hashing and Other Cool Stuff (Python Script in Comments)

I Started Awesome-Dev-Articles to Curate Great Dev Articles / Blog Posts. 50+ So Far, Contributions Welcome!

Write Yourself a Git!

Productivity Tips for Jupyter When Working in Python & R

I Made a Simple Tool to Fight Computer Vision Syndrome.

[D] Best Practice and Tips & Tricks to Write Scientific Papers in LaTeX, With Figures Generated in Python or Matlab

Comprehensive Python Cheatsheet

How I’m Able to Take Notes in Mathematics Lectures Using LaTeX and Vim

Hi, I’m Alan Smith, Data Visualisation Editor at the Financial Times. I’ve Just Finished an Experimental Project at the FT to Both Visualise and Sonify the Historical Yield Curve - a Large Dataset of Over 100,000 Data Points. AMA!

“How’s That Movie?” — Neural Collaborative Filtering With FastAI (Build a State-of-the-Art Recommendation Engine With Just 10 Lines of Code)

Python (Django) Source Code of missinboxalready.com

The Resources That Helped Me Learn How to Be a Data Engineer and Data Scientist

I Recently Had to Rebuild My Raspberry Pi Hosting My OpenVPN Server, So I Decided to Write a Tutorial This Time (to Make It Easier to Rebuild Next Time) (X-Post From R/Raspberry_Pi)

I Made This Figure in Python as a Masters Student and It’s Still My Favorite

Learning Data Science: Our Favorite Resources From Free to Not

Develop and Share R Shiny Content With This Docker Stack

I Wrote a Python Package to Do Adaptive Sampling of Functions in Parallel [OC]

[D] Which GPU(S) to Get for Deep Learning: My Experience and Advice for Using GPUs in Deep Learning

Machine Learning | Face Recognition in 10 Min

Taiga- Love Your Project

Purrleterian/Image-to-ASCII-Art-Converter

[P] Curated List of Python Resources

Openlibrary: One Webpage for Every Book Ever Published!

Which Are the 3-5 PEPs That Should I Read?

Eat for Free in NYC Using Python, Automation, Artificial Intelligence, and Instagram

Local Maxima and the Fallacy of Jumping to Fixed-Points: Examples From Economics, Evolution, and Computer Science.

Curated List of Data Science Resources

Online Courses Recommended by Hacker News Users.

I Converted MIT’s OpenCourseWare ”Learn Python Course” to a Layout Which Views Better on All Devices

13 Project Ideas for Intermediate Python Developers

36 Amazing Python Open Source Projects (V.2019)

Favorite Data Science Podcast for Fundamental, Advanced Topics?

Recommend a Data Science Courses

Notes on Data Science, Machine Learning, Artificial Intelligence, History, and Social Science.

I Created Tool That Generates Python Model Classes (Attrs, Dataclasses) Based on JSON Datasets With Typing Module Support

Getting Started With Kaggle: Explore the Dataset

R to Python: Data Wrangling With Dplyr and Pandas · GitHub

Let’s Write a Data Science Ethics Syllabus!

Complete Guide to Python Package Creation, Automated Testing and Deployment to PyPI

The Reason I Am Using Altair for Most of My Visualization in Python

[D] Machine Learning - WAYR (What Are You Reading) - Week 62

How to Start Your Own Remote Business & Make Money as a Freelancer With Upwork

15 Types of Regression You Should Know

[AI Application] Let Your Machine Teach Itself to Play Flappy Bird!

Useful Python Cheat Sheets

100-basic-machine-learning-interview

Learning Machine Learning Resources

Best Books or Courses to Learn Sql for a Beginner?

is_it_possible_to_quantify_my_life_with_python

What Are Underrated YouTube Channels for Python?

Possibly the Most Commonly Asked Front End Interview Question: Build a Progress Bar! a Video Tutorial With 2 Different Approaches

Resource Dump ; Linear Algebra for Biologists + Stats

List of Data Science and Machine Learning GitHub Repositories to Try In 2019

Cornell’s Entire Machine Learning Class (CS 4780) Is Now Entirely on You Tube. Taught by One of the Funniest and Best Professors I Have Ever Had.

beginner_tip_deploy_your_commandline_apps_on

Introduction to Machine Learning in Python With Scikit-Learn (Video Series)

[AI Application] Let Your Machine Play Super Mario Bros!

Learn React by Creating a Fully Functional Chat Application

Made a Video on How to Create a Basic Neural Network in Python, Wanted to Share! I’d Appreciate Any Honest Criticism as I’m Going to Create Another Video <3

Help With Sources for Starting!

Erlemar/Erlemar.github.io

How We Can Prepare Now for Catastrophically Dangerous AI—and Why We Can’t Wait

Yao Yao MSDS Alum the Job Search Interview Offer Letter Experience for Data Science

The Third Annual AI Now Report: 10 More Ways to Make AI Safe for Human Flourishing

Predictim Claims Its AI Can Flag ‘Risky’ Babysitters. So I Tried It on the People Who Watch My Kids.

What Is Your Favorite DS/ML/Statistics/KD Event?

Effective Presentations Techniques for Scientific and Technical Content

[New Resource] Top 50 Matplotlib Visualizations for Data Analysis - the Master Plots (W/ Full Python Code)

20 YouTube Channels for AI & Data Science

My First Arduino Project: Voice Controlled Organizer Using EasyVR 3.0 and Some WS2811S.

The Data Science Workflow

Data Science Workflow Tools

The Data Science Workflow

Dfply - the Dplyr of Python

I Made List of Resources for R/learnProgramming

Using Survival Analysis to Predict Customer Churn

Creating a Simple App From Start the Finish

Share Code From Any Device

The Joel Test: 12 Steps to Better Code

I Want to Watch a Programmers Workflow From the Start of a Project to the End. Does This Exist?

Learn Python the Fun Way

Advice/ Links/ Tips to Learning Python for DS

Using Statistics to Estimate the True Scope of the Secret Killings at the End of the Sri Lankan Civil War

A Beautiful Book About Numpy

Facebook Open Sources PyText NLP Framework

Hey All. I’m a Data Scientist Who Gave Up Learning Many Times Because of the Overload of Materials and Lack of Structured Road Map. So I Wrote This Article to Help Those Who Want to Achieve Their Learning Goals Next Year With a Simple Timetable They Can Replicate Every Month. I Hope It Helps.

The Hundred-Page Machine Learning Book Manuscript Is Complete

DataScienceJobs

Detect ANY Object With Raspberry Pi and TensorFlow

5 Papers and Articles About Project Management in Data Science

Find Usernames Across Over 75 Social Networks. Very Useful for Reconnaissance

🤖Python Examples of Popular Machine Learning Algorithms With Interactive Jupyter Demos and Math Being Explained

Comprehensive Python Cheatsheet

23 Awesome Programming Blogs to Follow In 2019

Reminder for Students Here, You Can Get Github Student Developer Pack Which Includes a Lot of Useful Free Software

Cookiecutter Data Science

A Short Guide I Wrote on Learning iOS Development Using Swift - Do Check It Out if You Are Interested in the Field!

Homm3 on Raspberry Pi 3 WITHOUT Exagear

Prof. Mike Gelbart’s (UBC) DS Lectures Are on YouTube and They’re Amazing

Data Science for Blog Content

Looking for Advice on Resources to Learn Python…

My Agile Workflow (Suggested by the Experts) (Episode 1)

Git for Your Shortcuts — Make Readable, Versioned Shortcut Backups With Git (via Working Copy)

Best Data Science Talk One Can Watch?

Over 500 Top PDFs Posted to Hacker News In 2018

An Ethics Checklist for Data Scientists

Compiled Notes for People Who Are Learning Python.

How Software, Data, and a Hell of a Lot of Work Helped Me Lose 110 Pounds in 25 Months

How Software, Data, and a Hell of a Lot of Work Helped Me Lose 110 Pounds in 25 Months

Yosemite Moonbow

I Never Took a Bayesian Class. What Book or Video Series Should I Watch to Get Up to Speed?

Python 3’S F-Strings: an Improved String Formatting Syntax (Guide)

Phd_Resume_Cover_Letters

Documenting Datasci Setup for Personal Computers

Former Google Engineer Breaks Down Interview Problems He Used to Use to Screen Candidates. Lots of Good Programming Tips and Advice.

Cruces13/DND-Char-Gen