SlideShare a Scribd company logo
How You Can Use Open
Source Materials to Learn
Python & Data Science
Kamila Stępniowska, EuroPython 2018
github.com/KStepniowska/EuroPython2018
CC-BY
What can you expect?
- Sociology
- Diversity: Geek Girls Carrots, Women Who Code, She’s Coding
- New Business Manager @10Clouds
Please don’t ask me about:
1. How can you become a data scientist in 3 weeks?
2. Which algorithm will solve an “A” or “B” problem?
EuroPython 2018 Kamila Stępniowska, CC-BY
I hope that you will learn more about...
RESOURCES - Python and Data Science
github.com/KStepniowska/EuroPython2018
Open Source - basics
Data Science Workflow
PROJECTS & COOPERATION & CONTRIBUTION
EuroPython 2018 Kamila Stępniowska, CC-BY
Shall we?
EuroPython 2018 Kamila Stępniowska, CC-BY
Fernando José Ignacio
Gárate Parra
https://bit.ly/2A5MoOW
CC BY-NC 2.0
Open Source
EuroPython 2018 Kamila Stępniowska, CC-BY
“Open data and content can be freely
used, modified, and shared by anyone
for any purpose”
https://opendefinition.org/
EuroPython 2018 Kamila Stępniowska, CC-BY
Educational Materials
Text, Pictures, Videos, Audio Records...
https://creativecommons.org/licenses/
EuroPython 2018 Kamila Stępniowska, CC-BY
As a User
Code
https://opensource.org/licenses
https://www.gnu.org/graphics/license-logos.en.html
EuroPython 2018 Kamila Stępniowska, CC-BY
As a User
Pick yours
General: choosealicense.com
Text: creativecommons.org/licenses/
Code: opensource.org/licenses
EuroPython 2018 Kamila Stępniowska, CC-BY
As a Creator
Python
EuroPython 2018 Kamila Stępniowska, CC-BY
Why Python?
COMMUNITY
Welcoming & Supportive
Global & Diverse
...
If there is a problem, there is a great chance that someone has written and
shared the solution already.
EuroPython 2018 Kamila Stępniowska, CC-BY
Learning Experience
Find Your Project -> learn by building
Find Your People -> Cooperate
Find a way to Contribute -> help others
https://bugs.python.org/
EuroPython 2018 Kamila Stępniowska, CC-BY
For Beginners
PEP 8 python.org/dev/peps/pep-0008/
“PEP 8 — the Style Guide for Python Code
This stylized presentation of the well-established PEP 8 was created by Kenneth
Reitz (for humans).”
*PEP=Python Enhancement Proposal pep8.org/#fn1
EuroPython 2018 Kamila Stępniowska, CC-BY
For Beginners
PEP 20 - The Zen of Python
python.org/dev/peps/pep-0020/
...
EuroPython 2018 Kamila Stępniowska, CC-BY
Resources: For Beginners
Python, Programming, Open Source
Knowledge: python.org -> Beginner’s Guide
for Non-Programmers https://bit.ly/1Iv5glG
for Programmers https://bit.ly/1UIBJMJ
How to learn: Lynn Rooth “Sink or swim”
http://www.roguelynn.com/words/The-New-Coder-A-path-to-Software-Engineering/
EuroPython 2018 Kamila Stępniowska, CC-BY
xkcd
CC BY-NC 2.5
https://xkcd.com/1838/
EuroPython 2018 Kamila Stępniowska, CC-BY
Data Science
EuroPython 2018 Kamila Stępniowska, CC-BY
Use of Python in Data Science
Python Developer - Survey 2017 Results
(9,500 developers, 150 countries)
“What do you use Python for?
(multiple answers)”
50% Data analysis, 31% Machine learning
jetbrains.com/research/python-developers-survey-2017/
EuroPython 2018 Kamila Stępniowska, CC-BY
jetbrains.com/research/python-developers-survey-2017/
EuroPython 2018 Kamila Stępniowska, CC-BY
Python & Data Science - what’s more?
Jupiter Notebook jupyter.org/
PyCharm jetbrains.com/pycharm/
Spyder pythonhosted.org/spyder/
EuroPython 2018 Kamila Stępniowska, CC-BY
Use Python to build your tools to explore data
You need to know Python to be able to freely
build experiments.
EuroPython 2018 Kamila Stępniowska, CC-BY
“Data”
Gathering, cleaning and data preparation is crucial.
Typical issues:
- there is not enough data
- data is messy
- we actually don’t know what is in the data set...
Gill Press, Forbes https://bit.ly/2OgNM4D
EuroPython 2018 Kamila Stępniowska, CC-BY
Data preparation is
even 80% of a
Data Scientist Work
“Science” - on the hunt for the right questions
● Understand what I want to achieve
● Define the problem that I want to solve
● Define what is the input and what I want to be an output
● Looking for helpful algorithms
● Compare the chosen algorithms
● Choose the algorithm/s to be used
● Choose the evaluation metrics
● Choose parameters set for experiments
● Run experiments
● Analyse the results
● Define the conclusions and/or get back to previous points
Anna Gut, Python Developer & Team Lead @10Clouds
EuroPython 2018 Kamila Stępniowska, CC-BY
UNDERSTAND
SEARCH
EXPERIMENT
How to find the right algorithm?
The Internet…
- the resource - do you define the source as trusted? (eg. scikit-learn)
- number of stars, forks, when was the last commit? (GitHub)
- the code
- is it aligned with the Python standards? (PEP 8)
- check the particular functions
- ...
- does it fit to the general architecture of a project?
- ask a friend
Anna Gut, Python Developer & Team Lead @10Clouds
EuroPython 2018 Kamila Stępniowska, CC-BY
Hacks - what was your steps & how did you get there
Step is a wrapper over the transformer and handles multiple aspects of the
execution of the pipeline, such as saving intermediate results (if needed),
checkpointing the model during training and more.
Transformer is purely computational, data scientist-defined piece that takes an
input data and produces some output data. Typical Transformers are neural
network, machine learning algorithms and pre- or post-processing routines.
github.com/neptune-ml/steppy
EuroPython 2018 Kamila Stępniowska, CC-BY
Resources
Data Science, Open Source
All… datasciencemasters.org/
Transformation from Math & Phys into Data Science:
p.migdal.pl/2016/03/15/data-science-intro-for-math-phys-background.html
EuroPython 2018 Kamila Stępniowska, CC-BY
Projects
Cooperation
Contribution
EuroPython 2018 Kamila Stępniowska, CC-BY
Projects
Find your project
- newcoder.io/tutorials/
- www.kaggle.com/
- devmesh.intel.com/
EuroPython 2018 Kamila Stępniowska, CC-BY
POSSIBLE?
DRIVING
ME?
Cooperation
Online:
- pyslackers.com (14,757 members)
- mail.python.org/mailman/listinfo/tutor
- https://www.facebook.com/groups/python.programmers
Offline:
- PyData, PyWaw
- PyLadies, Girl Geek, Geek Girls Carrots (Krakow)
- Django Carrots, Django Girls
EuroPython 2018 Kamila Stępniowska, CC-BY
Contribution
- Bag Tracker bugs.python.org
- Open Source Projects opensource.guide/how-to-contribute
- Answer questions at pyslackers.com
- Become a speaker/mentor pydata.org
- Organize Django Girls djangogirls.org/organize
- ...
EuroPython 2018 Kamila Stępniowska, CC-BY
Even More Resources...
EuroPython 2018 Kamila Stępniowska, CC-BY
Open Education
jose.theoj.org
EuroPython 2018 Kamila Stępniowska, CC-BY
Thank you!
EuroPython 2018 Kamila Stępniowska, CC-BY
github.com/KStepniowska/EuroPython2018
kamila.stepniowska@10clouds.com
@kstepniowska
Ad

Recommended

Python and BIG Data analytics | Python Fundamentals | Python Architecture
Python and BIG Data analytics | Python Fundamentals | Python Architecture
Skillspeed
 
Python in Data Science Work
Python in Data Science Work
Rick. Bahague
 
Python for Big Data Analytics
Python for Big Data Analytics
Edureka!
 
Collaborations in the Extreme: 
The rise of open code development in the scie...
Collaborations in the Extreme: 
The rise of open code development in the scie...
Kelle Cruz
 
Power of Python with Big Data
Power of Python with Big Data
Edureka!
 
Scikit-learn: the state of the union 2016
Scikit-learn: the state of the union 2016
Gael Varoquaux
 
Big Data com Python
Big Data com Python
Marcel Caraciolo
 
Turbocharge your data science with python and r
Turbocharge your data science with python and r
Kelli-Jean Chun
 
Introduction to Python Syntax and Semantics
Introduction to Python Syntax and Semantics
Adam Cook
 
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
USDSI
 
The-Power-of-Python-in-Computer-Science.pptx
The-Power-of-Python-in-Computer-Science.pptx
Attitude Tally Academy
 
PyData: Past, Present Future (PyData SV 2014 Keynote)
PyData: Past, Present Future (PyData SV 2014 Keynote)
Peter Wang
 
Introduction to python
Introduction to python
Rajesh Rajamani
 
What Is The Future of Data Science With Python?
What Is The Future of Data Science With Python?
SofiaCarter4
 
Programming for data science in python
Programming for data science in python
UmmeSalmaM1
 
S2-Programming_with_Data_Computational_Physics.pdf
S2-Programming_with_Data_Computational_Physics.pdf
CARLOSANDRESVIDALBET
 
Python Essentials For Dummies John C Shovic Alan Simpson
Python Essentials For Dummies John C Shovic Alan Simpson
cundyfema
 
python-programming-3-books-in-ryan-turner_compress.pdf
python-programming-3-books-in-ryan-turner_compress.pdf
Ahmed Attyub
 
Introduction to the intermediate Python - v1.1
Introduction to the intermediate Python - v1.1
Andrei KUCHARAVY
 
Using_python_webdevolopment_datascience.pdf
Using_python_webdevolopment_datascience.pdf
Sudipta Bhattacharya
 
Exploring and Using the Python Ecosystem
Exploring and Using the Python Ecosystem
Adam Cook
 
Python-data-science.pptx
Python-data-science.pptx
KabileshCm
 
Python Developer Roadmap 2023
Python Developer Roadmap 2023
Simplilearn
 
python-for-advanced-data-science-techniques-and-best-practices-20240911071850...
python-for-advanced-data-science-techniques-and-best-practices-20240911071850...
abhishekprasadabhima
 
Data science presentation
Data science presentation
MSDEVMTL
 
Why choose python for data science_.pptx
Why choose python for data science_.pptx
Avinash Sharma
 
Python in Industry
Python in Industry
Dharmit Shah
 
pycon-2015-liza-daly
pycon-2015-liza-daly
Liza Daly
 
University of Ghana Cracks Down on Misconduct: Over 100 Students Sanctioned
University of Ghana Cracks Down on Misconduct: Over 100 Students Sanctioned
Kweku Zurek
 
Birnagar High School Platinum Jubilee Quiz.pptx
Birnagar High School Platinum Jubilee Quiz.pptx
Sourav Kr Podder
 

More Related Content

Similar to How You Can Use Open Source Materials to Learn Python & Data Science - EuroPython 2018 (20)

Introduction to Python Syntax and Semantics
Introduction to Python Syntax and Semantics
Adam Cook
 
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
USDSI
 
The-Power-of-Python-in-Computer-Science.pptx
The-Power-of-Python-in-Computer-Science.pptx
Attitude Tally Academy
 
PyData: Past, Present Future (PyData SV 2014 Keynote)
PyData: Past, Present Future (PyData SV 2014 Keynote)
Peter Wang
 
Introduction to python
Introduction to python
Rajesh Rajamani
 
What Is The Future of Data Science With Python?
What Is The Future of Data Science With Python?
SofiaCarter4
 
Programming for data science in python
Programming for data science in python
UmmeSalmaM1
 
S2-Programming_with_Data_Computational_Physics.pdf
S2-Programming_with_Data_Computational_Physics.pdf
CARLOSANDRESVIDALBET
 
Python Essentials For Dummies John C Shovic Alan Simpson
Python Essentials For Dummies John C Shovic Alan Simpson
cundyfema
 
python-programming-3-books-in-ryan-turner_compress.pdf
python-programming-3-books-in-ryan-turner_compress.pdf
Ahmed Attyub
 
Introduction to the intermediate Python - v1.1
Introduction to the intermediate Python - v1.1
Andrei KUCHARAVY
 
Using_python_webdevolopment_datascience.pdf
Using_python_webdevolopment_datascience.pdf
Sudipta Bhattacharya
 
Exploring and Using the Python Ecosystem
Exploring and Using the Python Ecosystem
Adam Cook
 
Python-data-science.pptx
Python-data-science.pptx
KabileshCm
 
Python Developer Roadmap 2023
Python Developer Roadmap 2023
Simplilearn
 
python-for-advanced-data-science-techniques-and-best-practices-20240911071850...
python-for-advanced-data-science-techniques-and-best-practices-20240911071850...
abhishekprasadabhima
 
Data science presentation
Data science presentation
MSDEVMTL
 
Why choose python for data science_.pptx
Why choose python for data science_.pptx
Avinash Sharma
 
Python in Industry
Python in Industry
Dharmit Shah
 
pycon-2015-liza-daly
pycon-2015-liza-daly
Liza Daly
 
Introduction to Python Syntax and Semantics
Introduction to Python Syntax and Semantics
Adam Cook
 
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
USDSI
 
The-Power-of-Python-in-Computer-Science.pptx
The-Power-of-Python-in-Computer-Science.pptx
Attitude Tally Academy
 
PyData: Past, Present Future (PyData SV 2014 Keynote)
PyData: Past, Present Future (PyData SV 2014 Keynote)
Peter Wang
 
What Is The Future of Data Science With Python?
What Is The Future of Data Science With Python?
SofiaCarter4
 
Programming for data science in python
Programming for data science in python
UmmeSalmaM1
 
S2-Programming_with_Data_Computational_Physics.pdf
S2-Programming_with_Data_Computational_Physics.pdf
CARLOSANDRESVIDALBET
 
Python Essentials For Dummies John C Shovic Alan Simpson
Python Essentials For Dummies John C Shovic Alan Simpson
cundyfema
 
python-programming-3-books-in-ryan-turner_compress.pdf
python-programming-3-books-in-ryan-turner_compress.pdf
Ahmed Attyub
 
Introduction to the intermediate Python - v1.1
Introduction to the intermediate Python - v1.1
Andrei KUCHARAVY
 
Using_python_webdevolopment_datascience.pdf
Using_python_webdevolopment_datascience.pdf
Sudipta Bhattacharya
 
Exploring and Using the Python Ecosystem
Exploring and Using the Python Ecosystem
Adam Cook
 
Python-data-science.pptx
Python-data-science.pptx
KabileshCm
 
Python Developer Roadmap 2023
Python Developer Roadmap 2023
Simplilearn
 
python-for-advanced-data-science-techniques-and-best-practices-20240911071850...
python-for-advanced-data-science-techniques-and-best-practices-20240911071850...
abhishekprasadabhima
 
Data science presentation
Data science presentation
MSDEVMTL
 
Why choose python for data science_.pptx
Why choose python for data science_.pptx
Avinash Sharma
 
Python in Industry
Python in Industry
Dharmit Shah
 
pycon-2015-liza-daly
pycon-2015-liza-daly
Liza Daly
 

Recently uploaded (20)

University of Ghana Cracks Down on Misconduct: Over 100 Students Sanctioned
University of Ghana Cracks Down on Misconduct: Over 100 Students Sanctioned
Kweku Zurek
 
Birnagar High School Platinum Jubilee Quiz.pptx
Birnagar High School Platinum Jubilee Quiz.pptx
Sourav Kr Podder
 
List View Components in Odoo 18 - Odoo Slides
List View Components in Odoo 18 - Odoo Slides
Celine George
 
ENGLISH_Q1_W1 PowerPoint grade 3 quarter 1 week 1
ENGLISH_Q1_W1 PowerPoint grade 3 quarter 1 week 1
jutaydeonne
 
SCHIZOPHRENIA OTHER PSYCHOTIC DISORDER LIKE Persistent delusion/Capgras syndr...
SCHIZOPHRENIA OTHER PSYCHOTIC DISORDER LIKE Persistent delusion/Capgras syndr...
parmarjuli1412
 
June 2025 Progress Update With Board Call_In process.pptx
June 2025 Progress Update With Board Call_In process.pptx
International Society of Service Innovation Professionals
 
Hurricane Helene Application Documents Checklists
Hurricane Helene Application Documents Checklists
Mebane Rash
 
Code Profiling in Odoo 18 - Odoo 18 Slides
Code Profiling in Odoo 18 - Odoo 18 Slides
Celine George
 
INDUCTIVE EFFECT slide for first prof pharamacy students
INDUCTIVE EFFECT slide for first prof pharamacy students
SHABNAM FAIZ
 
ENGLISH-5 Q1 Lesson 1.pptx - Story Elements
ENGLISH-5 Q1 Lesson 1.pptx - Story Elements
Mayvel Nadal
 
Tanja Vujicic - PISA for Schools contact Info
Tanja Vujicic - PISA for Schools contact Info
EduSkills OECD
 
Aprendendo Arquitetura Framework Salesforce - Dia 02
Aprendendo Arquitetura Framework Salesforce - Dia 02
Mauricio Alexandre Silva
 
Public Health For The 21st Century 1st Edition Judy Orme Jane Powell
Public Health For The 21st Century 1st Edition Judy Orme Jane Powell
trjnesjnqg7801
 
THE PSYCHOANALYTIC OF THE BLACK CAT BY EDGAR ALLAN POE (1).pdf
THE PSYCHOANALYTIC OF THE BLACK CAT BY EDGAR ALLAN POE (1).pdf
nabilahk908
 
LDMMIA Shop & Student News Summer Solstice 25
LDMMIA Shop & Student News Summer Solstice 25
LDM & Mia eStudios
 
Peer Teaching Observations During School Internship
Peer Teaching Observations During School Internship
AjayaMohanty7
 
Pests of Maize: An comprehensive overview.pptx
Pests of Maize: An comprehensive overview.pptx
Arshad Shaikh
 
Gladiolous Cultivation practices by AKL.pdf
Gladiolous Cultivation practices by AKL.pdf
kushallamichhame
 
Paper 106 | Ambition and Corruption: A Comparative Analysis of ‘The Great Gat...
Paper 106 | Ambition and Corruption: A Comparative Analysis of ‘The Great Gat...
Rajdeep Bavaliya
 
IIT KGP Quiz Week 2024 Sports Quiz (Prelims + Finals)
IIT KGP Quiz Week 2024 Sports Quiz (Prelims + Finals)
IIT Kharagpur Quiz Club
 
University of Ghana Cracks Down on Misconduct: Over 100 Students Sanctioned
University of Ghana Cracks Down on Misconduct: Over 100 Students Sanctioned
Kweku Zurek
 
Birnagar High School Platinum Jubilee Quiz.pptx
Birnagar High School Platinum Jubilee Quiz.pptx
Sourav Kr Podder
 
List View Components in Odoo 18 - Odoo Slides
List View Components in Odoo 18 - Odoo Slides
Celine George
 
ENGLISH_Q1_W1 PowerPoint grade 3 quarter 1 week 1
ENGLISH_Q1_W1 PowerPoint grade 3 quarter 1 week 1
jutaydeonne
 
SCHIZOPHRENIA OTHER PSYCHOTIC DISORDER LIKE Persistent delusion/Capgras syndr...
SCHIZOPHRENIA OTHER PSYCHOTIC DISORDER LIKE Persistent delusion/Capgras syndr...
parmarjuli1412
 
Hurricane Helene Application Documents Checklists
Hurricane Helene Application Documents Checklists
Mebane Rash
 
Code Profiling in Odoo 18 - Odoo 18 Slides
Code Profiling in Odoo 18 - Odoo 18 Slides
Celine George
 
INDUCTIVE EFFECT slide for first prof pharamacy students
INDUCTIVE EFFECT slide for first prof pharamacy students
SHABNAM FAIZ
 
ENGLISH-5 Q1 Lesson 1.pptx - Story Elements
ENGLISH-5 Q1 Lesson 1.pptx - Story Elements
Mayvel Nadal
 
Tanja Vujicic - PISA for Schools contact Info
Tanja Vujicic - PISA for Schools contact Info
EduSkills OECD
 
Aprendendo Arquitetura Framework Salesforce - Dia 02
Aprendendo Arquitetura Framework Salesforce - Dia 02
Mauricio Alexandre Silva
 
Public Health For The 21st Century 1st Edition Judy Orme Jane Powell
Public Health For The 21st Century 1st Edition Judy Orme Jane Powell
trjnesjnqg7801
 
THE PSYCHOANALYTIC OF THE BLACK CAT BY EDGAR ALLAN POE (1).pdf
THE PSYCHOANALYTIC OF THE BLACK CAT BY EDGAR ALLAN POE (1).pdf
nabilahk908
 
LDMMIA Shop & Student News Summer Solstice 25
LDMMIA Shop & Student News Summer Solstice 25
LDM & Mia eStudios
 
Peer Teaching Observations During School Internship
Peer Teaching Observations During School Internship
AjayaMohanty7
 
Pests of Maize: An comprehensive overview.pptx
Pests of Maize: An comprehensive overview.pptx
Arshad Shaikh
 
Gladiolous Cultivation practices by AKL.pdf
Gladiolous Cultivation practices by AKL.pdf
kushallamichhame
 
Paper 106 | Ambition and Corruption: A Comparative Analysis of ‘The Great Gat...
Paper 106 | Ambition and Corruption: A Comparative Analysis of ‘The Great Gat...
Rajdeep Bavaliya
 
IIT KGP Quiz Week 2024 Sports Quiz (Prelims + Finals)
IIT KGP Quiz Week 2024 Sports Quiz (Prelims + Finals)
IIT Kharagpur Quiz Club
 
Ad

How You Can Use Open Source Materials to Learn Python & Data Science - EuroPython 2018

  • 1. How You Can Use Open Source Materials to Learn Python & Data Science Kamila Stępniowska, EuroPython 2018 github.com/KStepniowska/EuroPython2018 CC-BY
  • 2. What can you expect? - Sociology - Diversity: Geek Girls Carrots, Women Who Code, She’s Coding - New Business Manager @10Clouds Please don’t ask me about: 1. How can you become a data scientist in 3 weeks? 2. Which algorithm will solve an “A” or “B” problem? EuroPython 2018 Kamila Stępniowska, CC-BY
  • 3. I hope that you will learn more about... RESOURCES - Python and Data Science github.com/KStepniowska/EuroPython2018 Open Source - basics Data Science Workflow PROJECTS & COOPERATION & CONTRIBUTION EuroPython 2018 Kamila Stępniowska, CC-BY
  • 4. Shall we? EuroPython 2018 Kamila Stępniowska, CC-BY Fernando José Ignacio Gárate Parra https://bit.ly/2A5MoOW CC BY-NC 2.0
  • 5. Open Source EuroPython 2018 Kamila Stępniowska, CC-BY
  • 6. “Open data and content can be freely used, modified, and shared by anyone for any purpose” https://opendefinition.org/ EuroPython 2018 Kamila Stępniowska, CC-BY
  • 7. Educational Materials Text, Pictures, Videos, Audio Records... https://creativecommons.org/licenses/ EuroPython 2018 Kamila Stępniowska, CC-BY As a User
  • 9. Pick yours General: choosealicense.com Text: creativecommons.org/licenses/ Code: opensource.org/licenses EuroPython 2018 Kamila Stępniowska, CC-BY As a Creator
  • 10. Python EuroPython 2018 Kamila Stępniowska, CC-BY
  • 11. Why Python? COMMUNITY Welcoming & Supportive Global & Diverse ... If there is a problem, there is a great chance that someone has written and shared the solution already. EuroPython 2018 Kamila Stępniowska, CC-BY
  • 12. Learning Experience Find Your Project -> learn by building Find Your People -> Cooperate Find a way to Contribute -> help others https://bugs.python.org/ EuroPython 2018 Kamila Stępniowska, CC-BY
  • 13. For Beginners PEP 8 python.org/dev/peps/pep-0008/ “PEP 8 — the Style Guide for Python Code This stylized presentation of the well-established PEP 8 was created by Kenneth Reitz (for humans).” *PEP=Python Enhancement Proposal pep8.org/#fn1 EuroPython 2018 Kamila Stępniowska, CC-BY
  • 14. For Beginners PEP 20 - The Zen of Python python.org/dev/peps/pep-0020/ ... EuroPython 2018 Kamila Stępniowska, CC-BY
  • 15. Resources: For Beginners Python, Programming, Open Source Knowledge: python.org -> Beginner’s Guide for Non-Programmers https://bit.ly/1Iv5glG for Programmers https://bit.ly/1UIBJMJ How to learn: Lynn Rooth “Sink or swim” http://www.roguelynn.com/words/The-New-Coder-A-path-to-Software-Engineering/ EuroPython 2018 Kamila Stępniowska, CC-BY
  • 17. Data Science EuroPython 2018 Kamila Stępniowska, CC-BY
  • 18. Use of Python in Data Science Python Developer - Survey 2017 Results (9,500 developers, 150 countries) “What do you use Python for? (multiple answers)” 50% Data analysis, 31% Machine learning jetbrains.com/research/python-developers-survey-2017/ EuroPython 2018 Kamila Stępniowska, CC-BY
  • 20. Python & Data Science - what’s more? Jupiter Notebook jupyter.org/ PyCharm jetbrains.com/pycharm/ Spyder pythonhosted.org/spyder/ EuroPython 2018 Kamila Stępniowska, CC-BY
  • 21. Use Python to build your tools to explore data You need to know Python to be able to freely build experiments. EuroPython 2018 Kamila Stępniowska, CC-BY
  • 22. “Data” Gathering, cleaning and data preparation is crucial. Typical issues: - there is not enough data - data is messy - we actually don’t know what is in the data set... Gill Press, Forbes https://bit.ly/2OgNM4D EuroPython 2018 Kamila Stępniowska, CC-BY Data preparation is even 80% of a Data Scientist Work
  • 23. “Science” - on the hunt for the right questions ● Understand what I want to achieve ● Define the problem that I want to solve ● Define what is the input and what I want to be an output ● Looking for helpful algorithms ● Compare the chosen algorithms ● Choose the algorithm/s to be used ● Choose the evaluation metrics ● Choose parameters set for experiments ● Run experiments ● Analyse the results ● Define the conclusions and/or get back to previous points Anna Gut, Python Developer & Team Lead @10Clouds EuroPython 2018 Kamila Stępniowska, CC-BY UNDERSTAND SEARCH EXPERIMENT
  • 24. How to find the right algorithm? The Internet… - the resource - do you define the source as trusted? (eg. scikit-learn) - number of stars, forks, when was the last commit? (GitHub) - the code - is it aligned with the Python standards? (PEP 8) - check the particular functions - ... - does it fit to the general architecture of a project? - ask a friend Anna Gut, Python Developer & Team Lead @10Clouds EuroPython 2018 Kamila Stępniowska, CC-BY
  • 25. Hacks - what was your steps & how did you get there Step is a wrapper over the transformer and handles multiple aspects of the execution of the pipeline, such as saving intermediate results (if needed), checkpointing the model during training and more. Transformer is purely computational, data scientist-defined piece that takes an input data and produces some output data. Typical Transformers are neural network, machine learning algorithms and pre- or post-processing routines. github.com/neptune-ml/steppy EuroPython 2018 Kamila Stępniowska, CC-BY
  • 26. Resources Data Science, Open Source All… datasciencemasters.org/ Transformation from Math & Phys into Data Science: p.migdal.pl/2016/03/15/data-science-intro-for-math-phys-background.html EuroPython 2018 Kamila Stępniowska, CC-BY
  • 28. Projects Find your project - newcoder.io/tutorials/ - www.kaggle.com/ - devmesh.intel.com/ EuroPython 2018 Kamila Stępniowska, CC-BY POSSIBLE? DRIVING ME?
  • 29. Cooperation Online: - pyslackers.com (14,757 members) - mail.python.org/mailman/listinfo/tutor - https://www.facebook.com/groups/python.programmers Offline: - PyData, PyWaw - PyLadies, Girl Geek, Geek Girls Carrots (Krakow) - Django Carrots, Django Girls EuroPython 2018 Kamila Stępniowska, CC-BY
  • 30. Contribution - Bag Tracker bugs.python.org - Open Source Projects opensource.guide/how-to-contribute - Answer questions at pyslackers.com - Become a speaker/mentor pydata.org - Organize Django Girls djangogirls.org/organize - ... EuroPython 2018 Kamila Stępniowska, CC-BY
  • 31. Even More Resources... EuroPython 2018 Kamila Stępniowska, CC-BY
  • 32. Open Education jose.theoj.org EuroPython 2018 Kamila Stępniowska, CC-BY
  • 33. Thank you! EuroPython 2018 Kamila Stępniowska, CC-BY github.com/KStepniowska/EuroPython2018 [email protected] @kstepniowska