Skip to content

Scala to Python - sparkSql folder #2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 6 commits into from
Sep 30, 2017
Merged

Conversation

pedromb
Copy link
Collaborator

@pedromb pedromb commented Sep 30, 2017

  • Small change to regex on commons/Utils.py

  • Converted the scala files on the sparkSql folder to python

  • Ignored sparkSql/TypedDataset.scala

  • Changed sparkSql/RddDatasetConversion to sparkSql/RddDataframeConversion

@@ -1,5 +1,5 @@
import re

class Utils():

COMMA_DELIMITER = re.compile(''',(?=(?:[^'"]|'[^']*'|"[^"]*")*$)''')
Copy link
Owner

@jleetutorial jleetutorial Sep 30, 2017

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why do we need to make this small change?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I made some changes to the original one in scala to run in python (because the languages parse regex in a different way), and among those changes I, mistakenly, made the new one not match commas within single quotations too. But this raised some problems when processing the stackoverflow survey. I just removed that.

@jleetutorial jleetutorial merged commit 9fe453e into master Sep 30, 2017
evross pushed a commit to evross/python-spark-tutorial that referenced this pull request Dec 3, 2018
* Add env/ and .vscode/ to .gitignore

* Add IAM requirements.txt

* Python Service account keys (jleetutorial#5)

Add IAM service account keys snippets

* Python Quickstart (jleetutorial#3)

Add IAM quickstart

* Python Service accounts (jleetutorial#4)

Add IAM service account snippets

* Add query grantable roles and tests (jleetutorial#2)

* Add whitespace and correct string formatting

* Add extra CR

* Lint
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants