Skip to content

Issue #27/ingest drug synonyms#58

Merged
007vasy merged 13 commits intomasterfrom
Issue-#27/IngestDrugSynonyms
Apr 17, 2020
Merged

Issue #27/ingest drug synonyms#58
007vasy merged 13 commits intomasterfrom
Issue-#27/IngestDrugSynonyms

Conversation

@007vasy
Copy link
Copy Markdown
Collaborator

@007vasy 007vasy commented Apr 10, 2020

The code is working e2e, the big questions is what is the best way to include the neo4j class created in this issue into the master one.

After that the URL parsing and enrichment could be done easily.

@007vasy 007vasy requested review from bechbd and lmeyerov April 10, 2020 10:33
@007vasy 007vasy linked an issue Apr 10, 2020 that may be closed by this pull request
bechbd
bechbd previously approved these changes Apr 10, 2020
Comment thread modules/Neo4jDataAccess.py Outdated
Comment thread modules/Neo4jDataAccess.py Outdated
Copy link
Copy Markdown
Contributor

@lmeyerov lmeyerov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. did you mean to rm .gitignore ?

  2. is this creating a :Url node with all props? (see slack conv..)

property_value = '''"''' + property_value.replace('"',r"\"") + '''"'''
elif not property_value:
property_value = ""
property_value = "''"
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@bechbd not sure of best practices here wrt neo4j

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is only if the property value is None

if count_node > prev_count_node + 1000:
prev_count_node = count_node
logger.info("> {} nodes already imported".format(count_node))
logger.info("> {} nodes already merged".format(count_node))
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

non-blocker, but generally better to do
logger.info("blah %s blah", myvar) so when in production mode, string interpolation methods don't execute ( => faster), esp. good when in data loops like this

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do you want me to update it right now?

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Up to you. Fine merging as-is, I just like being helpful in code reviews as a mutual learning process :)

@lmeyerov lmeyerov self-requested a review April 13, 2020 02:04
Copy link
Copy Markdown
Contributor

@lmeyerov lmeyerov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

IMO fine to land vs. drift

From here, we should work w/ @bechbd to get into actual neo4j then w/ @ZiyaoWei as always-on prefect flow, and we can test/tweak further as needed through that

@007vasy 007vasy merged commit d725d65 into master Apr 17, 2020
@007vasy 007vasy deleted the Issue-#27/IngestDrugSynonyms branch April 17, 2020 00:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

3 participants