-
Notifications
You must be signed in to change notification settings - Fork 13
Webcoderz twint patch 1 #79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from 1 commit
Commits
Show all changes
183 commits
Select commit
Hold shift + click to select a range
c83e06f
Add files via upload
webcoderz 41bf592
Update TwintPool.py
webcoderz ddba27c
delete twint source
webcoderz e164212
comment out testing func left in
webcoderz cd18fb2
adding acct write
webcoderz 099da1f
trying to fix tweet type inference
webcoderz 247cfa0
trying to fix tweet type inference
webcoderz fdff8af
get info fix
webcoderz 4ebacb6
get info fix
webcoderz f5499be
changed hydration status to partial
webcoderz 2692d17
enrich usr info fx change can enrich any df with df["user_screen_name…
webcoderz 3a827e7
enrich usr info fx change can enrich any df with df["user_screen_name…
webcoderz 575096a
enrich usr info fx change can enrich any df with df["user_screen_name…
webcoderz 1f1d8f7
enrich usr info fx change can enrich any df with df["user_screen_name…
webcoderz 40552bc
enrich usr info fx change can enrich any df with df["user_screen_name…
webcoderz 7dcf8fc
enrich usr info fx change can enrich any df with df["user_screen_name…
webcoderz 7462633
enrich usr info fx change can enrich any df with df["user_screen_name…
webcoderz b1396a4
drop dupes b4 user acct enrichment
webcoderz 984b556
drop dupes b4 user acct enrichment
webcoderz f72c1db
drop dupes b4 fix user acct enrichment and before write
webcoderz 61bcf68
drop dupes b4 fix user acct enrichment and before write
webcoderz 4577311
usr info enrichment fx improvement
webcoderz 8f2eead
usr info enrichment fx improvement
webcoderz 7b7d4a8
usr info enrichment fx improvement
webcoderz 879b4d5
usr info enrichment fx improvement
webcoderz e27ec6d
usr info enrichment fx improvement
webcoderz 16b07b1
twint egg to be installed via pip instead of twint directly from pip …
webcoderz ac7e91f
removed twint get just use fh.search_time_range
webcoderz b41dd05
removed twint get just use fh.search_time_range
webcoderz 43f7736
removed twint get just use fh.search_time_range
webcoderz daf0fb8
added scale and changed version of nonrapids compose to 2
webcoderz 0634538
added scale and changed version of nonrapids compose to 2
webcoderz ec37636
added scale and changed version of nonrapids compose to 2
webcoderz e755491
added replicas and changed version of nonrapids compose to 3
webcoderz 14f4cc9
uncommenting out batch to improve on periodic write frequency
webcoderz 568c82a
recommenting out batch to improve on periodic write frequency
webcoderz 48735df
Merge branch 'master' into webcoderz-twint-patch-1
webcoderz bc28029
recommenting out batch to improve on periodic write frequency
webcoderz 79e0f3c
removed container name from prefect agent for scaling
webcoderz 464acd8
added jobs container to nonrapids-docker-compose.yml
webcoderz 6c71492
added jobs container to nonrapids-docker-compose.yml
webcoderz 2889dc3
added jobs container to nonrapids-docker-compose.yml
webcoderz 5416a68
added jobs container to nonrapids-docker-compose.yml
webcoderz 2ff4507
added jobs container to nonrapids-docker-compose.yml
webcoderz 6e4ffee
added jobs container to nonrapids-docker-compose.yml
webcoderz 5ae817f
refining jobs container
webcoderz d55e7e0
refining jobs container
webcoderz 04e34bd
refining jobs container
webcoderz 821d73d
fixing twintpool import
webcoderz aa2c6b2
fixing twintpool import
webcoderz 80b7b0b
fixing jobs container import
webcoderz 7ea96dc
logger type error fix
webcoderz 6fe89aa
logger type error fix
webcoderz e888f63
added datastream-docker-compose.yml
webcoderz 9f38550
command to kick job off on compose
webcoderz d2cb695
command to kick job off on compose
webcoderz 55b2cab
fix fh debug
webcoderz 92fe851
fix fh debug
webcoderz eb6af3b
change dir for check hydrate creds
webcoderz 559ac27
fixed nonrapids compose template
webcoderz 6743fa1
async loop not needed on job in container
webcoderz cb81014
changed agent job run interval to 10 seconds
webcoderz 1d1ecb4
renamed job dockerfile to datastream-Dockerfile for uniformity
webcoderz c6faf8f
changing logging.info to logging.debug to minimize log production
webcoderz f2d4eb4
added relationship writer
webcoderz 056a144
added relationship writer
webcoderz 3837d77
fix
webcoderz 4e15e20
fix
webcoderz 7161af1
writer fix- debug still says its writing relationships yet, not regis…
webcoderz 54e375e
writer fix- debug still says its writing relationships yet, not regis…
webcoderz d49b6d2
writer fix- debug still says its writing relationships yet, not regis…
webcoderz c065688
added tor node into agent
webcoderz 2feb3c3
twint import fix
webcoderz ac5c011
datastream tor fix to not expose 9050 outside container
webcoderz d99a3ea
fix
webcoderz d78a942
change directory check hydrate checks for creds
webcoderz c8784ba
removed tor dir just put dockerfile with rest
webcoderz 077e8da
removed tor dir just put dockerfile with rest
webcoderz ef66b13
fixing unused continuation in dockerfile
webcoderz 3269790
adding tor to datastream compose
webcoderz e00cf1a
added iptables bash script to route all container traffic through tor
webcoderz 1ffab43
tor container name to compose
webcoderz 44053c2
tor container name to compose
webcoderz b690c12
tor container name to compose
webcoderz 7bf36f9
tor container name to tor compose
webcoderz fb1adb8
tor container name to tor compose
webcoderz 64c6110
tor container fix for iptables and tor proxy
webcoderz 6e09494
ra
webcoderz 299bd92
remove restart policy for data stream compose
webcoderz a09b270
adding info logs into writer
webcoderz ebe46c4
reset_tables.sh
webcoderz eafdf9a
reset_tables.sh
webcoderz 60edc21
reset_tables.sh
webcoderz 2ea3ad7
writer related logs to info
webcoderz ab0cb04
writer related fixes
webcoderz d28eb7d
writer related df cleanups
webcoderz 68dea2e
writer related df cleanups
webcoderz 61a8d1b
writer related df cleanups
webcoderz 44b5211
fixed relationship writer, using old twarc writer.
webcoderz f813b95
, using old twarc writer.
webcoderz 993906b
, using old twarc writer.
webcoderz a44ffb9
added logs to writer
webcoderz 6e48b12
added logs to writer
webcoderz 51cd8e6
added logs to writer
webcoderz d05c3be
writer improvement
webcoderz 41a4434
writer improvement
webcoderz 6583550
writer improvement
webcoderz d209e27
writer improvement
webcoderz 97c6ca6
writer improvements cleanup
webcoderz 49b698c
added clocks to enrichment fx
webcoderz 67c55e1
aiohttp_socks
webcoderz 6abbf07
aiohttp_socks
webcoderz 1ed1d70
removing debug forloop in twintdf enrichment fx
webcoderz 1ff9b37
removing debug forloop in twintdf enrichment fx
webcoderz abe83a3
removing debug forloop in twintdf enrichment fx
webcoderz 8d5ed3a
removing debug forloop in twintdf enrichment fx
webcoderz c980ab8
increased default limit to 1000
webcoderz eed898d
added clocks to writer and each enrichment from twint to neo
webcoderz b180bdd
added clocks to twintpool and each enrichment from twint to neo
webcoderz ebbb353
added clocks to twintpool and each enrichment from twint to neo
webcoderz 8c484d7
commented out acct enrichment
webcoderz 1773635
commented out acct enrichment
webcoderz a07e6a2
logger to info
webcoderz ccb3864
logger to info
webcoderz fa17653
logger to info
webcoderz 6651382
logger to info
webcoderz 7930fc8
logger to info
webcoderz 1c5f7e6
set parameters hydrated status to PARTIAL
webcoderz 52983b3
fix(twint): container and tor setup
lmeyerov ae2eb1d
adding is_tor flag in and setting in twintpool in checkhydrate proper…
webcoderz ff45ea0
adding is_tor flag in and setting in twintpool in checkhydrate proper…
webcoderz 3c279f1
adding is_tor flag in and setting in twintpool in checkhydrate proper…
webcoderz 10b241e
adding is_tor flag in and setting in twintpool in checkhydrate proper…
webcoderz 14017fd
updating job
webcoderz 3d7c969
updating job
webcoderz 43b02d9
updating job
webcoderz 9230edc
updating job
webcoderz 88ed1ce
setting up datastream-Dockerfile for twitterscraper library
webcoderz 06e322c
setting up datastream-Dockerfile for twitterscraper library
webcoderz 9580d40
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz c36852b
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz a8ac909
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 5ecbedd
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz df9bd0b
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 2f84bcb
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 047430e
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 59d512b
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz a282d8d
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 540efe2
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 5ea76b8
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 58a66c7
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 7461321
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 332a4d9
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 431af04
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 07a7181
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 9d25744
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz c1e0eb9
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz a6983a4
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 10e600b
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz fc18003
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 0bd26b6
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 47297c5
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 84060b6
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz fad3bce
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 15463b4
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz dec74eb
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 631eb57
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 1aa8191
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 580a697
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 779cc99
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz b572c8e
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 0f0290f
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz a292cfe
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz b6a8ef3
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 2fed016
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz e862eef
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz b8e9757
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz 39bc1e9
setting up datastream-Dockerfile for twitterscraper library w/ geckod…
webcoderz badca9f
ublock extension
webcoderz b03e85b
ublock extension
webcoderz b269256
ublock extension
webcoderz 2fa3400
fix
webcoderz e4ca790
fix
webcoderz File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
tor container name to tor compose
- Loading branch information
commit 7bf36f95e54da6c0c80b7109c1dda2b9dda657cb
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,22 @@ | ||
| ######## | ||
| # | ||
| # Run from git root's parent: with .env in local folder and ProjectDomino/ inside | ||
| # | ||
| # $ touch .env | ||
| # $ sudo docker-compose -f ./ProjectDomino/infra/pipelines/docker/docker-compose.yml up -d prefect-agent | ||
| # | ||
| ######## | ||
|
|
||
| version: '3' | ||
|
|
||
| services: | ||
| ################################################################ | ||
| tor: | ||
| build: | ||
| context: ../../../ | ||
| dockerfile: ./infra/pipelines/docker/tor-Dockerfile | ||
| container_name: tor | ||
| network_mode: 'bridge' | ||
| restart: always | ||
| ports: | ||
| - 9050:9050 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should be clear about docker-compose minor version as it is tied to minimal required docker-compose version