Commit Graph

67 Commits

Author SHA1 Message Date
bsimpson63
2a05f17161 No intermediate storage step in mr_process_hour.pig.
Made possible by upgrading pig to 0.10.
2013-12-05 04:04:35 -05:00
Neil Williams
c3c2f200fc read_secrets: Don't transform INI options on output. 2013-11-19 12:43:28 -08:00
Neil Williams
3366083663 Create a vault for secret tokens and move some into it.
This is intended to reduce the number of critical secrets stored in the
INI file.  An initial subset of secrets is moved into the vault to test
things out.
2013-11-19 12:16:58 -08:00
Neil Williams
342ad24409 Factor out INI file parsing into util. 2013-11-19 12:16:58 -08:00
Chad Birch
05c06df947 Userpage gildings tab: start writing
Includes backfill script to be run before the other commit that actually
adds the tab
2013-10-22 12:43:19 -06:00
Max Goodman
aaf1e51abb write-live-config: Update to reflect PluginLoader API change. 2013-10-21 11:37:17 -07:00
Jason Harvey
96fdb4dcb5 Cleanup temp files if dying due to replication lag. 2013-10-07 16:57:18 -07:00
Chad Birch
7722a73d15 Userpage gilded tab: start writing
Includes backfill script to be run before second commit to actually add
tab is deployed.
2013-10-07 15:45:06 -06:00
Neil Williams
f2cf180597 Remove remnants of legacy stylesheet systems. 2013-10-04 11:53:47 -07:00
Neil Williams
d81b953053 Revert "upload_static_files_to_s3: Don't list the whole bucket to find changes."
This reverts commit c67d688c9da437a87e8cf2fa1f94aa32be4de29f.
2013-10-04 11:53:47 -07:00
Andre D
544e4994a5 ModAction: Add a migrate script. 2013-09-18 08:56:55 -04:00
Neil Williams
4303841341 Add ImagesByWikiPage model and start dual-writing.
This model will initially be used to transfer subreddit images (used in
the stylesheet) off to a new system, but is intended to be used for
per-wikipage images in the future as well.
2013-08-19 16:11:50 -07:00
Neil Williams
bcea327b51 manage-consumers: Allow for subtyped consumers.
This is immediately useful for commentstree_q but may also help if we
move vote_q procs onto a single box.
2013-06-19 16:12:24 -07:00
Neil Williams
2265908f20 upload_static_files_to_s3: Don't list the whole bucket to find changes.
The static files S3 bucket has been getting a lot larger recently,
between subreddit stylesheets being in there and the static file cleaner
being disabled due to a bug. This is causing the deploy process to take
upwards of 3 minutes just to determine that no files need to be uploaded
to the bucket.

As a short-term workaround, this changes the uploader to check each key
individually with an S3 HEAD request rather than listing the whole
bucket. This is slower than best case of listing the bucket, but is
significantly faster than the current condition (~25 second runtime
now).
2013-06-16 22:35:40 -07:00
Neil Williams
8af415476b Update and fix license headers for 2013.
Angst!
2013-03-19 20:03:40 -07:00
bsimpson63
c886c59cc8 Add promoted_link tools. 2013-02-27 17:16:11 -05:00
Ricky Ramirez
4aec9618e6 write_live_config: Fail correctly if there is an error. 2013-02-19 14:49:56 -05:00
Ricky Ramirez
a9b5d2a7b0 write_live_config: Exit with non-zero code on user abort. 2013-02-19 14:49:50 -05:00
Neil Williams
77efa9ab43 wrap-job: Don't cd into r2/r2/ when we just want to be in r2/.
This was causing Pygments to fail to import correctly in jobs since
r2/r2/ has a commands.py which was getting relatively imported by
accident. Yuck.
2013-02-14 14:18:47 -08:00
Neil Williams
cd88393fa6 Fix manage-consumers not correctly iterating over files. 2013-01-28 13:39:01 -08:00
Neil Williams
73f518adea fetch_trackers: Use X-Forwarded-For when behind a trusted reverse proxy. 2013-01-28 12:08:22 -08:00
Neil Williams
417e8e6bb5 fetch_trackers: Make trackers.py self-packaging for Elastic Beanstalk.
This should have no effect on actually running the script, but is useful
if you do want to use Eb or likely some other service like GAE.
2013-01-28 12:08:22 -08:00
Roger Ostrander
2ab402c41d Domainban: Use Zookeeper as a backend 2013-01-25 12:35:08 -08:00
Neil Williams
afeabf204f Move consumer count info to individual files.
This is more easily composable among multiple puppet recipes etc.
2013-01-22 14:52:31 -08:00
bsimpson63
a70a6eac62 Process traffic logs on Amazon EMR. 2013-01-22 13:58:19 -05:00
Neil Williams
3c3b9e9594 Add per-subreddit gilding listings. 2013-01-14 10:36:50 -08:00
Neil Williams
e028c35676 Add a new listing of gilded comments. 2012-12-14 12:52:36 -08:00
Jason Harvey
8b4b3961d9 Rework queries and add sanity checks. 2012-12-07 14:09:24 -08:00
Neil Williams
efa3bb4009 Add migration script for regenerating CommentParticipationByAccount CF. 2012-12-04 11:41:24 -08:00
Neil Williams
68291521b5 Move query_cache scripts to a more general "migrate" directory.
Not just for the query cache anymore!
2012-12-04 11:41:24 -08:00
Jason Harvey
d476208e38 Re-work the gen_time_listings query WHERE clause. 2012-11-15 15:22:55 -08:00
Neil Williams
e3952a5555 Add script for cleaning up the static files bucket on S3. 2012-11-08 15:04:25 -08:00
Neil Williams
8fc311487d Add a Pig-based map/reduce system for filling the new query cache. 2012-10-19 12:56:17 -07:00
maxwellhansen
ca39abfa9b upload_static: Use dictionary comprehension for readability. 2012-08-13 10:21:37 -07:00
Neil Williams
f495dad2a8 Add ZooKeeper-based dynamic configuration system.
The dynamic configuration system has two components: the app, which
reads, and the writer script. The latter is meant for use by humans, and
converts a [live_config] section of the INI file into JSON for storage
in ZooKeeper. The app will read this data on startup and place a watch
on the node to be notified, by ZooKeeper, of changes. This means that
running the writer script with new data will automatically propagate the
changes to every app very quickly, without restart.

The writer script relies on a human-entered password to authenticate
with ZooKeeper. The reddit app uses a different set of credentials
(specified in the INI file) to obtain read-only access to the
configuration data.

Also adds a new "live_config" spec to reddit and plugins. This spec is
parsed at write-time only and the parsed values are stored as JSON in
ZooKeeper.
2012-08-02 17:14:36 -07:00
Neil Williams
914b949286 Update / add license headers.
I'm actually quite excited to do this.
2012-06-19 14:02:42 -07:00
Neil Williams
165cd3c62e Delete old scripts, daemontools run files, and configs.
NOTE: if you're using an old-style install with symlinks to any of these
files, please make sure to copy them out of the repo before merging this
commit.
2012-06-18 15:13:56 -07:00
Neil Williams
da3bd2ee69 Get rid of problematic buffering in job wrapper. 2012-05-31 15:57:57 -07:00
Neil Williams
533e695d55 Tweak wrap-job to support queue consumer naming conventions. 2012-05-31 15:57:57 -07:00
Neil Williams
5c80560e51 Add script for starting/restarting queue consumer jobs. 2012-05-31 15:57:57 -07:00
Neil Williams
1c1863e464 Add script to wrap cron jobs. 2012-05-29 12:15:59 -07:00
Keith Mitchell
c209cc2f32 Switch to cloudsearch as primary search provider
Also, remove unused indextank files
2012-05-15 08:21:01 -07:00
Neil Williams
ab6be6c95f Make static file uploader use custom ini file for S3 credentials. 2012-04-17 16:00:55 -07:00
Keith Mitchell
3ff6d3a66c Avoid iri_to_uri in flask redirect 2012-02-09 11:54:24 -08:00
Neil Williams
d48284c4cf Make mr_top jobs more portable. 2011-12-15 22:16:26 -08:00
David King
a39f0bcc8f Update cron scripts to reflect reality.
Still out of date, but less so.
2011-12-15 22:16:26 -08:00
Neil Williams
10d19f3140 Make S3 uploader aware of bizarre .gzip extension. 2011-11-29 16:23:28 -08:00
Neil Williams
ab243a7bcd upload_static: Put the unmangled files on S3 as well. 2011-11-15 15:03:59 -08:00
Neil Williams
af61995c48 Add script for uploading static files to S3. 2011-11-14 20:45:01 -08:00
Neil Williams
93a65a6a3a Move onload out of the app.
We need onload to be able to get a client-IP based
tracking validation hash, but it doesn't actually need
to do any database work to make those hashes. It also
fails a huge number of times per day. This moves the hash
calculation out to an external app with no db dependencies.
2011-10-06 08:22:16 -07:00