5.2. Building from source

5.2.1. Clone the Socorro repository

If you haven’t already, you’ll need to clone the Socorro git repository:

git clone git://github.com/mozilla/socorro.git
cd socorro

5.2.2. Setting up environment

Socorro can install python dependencies into a virtualenv for you. You only need to run this once:

export PATH=$PATH:/usr/pgsql-9.3/bin/
make dev

Before running any Socorro components, always make sure that the virtualenv is activated:

. socorro-virtualenv/bin/activate

5.2.3. Add a new superuser account to PostgreSQL

Create a superuser account for yourself. Make sure to put your username and desired password instead of YOURNAME and YOURPASS. As the postgres user:

psql template1 -c \
  "create user YOURNAME with encrypted password 'YOURPASS' superuser"

For running unit tests, you’ll want a test user as well (make sure to remove this for production installs).

psql template1 -c \
  "create user test with encrypted password 'aPassword' superuser"

5.2.4. Configure migrations (Alembic)

Also, before you run unit tests or make, be sure to copy and edit this file:

cp config/alembic.ini-dist config/alembic.ini
vi config/alembic.ini

The important line to update is sqlalchemy.url, it should be changed to match the superuser account you created above:

sqlalchemy.url = postgresql://YOURNAME:YOURPASS@localhost/socorro_migration_test

5.2.5. Run unit/functional tests

From inside the Socorro checkout

make test database_username=test database_password=aPassword

5.2.6. Populate PostgreSQL Database

Load Socorro schema plus test products:

socorro setupdb --database_name=breakpad --fakedata --createdb

5.2.7. Create partitioned tables

Normally this is handled automatically by the cronjob scheduler crontabber but can be run as a one-off:

python socorro/cron/crontabber_app.py --job=weekly-reports-partitions --force

5.2.8. Populate Elasticsearch database


See the chapter about ElasticSearch for more information.

First you need to setup your Elasticsearch database:

cd scripts && python ./setup_supersearch_app.py

Then, and once you have populated your PostgreSQL database with “fake data”, you can migrate that data into Elasticsearch:

python socorro/external/postgresql/crash_migration_app.py

5.2.9. Sync Django database

Django needs to write its ORM tables:

export SECRET_KEY="..."
cd webapp-django
./manage.py migrate auth
./manage.py migrate

5.2.10. Run Socorro in dev mode

Copy default config files

cp config/alembic.ini-dist config/alembic.ini
cp config/collector.ini-dist config/collector.ini
cp config/processor.ini-dist config/processor.ini
cp config/middleware.ini-dist config/middleware.ini

You may need to edit these config files - for example collector (which is generally a public service) might need listen on the correct IP address.

In particular, for login to work you want to modify the following either as environment variables or in a ./webapp-django/.env file:


Run Socorro services using Honcho (configured in Procfile)

honcho start

The port numbers will be printed near the start of the output. The web UI will be on port 5000, collector on 5100, middleware on 5200.

Alternatively you can also start individual services:

honcho start web
honcho start collector
honcho start middleware
honcho start processor

Note the port number when they start up, it will be different than if you start all services together (starts at port 5000)

If you want to modify something that is common across config files like PostgreSQL username/hostname/etc, refer to config/common_database.ini-dist and the “+include” line in the service-specific config files (such as collector.ini and processor.ini). This is optional but recommended.

5.2.11. Troubleshooting

If you are seeing errors after starting Socorro with honcho, it may be that a previous unsuccessful run didn’t clean up all the Python processes.

You can inspect for such stray processes using ps:

ps ax | grep python