Home Help Me Help You Transparency Report Greetz

Follow _hyp3ri0n

Scylla Crowdsourced Development

I need help in turning the data dumps I own of a variety of formats, to structured data! This sh** drives you nuts sometimes. Sometimes you'll get sql files you'll that you have to import then export, then reformat, sometimes you'll get an organized csv file, or sometimes a horribly formatted csv-like thing . This isn't very time consuming, but multiply by 100 and it sucks enough that you have to convince yourself that throwing money at the problem is "totally just the responsible thing to do for the project". Anyway, the deal I'm offering is 20 dollars for every database you normalize. Note I am not paying for the data (that is available free via the download link), I simply need help normalizing it. I would appreciate it if you did not download every file as my bandwidth costs will go through the roof. If you'd like the databases please just email me at acaceres@0-sec.net and I will provide you a link to all databases in the system.

  name <-- this is the "real name" of a person as they have entered it

The above are the valid JSON keys in a single-layer JSON object. The fields are fairly self explanatory, if a dump doesn't have a particular field please just leave it out of the object. Here is an example of what your file should look something like at the end:

    ubuntu@ip-172-31-37-193:~/bighd/normalized/05Aug_normed$ head -n 10 linkedin.jl
    {"Email": "mnaaderi323211@gmail.com", "PassHash": "043bab64ffa7cba2dfeddc20a376ba1cab279225", "Domain": "linkedin.com"}
    {"Email": "edgar323t2314@casarealtyinvestments.com", "PassHash": "a22d86b4ca2e489f67bf083e623a2558f1eb2a31", "Domain": "linkedin.com"}
    {"Email": "i_workrrelated66432@yahoo.com", "Password": "xxx", "Domain": "linkedin.com"}
    {"Email": "blkankgh5423@gmail.com", "PassHash": "0be150f6e0b05e5e8135130bbef021c46ce3d089", "Domain": "linkedin.com"}
    {"Email": "scpo2rfirio1@hotmail.com", "PassHash": "b4b99d86e27d43e3740f7167a5c5b59cda044d73", "Domain": "linkedin.com"}
    {"Email": "cha3rlesmfeldman2452@aol.com", "PassHash": "15acb58ea5737e06a1a40953575d0b6fbdbd78ba", "Domain": "linkedin.com"}
    {"Email": "wzerdofgz32468264@gmail.com", "PassHash": "47d4d399df36393755181d1095b548461a3e3096", "Domain": "linkedin.com"}
    {"Email": "awanweeshpande33343@yahoo.com", "PassHash": "0644bf09114451f6d11026728442ebbdcdb015f3", "Domain": "linkedin.com"}
    {"Email": "dmosee@meaer.net", "PassHash": "835ed0ee9f8b21ed70e37febb008d2d6a5cac8a1", "Domain": "linkedin.com"}
    {"Email": "aida.marcial@exitsouth.com2542221", "PassHash": "0e08bfa7a71090ed259506f01eb7d8158d6282ad", "Domain": "linkedin.com"}

(those aren't real emails by the way)

How it works

You can choose up to two databases at a time and "claim" them. In which case they will be marked as claimed (i.e. you will see the word claimed next to it). To claim a database just email acaceres@0-sec.net with the databases you want to help out with, please wait for me to respond before starting work as I am keeping track of conflicts and such. After I get back to you, I offer 20 dollars per database normalized, flat rate despite size or complexity and all that. It's kinda like playing Russian Roulette when unzipping the file - Some of them are in .sql format, some csv, and some are just very badly organized and formatted altogether. Sometimes you have to get fiddly with the string manipulation, and that part can suck a bit (it's not common), but I don't believe I've ever spent more than 10 minutes or so on a db. If you're feeling extra generous and just want to help out that's cool too, I'm glad to put your name up as a contributor on the front page.

**Important note**: This is not meant for data that I do not already own. I provide you the data, and you normalize for me is the deal. I am not and will never be paying for data. That said, if you have data you want to just give me, that's awesome. **Other important note**: This is on hold for a little bit while I build out some automation. It became very difficult to manually track everyone's files and pay them.

When finished please simply drop me a link to the file wherever you have chosen to upload it. *Very important*: please make the file extension a .jl extension (json lines),I won't download files if they are not in this format.

Payment Details

Just send me the database and I can pay you through paypal, venmo, or any cryptocurrency (I prefer btc or xmr, but whatever is fine). You'll just have to trust me that this isn't some half-assed scheme to get free normalized databases. Anyway thanks for participating if you choose to do so!

Other ways to help Scylla

In general if you help me out I'm glad to put your twitter handle, name, resume, tax returns, insta model handle, Tinder profile link, or whatever you want as a contributor on the front page. Below are some ways you could help out!

Give us your Databases!

Got a database we don't have? I'm sure we're missing even some of the larger dumps. With your help we can make the scylla.sh community more powerful than ever - and free to boot! If you have a db we don't have please drop me an email at acaceres@0-sec.net, I will give you credentials to where I store stuff and you can upload it there. Every db helps! Please check relevant laws in your country to ensure that you are not breaking any by sharing this data. I *will not* pay for a database so please don't ask.

Sponsor scylla.sh

Do you hate money and love OSS? We offer sponsorship packages that will get your name front and center on this project if you would like. We are happy to take anonymous sponsors, but 1k/mo. will get your logo on the front page as a contributor. ALL donations go to hosting costs. If donations ever exceed the cost of scylla.sh (ha) I will return the money and keep people as sponsors, asking them to reduce their sponsorship amount.

Follow me on Twitter

It may seem like a small gesture, but if you follow me on twitter you can both stay abreast of scylla news along with my various other OSS. It's a huge help to be able to amplify my message here, so if you can't help with any of the above and happen to be a twitter user, every follow, RT, like, or whatever helps!

Storage Space

If you happen to have a large storage space device that's Internet accessible (cloud or otherwise) and I could use longish-term for this project, that's always helpful. I currently host about 8 TB of data, and that's only going to get bigger as more and more dumps are released. Please check relevant laws in your country first, I am passing around sensitive info that is fine in the US but may be questionable elsewhere.

"Gimme, gimme, gimme, I neeeed, I neeeed!"

-What About Bob

Follow _hyp3ri0n