- From: Eric Mill <eric@sunlightfoundation.com>
- Date: Wed, 13 Mar 2013 11:40:10 -0400
- To: public-opengov@w3.org
- Message-ID: <CAO1GrFv-qoc1h87J-0g8zbT2YbJzNSMbkhU5ie4nabyq5So=7A@mail.gmail.com>
Hi all, Nice to meet you - I'm Eric Mill, a developer at the Sunlight Foundation. I work on a few things here relevant to this project, including our Congress API <http://sunlightlabs.github.com/congress/> and on the github.com/unitedstates project (along with several non-Sunlighters) that gathers the data around members of Congress. I was just having a conversation with James off-list that he wisely suggested would be good to have on-list. :) My concern was that the data schema was a little too intermingled with implementation details. Namely, I was offput by the spec suggesting specific MongoDB collection names, and discouraging the use of specific out-of-spec fields like "_type" because of how they might interfere with common patterns in MongoDB object document mappers (ODMs). I use these ODMs myself and understand the concern, but I also believe that as a new data schema attempting to get buy in from people and organizations with vastly differing tech stacks, it'd be smaller and less intimidating to make it clear the minimum requirements necessary to publish data that complies with Popolo as a data spec. People interested in building higher level implementations - like interoperable Rails engines, Django apps, CPAN modules, whatever - would probably also want to agree on some best practices there, but those to me are separate from how to publish bulk data whose schema agrees with each other. James, like the good man he is, has already taken my concerns to heart and said he plans <https://github.com/opennorth/popolo-standard/issues/24> to update the spec to make the separation more clear. Still, if anyone else has opinions on this sort of thing, we should talk about them! -- Eric -- Developer | sunlightfoundation.com
Received on Wednesday, 13 March 2013 19:48:16 UTC