- From: Guglielmo Celata <guglielmo.celata@gmail.com>
- Date: Sat, 7 Sep 2013 08:22:08 +0200
- To: James McKinney <james@opennorth.ca>
- Cc: public-opengov@w3.org
- Message-Id: <D8D64EF8-D2C8-4430-BA3B-A7DDD6D52A57@gmail.com>
You're right, the children property is not necessary at all in the protocol.
It is an implementation mechanism I use it as a cache, in order to reduce the number of queries in the DB.
I agree with the proposed implementation, as shown in the ticket (the child_id refers to the current document).
I would add a current_parent_id property, though, just to keep compatibility with code using the standard parent_id representation for trees.
Guglielmo
Il giorno 06/set/2013, alle ore 20:43, James McKinney <james@opennorth.ca> ha scritto:
> Thanks, Guglielmo.
>
> Is a "children" property necessary? It's possible to traverse a tree using only a "parents" property. It's a little more error-prone to have to maintain the organizational hierarchy in two fields instead of one.
>
> I've created an issue in the tracker: https://github.com/opennorth/popolo-spec/issues/41
>
> James
>
> On 2013-09-04, at 5:53 AM, Guglielmo Celata wrote:
>
>> I understand the *preoccupations* (forgive my limited english vocabulary) regarding the standard parent_id case, which indeed would cover 9 out of 10 use cases.
>> What we came up with in some of the projects is de-normalizing the database, and it's pretty much the solution you're proposing.
>>
>> So, for example, the Organization model would still have a parent (or parent_id) attribute, that I would call current_parent, for clarity.
>> The JSON serialization explicitly would contain both an array of parents and children, with start and end dates, extracted from the external Relation model.
>> The current_parent would usually be the last element of the parents list, and it must have a Null end_date.
>>
>> This would allow to represent time-dependend father-child compositions.
>>
>>
>> An example (pseudo-python) for an organization with parentships changing dynamically over time:
>>
>> Organization
>> { id: ID,
>> current_parent_id: PID3,
>> parents:
>> [
>> { id: PID1, start_date: '2006/07', end_date: '2008/09/01' },
>> { id: PID2, start_date: '2008/09/02', end_date: '2012/04' },
>> { id: PID3, start_date: '2012/04', end_date: NULL },
>> ],
>> childresn: []
>> }
>>
>>
>>
>>
>> As for N-N aggregations, it's a very rare use-case, in the institutional context we're focusing on, I can only think of
>> a rather stretched example.
>>
>> Since in Italy there is a minimum number of MPs required to form a group (in both chambers of the parliament),
>> we have a so called mixed group, with members from various small (usually regional) electoral parties.
>> Now, from time to time, an MP exits from a big group and enters into the mixed group, usually a few months before
>> passing into another different big group alltogether, just in order to disguise the actual flip.
>>
>> If I want to know the composition of the mixed group at any given time, in terms of electoral parties, a single party could easily be into two groups.
>> The electoral party as an organization exists independently of the parliament groups.
>>
>> Of course I could just count the memberships and obtain the same result, but I was just trying to make an example.
>> In other contexts these situation could arise more frequently.
>>
>> I would agree in considering aggregation issue a minor one.
>> Given the focus and context of the popolo project, it could be left out of the specs.
>>
>>
>> Guglielmo
>>
>>
>>
>> Il giorno 04/set/2013, alle ore 00:46, James McKinney <james@opennorth.ca> ha scritto:
>>
>>> Hi Guglielmo,
>>>
>>> For your second question about aggregations (N-N relations between organizations), can you give an example from your work where this is the case?
>>>
>>> For the first question: indeed, there is an issue in the tracker: https://github.com/opennorth/popolo-spec/issues/27 Very few existing standards handle changes over time, so we will likely have to come up with our own solution like the one you suggest.
>>>
>>> The relation you propose would work. It's actually very similar to a Membership in Popolo. (Perhaps an eventual solution would have a Relation superclass with your new class and Membership as subclasses.)
>>>
>>> The challenge when dealing with historical use cases is to make sure that the common use cases are still easy to implement. Here, a common use case is to represent the *current* organizational hierarchy/tree/graph. There already exist many treelibraries in various languages for storing tree structures in databases. Most of these have no method of tracking changes over time, and use a single field like "parent_id" to track the tree structure. An ideal solution to the challenge would allow people to continue to use such libraries.
>>>
>>> Perhaps a solution would be to maintain "parent_id" and "parent" as-is, and to add a new "parents" property, whose value is an array of Relation objects? Implementations can then choose whether to implement either "parent_id" or "parents" or both.
>>>
>>> Depending on how the aggregations issue is resolved, it may make sense to encourage the use of "parents" only.
>>>
>>> Would anyone be against eliminating parent_id, if that were part of a solution?
>>>
>>> James
>>>
>>> On 2013-09-03, at 9:38 AM, Guglielmo Celata wrote:
>>>
>>>> James,
>>>> the Popolo protocol currently allows hierarchical relations between organizations to be mapped through the parent_id attribute.
>>>>
>>>> One possible shortcoming is that this is a permanent relation (it has no start nor end dates), and sometimes, especially in political groups, relations do depend on time.
>>>>
>>>> Another lesser shortcoming is it maps compositions (a group, or a big company and its departments), but leaves out aggregations (members can join more than one group).
>>>>
>>>> In a relational world, I would map it with an external entity:
>>>>
>>>> ------------
>>>> 1| |N
>>>> ----- -----
>>>> | Org |------| Rel |
>>>> ----- 1 N -----
>>>>
>>>> Where Rel is the relation and it would have these fields:
>>>> id
>>>> from_id
>>>> to_id
>>>> start_date
>>>> end_date
>>>>
>>>> from_id and to_id are references to the Org, organizaiton entities.
>>>>
>>>> Don't know how it would translate into the protocol and if the complexity it introduces are worth the issues it tries to solve.
>>>>
>>>>
>>>> Any ideas?
>>>>
>>>> Guglielmo Celata
>>>> Developer
>>>> Associazione Openpolis
>>>>
>>>
>>
>
Received on Saturday, 7 September 2013 06:22:54 UTC