privacysandbox/aggregation-service

Confused about the output_domain.avro

Closed this issue · 2 comments

Hi aggregation-service team,

I'm really confused about the file "output_domain.avro" used for producing a summary report locally. In your nodejs example(code), how can I generate a "output_domain.avro" for the aggregation report ?

Here is your sample doc: https://github.com/privacysandbox/aggregation-service/blob/main/docs/collecting.md#collecting-and-batching-aggregatable-reports

{
    "bucket": "\u0005Y"
}

Will this "output_domain.avro" work for your nodejs example ?

If convenient, could you explain what this domain file is generated according to ? Thanks a lot !!

Hi @yanghuang1028 ,

The output_domain.avro file is a file that will contain the domain keys that you wish to retrieve from the aggregatable reports generated by the measurements APIs like Attribution Reporting API and Private Aggregation API. Currently, you would be required to pre-declare the keys that you want to get from the aggregatable reports. When you batch in Aggregation Service, you will need to include the location of your output_domain files.

The output domain is created using the an avro schema of a key-value pair. The key being the "bucket" and the value being the domain key.

{
    "bucket": "\u0005Y"
}

The domain key format is a 128-bit integer encoded as a 16-byte big-endian bytestring.

Domain keys are 128-bit keys designed by you which will contain different dimensions that you would wish to collect/aggregate. This section of the Attribution Reporting API handbook should help explain more about keys.

Thanks a lot ! really help me !!