DMelt:DataAnalysis/8 Metadata
From HandWiki
Member
Metadata
Metadata is a short record which captures main characteristics of an entire data record or objects inside each data record. These characteristics appear to be useful when one needs to find a necessary record as fast as possible, without reading the entire data records. Read Metadata article.
Let us consider a script which fills an event record with many events. It can look as
from jhplot.io import * from jhplot import * ps=math.Poisson(500) def event(entry): p1,p2,p3=P0D('a'),P0D('b'),P0D('c') p1.randomUniform(ps.next(),0,1) p2.randomNormal(ps.next(),0,1) p3.randomNormal(ps.next(),0,1) return [str(entry),p1,p2,p3] f=HFile('data.jser','w') for i in range(5000): ev=event(i) if (i%100 == 0): print 'event=',ev[0] f.write(ev) f.close()
In this example, event consists of 3 data containers with Poisson random numbers. We write all such events into a file. in a zipped serialized form. How to insert a data record characterizing the sum of 3 random values, x+y+x, such that after reading this small record one can navigate to the necessary event without running over all 50000 events?