Ako imam ulazni tekstualni fajl u kome se nalaze ovi podaci:
1226908800 5782717 230 68.189.11.162 snagajob.com/job-search-engine.aspx gv 253
1226908800 6223791 230 99.148.149.0 gayvideotgp.com/tgp/st/st.php?id=162651&script=1&url=http://www.gaysexmovies.us/885amnsh/alotmpgporn.html&p=75 gv 253
1226908800 4030941 198 173.96.76.204 shop.heaven666.org/free_stuff.php gv 235
1226908800 4664251 230 69.154.95.111 gtoons.info/my-wild-raunchy-son-3/NUL gv 253
1226908800 5727035 230 76.93.202.116 dynimages.neopets.com/nh/spotlight/entry/a_and_cz_mom/1.png?lm=1225240228 gv 253
1226908800 5464520 226 193.159.167.113 google.com /bookmarks/?output=xml&num=10000&zx=13313 gv 253
1226908800 3463527 226 24.4.211.9 google.com /search?hl=en&q=porn&btnI=I'm Feeling Lucky&aq=f&oq= gv 235
1226908800 5267389 230 41.246.217.190 ask.com /inc/js/webadvancedsearch_c.js?v=1.2 gv 253
1226908800 5170696 230 76.174.253.183 gpassionpsp.foros.ws/admin_867a98syysf2/admin_styles.php?sid=206ac79e3839e94fbd45b83b23502e5a gv 253
1226908800 3205614 226 161.51.11.6 search.live.com
http://search.live.com/cashbac...;form=CM&p=2&q=carlson civil suite 2009 resale gv 204
Ako bi znacenje kolona bilo u redosljedu:
timestamp, clientid, locationid, IP, search_engine, url, client_version.
search_engine je npr. google, yahoo...Kada se racunaju domeni ova kolona se spaja sa url za dobijanje pravog url.
Kako bih u php mogao da napisem skripte za racunanje broj unique klijenata po koloni (clientid),
statistiku koliko ima klijenata iz npr.US koliko NON-US, npr top 10 US states na osnovu broja unique klijenata,
npr. top 10 posjecenih domena racunajuci samo unique posjete, npr top 10 posjecenih domena (bez subdomena) racunajuci ukupan broj request-a, te npr. top 10 locationid-a racunajuci samo unique klijente.
Nadam se da moze neko nesto pomoci, treba mi sto je moguce prije. Pozz