From nch@roe.ac.uk Fri Mar 12 16:18:25 2010
Date: Fri, 26 Feb 2010 17:57:05 +0000 (GMT)
From: Nigel Hambly <nch@roe.ac.uk>
To: WFCAM Science Archive Team -- Eckhard Sutorius <etws@roe.ac.uk>,
     Mike Read <mar@roe.ac.uk>, Mark Holliman <msh@roe.ac.uk>,
     Nigel Hambly <nch@roe.ac.uk>, Nicholas Cross <njc@roe.ac.uk>,
     Rob Blake <rpb@roe.ac.uk>, Ross Collins <rsc@roe.ac.uk>
Cc: CCs for WSA weekly meeting minutes distribution -- Andrew Lawrence
    <al@roe.ac.uk>, Andy Adamson <a.adamson@jach.hawaii.edu>,
     Jim Emerson <j.p.emerson@qmul.ac.uk>, Keith Noddle <ktn@star.le.ac.uk>,
     Lorenzo Rimoldini <lgr@roe.ac.uk>, Mike Irwin <mike@ast.cam.ac.uk>,
     Peredur Williams <pmw@roe.ac.uk>, Bob Mann <rgm@roe.ac.uk>,
     Stephen Warren <s.j.warren@ic.ac.uk>
Subject: WFAU VDFS Science Archive weekly project meeting mins, 26/02/10

Minutes of WFAU VDFS Science Archive meeting:  February 26th 2009
-------------------------------------------------------------------------
-------------------------------------------------------------------------

Present:       NCH, PMW, ETWS, RGM, RSC, RPB, MSH, MAR, KTN
Apologies:     JPE, AL, NJC

DoNM: 10am, Friday March 5th 2010 in the VISTA Hut


Actions discharged this week:
-----------------------------

ACTION: NCH to book the Vista Hut meeting room for 2pm Wed 24th.
Discharged; see below for summary of meeting.

ACTION: All the consider possible poster/talk contributions by next
         Friday at which point we'll have a look-see and plug any gaps.
Discharged:
  NCH: SIA services poster contributed to Software parallel session;
  MSH: VO      "      "        "       "      "        "        "
  RSC: VSA talk/poster contribution to the VISTA session
All attendees should register asap to get early-bird discount on day-rates.


Actions partly discharged but continuing:
-----------------------------------------

None this week.


Actions carried forward from 19/02/10 meeting:
----------------------------------------------

ACTION: RPB & NJC (with help when required from RSC) set up cross-neighbour
         tables between Stripe 82 and WSA/VSA surveys as appropriate.
Done for the WSA; same should be done for VISTA VHS, VIKING (& possibly 
VIDEO) in the VSA.


Specific points and new actions:
--------------------------------

Project management:

NCH welcomed KTN to the team (official start date is May 1st at which
point early doors will be arranged).


WFCAM & VISTA updates:

NCH noted the constructive meeting earlier in the week concerning 
operation/procedural/SW mods required following on from the VISTA PSPIs
meeting in Cambridge. The priorities discussed were:

1) Survey team support during the notional proprietary period. MAR noted
that a science-ready DB (not just metadata with possibly basic catalogue
tables) needs to be released to the survey teams well within their
proprietary period to enable early science exploitation. Hence in addition
to the daily/monthly updating ingest DB mirrors (enabling for example
input into QC etc.) the proposal is that WFAU will create for the survey
science teams a science-ready release DB (merged sources, cross-neighbours
and all the usual bells and whistles associated with UKIDSS releases)
within N months of the last data to be included arriving processed at
the archive, where N depends on the amount of data required to be included.
(The suggestion from WFAU is to make the first release a small Early Data
Release for hopefully rapid turn-around.) The remaining 18 (12) - N months
is the survey team's proprietary opportunity to do headline science etc.
for the first (subsequent) data releases, in line with the "general 
conditions" laid down by ESO. Alignment of releases with standard ESO
observing periods, and/or follow-up telescope proposal deadlines is a
possibility, but to start the ball rolling a possible schedule could be:

| 2010              | 2011              | 2012              |
|                   |                   |                   |
| Q1 | Q2 | Q3 | Q4 | Q1 | Q2 | Q3 | Q4 | Q1 | Q2 | Q3 | Q4 | ...
   a     --b--     c              d
   <--EDR-->
   <--DR1-------------->?             e              f

where:

a: start of survey ops
b: cut-off in contents of EDRs: somewhere in Q2/Q3 2010
c: EDRs release to PIs
d: EDRs release to the world (within 18 months of start of survey ops)
e: DR1s release to the PIs
f: DR1s release to the world (within 18+12 months of start of survey ops).

and we would be interested in hearing reaction from with upstream in VDFS 
on this.

2) Tile issues: the team discussed the required schema changes related to
the baseline assumption that catalogues will be delivered from all of
paw-prints, unfiltered and filtered tiles. the team agreed that all
detections should go into one detection table but with a new attribute
frameCode to make it possible to unpick the required catalogue detections
without recourse to a relational join with Multiframe. Some specific
requirements from RGMcM for VHS were discussed and NJC sought and received
clarification (thanks!) that some kind of pointers linking the detections
would be good for QC and science: the porposal is to define new tables
for these and using the existing source merging SW (rather than inefficient
neighbour tables) to create the necessary info. But WFAU notes that
rolling catalogue updates to PIs ingest DB mirror are only possible on
the same period as the data are delivered from the pipeline (i.e. monthly,
not daily).

3) ESO-SAF interface: all agreed (in the absence of political arguments
from RGM/AL) that this is not a high priority at present, and we should
concentrate on doing what we do, and doing it well.

4) QC changes: MAR noted that generally the PIs seemed happy with his
presentation at the meeting, and that the baseline approach will be to
apply those QC filters appropriate to VIRCam as are currently defined
for WFCAM, then to iterate with the PIs.

Finally, the team went through the detailed points noted by the attendees
on the WFAU TWiki topic. Some noteworthy items:
i) ppErrBits for VIRCam: RSC noted that new bits can be defined for
detections from underexposed regions of tiles ("ears" now is it?!), for
detections from poor regions of detectors (e.g. upper third of detector
16?) and for propagation of the new average confidence level from the
standard catalogues.
ii) detailed  metadata/catalogue schema changes arising from pipeline
developments (e.g. background subtraction option keywords, filtering
parameters, ESO VIRCam QC1 data from the headers as written by CASU)
iii) integration of ISIS difference imaging, maybe defined in collaboration
with VVV and/or Eamonn Kerins [TBC]
iv) Split of VHS into separate survey DBs for DES (JHK), GPS (JK) and 
ATLAS (YJHK)

Subject to the reaction from upstream in VDFS, some kind of communique
with the PIs (possibly via JPE to coordinate conflicts in priorities) 
will be made on all the above.


Comments and issues arising from CASU minutes:

The team noted the minutes of the meeting of 16th Feb; there were no
major comments.


Networking:

ETWS noted that the last lot of transfers from 09B (Jan 2010) from CASU
are grinding to a halt for some reason. MSH volunteered to investigate
the network cards at the WFAU NAS box end to see if there is a problem
since that's where we're writing to and there were network problems
earlier in the week...


WSA/VSA Operations:

Ingests of 09B catalogue data are going ahead. RPB and ETWS noted
that this is the main bottleneck in operations these days, and that
we should seriously consider splitting the monolithic VSA into
separate survey DBs to enable parallelisation of ingests on different
instances of SQL Server. Since this is such a big operational
change, RPB and ETWS will do a few tests and think a bit more about
the ramifications before we plough ahead with any changes.

ETWS noted:
- Updated the thumbnail creation software to use multiple processors, this
closes the software upgrade of early CU1/CU2 quality control checks.
- Together with MAR we fixed the problems in the listdriven photometry tool
where some of the data wasn't returned.
- Started transfers of 09B January data, but transfer speed is very slow.
Also started CU2 on the data that has arrived.


Hardware and Systems:

NCH and RPB noted that the current public catalogue server design of
multiply-attaching release DBs to several SQL Server instances for
performance has fallen foul of the inability of SQL to keep query plan
execution statistics up-to-date as nothing can be written to the 
read-only DBs. RPB has a bodged solution whereby the UI logs that
report missing statistics can be parsed to create scripts that must be
run periodically on the servers. This has been done for DR7+ for all
column stats logged so far, but this has not prevented one of the WSA
paper standard queries from timing out in the current release. Obviously
this is one to keep an eye on over the coming months.

MSH noted that he's trying to update the Win infiniband drivers at
the moment in the hope that it cures some of the file-copy performance
bugs when transfering data between SQL Servers.

A beefy new curation server (8-core, 16GB ram) and also a slightly lower 
spec VO Services server are in the process of being ordered; the new
96TB (raw) NAS box has been ordered.


Software:

RSC/NJC have continued to discuss and work on the curation infrastructure
mods necessary to support more flexibly the requirements for the VISTA
surveys.


Survey Data Release:

UKIDSS DR7+ is released (LAS, DXS and GCS) and has been announced to
wsa-announce (but not yet to the UKIDSS consortium - we assume SJW
will do the usual).


Non-survey Data Release:

RSC asked that the retrospective deblend fix for non-survey data
earlier that 08A be closed off if at all possible. NCH noted that
09B non-survey prepared catalogue DB releases will be done once
all the 09B data are ingested. (all 09A non-surveys have been
released apart from one by RPB and MAR).


Astrogrid deployment & Data Analysis services:

MSH noted that a DSA will be set up for secure access to UKIDSS DR7+
via the VO.


Miscellaneous:

Nothing else this week.


=============================================================
Nigel Hambly                            Tel: +44-131-668-8234
Institute for Astronomy                 Fax: +44-131-668-8416
School of Physics and Astronomy
University of Edinburgh               Email: nch@roe.ac.uk
Royal Observatory
Blackford Hill
Edinburgh EH9 3HJ

The University of Edinburgh is a charitable body, registered
in Scotland, with registration number SC005336.
=============================================================