Sdss id zway #425

zachway1996 · 2024-04-18T13:54:45Z

Added functionality for creating the catalogidx_to_catalogidy match between crossmatch versions.

Added method to append new sdss_ids to the tables sdss_id_flat and sdss_id_stacked

…, 31

…ments

albireox · 2024-04-23T18:28:59Z

HI @zachway1996. Sorry for not commenting on this earlier.

What is the status of this PR? It seems to be mostly in good shape, but if I wanted to test the procedure under append_to_sdss_id.py to add catalogids to the sdss_id tables, is that ready to go?

I have a few comments, some of which maybe are not really relevant or you're already working on them:

Right now, if I understand the code, you need to provide a list of new catalogids to add to sdss_id. I think that's mostly fine but I wonder if we should have a mode in which the code automatically looks for missing catalogids from targetdb.target and catalogdb.target_non_carton and add them. Maybe that's what append_to_tables without a catalogid_list does? There is not documentation for the parameters of that class which would be useful.
To keep in line with the rest of the package, could you make the classes CamelCase (e.g., append_to_tables -> AppendToTables)? A name as append_to_tables is a bit confusing because it seems that it's a function that does something, while it's a class on which you need to call some methods, but that's not really a bit deal.
Could we consolidate all the sdss_id functions and classes under a single sdss_id.py file? Or if you prefer to split things up, feel free to create a sdss_id/ submodule. I also would rename files that seem imperative but are not actual scripts, like create_catalogidx_to_catalogidy.py, but again, not a big deal.
The package uses flake8 and isort linting. I think you have most of it, but I would install it and check what's missing; it seems to be mostly spaces around operators.
Is there documentation for this procedure? I think Pramod and I will like a tour once you think it's ready, but some written documentation may be useful.
Something that doesn't worry me too much, but I think is worth thinking about is that the code is very hardcoded for the three versions we have right now ... That's probably fine as we most likely won't need to sdss_id-match another version, but it also means that if we ever need to do it a very deep knowledge of the code is needed ...

zachway1996 · 2024-04-29T15:18:52Z

Howdy @albireox,

What is the status of this PR? It seems to be mostly in good shape, but if I wanted to test the procedure under append_to_sdss_id.py to add catalogids to the sdss_id tables, is that ready to go?

Yes, the code is functional. But given some of your suggestions below, I should probably change some things.

Right now, if I understand the code, you need to provide a list of new catalogids to add to sdss_id. I think that's mostly fine but I wonder if we should have a mode in which the code automatically looks for missing catalogids from targetdb.target and catalogdb.target_non_carton and add them. Maybe that's what append_to_tables without a catalogid_list does? There is not documentation for the parameters of that class which would be useful.

This should work for both cases, providing a list of catalogids and linking to a table to search for new sdss_ids. However, most of my testing has been on the former case. I'll test the latter and write some documentation on how to use that.

To keep in line with the rest of the package, could you make the classes CamelCase (e.g., append_to_tables -> AppendToTables)? A name as append_to_tables is a bit confusing because it seems that it's a function that does something, while it's a class on which you need to call some methods, but that's not really a bit deal.

That's an easy fix.

Could we consolidate all the sdss_id functions and classes under a single sdss_id.py file? Or if you prefer to split things up, feel free to create a sdss_id/ submodule. I also would rename files that seem imperative but are not actual scripts, like create_catalogidx_to_catalogidy.py, but again, not a big deal.

I think I would prefer to keep the two separate because they serve different purposes. create_catalogidx_to_catalogidy searches for catalogid pairs whereas all of the sdss_id logic is in append_to_sdss_id. I'll put them in a submodule.

The package uses flake8 and isort linting. I think you have most of it, but I would install it and check what's missing; it seems to be mostly spaces around operators.

Realized this after I submitted the pull request, I've downloaded a linter now.

Is there documentation for this procedure? I think Pramod and I will like a tour once you think it's ready, but some written documentation may be useful.

There is documentation for how the schema is set up here, but I would agree that it needs to be fully fleshed out.

One worry I have is that assigning the sdss_id is not definite anymore in the way I have written things. This is primarily because I assigned sdss_id-s based on a SERIAL sequence as new matches were appended to the list. But, since there have been changes to the v31 crossmatch, I'm not sure that sdss_id-s would match up. I can email you and Pramod about this.

Something that doesn't worry me too much, but I think is worth thinking about is that the code is very hardcoded for the three versions we have right now ... That's probably fine as we most likely won't need to sdss_id-match another version, but it also means that if we ever need to do it a very deep knowledge of the code is needed ...

This is something I've thought about but haven't found a way around. A lot of the sdss_id logic is based on assuming that v21, v25, and v31 already (mostly) exist. Adding a new crossmatch would work within the crossmatch but many new rows could be created (e.g. a stellar source in v31 is resolved into two sources in v??).

I could write some text about how one would go about adding a v??, if that would help. Also could make comments in the code along with the text.

… into sdss_id_zway

Targetdb

zachway1996 added 15 commits September 25, 2023 10:12

Added creation of catalogidx_to_catalogidy for xmatch versions 21, 25…

00564ab

…, 31

Removed old files

d85f940

Added append_* and some create_I functionality

a5f3dfe

Fixing weird peewee things and changing some queries over to raw SQL

85999cf

Finished ability to append sdss_ids

3f74327

Switched to single model for sdss_id_*

ce64f11

Transferred sdss_id scripts to /python and fixed bugs

18ba174

Turned off saving log to file

e444618

Switched from sandbox to catalogdb

d7b6334

Switched back to sandbox because of the fear

4a36f93

Switched to catalogdb instead of the sandbox. Removed unnecessary com…

f0e542d

…ments

Documented sdss_id pk sequences

4ad9f44

Testing git rm --cached

09ecc0f

Removing unnecessary files

e91bf50

Added comments to add_pk_defaults_to_sdss_id_tables.sql

99f9e97

zachway1996 requested a review from albireox as a code owner April 18, 2024 13:54

Zachary Way added 6 commits April 18, 2024 16:07

Fixed some linting issues

9e7f758

Cloned python scripts from python/ to documentation

ab56557

Added documentation and fixed linting issues

c3cd017

Fixed more linting issues

d24b446

Fixed more linting issues

60f26da

Fixed isort errors

2c4d087

Zachary Way and others added 5 commits April 29, 2024 16:24

Converted functions to CamelCase

1b0b21a

Moving scripts to sdss_id submodule

394f9ce

Fixed __init__ imports

ed7874d

Fixed local import for __init__ and append_to_sdss_id

eb8352b

Fixed linting issue

488d833

albireox approved these changes May 2, 2024

View reviewed changes

Zachary Way added 28 commits July 29, 2024 17:55

Removed panstarrs split_query

c10e5ae

Merge branch 'sdss_id_zway' of https://github.com/sdss/target_selection…

9167e8f

… into sdss_id_zway

Added last_updated column to sdss_id_stacked

17758d7

Expanded Examples at end of append_to_sdss_id

83c8d62

Fixed typo

e929465

Fixed temp_catalogid_v? missing catalogids

84d0fcd

Added default option of running on targetdb

6213fa1

Added example of running on targetdb

fe95396

Linting and changed to TempMatch.create_table()

85c9d64

Changed to UniqueMatch.create_table()

3557ca5

Added indexes to SdssIdStackedAddendum

dcf5a10

Manually added indexing

d0b1910

Editted create_temp_catalogid_lists to allow for

3b888cd

Targetdb

Typo 'catalgid'

2010f27

Shortened TempMatch index labels

ad77c67

Shortened index labels in Unique Match

12aaa5e

Added new catalogs to individual_crossmatches

5f03345

Added Rank to SdssIdFlatAddendum

1c7f8b6

rank in SdssIdFlatAddendum is nullable

2cf7a29

Rank joins on sdss_id/catalogid pair instead of pk

a9365e0

Typo "Rank" to "rank"

50c6972

Added documentation

89405f7

Linting issues

c1fce48

More Linting?

7d9e578

More more linting

efda4a9

Merge branch 'main' into sdss_id_zway

eaf2039

Linting issues

45c6f14

Added README to catalog_to_catalog

8d6c865

albireox merged commit a6569d6 into main Nov 14, 2024
4 of 5 checks passed

albireox deleted the sdss_id_zway branch November 14, 2024 19:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sdss id zway #425

Sdss id zway #425

zachway1996 commented Apr 18, 2024

albireox commented Apr 23, 2024

zachway1996 commented Apr 29, 2024 •

edited

Loading

Sdss id zway #425

Sdss id zway #425

Conversation

zachway1996 commented Apr 18, 2024

albireox commented Apr 23, 2024

zachway1996 commented Apr 29, 2024 • edited Loading

zachway1996 commented Apr 29, 2024 •

edited

Loading