feat(websoc-scraper): Dynamic Resolution for Enrollment History by aadi-shanker · Pull Request #300 · icssc/anteater-api

aadi-shanker · 2026-02-05T21:52:14Z

Description

Implements variable-frequency enrollment snapshots based on academic calendar periods. The scraper now captures enrollment data at different frequencies depending on enrollment activity:

ENROLLMENT period (Week 8-10): Every 3 hours, 7am-7pm only (~5 snapshots/day)
ADD_DROP period (Week 1-2): Every 6 hours 24/7, with hourly snapshots on Week 2 Friday 12pm-5pm (~4-8 snapshots/day)
REGULAR period (Week 3-7): Once per week (168 hours)
Between quarters: Once per week for late enrollment

Database-driven frequency tracking checks hours elapsed since the last snapshot and only inserts when thresholds are met. This prevents missed snapshots during scraper outages (self-correcting behavior) while optimizing storage usage.

Related Issue

#134

Motivation and Context

Missing intraday enrollment trends -Want to increase and visualize hourly changes during critical periods

How Has This Been Tested?

I ran a test file with multiple detection unit tests based on given fake dates. I also ran the scraper using fake dates just to double check the processing logic is correct.

Screenshots (if appropriate):

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My code involves a change to the database schema.
My code requires a change to the documentation.

…date createdAt to timestamp

…ollment management

… and period

…nt duplicated in lib.ts

…ction

… scrapeTerm

…ng in websoc scraper

ParzivalPerhaps

Aadi owes me $10,000

lgtm

…T ON

…ent history query

… clarify data availability

laggycomputer

re-migrate

aadi-shanker · 2026-05-20T00:42:10Z

re-migrate

Done

laggycomputer · 2026-05-20T00:47:27Z

+      .enum(websocSectionTypes, { error: (_issue) => "Invalid sectionType provided" })
+      .optional(),
+    from: z.iso.datetime({ error: "Invalid from date provided" }).optional().openapi({
+      description: "Start of the time range (ISO 8601 timestamp). Optional.",


No need to mark this as optional in text

laggycomputer · 2026-05-20T00:48:18Z

+  units: z.string(),
+  instructors: z.string().array(),
+  meetings: z.object({ bldg: z.string().array(), days: z.string(), time: z.string() }).array(),


I don't think there's a reason to serve these fields and anything besides identifying the section uniquely. Thoughts?

laggycomputer · 2026-05-20T00:49:00Z

      .where(inArray(websocSectionEnrollment.sectionId, transformedSectionRows.keys().toArray()))
-      .orderBy(websocSectionEnrollment.createdAt);
+      .orderBy(
+        sql`DATE(${websocSectionEnrollment.createdAt})`,


There's no need to convert this to DATE before ordering.

laggycomputer · 2026-05-20T00:50:16Z

+      .orderBy(
+        sql`DATE(${websocSectionEnrollment.createdAt})`,
+        websocSectionEnrollment.sectionId,
+        desc(websocSectionEnrollment.createdAt),


I'm not comfortable with two ORDER BYs like this without showing it's not a major performance issue. And shouldn't the snapshots be in increasing order anyway?

laggycomputer · 2026-05-20T00:51:42Z

+    for (const [sectionId, section] of transformedSectionRows) {
+      granularMapping.set(sectionId, {
+        year: section.year,
+        quarter: section.quarter,
+        sectionCode: section.sectionCode,
+        department: section.department,
+        courseNumber: section.courseNumber,
+        sectionType: section.sectionType,
+        sectionNum: section.sectionNum,
+        units: section.units,
+        instructors: Array.from(section.instructors),
+        meetings: section.meetings.map(({ bldg, ...rest }) => ({
+          bldg: Array.from(bldg),
+          ...rest,
+        })),
+        finalExam: section.finalExam,
+        snapshots: [],
+      });


See above about excess fields. That would probably also make this unnecessary.

Why can't we ARRAY_AGG this?

aadi-shanker added 5 commits February 2, 2026 10:15

Dependencies Update

af0eecb

feat(db): schema update add enrollment_res_timestamp migration and up…

b57eee3

…date createdAt to timestamp

feat(scraper): add week helper functions and period detection for enr…

c6fe3a6

…ollment management

fix(scraper): enhance enrollment snapshot logic based on current term…

b74d5fa

… and period

fix(scraper): implement dynamic enrollment snapshot frequency

e8e8bcb

aadi-shanker temporarily deployed to staging-300 February 5, 2026 21:52 — with GitHub Actions Inactive

Merge branch 'main' into enrollmentdatafix

7d7ad2b

aadi-shanker temporarily deployed to staging-300 February 5, 2026 22:36 — with GitHub Actions Inactive

laggycomputer requested review from HwijungK and ParzivalPerhaps February 6, 2026 03:33

aadi-shanker added 4 commits February 11, 2026 11:35

feat(refactor): Moved the week calculation logic into stdlib so it is…

0458569

…nt duplicated in lib.ts

feat(fix): fixing migration

bbe9722

Merge branch 'main' into enrollmentdatafix

9a26118

feat(migration): Migration update

4e80f7f

aadi-shanker temporarily deployed to staging-300 February 12, 2026 01:11 — with GitHub Actions Inactive

aadi-shanker added 2 commits February 12, 2026 17:42

feat(fix): correct week range for ADD_DROP period in detectPeriod fun…

9bd0a71

…ction

feat(fix): refactor snapshot logic by moving it from doChunkUpsert to…

e21f2c7

… scrapeTerm

aadi-shanker temporarily deployed to staging-300 February 13, 2026 01:49 — with GitHub Actions Inactive

aadi-shanker marked this pull request as ready for review February 13, 2026 01:53

This comment was marked as resolved.

Sign in to view

feat(fix): update week calculation logic and snapshot decision handli…

81f0fe0

…ng in websoc scraper

aadi-shanker temporarily deployed to staging-300 February 13, 2026 20:14 — with GitHub Actions Inactive

ParzivalPerhaps approved these changes Feb 16, 2026

View reviewed changes

laggycomputer requested changes Feb 16, 2026

View reviewed changes

HwijungK reviewed Feb 16, 2026

View reviewed changes

Comment thread apps/data-pipeline/websoc-scraper/src/lib.ts Outdated

HwijungK reviewed Feb 16, 2026

View reviewed changes

Comment thread apps/data-pipeline/websoc-scraper/src/lib.ts Outdated

feat(fix): fixed timezone dependency problem by just converting to PST

1817954

aadi-shanker temporarily deployed to staging-300 February 17, 2026 22:06 — with GitHub Actions Inactive

aadi-shanker requested a review from laggycomputer February 17, 2026 22:10

laggycomputer requested changes Feb 19, 2026

View reviewed changes

aadi-shanker added 4 commits May 16, 2026 15:20

feat: add granular enrollment history endpoint

9615991

Merge branch 'main' into enrollmentdatafix

dd1392e

refactor: remove onConflictDoNothing now that createdAt is timestamp

e264c85

refactor: downsample existing enrollment history endpoint via DISTINC…

19dabdf

…T ON

aadi-shanker temporarily deployed to staging-300 May 18, 2026 00:52 — with GitHub Actions Inactive

Merge branch 'main' into enrollmentdatafix

577e7ad

aadi-shanker temporarily deployed to staging-300 May 18, 2026 02:48 — with GitHub Actions Inactive

aadi-shanker requested a review from laggycomputer May 18, 2026 02:51

laggycomputer removed request for ParzivalPerhaps and laggycomputer May 18, 2026 02:52

HwijungK reviewed May 18, 2026

View reviewed changes

Comment thread apps/api/src/services/enrollment-history.ts

Comment thread apps/api/src/services/enrollment-history.ts Outdated

HwijungK reviewed May 18, 2026

View reviewed changes

aadi-shanker added 4 commits May 18, 2026 12:51

refactor: extract section rows query into helper

a3d2c32

refactor: Updated doc description

8d94a6d

refactor: make date fields optional and update validation for enrollm…

08087ed

…ent history query

refactor: update description for granular enrollment history route to…

a38bebe

… clarify data availability

aadi-shanker temporarily deployed to staging-300 May 18, 2026 20:11 — with GitHub Actions Inactive

aadi-shanker requested a review from HwijungK May 18, 2026 20:13

HwijungK approved these changes May 19, 2026

View reviewed changes

HwijungK added 3 commits May 19, 2026 14:37

remigrating

762edbf

Merge branch 'main' into enrollmentdatafix

e4d676c

remigrated

7447c62

HwijungK temporarily deployed to staging-300 May 19, 2026 21:38 — with GitHub Actions Inactive

laggycomputer requested changes May 20, 2026

View reviewed changes

aadi-shanker added 3 commits May 19, 2026 17:37

migration delete

26cc24d

Merge branch 'main' into enrollmentdatafix

50bcf02

remigrate

3e77a5e

aadi-shanker deployed to staging-300 May 20, 2026 00:41 — with GitHub Actions View deployment

laggycomputer requested changes May 20, 2026

View reviewed changes

Conversation

aadi-shanker commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Motivation and Context

How Has This Been Tested?

Screenshots (if appropriate):

Types of changes

Checklist:

Uh oh!

This comment was marked as resolved.

Uh oh!

ParzivalPerhaps left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

laggycomputer left a comment

Choose a reason for hiding this comment

Uh oh!

aadi-shanker commented May 20, 2026

Uh oh!

laggycomputer May 20, 2026

Choose a reason for hiding this comment

Uh oh!

laggycomputer May 20, 2026

Choose a reason for hiding this comment

Uh oh!

laggycomputer May 20, 2026

Choose a reason for hiding this comment

Uh oh!

laggycomputer May 20, 2026

Choose a reason for hiding this comment

Uh oh!

laggycomputer May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

aadi-shanker commented Feb 5, 2026 •

edited

Loading