Possible Efficiency Improvements Related to BeautifulSoup? #74

Open
opened 2023-08-01 17:01:12 -04:00 by hz · 0 comments

Right now, running BeautifulSoup on the output HTML page (and running through the tree to find our custom tags) takes up the bulk of page load times. This is especially true for large/complex HTML output. I wonder if it would be better to run these processes a bunch of times, on small pieces of the page as they're being assembled (e.g., as they're being put into cs_problem_spec) rather than doing one big pass at the end.

Maybe worth experimenting.

Right now, running BeautifulSoup on the output HTML page (and running through the tree to find our custom tags) takes up the bulk of page load times. This is especially true for large/complex HTML output. I wonder if it would be better to run these processes a bunch of times, on small pieces of the page as they're being assembled (e.g., as they're being put into `cs_problem_spec`) rather than doing one big pass at the end. Maybe worth experimenting.
hz added the
investigate
label 2023-08-01 17:01:12 -04:00
Sign in to join this conversation.
No Milestone
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: catsoop/catsoop#74
There is no content yet.