Posted Thursday, March 7 at 10:27 PM (9 months ago)
I’ve been working on getting Pushl much more stable and reliable, particularly around a persistent “too many open files” error I was having, which turned out to be primarily due to a fd leak in the caching routines. Oops.
Anyway, there’s also seemingly a problem with how
aiohttp manages its connection pool, at least on macOS, so I’ve disabled connection keep-alive by default. However, if you still want to use keep-alive, there’s now a
--keepalive option to allow you to do that. I’m finding that it doesn’t really improve performance all that much anyway.
This is feeling beta-ready but I’ll give it a few days for other issues to shake out first.
Posted Thursday, March 7 at 12:05 AM (9 months ago)
So, I just released v0.2.0 of Pushl. It was a pretty big change, in that I pretty much rewrote all the networking stuff, and fixed some pretty ridiculous bugs with the caching implementation as well.
The main thing is now it’s using async I/O instead of thread-per-connection, so it’s way more efficient and also times out correctly.
And oh gosh, I had so many tiny but critical errors in the way caching was implemented – no wonder it kept on acting as if there was no cached state. Yeesh.
Anyway, I’ll let this run on my site for a few days and if I like what I see I’ll upgrade it to beta status on PyPI.
Posted Monday, March 4 at 3:47 PM (9 months ago)
I’ve released Publ v0.3.19, which now finally has a tagging system, which is only one of the oldest issues that was still open.
Here’s a list of what’s been added or changed since 0.3.18:
I want to thank Karina Antonio for implementing image cropping.
Posted Wednesday, February 27 at 9:38 PM (10 months ago)
I’ve just released v0.3.18, with the following changes:
- Add date grouping properties to entry
- Add a
pages property to
- Provide the current
category object to the error handler
- Support linking to non-image/non-entry local files
- Added, then removed, some performance micro-optimizations that only caused problems
More details about the major changes below!
Update: I released a hotfix as 0.3.18.1 because there was a last-minute bug that snuck in while I was trying to silence a new pylint error. Oops.
Posted Wednesday, February 13 at 6:20 PM (10 months ago)
It’s been a while since I’ve had a chance to work on Publ, but the great thing is that I actually had a reason to work on it for my day job. Which is to say I’m finally being paid to work on Publ. ;)
Changes since 0.3.14:
- Add requirement for Arrow 0.13.0 (issue 41)
- Fix a dumb tpyo that was the cause of issue 158
- Don’t rewrite DRAFT files; fixes 137
- Move sample-site files back to the library repo rather than in the doc repo
- Fix the way we map malformed category URLs (issue 156)
- Update upstream library versions
- Move version number to publ module
- Allow empty slug-text in entry route (fixes 161)
- Process HTML entries, to finally handle issues 136 and 154.
Some more information about that last one under the cut!
Posted Monday, January 14 at 9:28 PM (11 months ago)
I ended up doing some more work on Pushl and have now released v0.1.7. The major changes:
- Did a bunch of refactoring to make the code a little cleaner and handle configuration more appropriately
- Added a configurable timeout for connections (which now defaults to 15)
- Added a
--version option on the command line arguments
Also, some suggested usage ideas below the cut!
Posted Sunday, January 13 at 8:48 PM (11 months ago)
It’s been a while since I’ve updated Pushl but today I released v0.1.6. It includes the following fixes:
- Now it supports Pingback as well as Webmention
- Improved the threading defaults and connection pooling
- Also checks entries for updates even if the feed didn’t change (in case something changed in the more text or page metadata or whatever)
Anyway, it should just be a
pip install --upgrade pushl (or
pipenv update) away.
Posted Saturday, December 22 at 1:35 AM (a year ago)
While I’m fixing random stuff in Publ, I figured I’d finally fix some problems with Pushl too. Nothing major here, just:
- Stability: Fixed a bug where feeds that don’t declare links caused the worker to die before entries got processed
- Performance: Now we use a global connection pool (so connections can be reused)
- Fixed a minor correctness issue with archive feeds (which actually doesn’t make any difference in the real world but whatever)
Posted Saturday, December 15 at 1:08 AM (a year ago)
v0.3.11 is now released, with the following changes:
- A more complete fix for how to handle image sets and inline images with respect to paragraphs
- Better cleanup for spurious empty paragraphs
- Improved internal entry link handling
Detailed descriptions of the changes are below.
Posted Wednesday, November 28 at 3:33 PM (a year ago)
This entry marks the release of Publ v0.3.9. It has the following changes:
more_text and related functionality to image sets (an example being visible over here)
- Improved and simplified the caching behavior (fixing some fiddly cases around how ETags and last-modified worked, or rather didn’t)
I also made, and then soon reverted, a change around how entry IDs and publish dates were automatically assigned to non-published entries. I thought it was going to simplify some workflow things but it only complicated the code and added more corner cases to deal with, all for something that doesn’t actually address the use case I was worried about. So never mind on that.
(What happened to v0.3.8? I goofed and forgot to merge the completed
more_text et al changes into my build system first. Oops.)
See below for more on the caching changes.
Posted Wednesday, November 28 at 1:18 AM (a year ago)
I just released Pushl v0.1.3, which adds some minor performance optimizations and a bug fix.
Originally I was hoping to have a major performance optimization, in the form of having rewritten Pushl from thread-per-connection to async operation, but unfortunately I ran into a bunch of problems with it. Mostly that I was running into a “too many open files” error and I couldn’t figure out what was causing a descriptor leak. I have the work-in-progress branch online if anyone wants to take a look at it.
Anyway, the reason I went down this route is because I added WebSub subscriber support to my fork of Feed-On-Feeds, which makes it so that WebSub-enabled RSS and Atom feeds will push their updates to your reader instead of having to wait for a polling interval.
You can read more about some of my other thoughts on a blog entry that quickly devolves into a rant, if you’re so inclined.
Posted Wednesday, October 10 at 10:41 PM (a year ago)
I’ve gotten Pushl to the point that I’m confident in releasing it as a full alpha and having a cron job run it every 15 minutes. Whew.
I’ve also added some
h-entry markup to the Publ site templates so that hopefully the notifications appear at least somewhat reasonably elsewhere.
I really need to redo these site templates now that I have a better idea of how they go together. Also the quickstart guide could be a lot better.
Posted Monday, October 8 at 11:53 PM (a year ago)
I finally got around to releasing a very rough prototype of Pushl to pypi. It only sends out WebSub notifications for now (does anyone even use those?), but I’ll work on actually implementing WebMention soon.
Also, recently someone pointed out to me fed.brid.gy which makes it easy to turn a static site into an ActivityPub source. At some point I’ll experiment with setting up Publ for this; it looks like it’s just a matter of adding a couple of additional route rules to Publ, so that will probably go into an advanced configuration guide if I ever get around to making such a thing. (Or it could actually be added to Publ directly but there isn’t much of a reason for that, IMO.)
Posted Monday, October 1 at 11:16 PM (a year ago)
I’ve started working on Pushl in earnest now, and one thing that was really bugging me about this is that anything which polls feeds and entries would really benefit from having client-side cache control working. Which was a big missing feature in Publ.
Well, I finally implemented it, and I’m pretty happy with how I did it.
The short version: for any given view it figures out (pessimistically) what’s the most recent file that would have affected the view (well, within reason; it only looks at the current template rather than any included templates, which is pretty difficult to do correctly) and uses that to generate an ETag (via metadata fingerprint) and a Last-Modified time (based either on the file modification time or the time the entry was actually published).
There’s probably a few corner cases this misses but in general this makes client-side caching of feeds and such work nicely.
Posted Tuesday, September 25 at 2:55 PM (a year ago)
I found a few more annoying bugs that were shaken out from the whole PonyORM transition, as well as a couple of bugs in the new shape functionality. There’s probably a few more of these bugs lurking in the codebase (I mean, in addition to the existing bugs I know about), but here’s what’s changed:
- Image shape bugs:
- Fix some
FileNotFound handling on images (so
shape errors propagate correctly)
class work correctly per the documentation
- PonyORM bugs:
Posted Thursday, September 20 at 10:58 PM (a year ago)
Did you know that CSS3 has a style called
shape-outline? It’s pretty neat, it makes it so that a floated object gets a shape based on the alpha channel of its specified image. But it’s kind of a pain to set up; in plain HTML it looks something like this:
<img src="/path/to/image.png" width="320" height="320"
and if you want a different shape mask for your image than its own alpha channel, you have to do a bunch of stuff like making sure that the image sizes are the same and whatever.
Posted Wednesday, September 19 at 2:27 AM (a year ago)
For a number of reasons, I have replaced the backing ORM. Previously I was using peewee, but now I’m using PonyORM. The primary reason for this is purely ideological; I do not want to use software which is maintained by someone with a track record of toxic behavior. peewee’s maintainer responds to issues and feature requests with shouting and dismissive snark; PonyORM’s maintainer responds with helpfulness and grace. I am a strong proponent of the latter.
PonyORM’s API is also significantly more Pythonic, and rather than abusing operator overloads for clever query building purposes, it abuses Python’s AST functionality to parse actual Python expressions into SQL queries. Seriously, look at this explanation of it and tell me that isn’t just amazing.
Posted Wednesday, June 27 at 8:01 PM (a year ago)
So, sorry to anyone who was subscribed to the RSS feed for this and got spammed with v0.1.24 release announcements. I made a mistake and pushed a version of the entry that didn’t have a canonical ID assigned yet, and as a result, every time Heroku spun up, it assigned a new ID. This is something that’s happened before and I really ought to do something about it.
Three things come to mind:
- Figuring out how to always make IDs get assigned in an idempotent manner (hard to do correctly)
- Don’t run on Heroku so the assignments persist between executions (easy)
- Add a pre-push hook to the repo that verifies that all entries alread have an assigned ID (???)
2 seems like the easiest approach for now, so that’s what I’ll probably do.
Posted Friday, May 18 at 12:00 PM (2 years ago)
There’s an old joke in programming, that the two hardest things to do are naming things, cache invalidation, and off-by-one errors. But this doesn’t pay sufficient respect to one of the other hardest things, namely handling date and time.
Posted Tuesday, May 15 at 5:21 PM (2 years ago)
Today I got two major bits of functionality in: Publ will now asynchronously scan the content index (which speeds up startup and fixes some annoying race conditions with entry creation), and it also asynchronously generates image renditions (which makes pages not take forever to load on first render, and will also use multiple CPU cores if available). Seems to work well so far.
I was running into scaling problems with beesbuzz.biz (what with there being a couple thousand entries and some pages with hundreds of images on it) and this keeps it feeling pretty good.
So, this brings us up to version 0.1.14.