Commit Graph

69 Commits

Author SHA1 Message Date
Athou
dab26af294 allow feeds without entries (fix #565) 2014-03-15 04:24:40 +01:00
Athou
f4f3d9ca48 handle invalid feeds having unescaped html entities 2014-03-01 18:19:49 +01:00
Athou
8a633aa648 if link is empty, use guid instead if able (fix #551) 2014-02-26 08:36:40 +01:00
Athou
57abee6cf0 use the url of the feed as the base url to resolve relative entry links when the declared link in the feed is relative 2013-10-03 12:42:05 +02:00
Athou
0d730128f7 use lombok slf4j annotation 2013-08-11 11:45:32 +02:00
Athou
71bb33d710 small cleanup 2013-07-31 12:07:29 +02:00
Athou
3dd4f140e2 refactored the way we handle feed refresh queue 2013-07-27 15:45:03 +02:00
Athou
9cdc364fde index title hash 2013-07-26 08:15:23 +02:00
Athou
8845c54d0c apply formatter 2013-07-25 09:17:33 +02:00
Athou
dd0fdfc89e prevent exception when hashing content 2013-07-24 19:19:50 +02:00
Athou
645164997d move author to content 2013-07-24 15:50:05 +02:00
Athou
e759d095ed propagate exception message 2013-07-03 07:56:52 +02:00
Athou
b8a62777be always store validate date 2013-07-02 09:44:16 +02:00
Athou
9e2892cbe0 strip html from the author field (fix #351) 2013-06-27 16:54:58 +02:00
Athou
158abadf19 use date constructor directly when a calendar is not needed, better performances 2013-06-25 16:57:48 +02:00
Athou
cabea47995 throw an exception if there are no items in the feed 2013-06-12 05:12:38 +02:00
Athou
2baa2bbfe5 store average entry interval and use it to decide when to reschedule feed refresh when it returns 304 2013-06-10 12:53:46 +02:00
Athou
703fd91856 validate published date only if not null 2013-06-09 06:42:25 +02:00
Athou
ca64278c08 use entry date if more recent than feed published date 2013-06-08 22:35:19 +02:00
Athou
3572c0344e use entry date if more recent than feed published date 2013-06-08 22:31:34 +02:00
Athou
349ed7f223 add a day to make sure we don't have issues with timezones 2013-06-08 22:13:44 +02:00
Athou
d820d9738e make sure the date is in the past 2013-06-08 22:00:22 +02:00
Athou
a9dc2f05f7 validate the date 2013-06-08 21:56:01 +02:00
Athou
19a4a58d9e use publishedDate from feed to determine if we need to handle feed entries or not 2013-06-08 21:47:19 +02:00
Athou
87d3cb6c34 handle entries with no title 2013-06-04 15:11:35 +02:00
Athou
145285dd20 handle entry content only if the entry is new 2013-05-29 16:42:45 +02:00
Athou
5dad786802 no guid and no link, skip entry 2013-05-28 21:53:58 +02:00
Athou
5b0c9e940b first resolve absolute url then truncate 2013-05-27 11:44:14 +02:00
Athou
f04b24796c resolve entry link when url is relative (fix #181) 2013-05-26 07:29:10 +02:00
Athou
02c0ef175d make sure the guid fits in the database field 2013-05-25 18:45:24 +02:00
Athou
9dc67ba6f9 handle all entry content sanity check at the same place 2013-05-25 00:21:00 +02:00
Athou
20a2c90241 truncate url if too long 2013-05-23 15:35:12 +02:00
Athou
43caaab007 null handling 2013-05-22 13:33:22 +02:00
Athou
23512e95a7 add atom links for rss feeds 2013-05-21 13:36:20 +02:00
Athou
185df1d3d7 change log level for some logging lines 2013-05-20 18:12:11 +02:00
Athou
c88d3021b8 initial pubsubhubbub support (#44) 2013-05-20 17:44:38 +02:00
Athou
badc830535 use the first entry date as publish date 2013-05-20 12:37:37 +02:00
Athou
f59198e018 persist entry author 2013-05-19 06:47:37 +02:00
Athou
effa009241 new fetchedfeed class 2013-04-25 12:30:21 +02:00
Athou
92e76d469f check for nulls before querying the database 2013-04-23 09:06:35 +02:00
Athou
64747881de detect encoding and trim invalid characters before parsing xml (#60) 2013-04-23 07:20:21 +02:00
Athou
bb5d76109a Integer.MAX_VALUE is actually the maximum value in seconds, construct date with milliseconds (#56) 2013-04-22 09:05:04 +02:00
Athou
d28c74ea25 use a default date instead of null 2013-04-21 23:07:19 +02:00
Athou
0204dadc3f make sure dates are in an acceptable interval 2013-04-21 16:56:09 +02:00
Athou
7502707930 check if the feed has changed by using publishDate of the feed or the first entry 2013-04-19 12:53:25 +02:00
Athou
a4a7f6389c faster way of extracting declared encoding 2013-04-19 09:36:58 +02:00
Athou
5be7c487ad optimizations 2013-04-14 18:51:12 +02:00
Athou
86edd54a21 index hashes 2013-04-14 18:28:48 +02:00
Athou
84f055b67a store entry content externally and load only when needed 2013-04-11 10:31:19 +02:00
Athou
7d982a440a fix #29 2013-04-09 13:37:00 +02:00