Commit Graph

66 Commits

Author SHA1 Message Date
Athou
57abee6cf0 use the url of the feed as the base url to resolve relative entry links when the declared link in the feed is relative 2013-10-03 12:42:05 +02:00
Athou
0d730128f7 use lombok slf4j annotation 2013-08-11 11:45:32 +02:00
Athou
71bb33d710 small cleanup 2013-07-31 12:07:29 +02:00
Athou
3dd4f140e2 refactored the way we handle feed refresh queue 2013-07-27 15:45:03 +02:00
Athou
9cdc364fde index title hash 2013-07-26 08:15:23 +02:00
Athou
8845c54d0c apply formatter 2013-07-25 09:17:33 +02:00
Athou
dd0fdfc89e prevent exception when hashing content 2013-07-24 19:19:50 +02:00
Athou
645164997d move author to content 2013-07-24 15:50:05 +02:00
Athou
e759d095ed propagate exception message 2013-07-03 07:56:52 +02:00
Athou
b8a62777be always store validate date 2013-07-02 09:44:16 +02:00
Athou
9e2892cbe0 strip html from the author field (fix #351) 2013-06-27 16:54:58 +02:00
Athou
158abadf19 use date constructor directly when a calendar is not needed, better performances 2013-06-25 16:57:48 +02:00
Athou
cabea47995 throw an exception if there are no items in the feed 2013-06-12 05:12:38 +02:00
Athou
2baa2bbfe5 store average entry interval and use it to decide when to reschedule feed refresh when it returns 304 2013-06-10 12:53:46 +02:00
Athou
703fd91856 validate published date only if not null 2013-06-09 06:42:25 +02:00
Athou
ca64278c08 use entry date if more recent than feed published date 2013-06-08 22:35:19 +02:00
Athou
3572c0344e use entry date if more recent than feed published date 2013-06-08 22:31:34 +02:00
Athou
349ed7f223 add a day to make sure we don't have issues with timezones 2013-06-08 22:13:44 +02:00
Athou
d820d9738e make sure the date is in the past 2013-06-08 22:00:22 +02:00
Athou
a9dc2f05f7 validate the date 2013-06-08 21:56:01 +02:00
Athou
19a4a58d9e use publishedDate from feed to determine if we need to handle feed entries or not 2013-06-08 21:47:19 +02:00
Athou
87d3cb6c34 handle entries with no title 2013-06-04 15:11:35 +02:00
Athou
145285dd20 handle entry content only if the entry is new 2013-05-29 16:42:45 +02:00
Athou
5dad786802 no guid and no link, skip entry 2013-05-28 21:53:58 +02:00
Athou
5b0c9e940b first resolve absolute url then truncate 2013-05-27 11:44:14 +02:00
Athou
f04b24796c resolve entry link when url is relative (fix #181) 2013-05-26 07:29:10 +02:00
Athou
02c0ef175d make sure the guid fits in the database field 2013-05-25 18:45:24 +02:00
Athou
9dc67ba6f9 handle all entry content sanity check at the same place 2013-05-25 00:21:00 +02:00
Athou
20a2c90241 truncate url if too long 2013-05-23 15:35:12 +02:00
Athou
43caaab007 null handling 2013-05-22 13:33:22 +02:00
Athou
23512e95a7 add atom links for rss feeds 2013-05-21 13:36:20 +02:00
Athou
185df1d3d7 change log level for some logging lines 2013-05-20 18:12:11 +02:00
Athou
c88d3021b8 initial pubsubhubbub support (#44) 2013-05-20 17:44:38 +02:00
Athou
badc830535 use the first entry date as publish date 2013-05-20 12:37:37 +02:00
Athou
f59198e018 persist entry author 2013-05-19 06:47:37 +02:00
Athou
effa009241 new fetchedfeed class 2013-04-25 12:30:21 +02:00
Athou
92e76d469f check for nulls before querying the database 2013-04-23 09:06:35 +02:00
Athou
64747881de detect encoding and trim invalid characters before parsing xml (#60) 2013-04-23 07:20:21 +02:00
Athou
bb5d76109a Integer.MAX_VALUE is actually the maximum value in seconds, construct date with milliseconds (#56) 2013-04-22 09:05:04 +02:00
Athou
d28c74ea25 use a default date instead of null 2013-04-21 23:07:19 +02:00
Athou
0204dadc3f make sure dates are in an acceptable interval 2013-04-21 16:56:09 +02:00
Athou
7502707930 check if the feed has changed by using publishDate of the feed or the first entry 2013-04-19 12:53:25 +02:00
Athou
a4a7f6389c faster way of extracting declared encoding 2013-04-19 09:36:58 +02:00
Athou
5be7c487ad optimizations 2013-04-14 18:51:12 +02:00
Athou
86edd54a21 index hashes 2013-04-14 18:28:48 +02:00
Athou
84f055b67a store entry content externally and load only when needed 2013-04-11 10:31:19 +02:00
Athou
7d982a440a fix #29 2013-04-09 13:37:00 +02:00
Athou
e6a0247bf1 generate default date when no date is found while parsing entry 2013-04-09 12:52:50 +02:00
Athou
b2421eb1ff fix encoding issues with idiots using word for writing content 2013-04-06 16:55:52 +02:00
Athou
648b4017d7 fix NPE when content is empty and fix character encoding issues 2013-04-05 22:38:35 +02:00