apache/incubator-stormcrawler
A scalable, mature and versatile web crawler based on Apache Storm
JavaApache-2.0
Issues
- 0
Exclude "__files" from Source Release Artifacts
#1313 opened by rzo1 - 0
sha512 hash of source release is missing the file part
#1312 opened by rzo1 - 0
add build doc for the source release
#1301 opened by pjfanning - 3
files in jars have odd dates
#1300 opened by pjfanning - 1
add DISCLAIMER to jars
#1299 opened by pjfanning - 0
- 0
Add workflow to publish SNAPSHOTS to repository.a.o
#1295 opened by rzo1 - 2
Incubator Branding Policy Compliance
#1254 opened by rzo1 - 0
Add close/cleanup method to ParseFilters
#1290 opened by rzo1 - 0
Storm 2.6.4
#1257 opened by rzo1 - 0
- 0
Enable Dependabot
#1259 opened by rzo1 - 0
Update to Storm 2.6.3
#1251 opened by rzo1 - 2
Inquiry About StormCrawler Features and Capabilities
#1253 opened by alikaz3mi - 0
- 2
HttpProtocol (both okhttp and apache) race condition while having different proxies in different threads
#1247 opened by chhsiao90 - 2
To allow ProxyManager return null (or empty proxy) for not using a proxy for some specific requests
#1246 opened by chhsiao90 - 1
Add RAT Exclusion File for standalone RAT
#1216 opened by rzo1 - 1
Ensure SC can be build without a Docker environment
#1241 opened by rzo1 - 0
Migrate to JUnit 5
#1244 opened by rzo1 - 0
Avoid use of star imports
#1238 opened by rzo1 - 0
Fix Typos in SC
#1236 opened by rzo1 - 0
- 0
Update RAT exclusions
#1215 opened by rzo1 - 0
Switch release source artifact to tar.gz
#1221 opened by rzo1 - 0
Add a disclaimer for binary (test) artifacts
#1220 opened by rzo1 - 1
Redirected sitemaps in SiteMapParserBolt / SiteMapFilter
#1230 opened by mvolikas - 0
Add FileSpout TestCase for Custom Meta Data Injections
#1226 opened by rzo1 - 0
Switch next release version to 3.1.0
#1219 opened by rzo1 - 1
SOLR StatusUpdaterBolt "deletion" stream
#1223 opened by mvolikas - 0
Update Release Docs with Feedback from 3.0 RC2 Vote Thread
#1214 opened by rzo1 - 0
Upgrade dependency Storm 2.6.2
#1188 opened by jnioche - 1
Check license of opensearch/Constants.java
#1211 opened by ayushtkn - 2
- 1
Allow to add a custom DNS suffix to OpenSearch Node addresses returned by Sniffer
#1198 opened by rzo1 - 2
Add forbidden-apis
#1207 opened by tballison - 3
Apple Silicon emulation issue in unit tests
#1209 opened by joshfischer1108 - 0
Add Release Documentation
#1202 opened by rzo1 - 1
Documentation site has path collision on case sensitive file systems.
#1204 opened by joshfischer1108 - 2
Update Releases Page in GitHub and mark non-ASF releases
#1196 opened by rzo1 - 0
Fix license headers
#1200 opened by jnioche - 1
- 3
add incubator footer to stormcrawler web site
#1193 opened by pjfanning - 8
Build website
#1187 opened by jnioche - 0
Delete branch gh-pages
#1194 opened by jnioche - 2
- 4
Continuing ES use 2.11+
#1191 opened by sam-ulrich1 - 1
Set version to 3.0 snapshot?
#1178 opened by jnioche - 0
replace storm-crawler by stormcrawler in pom files
#1176 opened by jnioche - 4
Remove developer list section in pom
#1179 opened by jnioche