什么免费/付费搜索API允许对结果数据进行编程查询和缓存/存储?

时间:2021-08-11 16:23:07

If you've done any serious research into search API's, you know that most of them have a huge slew of TOS/TOU restrictions that make them nearly impossible to use in anything but the most inane applications.

如果你已经对搜索API做了任何认真的研究,你就会知道他们中的大多数都有大量的TOS / TOU限制,这使得他们几乎不可能在除了最愚蠢的应用程序之外的任何东西中使用。

Bing's 2.0 API, Yahoo Search BOSS, Google Places, Google AJAX Search (dead), et al, are far too restrictive for us. I need to run a finite and relatively small number of queries (perhaps 500k) one time only, storing specific data from the results for use within our application.

Bing的2.0 API,Yahoo Search BOSS,Google Places,Google AJAX Search(死机)等等对我们来说太过限制了。我需要一次运行有限且相对较少的查询(可能是500k),从结果中存储特定数据以便在我们的应用程序中使用。

For example, we need to match up business names with their target websites (we have written the algorithm to make a 'best guess' from a set of results if necessary; we just need a vanilla result set). Also, we need to match an address to this company in question.

例如,我们需要将业务名称与其目标网站进行匹配(我们已经编写了算法,以便在必要时从一组结果中进行“最佳猜测”;我们只需要一个vanilla结果集)。此外,我们需要将地址与该公司的地址相匹配。

Unfortunately, I can find ZERO search API's that will allow us to fire off queries in a programmatic, non-user-initiated manner.

不幸的是,我可以找到ZERO搜索API,它将允许我们以程序化,非用户启动的方式启动查询。

We're even quite eager to give someone cold, hard cash for access to this kind of data; Google, Bing, Yahoo, and others simply seem to not want our money (as evidenced by their TOSes)...

我们甚至非常渴望给某人一些冷酷的现金来获取这类数据;谷歌,必应,雅虎和其他人似乎根本不想要我们的钱(正如他们的TOS所证明的那样)......

Any thoughts?

有什么想法吗?

2 个解决方案

#1


3  

A freely accessible index of 5 billion web pages, their page rank, their link graphs and other metadata, hosted on Amazon EC2.

在Amazon EC2上托管的可*访问的50亿网页索引,页面排名,链接图和其他元数据。

http://commoncrawl.org/

http://commoncrawl.org/

Their Terms of Service (or TOU) are pretty reasonable and unrestricted too:

他们的服务条款(或TOU)非常合理且不受限制:

http://commoncrawl.org/about/terms-of-use/

http://commoncrawl.org/about/terms-of-use/

#2


0  

If you know some visual basic I'd suggest playing around with Bing Ad Intelligence. It's a free Excel plugin and all you need to use it is a free Microsoft account.

如果你知道一些视觉基础我会建议玩Bing Ad Intelligence。这是一个免费的Excel插件,您需要使用它是一个免费的Microsoft帐户。

The query limit is 20,000 words per query. You can get information on Clicks, Impressions, CTR, CPC, Average Bid and Total Cost. The query limit is a little lower if you use the more advanced keyword research features.

每个查询的查询限制为20,000个单词。您可以获取有关点击次数,展示次数,点击率,每次点击费用,平均出价和总费用的信息。如果您使用更高级的关键字研究功能,查询限制会略低一些。

#1


3  

A freely accessible index of 5 billion web pages, their page rank, their link graphs and other metadata, hosted on Amazon EC2.

在Amazon EC2上托管的可*访问的50亿网页索引,页面排名,链接图和其他元数据。

http://commoncrawl.org/

http://commoncrawl.org/

Their Terms of Service (or TOU) are pretty reasonable and unrestricted too:

他们的服务条款(或TOU)非常合理且不受限制:

http://commoncrawl.org/about/terms-of-use/

http://commoncrawl.org/about/terms-of-use/

#2


0  

If you know some visual basic I'd suggest playing around with Bing Ad Intelligence. It's a free Excel plugin and all you need to use it is a free Microsoft account.

如果你知道一些视觉基础我会建议玩Bing Ad Intelligence。这是一个免费的Excel插件,您需要使用它是一个免费的Microsoft帐户。

The query limit is 20,000 words per query. You can get information on Clicks, Impressions, CTR, CPC, Average Bid and Total Cost. The query limit is a little lower if you use the more advanced keyword research features.

每个查询的查询限制为20,000个单词。您可以获取有关点击次数,展示次数,点击率,每次点击费用,平均出价和总费用的信息。如果您使用更高级的关键字研究功能,查询限制会略低一些。