本文概述
- 获取URL标题
- 从HTML文件获取标题
- 获取URL的总链接
- 获取URL的元信息
- 获取URL的总图片
- 获取表单参数
获取URL标题
Document doc = Jsoup.connect("http://www.srcmini.com").get();
String title = doc.title();
点击我查看完整示例
从HTML文件获取标题
Document doc = Jsoup.parse(new File("e:\\register.html"), "utf-8");
//assuming register.html file in e drive
String title = doc.title();
点击我查看完整示例
获取URL的总链接
Document doc = Jsoup.connect("http://www.srcmini.com").get();
Elements links = doc.select("a[href]");
for (Element link : links) {
System.out.println("\nlink : " + link.attr("href"));
System.out.println("text : " + link.text());
}
点击我查看完整示例
获取URL的元信息
Document doc = Jsoup.connect("http://www.srcmini.com").get();
String keywords = doc.select("meta[name=keywords]").first().attr("content");
System.out.println("Meta keyword : " + keywords);
String description = doc.select("meta[name=description]").get(0).attr("content");
System.out.println("Meta description : " + description);
点击我查看完整示例
获取URL的总图片
Document doc = Jsoup.connect("http://www.srcmini.com").get();
Elements images = doc.select("img[src~=(?i)\\.(png|jpe?g|gif)]");
for (Element image : images) {
System.out.println("src : " + image.attr("src"));
System.out.println("height : " + image.attr("height"));
System.out.println("width : " + image.attr("width"));
System.out.println("alt : " + image.attr("alt"));
}
点击我查看完整示例
获取表单参数
Document doc = Jsoup.parse(new File("e:\\register.html"), "utf-8");
Element loginform = doc.getElementById("registerform");
Elements inputElements = loginform.getElementsByTag("input");
for (Element inputElement : inputElements) {
String key = inputElement.attr("name");
String value = http://www.srcmini.com/inputElement.attr("value");
System.out.println("Param name: "+key+" \nParam value: "+value);
}
【Jsoup示例】点击我查看完整示例
推荐阅读
- Jsoup教程介绍
- Jsoup示例(打印URL的标题)
- Jsoup示例(从HTML文件打印标题)
- Jsoup示例(打印URL的元数据)
- Jsoup示例(打印URL的链接)
- Jsoup示例(打印URL的图像)
- Jsoup示例(打印表单参数)
- Jsoup API
- 安装Jsoup