Table of Contents
Reply to the discussion (Solution)
Home Web Front-end HTML Tutorial Where is web page text information generally stored_html/css_WEB-ITnose

Where is web page text information generally stored_html/css_WEB-ITnose

Jun 24, 2016 pm 12:10 PM

The topic of the graduation project is to extract web page text information based on statistics. Therefore, we need to know what components general web pages put text information in.


Reply to the discussion (Solution)

Haha
It’s hard to say, it’s inside the body anyway
Haha

Haha
It’s hard to say, it’s inside the body anyway
Haha
Look I read a paper saying that it is usually placed in a table

A table is a table. In the past, when making web pages, tables were usually used for layout and text placement. Now many websites use DIV CSS, then The text may be placed in DIV instead of the table

or it can be placed in the database, which is easy to update and maintain

I feel that it is a bit vague... There are two possibilities: 1. is the displayed text, which of course refers to the content between and . 2. The text of the web page, that is, all the content that makes up the web page, that is, between and ( The previous code is probably the same, right? Not sure). This seems to be the content searched by web crawlers. According to your title (statistically based web page text information extraction), it is estimated that it is through extracting web page content. Then search the specified content for statistics... So it should be the second case... Haha

This requires "specific analysis of specific websites". The main data content of some websites is in the table, and some But it may be in div, or even dl, ol, ul.

is placed in html haha,

in is placed in




This is all nonsense

Just put it wherever you like

Quoting the reply from xming4321 on the 1st floor:
Haha
It’s hard to say, it’s in the body anyway
Haha

I saw a paper saying that
is usually placed in the table. Generally, the text information is in the paragraph

, because

is the terminal block element that meets the standard.
is used in modern web pages. div css is used for typesetting,
so the data placed in

are all data information that has a vertical and horizontal table format relationship.

The topic of the graduation project is to extract text information from web pages based on statistics. Therefore, we need to know what components general web pages put text information in.
Could you please tell me if you have finished the text extraction program? Send me a copy for reference. Thank you very much! !

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Is HTML easy to learn for beginners? Is HTML easy to learn for beginners? Apr 07, 2025 am 12:11 AM

HTML is suitable for beginners because it is simple and easy to learn and can quickly see results. 1) The learning curve of HTML is smooth and easy to get started. 2) Just master the basic tags to start creating web pages. 3) High flexibility and can be used in combination with CSS and JavaScript. 4) Rich learning resources and modern tools support the learning process.

The Roles of HTML, CSS, and JavaScript: Core Responsibilities The Roles of HTML, CSS, and JavaScript: Core Responsibilities Apr 08, 2025 pm 07:05 PM

HTML defines the web structure, CSS is responsible for style and layout, and JavaScript gives dynamic interaction. The three perform their duties in web development and jointly build a colorful website.

What is an example of a starting tag in HTML? What is an example of a starting tag in HTML? Apr 06, 2025 am 12:04 AM

AnexampleofastartingtaginHTMLis,whichbeginsaparagraph.StartingtagsareessentialinHTMLastheyinitiateelements,definetheirtypes,andarecrucialforstructuringwebpagesandconstructingtheDOM.

Understanding HTML, CSS, and JavaScript: A Beginner's Guide Understanding HTML, CSS, and JavaScript: A Beginner's Guide Apr 12, 2025 am 12:02 AM

WebdevelopmentreliesonHTML,CSS,andJavaScript:1)HTMLstructurescontent,2)CSSstylesit,and3)JavaScriptaddsinteractivity,formingthebasisofmodernwebexperiences.

Gitee Pages static website deployment failed: How to troubleshoot and resolve single file 404 errors? Gitee Pages static website deployment failed: How to troubleshoot and resolve single file 404 errors? Apr 04, 2025 pm 11:54 PM

GiteePages static website deployment failed: 404 error troubleshooting and resolution when using Gitee...

How to implement adaptive layout of Y-axis position in web annotation? How to implement adaptive layout of Y-axis position in web annotation? Apr 04, 2025 pm 11:30 PM

The Y-axis position adaptive algorithm for web annotation function This article will explore how to implement annotation functions similar to Word documents, especially how to deal with the interval between annotations...

HTML, CSS, and JavaScript: Essential Tools for Web Developers HTML, CSS, and JavaScript: Essential Tools for Web Developers Apr 09, 2025 am 12:12 AM

HTML, CSS and JavaScript are the three pillars of web development. 1. HTML defines the web page structure and uses tags such as, etc. 2. CSS controls the web page style, using selectors and attributes such as color, font-size, etc. 3. JavaScript realizes dynamic effects and interaction, through event monitoring and DOM operations.

How to use CSS3 and JavaScript to achieve the effect of scattering and enlarging the surrounding pictures after clicking? How to use CSS3 and JavaScript to achieve the effect of scattering and enlarging the surrounding pictures after clicking? Apr 05, 2025 am 06:15 AM

To achieve the effect of scattering and enlarging the surrounding images after clicking on the image, many web designs need to achieve an interactive effect: click on a certain image to make the surrounding...

See all articles